Skip to content
View bwang008's full-sized avatar

Block or report bwang008

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
bwang008/README.md

Greetings!πŸ‘‹

πŸ‘©β€πŸ’» About Me

Machine Learning Engineer | Platform & Infrastructure with 10+ years architecting high-availability data systems and ML pipelines for mission-critical utilities. Recently completed M.S. Computer Science (Machine Learning), Georgia Tech. Expert in productionizing ML from ETL to deployment across complex domains like grid operations and financial modeling.

  • πŸ”­ Building production ML pipelines and GenAI tooling (RAG, RL agents)
  • 🌱 Deep expertise: PyTorch, Transformers, Lakehouse (Databricks/PySpark), Docker/K8s
  • ⚑ Passion: Turning messy real-world data into reliable, scalable intelligence

πŸ›  Tech Stack

python pytorch docker k8s linux git aws gcp

πŸš€ Production Experience & Projects

PG&E | Senior Network Model & Automation Lead (2022-2024)
Architected ADMS graph database for millions of grid assets; built Python automation reducing manual validation 40% across 20+ engineers; deployed anomaly detection on live network models. [file:1]

SCE | Lead Network Model Engineer (2012-2022)
Productionized ML prototypes (Random Forest wildfire prediction, genetic algorithms for circuit phasing); led data standardization cutting redundancy 30%; mission-critical Bash automation for grid control systems.

CL Futures ML Pipeline | github.com/bwang008/CL_Analyst
End-to-end MLOps: ETL β†’ feature engineering (100+ indicators w/ Numba) β†’ LightGBM β†’ walk-forward validation. 15% edge over random on skewed breakout prediction.

GenAI RAG Tool | LangChain, OpenAI API, ChromaDB, Docker
Production-ready internal doc search using retrieval-augmented generation for engineering teams.

RL Agent Pipeline | PyTorch, Gymnasium
Modular continuous-control RL with hyperparameter benchmarking (Adam vs Genetic Algorithms).

Streaming QoS Lakehouse | Databricks, PySpark, Delta Lake
Medallion pipeline for high-throughput telemetry; schema enforcement, PII masking, optimized partitioning.

Pinned Loading

  1. LifeSimulator LifeSimulator Public

    Simple example of life forming and interacting between plants, herbivores, and carnivores

    HTML 1

  2. Antigravity_AutoClick Antigravity_AutoClick Public

    Tool to auto-proceed steps which mimics actions outside of the IDE and does not rely on extensions.

    Python

  3. genai-knowledge-retrieval genai-knowledge-retrieval Public

    A containerized RAG pipeline for querying internal documentation using LangChain, Gemini, and ChromaDB.

    Python

  4. KnightWalk KnightWalk Public

    JavaScript

  5. PrisonerProblem PrisonerProblem Public

    HTML