Skip to content
View YB96's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report YB96

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
YB96/README.md

Profile Views GitHub followers


╔══════════════════════════════════════════════════════╗
β•‘                                                      β•‘
β•‘        > Yash Prakash Bhairao                        β•‘
β•‘        > Role     : Data Engineer                    β•‘
β•‘        > Focus    : Pipelines Β· dbt Β· Airflow        β•‘
β•‘        > Cloud    : AWS                              β•‘
β•‘        > Stack    : Docker Β· Python Β· SQL            β•‘
β•‘        > Status   : Building in production πŸš€        β•‘
β•‘                                                      β•‘
β•šβ•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•

⚑ GitHub Activity

Yash's GitHub Stats

GitHub Streak

Top Languages


πŸ—οΈ Data Engineering Stack

Apache Airflow dbt Docker

AWS AWS S3 AWS Redshift AWS Glue

Python SQL R

Pandas NumPy Scikit-learn TensorFlow

Tableau Power BI Plotly


πŸš€ Featured Projects

πŸ›’ DBT Data Pipeline

Docker ETL Workflow

End-to-end pipeline using DBT. Raw data extracted from Postgres, transformed via dbt, orchestrated with Airflow DAGs, containerised in Docker.

Airflow dbt AWS Docker

🧬 ML Classification Pipeline

Breast Cancer Analysis

ML pipeline with feature engineering and model training. Containerised with Docker, deployable on AWS EC2, validated with statistical EDA.

Scikit-learn Docker AWS

πŸ›οΈ Scraper + Ingestion Pipeline

Flipkart Automation

Automated scraper loading iPhone listings into AWS RDS via a scheduled Airflow DAG. Downstream reporting in Power BI.

Airflow AWS Power BI

πŸ“ Pipeline Architecture

Sources β†’ Ingest  (Python)
        β†’ Store   (S3 / RDS)
        β†’ Model   (dbt)
        β†’ Run     (Airflow)
        β†’ Serve   (Redshift)
        β†’ Viz     (Power BI)
🐳 Docker everywhere

πŸ“Š Contribution Graph

Activity Graph


"Data holds the key to unlocking knowledge. Let's explore its secrets together." β€” Yash Prakash Bhairao

⭐ Star a repo if you find it useful!

Pinned Loading

  1. EDA-on-Brazilian-E-Commerce-Olist EDA-on-Brazilian-E-Commerce-Olist Public

    Jupyter Notebook

  2. Portfolio Portfolio Public template

    HTML

  3. automated-flipkart-spreadsheet automated-flipkart-spreadsheet Public

    Python

  4. web-scraping web-scraping Public

    Jupyter Notebook