Welcome to my GitHub profile! I am an M.S. Data Science student at the University of Colorado Boulder (expected May 2026). I focus on building reproducible data and ML workflows, evaluation artifacts, and validation gates so results are reviewable and deployment-ready.
π Vista, CA
π LinkedIn: https://www.linkedin.com/in/rohan-jain11
π« Email: [email protected]
-
Master of Science in Data Science
University of Colorado Boulder
Expected May 2026 -
Bachelor of Engineering in Computer Engineering
Rajiv Gandhi Institute of Technology, Mumbai
Jul 2020 to May 2024
-
Data Science and Engineering Intern
Nexus Weather and Climate (May 2025 to Present)- Built and benchmarked ML pipelines for ensemble forecasting post-processing
- Created evaluation artifacts and metric-driven comparisons (MAE, MSE, CRPS)
- Implemented inference quality gates and reproducible run artifacts to improve reliability
-
Graduate Research Assistant
University of Colorado Boulder (Kopf Lab) (Dec 2025 to Present)- Reverse-engineering legacy binary formats and serialization patterns (MFC CArchive style)
- Building deterministic R readers with validation checks to make older data accessible again
-
AI PDF Chatbot for Research Papers
- Local-first project that lets users upload a PDF and chat to extract information efficiently
-
Diabetes Analytics Website (ML Models and Visualizations)
- Implemented multiple models (Naive Bayes, Logistic Regression, Decision Trees) with evaluation outputs and structured explainers
-
Yoga Posture Detection and Correction (Research Project)
- Developed a neural network approach for posture classification and correction, achieving 96.5% accuracy
-
Denver International Airport Data Integration Dashboard (Hackathon, 3rd Place)
- Designed a dashboard solution to streamline operational insights and reporting
-
Netflix Dashboard (Tableau)
- Built a Tableau dashboard to analyze engagement and content performance
- Programming and Databases: Python, R, SQL (MySQL, PostgreSQL), MongoDB
- ML and Analytics: model selection and tuning, feature engineering, time-series forecasting, probabilistic evaluation (CRPS)
- Libraries: scikit-learn, TensorFlow, PyTorch, pandas, NumPy, Matplotlib, Seaborn, Plotly
- Tools and Cloud: FastAPI, REST APIs, Git/GitHub, Tableau, Power BI, AWS, Azure, Apache Spark
- Data engineering foundations (pipelines, testing, reproducibility)
- Advanced SQL (MySQL and PostgreSQL)
- ML evaluation and reliability for forecasting workflows
- LinkedIn: https://www.linkedin.com/in/rohan-jain11
- GitHub: https://github.com/rohanjain11
Did you know that the first computer programmer was Ada Lovelace? She wrote an algorithm for Charles Babbageβs Analytical Engine in the mid-1800s.
Feel free to reach out for collaboration, internships, or a chat about data science and data engineering.
π§ [email protected]
