Senior Data Scientist at Autodesk with 6+ years of experience building production ML systems that deliver measurable business impact. I specialize in the full ML lifecycle โ from feature engineering and model development to deployment, monitoring, and stakeholder dashboards.
- ๐ข Autodesk โ ML for cloud optimization, anomaly detection, LLM-powered tooling
- ๐ Expedia โ Large-scale propensity modeling for 200M+ customers
- ๐ค Tata Elxsi โ CNN-based health monitoring & deep learning R&D
- ๐ Bay Area, CA ย |ย ๐ MS Computer Science, Penn State
๐ฐ ~$4M annual cloud compute savings via cost-aware ML model at Autodesk
๐ 12% reduction in customer opt-out rates at Affine Analytics
๐ฅ 200M+ customers served through personalized recommendation systems
๐จ Real-time API anomaly detection preventing production incidents
๐ค Text-to-GraphQL interface via LLMs for streamlined developer experience
Languages
ML / AI
Data & Cloud
MLOps & Tooling
Chat with Documents โ Advanced RAG
A local RAG web app with two production-grade pipelines built on top of a swappable LLM/embedding/vectorstore backend.
| Feature | Details |
|---|---|
| ๐ Document Q&A | Upload PDFs โ persistent vector store โ grounded answers with citations (filename, page, excerpt) |
| ๐ฏ Resume Tailor | 6-agent pipeline: Resume Understanding โ Job Analysis โ Gap Analysis โ Suggestions โ Tailoring โ Judge |
| ๐ Runtime config | Swap LLM, embeddings & vectorstore from the UI without restarting |
| ๐ค Model support | Ollama (local) ยท OpenAI ยท Anthropic |
Python LangChain RAG Vector Store Agentic AI Ollama OpenAI Anthropic
Snowflake MCP Server โ Pure Async
A production-ready MCP (Model Context Protocol) server for Snowflake using the low-level async API โ giving full control over server lifecycle, tool registration, and async execution.
| Feature | Details |
|---|---|
| ๐ Tools exposed | execute_query, list_databases, list_schemas, list_tables, describe_table, check_database_exists |
| ๐ก๏ธ Query safety | Read-only validation (SELECT, WITH, SHOW, DESCRIBE, EXPLAIN only) |
| โ๏ธ Production features | Persistent connection ยท health checks ยท timeout control ยท query tagging ยท cache control ยท row limiting |
| ๐ Config | Pydantic models for type-safe, validated configuration with clear startup errors |
Python MCP Snowflake Async LLM Tooling Pydantic
| Project | Description | Stack |
|---|---|---|
| ๐ซ COVID-19 Chest X-ray Detection | Transfer learning with VGG16 for binary medical image classification | PyTorch ยท VGG16 ยท OpenCV |
| Project | Description | Stack |
|---|---|---|
| ๐ท๏ธ StackOverflow Tag Predictor | Multi-label classifier on 6M+ posts | scikit-learn ยท TF-IDF ยท Linear Models |
| ๐งฎ Neural Network from Scratch | Two-layer NN with backprop โ no frameworks | Python ยท NumPy |
current_focus = {
"LLMs": ["Fine-tuning", "RAG systems", "Agentic workflows"],
"MLOps": ["Feature stores", "Model monitoring", "SageMaker pipelines"],
"GenAI": ["LangChain", "LangGraph", "Prompt engineering"],
}
