Welcome to my Github page!
I build things with data π οΈ. From distributed pipelines that ingest massive, diverse datasets to training language and vision models for practical machine learning systems, I work across the full ML stack, transforming messy, real-world problems into production-ready solutions.
Currently researching uncertainty quantification for medical image registration 𧬠at UCSD's BIAG Lab. On the side, I've been exploring LLM-based reasoning, agentic AI architectures π€, and low-resource NLP for underrepresented languages, staying curious about where the boundaries of reliable machine reasoning lie.
My industry experience spans MLOps and healthcare at a blood diagnostics biotech π¬, where I rebuilt ML training infrastructure for time series classification, and large-scale data engineering at a supply chain fintech π, where I built distributed pipelines, graph-based analytics tools, and NLP systems processing hundreds of thousands of company records.
Expanding my ML expertise at UC San Diego π through coursework spanning Statistical NLP, Computer Vision, AI Agents, and ML Systems. Before that, I did my undergrad at Ashoka University, where I got my first taste of research in privacy-preserving ML, adversarial attacks on ML models, and applied cryptography π.
Outside of code, I follow Chelsea FC β½, cricket π, and F1 ποΈ closely, and enjoy sci-fi, mystery, and thriller films π₯. I also enjoy swimming and hiking when I'm not staring at a screen π.
Things I code with:
Let's Connect π

