8+ years shipping software. 5 years deep in backends and full-stack (Python, Java, Node.js) β then LLMs hit, and I went all-in. No looking back.
3 years in, I design and ship production multi-agent systems β agentic RAG pipelines that retrieve with precision, reason across tools, evaluate themselves, fail gracefully, and don't bankrupt the company on API calls. If a system can't show you why it gave that answer, it shouldn't be in production.
- 13K+ users on SciWeave β multi-agent RAG across 250M+ papers, handling 10K+ monthly queries with cited answers in <6 seconds
- 10x cost reduction ($90 β $9/month) via hybrid DeBERTa + LLM classification across 275 intent labels, semantic caching & tiered routing
- 60% latency reduction on multi-agent pipelines with parallel execution, 5-layer caching & dual-provider failover
π SciWeave Β· π RepoScout Β· π« [email protected] Β· LinkedIn
π RepoScout β AI-Powered Open Source Intelligence Engine
5-stage agentic pipeline across 85K+ Python packages Β· hybrid Mistral + OpenAI model selection optimized per stage Β· autonomous tool-calling with up to 8 reasoning iterations Β· 85K+ semantic embeddings on Qdrant Cloud Β· Supabase over 2.1M+ dependency signals Β· SSE streaming with conversation follow-ups.
π View Project
𧬠Clinical Evidence IQ β Multi-Agent Clinical Research & Safety Engine (under NDA)
LangGraph multi-agent workflow: query_analyzer β researcher β critic β safety_checker β synthesizer with conditional routing and retry loops Β· Qdrant-backed retrieval over 5K+ domain papers Β· dual-model inference Β· inline citations + confidence scoring.
- Advanced RAG (Self-RAG, Hierarchical, Adaptive) with 13+ DSPy modules across a 4-phase parallel pipeline
- Multi-agent orchestration with dual-provider failover β 60% latency reduction, 30% fewer LLM calls
- NL-to-SQL pipelines with hallucination guardrails
- Hybrid retrieval: BM25 + dense embeddings + cross-encoder reranking
- Qdrant, FAISS, Chroma, Pinecone, Elasticsearch, Supabase, PostgreSQL
- Multimodal document systems: layout analysis, figure extraction, table parsing, vision models
- 4-tier query routing β 15% retrieval precision improvement
- Hybrid DeBERTa + LLM classification for 275 intent labels (83% accuracy at 95% confidence)
- 5-layer caching, semantic caching, tiered routing
- 10x cost reduction ($90 β $9/month)
- RAG evaluation on QASA benchmark using RAGAS & LLM-as-judge β 6.3% context recall gain, 0% faithfulness loss
- Analyzed 40K+ queries across personas to drive complexity-aware routing
- Tool-use flows, function calling, structured outputs, custom MCP servers
- MCP servers for K8s tunneling, SQL safety, resource lifecycle management
- 8+ years across monoliths and microservices
- REST, GraphQL, async pipelines, distributed workflows
- AWS (Lambda, S3, SQS, DynamoDB, SageMaker, Bedrock), Docker, Kubernetes, MLFlow
- React, Next.js, TypeScript, shadcn/ui, Vercel
- Full-stack AI app (RepoScout) built end-to-end
- π€ Multi-agent systems β orchestration patterns, handoff protocols, memory architectures
- π§ Tool-use & function calling β making agents actually do things reliably
- πΈοΈ Graph RAG & knowledge graphs β structured reasoning over unstructured data
- π§© Claude Agent SDK & MCP β building with the next generation of agent infrastructure
- π― Google Certified TensorFlow Developer β scored 100%
- π President's Honor List β Post Graduate Certificate, Seneca College, Toronto
- π₯ Gold Medalist β B.E. Computer Engineering, VNSGU, India
Building AI systems that work in production, not just in notebooks.

