Skip to content
View charu01smita28's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report charu01smita28

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
charu01smita28/README.md

logo

Hi 👋, This is Charusmita Dhiman!

Typing SVG

ai engineer

charu01smita28

8+ years of software engineering. 5 years building backends and full-stack systems in Python, Java, Node.js, then a hard pivot into AI/ML. Once I saw the power of LLMs and AI, I went all-in, no looking back.

Now 3 years deep in production LLM systems - I build multi-agent pipelines that retrieve with precision, evaluate themselves, fail gracefully, and don't bankrupt the company on API calls. If a system can't show you why it gave that answer, it shouldn't be in production.

  • 13K+ users on SciWeave — multi-agent RAG across 250M+ papers, handling 10K+ monthly queries with cited answers in <6 seconds
  • 10x cost reduction ($90 → $9/month) via hybrid DeBERTa + LLM classification across 275 intent labels, semantic caching & tiered routing
  • 60% latency reduction on multi-agent pipelines with parallel execution, 5-layer caching & dual-provider failover

🔗 SciWeave · 🔍 RepoScout · 📫 [email protected] · LinkedIn


🚀 What I'm building

🔍 RepoScout — AI-Powered Open Source Intelligence Engine

5-stage agentic pipeline across 85K+ Python packages · hybrid Mistral + OpenAI model selection optimized per stage · autonomous tool-calling with up to 8 reasoning iterations · 85K+ semantic embeddings on Qdrant Cloud · Supabase over 2.1M+ dependency signals · SSE streaming with conversation follow-ups.

🔗 View Project


🛠️ Core Skills & Systems

🧠 LLM Systems & Reasoning

  • Advanced RAG (Self-RAG, Hierarchical, Adaptive) with 13+ DSPy modules across a 4-phase parallel pipeline
  • Multi-agent orchestration with dual-provider failover — 60% latency reduction, 30% fewer LLM calls
  • NL-to-SQL pipelines with hallucination guardrails

🗄️ Vector Search & Embedding Systems

  • Hybrid retrieval: BM25 + dense embeddings + cross-encoder reranking
  • Qdrant, FAISS, Chroma, Pinecone, Elasticsearch, Supabase, PostgreSQL

📄 Document Intelligence & Multimodal QA

  • Multimodal document systems: layout analysis, figure extraction, table parsing, vision models
  • 4-tier query routing — 15% retrieval precision improvement

💸 LLM Cost Engineering

  • Hybrid DeBERTa + LLM classification for 275 intent labels (83% accuracy at 95% confidence)
  • 5-layer caching, semantic caching, tiered routing
  • 10x cost reduction ($90 → $9/month)

📊 Evaluation & Observability

  • RAG evaluation on QASA benchmark using RAGAS & LLM-as-judge — 6.3% context recall gain, 0% faithfulness loss
  • Analyzed 40K+ queries across personas to drive complexity-aware routing

🤖 Product-Driven AI

  • Tool-use flows, function calling, structured outputs, custom MCP servers
  • MCP servers for K8s tunneling, SQL safety, resource lifecycle management

⚙️ Backend & Distributed Systems

  • 8+ years across monoliths and microservices
  • REST, GraphQL, async pipelines, distributed workflows

☁️ Cloud & MLOps

  • AWS (Lambda, S3, SQS, DynamoDB, SageMaker, Bedrock), Docker, Kubernetes
  • MLFlow, Terraform, Vercel

🎨 Frontend & Full-Stack

  • React, Next.js, TypeScript, shadcn/ui
  • Full-stack AI app (RepoScout) built end-to-end

🧰 Tech Stack


🧭 Where my head is at

  • 🤖 Multi-agent systems — orchestration patterns, handoff protocols, memory architectures
  • 🔧 Tool-use & function calling — making agents actually do things reliably
  • 🕸️ Graph RAG & knowledge graphs — structured reasoning over unstructured data
  • 🧩 Claude Agent SDK & MCP — building with the next generation of agent infrastructure

🏆 Accolades

  • 🎯 Google Certified TensorFlow Developer — scored 100%
  • 🎓 President's Honor List — Post Graduate Certificate, Seneca College, Toronto
  • 🥇 Gold Medalist — B.E. Computer Engineering, VNSGU, India

Connect with me:

charusmitadhiman


Building AI systems that work in production, not just in notebooks.

Pinned Loading

  1. reposcout reposcout Public

    AI-powered Open Source Intelligence Engine. Query 85K+ PyPI packages with semantic search, health scores, dependency graphs, and code analysis.

    TypeScript

  2. fastapi-elasticsearch-semantic-search fastapi-elasticsearch-semantic-search Public

    Project demonstrating an Advanced Semantic Search Engine Microservice for products using FastAPI, Elasticsearch and NLP Sentence Transformers.

    Python 1

  3. pdf-chat-agent pdf-chat-agent Public

    An AI Agent for PDF-Based Question Answering using Open AI LLMs , LangChain and Chroma DB vector database with RAG

    Python

  4. Multimodal_RAG Multimodal_RAG Public

    A Grand Theft Auto Multimodal Retrieval-Augmented Generation (RAG) Application that leverages LanceDB as a vector database, OpenAI’s “ViT-L/14” for multimodal embedding, and the GTA-Image-Captionin…

    Python 1

  5. Investment-Genie Investment-Genie Public

    Time Series analysis and prediction. Developed an investment algorithm aimed at optimizing profitability currently using MSFT data from yfinance.

    Python

  6. Recruiting-Talent-Finder Recruiting-Talent-Finder Public

    Designed an advanced ML workflow focused on optimizing talent acquisition costs through the efficient identification and screening of highly qualified candidates. Created an impactful ranking algor…

    Jupyter Notebook