Muaaz Ahmad muaazdev

Hi, I'm Muaaz Ahmad 👋

Senior AI/ML Engineer with 4+ years of production experience building end-to-end AI systems — from RAG pipelines and multi-agent architectures to real-time voice agents and GPU-optimized inference services.

Currently based in Dammam, Saudi Arabia.

💼 What I Do

I build AI systems that actually ship to production — not just prototypes. My work spans the full lifecycle: architecture, training, optimization, deployment, and monitoring.

LLMs & Generative AI — RAG pipelines, multi-agent systems (LangGraph, CrewAI), LLM fine-tuning (LoRA, QLoRA), prompt engineering
MLOps & Infrastructure — CI/CD for ML (GitHub Actions), Docker, AWS (ECS, EC2, Lambda, SageMaker), model serving, auto-scaling
Computer Vision — Object detection & tracking (YOLO, ByteTrack), segmentation (SAM), pose estimation, video analytics
Voice AI — Real-time voice agents with Whisper (STT), custom TTS, VAPI, Twilio/Telnyx integration
Backend — FastAPI async services, Redis caching, request batching, <200ms p95 latency at scale

🏗️ Production Experience Highlights

No-Code Multi-Agent Builder Platform — Built a platform where users create and deploy custom AI agents without coding. Deployed on AWS ECS with Docker, auto-scaling, and load balancing handling concurrent users with sub-second latency.

High-Throughput LLM Inference Service — Designed FastAPI async endpoints with request batching and Redis caching achieving 10x latency reduction and <200ms p95 latency under load.

LegalMind — Modular RAG system for legal document Q&A featuring hybrid search (vector + BM25), Cohere cross-encoder reranking, mandatory citation validation, and AI evaluation agents (adversarial test generation, hallucination detection, citation verification) with CI/CD pipeline.

Real-Time Sports Video Analytics — Custom YOLO training for player/puck tracking in ice hockey. Optimized from 8 FPS to 25+ FPS via TensorRT quantization and GPU optimization.

Real-Time Voice Agent — Built calling agent using Twilio/Telnyx with Whisper for STT and custom TTS. Sub-1s end-to-end latency with VAD-based turn-taking and interruption handling.

Stable Diffusion Production APIs — Fine-tuned on custom datasets. Built production serving with request queuing and horizontal scaling for face restoration, AI aging, and image editing.

🚀 Open Source Projects

Project	What It Does	Stack	Status
DocMind	Multi-tenant RAG platform with hybrid search, reranking & evaluation	LangChain, Pinecone, FastAPI, Docker	✅ Live
LegalMind	Legal document Q&A with hybrid search, citation validation & eval agents	LangChain, Cohere, FastAPI, CI/CD	✅ Live
AgentForge	Multi-agent task orchestration with LangGraph state machines	LangGraph, CrewAI, MCP, Python	🔨 Coming Soon
CallBot AI	AI voice agent — answers calls, books appointments, syncs CRM	VAPI, Twilio, OpenAI, FastAPI	🔨 Coming Soon
FlowPilot	AI-powered workflow automation suite	n8n, Python, OpenAI API	🔨 Coming Soon
MCP Toolkit	Production-ready MCP servers for LLM tool integration	FastMCP, Python, Docker	🔨 Coming Soon
SmartServe	AI customer support system with RAG + agent handoff	LangGraph, FastAPI, React	🔨 Coming Soon

🛠️ Tech Stack

LLMs & GenAI:     LangChain, LangGraph, CrewAI, MCP, LlamaIndex, OpenAI, Claude, Gemini
                  Fine-tuning: LoRA, QLoRA, PEFT | Models: Llama, Mistral, Gemma
RAG:              Pinecone, ChromaDB, Qdrant, Weaviate, FAISS, Cohere Reranking
Voice AI:         VAPI, Twilio, Telnyx, Whisper, Deepgram, ElevenLabs, Coqui TTS
Computer Vision:  YOLO, ByteTrack, SAM, TensorRT, OpenCV, Stable Diffusion
MLOps:            Docker, GitHub Actions CI/CD, AWS ECS, Model Serving, Auto-scaling
                  vLLM, ONNX, TensorRT, Prefect
Cloud:            AWS (ECS, EC2, S3, Lambda, SageMaker, CloudWatch), GCP
Backend:          Python, FastAPI, Node.js, REST APIs, WebSockets, Async Programming
Data:             PostgreSQL, MongoDB, Redis, Vector DBs, Pandas, NumPy
ML/DL:            PyTorch, TensorFlow, scikit-learn, Hugging Face Transformers

📫 Let's Connect

🌐 Website: muaazdev.com
💬 LinkedIn: muaazdev
📧 Email: [email protected]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Muaaz Ahmad muaazdev

Achievements

Achievements

Highlights

Block or report muaazdev

Hi, I'm Muaaz Ahmad 👋

💼 What I Do

🏗️ Production Experience Highlights

🚀 Open Source Projects

🛠️ Tech Stack

📫 Let's Connect

📊 GitHub Stats:

Pinned Loading

Uh oh!