Grok-Like AI Agent in ~400 Lines of Python (2026 Edition)

Build your own witty, tool-using, Indic-language-capable AI agent inspired by Grok — using only Python, LangChain, and llama.cpp.

Features:

Perfect for Mumbai/Bengaluru devs who want local-first AI without spending ₹50k/month on cloud GPUs.

Quick Start (Local)

pip install -r requirements.txt

Download Llama 3.1 8B GGUF (Q5_K_M recommended)

huggingface-cli download TheBloke/Llama-3.1-8B-GGUF llama-3.1-8b.Q5_K_M.gguf --local-dir ./models

Run simple chat test

python inference_test.py

Run full FastAPI server

uvicorn app:app --reload --port 8000

Then visit http://localhost:8000/docs and try the /chat endpoint.

Indic Fine-Tuning (Optional)

python fine_tune_indic.py

Requires GPU or patience (takes ~4–12 hours on t4g.medium EC2). Deploy on AWS Mumbai (ap-south-1)

bash deploy_ec2.sh

See deploy_ec2.sh for full instructions.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
agent_core.py		agent_core.py
app.py		app.py
deploy_ec2.sh		deploy_ec2.sh
fine_tune_indic.py		fine_tune_indic.py
inference_test.py		inference_test.py
requirements.txt		requirements.txt