Skip to content

Venkat2811/Venkat2811

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

99 Commits
 
 
 
 

Repository files navigation

Mechanical Sympathy is All You Need

Hi 👋, I'm Venkat !

Twitter LinkedIn GitHub

Built and scaled systems that can handle 5k->250k RPS w/o breaking a sweat.

Got into model serving and inference, enjoyed solving cold start, intelligent routing and optimizing GPU cluster utilization. Did a bit of RAG & Agents infra. Currently ML Infra - training, inference, comms collectives, storage, compiler backends, custom kernels optimizations & researching novel techniques.

High Agency individual deep in agentic-engineering mode. AI tools have enabled me to touch end-to-end infra from user facing APIs & Infra to tensors to metal. Always looking to maximize my learning curve 📈

ʕ•ᴥ•ʔ venkat.systems


Highlights


Projects

  • 🐘 YALI - Ultra-low-latency GPU comms collective. Outperforms NVIDIA NCCL P2P by 1.2 - 2.4x.
  • ⏲️ Metered Compute - 5 reference architectures for reliably metering sync and async compute.
  • 🔍 Inference Assayer - Compiler driven models <> HWs inference perf analyzing deterministic fast simulator lab.
  • WIP / TBA

Technologies

HomeCodex Claude CLI macOS pi.dev Tailscale tmux AutoResearch
LanguagesRust Go Python Java CUDA English Markdown does it matter anymore?
InferencevLLM SGLang HuggingFace TensorRT-LLM Transformers
InfraK8s Helm Argo Docker NVIDIA Dynamo vLLM AIBrix
AcceleratorsPyTorch Triton CUTLASS CuBLAS Mojo ThunderKittens
StorageMySQL PostgreSQL Redis S3 SlateDB
MiddlewareKafka Apache Iggy NATS Redpanda ZeroMQ RabbitMQ
CloudAWS GCP Terraform Ansible
BuildEarthly Makefile Bash Bazel

Writings

Hashnode Medium Blogger

Acknowledgements

Inspired by

  • GitHub
  • GitHub
  • GitHub

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors