Lingling Fan lfopensource

🎓 Ph.D. in Electrical Engineering from Stanford University.
👀 I’m interested in LLM Inference & Serving, with a focus on Quantization and Parallelism (e.g., Parallel Decoding, Speculative Decoding).
🌱 Currently focused on:
- CUDA Kernel Optimization
- Model Deployment & Serving Infrastructure (Paged KV Cache, Continuous Batching)
- Post-training (RLHF, Distillation, Flow-matching)
📫 How to reach me: [email protected]
😄 Pronouns: She/Her

Provide feedback