I am a CS Ph.D. candidate at the University of Pennsylvania (defending 2026). I build agentic RL post-training, and inference systems for LLMs.
I am seeking full-time industry roles starting in 2026.
At Alibaba, I shipped Partial Overlapping, a high-priority feature in ROLL, for production RL training of models with 100s of billions of parameters on 1000s of GPUs, open-sourced RLix , and contributed to the ROME technical report. My work spans vLLM, Megatron-LM, and Ray.
- My personal website: taoluo.net
- My CV: taoluo.net/cv