zpqiu

Follow

💭

Vibe coding

alexchiu zpqiu

💭

Vibe coding

Follow

LLM RL @NVIDIA

32 followers · 36 following

NVIDIA
Beijing, China
zpqiu.github.io
https://scholar.google.com/citations?hl=zh-CN&user=xxDgpBYAAAAJ

Achievements

Achievements

Pinned Loading

NVIDIA-NeMo/RL NVIDIA-NeMo/RL Public

Scalable toolkit for efficient model reinforcement

Python 1.5k 300
nvidia-china-sae/mair-hub nvidia-china-sae/mair-hub Public

Jupyter Notebook 84 17
WLiK/LLM4Rec-Awesome-Papers WLiK/LLM4Rec-Awesome-Papers Public

A list of awesome papers and resources of recommender system on large language model (LLM).

2.3k 163
rl-infra-notes rl-infra-notes Public

Source-code level analysis of LLM RL training infra: async RL, weight sync, FP8, MoE routing | LLM RL 训练基础设施源码级分析

6