💭
Vibe coding
LLM RL @NVIDIA
-
NVIDIA
- Beijing, China
- zpqiu.github.io
- https://scholar.google.com/citations?hl=zh-CN&user=xxDgpBYAAAAJ
Pinned Loading
-
NVIDIA-NeMo/RL
NVIDIA-NeMo/RL PublicScalable toolkit for efficient model reinforcement
-
-
WLiK/LLM4Rec-Awesome-Papers
WLiK/LLM4Rec-Awesome-Papers PublicA list of awesome papers and resources of recommender system on large language model (LLM).
-
rl-infra-notes
rl-infra-notes PublicSource-code level analysis of LLM RL training infra: async RL, weight sync, FP8, MoE routing | LLM RL 训练基础设施源码级分析
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.


