openrlhf

Here are 5 public repositories matching this topic...

TsinghuaC3I / MARTI

A Framework for LLM-based Multi-Agent Reinforced Training and Inference

camel llama gemma multi-agent-systems autogen multi-agent-reinforcement-learning large-language-models qwen large-reasoning-models deepseek-r1 verl openrlhf

Updated Apr 14, 2026
Python

DeepGym / deepgym

Star

RL training environments with verifiable rewards for coding agents. Works with TRL, Unsloth, verl, OpenRLHF.

python machine-learning reinforcement-learning deep-learning sandbox evaluation rl code-execution ai-agents daytona llm unsloth coding-agents grpo verifiable-rewards openrlhf reward-function grpo-training

Updated Apr 1, 2026
Python

Magnicord / llm-env-templates

Star

A list of uv environments templates for LLM development.

python environment deep-learning conda pytorch venv uv llm flash-attn verl openrlhf

Updated Sep 19, 2025

Amankumarsingh23 / rlhf-annotation-studio

Star

RLHF Annotation Studio — Web-based tool for collecting human preference data to train LLMs via Reinforcement Learning from Human Feedback (RLHF). Compare responses side-by-side, capture preferences, and export JSONL for reward model training.

reinforcement-learning annotation-tool llm rlhf openrlhf

Updated Apr 8, 2026
TypeScript

KRESS99 / llm-env-templates

Star

🌐 Streamline LLM development with ready-to-use environment templates for efficient setup and deployment.

python environment deep-learning conda pytorch venv uv llm flash-attn verl openrlhf

Updated Feb 23, 2026

Improve this page

Add a description, image, and links to the openrlhf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the openrlhf topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

openrlhf

Here are 5 public repositories matching this topic...

TsinghuaC3I / MARTI

DeepGym / deepgym

Magnicord / llm-env-templates

Amankumarsingh23 / rlhf-annotation-studio

KRESS99 / llm-env-templates

Improve this page

Add this topic to your repo