Skip to content

Latest commit

 

History

History

README.md

RLLaVA Examples

Comprehensive examples for training and evaluating vision-language models with reinforcement learning.

Documentation

Directory Structure

examples/
├── algorithms/       # RL algorithm scripts (GRPO, RLOO, DAPO, etc.)
├── tasks/            # Task-specific training scripts
├── eval/             # Evaluation scripts
├── format_prompt/    # Prompt templates
├── reward_function/  # Reward functions
└── config.yaml       # Base configuration