Skip to content

LuLim14/Alignment_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 

Repository files navigation

Implementation of WARP algorithm (https://arxiv.org/pdf/2406.16768)

Clone

git clone https://github.com/LuLim14/Alignment_project.git
cd ./Alignment_project/warp_algorithm

Prerequisites

Установка зависимостей из 'requirements.txt':

pip install -U -r requirements.txt

Запуск пайплайна

python main.py --use_wandb '[False|True]' --path_to_checkpoints_reward_model [path to reward_model checkpoints directory] --checkpoint_theta_dir [path to train_checkpoints_theta directory] --checkpoint_final_dir [path to checkpoints final directory] --checkpoint_ema_dir [path to checkpoints ema directory]

Ссылка на отчет

https://api.wandb.ai/links/lulim/apf1bg9x

About

Implementation WARP algorithm for LLM alignment

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors