Skip to content

ritzz-ai/OPRM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 

Repository files navigation

Learning Ordinal Probabilistic Reward from Preferences

Paper Github Models

πŸ₯³ News

🧐 Getting Started

πŸ‘€ Coming soon!

😘 Citation

If you find this work helpful, please cite us.

@article{chen2026learning,
    title={Learning Ordinal Probabilistic Reward from Preferences},
    author={Chen, Longze and Wang, Lu and Shan, Renke and Gong, Ze and Luo, Run and Li, Jiaming and Luo, Jing and Wang, Qiyao and Yang, Min},
    journal={arXiv preprint arXiv:2602.12660},
    year={2026},
    url={https://arxiv.org/abs/2602.12660}
}

🫑 Attribution

Our implementation is based on a recent version of LlamaFactory.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors