This is the official repository of Fake it till You Make it: Reward Modeling as Discriminative Prediction (arXiv).
The code and checkpoints will be released soon.
This is the official repository of Fake it till You Make it: Reward Modeling as Discriminative Prediction (arXiv).
The code and checkpoints will be released soon.