Highlights
- Pro
Pinned Loading
-
sjelassi/ebft_openrlhf
sjelassi/ebft_openrlhf PublicCode for "Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models".
Python 13
-
optimizers-llm
optimizers-llm PublicCode for "Deconstructing What Makes a Good Optimizer for Language Models"
Python 3
-
openrlhf-pretrain
openrlhf-pretrain PublicCode for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"
-
McGillAISociety/mcgillaiwebsite
McGillAISociety/mcgillaiwebsite PublicThe McGill AI Society Website
JavaScript 7
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.



