PhD Student @cambridgeltl | NLP
-
University of Cambridge
- Cambridge, England
-
01:34
(UTC) - sharanmaiya.com
- @_maiush
- in/sharanmaiya
Highlights
- Pro
Popular repositories Loading
-
-
LP-as-a-Judge
LP-as-a-Judge Publicexperiments on the use of linear classifier heads for llm-as-a-judge tasks.
Jupyter Notebook 2
-
repeng
repeng PublicForked from vgel/repeng
A library for making RepE control vectors
Jupyter Notebook 1
-
-
OpenRLHF
OpenRLHF PublicForked from OpenRLHF/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & RFT & Dynamic Sampling & Async Agent RL)
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

