Skip to content
View ZhaolinGao's full-sized avatar

Block or report ZhaolinGao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. REBEL REBEL Public

    Reinforcement Learning via Regressing Relative Rewards

    Python 40 11

  2. REFUEL REFUEL Public

    Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF

    Python 24 2

  3. Reviewer2 Reviewer2 Public

    Optimizing Review Generation Through Prompt Generation

    Python 15

  4. TD-VAE-CF TD-VAE-CF Public

    Mitigating the Filter Bubble while Maintaining Relevance: Targeted Diversification with VAE-based Recommender Systems

    Python 10

  5. LangPTune LangPTune Public

    End-to-end Training for Recommendation with Language-based User Profiles

    Python 11 1

  6. A-PO A-PO Public

    Accelerating RL for LLM Reasoning with Optimal Advantage Regression

    Python 40 1