Skip to content
Change the repository type filter

All

    Repositories list

    • creating-agents

      Public
      Agents for science platform
      Python
      5101Updated Apr 22, 2026Apr 22, 2026
    • mcgill-nlp.github.io

      Public
      Python
      26153Updated Apr 21, 2026Apr 21, 2026
    • agent-as-annotators

      Public
      Agent-as-Annotators: Structured Distillation of Web Agent Capabilities
      Python
      0300Updated Apr 14, 2026Apr 14, 2026
    • llm2vec-gen

      Public
      Code for `LLM2VEC-GEN: Generative Embeddings from Large Language Models`
      Python
      MIT License
      26100Updated Apr 5, 2026Apr 5, 2026
    • llm2vec

      Public
      Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
      Python
      MIT License
      1341.7k404Updated Apr 4, 2026Apr 4, 2026
    • latentlens

      Public
      Code and data for the paper "LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs"
      Python
      Other
      44100Updated Mar 31, 2026Mar 31, 2026
    • barbados-workshop-2026

      Public
      Workshop on AI for Science
      HTML
      0000Updated Mar 18, 2026Mar 18, 2026
    • the-markovian-thinker

      Public
      Code for paper "The Markovian Thinker: Architecture-Agnostic Linear Scaling of Reasoning"
      Python
      Apache License 2.0
      2734520Updated Mar 16, 2026Mar 16, 2026
    • probabilistic-reasoning

      Public
      Data and code for the paper "Humans and LLMs Diverge on Probabilistic Inferences"
      Jupyter Notebook
      MIT License
      1210Updated Mar 4, 2026Mar 4, 2026
    • BRIDGE

      Public
      BRIDGE: Predicting Human Task Completion Time From Model Performance
      HTML
      2300Updated Feb 10, 2026Feb 10, 2026
    • Code for `Exploiting Instruction-Following Retrievers for Malicious Information Retrieval`
      Python
      MIT License
      1600Updated Jan 8, 2026Jan 8, 2026
    • llm-logic-conflict

      Public
      Python
      0000Updated Jan 5, 2026Jan 5, 2026
    • thoughtology

      Public
      Jupyter Notebook
      MIT License
      21300Updated Dec 8, 2025Dec 8, 2025
    • TypeScript
      0000Updated Nov 29, 2025Nov 29, 2025
    • mila-speech-recorder-webapp

      Public
      TypeScript
      0000Updated Nov 29, 2025Nov 29, 2025
    • aural-portal

      Public
      TypeScript
      0000Updated Nov 18, 2025Nov 18, 2025
    • mSTEB

      Public
      Jupyter Notebook
      1100Updated Nov 14, 2025Nov 14, 2025
    • MAGNIFICo

      Public
      EMNLP 2023: MAGNIFICo: Evaluating the In-Context Learning Ability of Large Language Models to Generalize to Novel Interpretations
      Python
      0200Updated Nov 7, 2025Nov 7, 2025
    • nano-aha-moment

      Public
      Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"
      Jupyter Notebook
      MIT License
      5560741Updated Oct 7, 2025Oct 7, 2025
    • Python
      0810Updated Oct 3, 2025Oct 3, 2025
    • llmsafety

      Public
      A fork of JailbreakBench: An Open Robustness Benchmark for Jailbreaking Language Models [NeurIPS 2024 Datasets and Benchmarks Track]
      Python
      MIT License
      69000Updated Oct 1, 2025Oct 1, 2025
    • DIVERS-Bench

      Public
      0000Updated Sep 22, 2025Sep 22, 2025
    • weblinx-browsergym

      Public
      Python
      3200Updated Sep 10, 2025Sep 10, 2025
    • bias-bench

      Public
      ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.
      Python
      4115500Updated Aug 18, 2025Aug 18, 2025
    • AdversarialTriggers

      Public
      TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models
      Python
      MIT License
      21900Updated Aug 17, 2025Aug 17, 2025
    • agent-reward-bench

      Public
      AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
      Python
      24310Updated Aug 7, 2025Aug 7, 2025
    • meaning-change

      Public
      R
      MIT License
      0200Updated Jul 30, 2025Jul 30, 2025
    • AfroBench

      Public
      Large Scale Benchmark of Large Language Models on African Languages
      Python
      31900Updated Jul 28, 2025Jul 28, 2025
    • AURORA

      Public
      Code and data for the paper: Learning Action and Reasoning-Centric Image Editing from Videos and Simulation
      Python
      MIT License
      23500Updated Jun 30, 2025Jun 30, 2025
    • ast-lrl-speech

      Public
      Python
      0000Updated Jun 22, 2025Jun 22, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.