Skip to content
Change the repository type filter

All

    Repositories list

    • XToM

      Public
      Data and Code for paper “X-ToM: Exploring the Multilingual Theory of Mind for Large Language Models”
      MIT License
      0200Updated Apr 20, 2026Apr 20, 2026
    • The official repository of the paper "Do Reasoning Models Enhance Embedding Models?"
      Python
      MIT License
      32800Updated Apr 17, 2026Apr 17, 2026
    • GrandGuard

      Public
      Python
      0000Updated Apr 16, 2026Apr 16, 2026
    • [ACL 2026 Findings] OmniCompliance-100K: A Multi-Domain, Rule-Grounded, Real-World Safety Compliance Dataset
      Python
      0000Updated Apr 16, 2026Apr 16, 2026
    • [ACL 2026 Main] InferenceDynamics: Adaptive LLM Routing through Structured Capability and Knowledge Profiling
      PDDL
      0000Updated Apr 16, 2026Apr 16, 2026
    • 66000Updated Apr 14, 2026Apr 14, 2026
    • Python
      MIT License
      0100Updated Apr 14, 2026Apr 14, 2026
    • MarConf

      Public
      [ACL 2025] Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Models' Uncertainty?.
      Python
      1801Updated Apr 14, 2026Apr 14, 2026
    • MarPT

      Public
      Code for Prospect Theory Fails for LLMs: Instability of Decision-Making under Epistemic Uncertainty
      Python
      0410Updated Mar 24, 2026Mar 24, 2026
    • Robust-Rule-Induction

      Public
      Source code and data for paper "Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations".
      Python
      Apache License 2.0
      0100Updated Mar 19, 2026Mar 19, 2026
    • NAACL

      Public
      The official codebase for our paper "NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems"
      Python
      MIT License
      12410Updated Feb 28, 2026Feb 28, 2026
    • NewtonBench

      Public
      [ICLR2026] NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents
      Python
      MIT License
      2014610Updated Feb 27, 2026Feb 27, 2026
    • NGDBench

      Public
      Python
      0400Updated Feb 25, 2026Feb 25, 2026
    • DARK

      Public
      Code for DARK: Unifying Deductive and Abductive Reasoning in Knowledge Graphs with Masked Diffusion Model
      Python
      MIT License
      0400Updated Feb 11, 2026Feb 11, 2026
    • CtrlHGen

      Public
      Python
      0410Updated Feb 11, 2026Feb 11, 2026
    • AtlasKV

      Public
      [ICLR'26] AtlasKV: A scalable, effective, and general way to augment LLMs with billion-scale knowledge graphs using very little GPU memory cost.
      Python
      MIT License
      32110Updated Jan 27, 2026Jan 27, 2026
    • Python
      0200Updated Jan 18, 2026Jan 18, 2026
    • This repository contains the implementation of AutoSchemaKG, a novel framework for automatic knowledge graph construction that combines schema generation via co…
      Python
      MIT License
      9772980Updated Jan 14, 2026Jan 14, 2026
    • Python
      MIT License
      33010Updated Nov 17, 2025Nov 17, 2025
    • privacy

      Public
      HTML
      0200Updated Nov 17, 2025Nov 17, 2025
    • CritiCal

      Public
      Code for CritiCal: Can Critique Help LLM Uncertainty or Confidence Calibration?
      Python
      0510Updated Nov 15, 2025Nov 15, 2025
    • MARS

      Public
      Code and dataset for the paper: MARS: Benchmarking the Metaphysical Reasoning Abilities of Language Models with a Multi-task Evaluation Dataset (https://arxiv.o…
      Python
      MIT License
      0600Updated Nov 10, 2025Nov 10, 2025
    • [EMNLP2025] From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery
      MIT License
      4133004Updated Nov 5, 2025Nov 5, 2025
    • [ACL 2024] Implementation for Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation
      Python
      11500Updated Oct 9, 2025Oct 9, 2025
    • [EMNLP 2025 Wordplay] LLM-Hanabi Evaluating Multi-Agent Gameplays with Theory-of-Mind and Rationale Inference in Imperfect Information Collaboration Game
      Python
      0200Updated Oct 4, 2025Oct 4, 2025
    • Official Repository for MASLegalBench.
      Python
      MIT License
      0000Updated Sep 30, 2025Sep 30, 2025
    • MCIP

      Public
      Python
      MIT License
      21210Updated Sep 29, 2025Sep 29, 2025
    • InteGround

      Public
      Python
      0000Updated Sep 20, 2025Sep 20, 2025
    • [EMNLP 2025 Main] Official Repository for Context Reasoner.
      Python
      MIT License
      0900Updated Sep 1, 2025Sep 1, 2025
    • CEQA

      Public
      Official Implementation of paper: Complex Query Answering on Eventuality Knowledge Graph with Implicit Logical Constraints
      Python
      MIT License
      11220Updated Jul 15, 2025Jul 15, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.