All

175 repositories

XToM
Public
Data and Code for paper “X-ToM: Exploring the Multilingual Theory of Mind for Large Language Models”
MIT License
•0•2•0•0•Updated Apr 20, 2026Apr 20, 2026
Reasoning-Embedding
Public
The official repository of the paper "Do Reasoning Models Enhance Embedding Models?"
representation-learning manifold embedding
representation-learning manifold embedding reasoning rlvr
Python
•
MIT License
•3•28•0•0•Updated Apr 17, 2026Apr 17, 2026
GrandGuard
Public
Python
•0•0•0•0•Updated Apr 16, 2026Apr 16, 2026
OmniCompliance-100K
Public
[ACL 2026 Findings] OmniCompliance-100K: A Multi-Domain, Rule-Grounded, Real-World Safety Compliance Dataset
Python
•0•0•0•0•Updated Apr 16, 2026Apr 16, 2026
InferenceDynamics
Public
[ACL 2026 Main] InferenceDynamics: Adaptive LLM Routing through Structured Capability and Knowledge Profiling
PDDL
•0•0•0•0•Updated Apr 16, 2026Apr 16, 2026
Awesome-Agent-Harness
Public
6•60•0•0•Updated Apr 14, 2026Apr 14, 2026
ContextLens
Public
Python
•
MIT License
•0•1•0•0•Updated Apr 14, 2026Apr 14, 2026
MarConf
Public
[ACL 2025] Revisiting Epistemic Markers in Confidence Estimation: Can Markers Accurately Reflect Large Language Models' Uncertainty?.
uncertainty-estimation confidence-estimation epistemic-uncertainty
uncertainty-estimation confidence-estimation epistemic-uncertainty confidence-calibration
Python
•1•8•0•1•Updated Apr 14, 2026Apr 14, 2026
MarPT
Public
Code for Prospect Theory Fails for LLMs: Instability of Decision-Making under Epistemic Uncertainty
Python
•0•4•1•0•Updated Mar 24, 2026Mar 24, 2026
Robust-Rule-Induction
Public
Source code and data for paper "Patterns Over Principles: The Fragility of Inductive Reasoning in LLMs under Noisy Observations".
Python
•
Apache License 2.0
•0•1•0•0•Updated Mar 19, 2026Mar 19, 2026
NAACL
Public
The official codebase for our paper "NAACL: Noise-AwAre Verbal Confidence Calibration for LLMs in RAG Systems"
Python
•
MIT License
•1•24•1•0•Updated Feb 28, 2026Feb 28, 2026
NewtonBench
Public
[ICLR2026] NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents
Python
•
MIT License
•20•146•1•0•Updated Feb 27, 2026Feb 27, 2026
NGDBench
Public
Python
•0•4•0•0•Updated Feb 25, 2026Feb 25, 2026
DARK
Public
Code for DARK: Unifying Deductive and Abductive Reasoning in Knowledge Graphs with Masked Diffusion Model
Python
•
MIT License
•0•4•0•0•Updated Feb 11, 2026Feb 11, 2026
CtrlHGen
Public
Python
•0•4•1•0•Updated Feb 11, 2026Feb 11, 2026
AtlasKV
Public
[ICLR'26] AtlasKV: A scalable, effective, and general way to augment LLMs with billion-scale knowledge graphs using very little GPU memory cost.
knowledge-graph kv-cache large-language-models
knowledge-graph kv-cache large-language-models retrieval-augmented-generation
Python
•
MIT License
•3•21•1•0•Updated Jan 27, 2026Jan 27, 2026
RelationalIntentionGraph
Public
Python
•0•2•0•0•Updated Jan 18, 2026Jan 18, 2026
AutoSchemaKG
Public
This repository contains the implementation of AutoSchemaKG, a novel framework for automatic knowledge graph construction that combines schema generation via co…
knowledge-graph graph-construction rag
knowledge-graph graph-construction rag
Python
•
MIT License
•97•729•8•0•Updated Jan 14, 2026Jan 14, 2026
AutoGraph-R1
Public
Python
•
MIT License
•3•30•1•0•Updated Nov 17, 2025Nov 17, 2025
privacy
Public
HTML
•0•2•0•0•Updated Nov 17, 2025Nov 17, 2025
CritiCal
Public
Code for CritiCal: Can Critique Help LLM Uncertainty or Confidence Calibration?
Python
•0•5•1•0•Updated Nov 15, 2025Nov 15, 2025
MARS
Public
Code and dataset for the paper: MARS: Benchmarking the Metaphysical Reasoning Abilities of Language Models with a Multi-task Evaluation Dataset (https://arxiv.o…
Python
•
MIT License
•0•6•0•0•Updated Nov 10, 2025Nov 10, 2025
Awesome-LLM-Scientific-Discovery
Public
[EMNLP2025] From Automation to Autonomy: A Survey on Large Language Models in Scientific Discovery
MIT License
•41•330•0•4•Updated Nov 5, 2025Nov 5, 2025
AbductiveKGR
Public
[ACL 2024] Implementation for Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation
Python
•1•15•0•0•Updated Oct 9, 2025Oct 9, 2025
LLM-Hanabi
Public
[EMNLP 2025 Wordplay] LLM-Hanabi Evaluating Multi-Agent Gameplays with Theory-of-Mind and Rationale Inference in Imperfect Information Collaboration Game
Python
•0•2•0•0•Updated Oct 4, 2025Oct 4, 2025
MASLegalBench
Public
Official Repository for MASLegalBench.
Python
•
MIT License
•0•0•0•0•Updated Sep 30, 2025Sep 30, 2025
MCIP
Public
Python
•
MIT License
•2•12•1•0•Updated Sep 29, 2025Sep 29, 2025
InteGround
Public
Python
•0•0•0•0•Updated Sep 20, 2025Sep 20, 2025
ContextReasoner
Public
[EMNLP 2025 Main] Official Repository for Context Reasoner.
Python
•
MIT License
•0•9•0•0•Updated Sep 1, 2025Sep 1, 2025
CEQA
Public
Official Implementation of paper: Complex Query Answering on Eventuality Knowledge Graph with Implicit Logical Constraints
Python
•
MIT License
•1•12•2•0•Updated Jul 15, 2025Jul 15, 2025

ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HKUST-KnowComp

All

All

175 repositories

XToM

Reasoning-Embedding

GrandGuard

OmniCompliance-100K

InferenceDynamics

Awesome-Agent-Harness

ContextLens

MarConf

MarPT

Robust-Rule-Induction

NAACL

NewtonBench

NGDBench

DARK

CtrlHGen

AtlasKV

RelationalIntentionGraph

AutoSchemaKG

AutoGraph-R1

privacy

CritiCal

MARS

Awesome-LLM-Scientific-Discovery

AbductiveKGR

LLM-Hanabi

MASLegalBench

MCIP

InteGround

ContextReasoner

CEQA

All

All

Repositories list

175 repositories