Skip to content
Change the repository type filter

All

    Repositories list

    • wellbeing

      Public
      Measuring and improving the functional pleasure and pain of AIs
      MIT License
      1000Updated Apr 27, 2026Apr 27, 2026
    • HPC cluster code and configurations for running on OCI
      Python
      Universal Permissive License v1.0
      29592Updated Mar 26, 2026Mar 26, 2026
    • Python
      MIT License
      0000Updated Mar 24, 2026Mar 24, 2026
    • ccc-docs

      Public
      CAIS Compute Cluster (CCC) documentation
      MIT License
      0370Updated Feb 20, 2026Feb 20, 2026
    • hle

      Public
      Humanity's Last Exam
      Python
      MIT License
      971.5k43Updated Feb 20, 2026Feb 20, 2026
    • HTML
      MIT License
      0300Updated Jan 28, 2026Jan 28, 2026
    • Simple evaluation scripts for AI benchmarks with minimal dependencies.
      Python
      MIT License
      11300Updated Dec 17, 2025Dec 17, 2025
    • 0000Updated Dec 4, 2025Dec 4, 2025
    • ZAP
      MIT License
      51813Updated Dec 2, 2025Dec 2, 2025
    • Public repository for the Remote Labor Index (RLI)
      TypeScript
      46610Updated Nov 3, 2025Nov 3, 2025
    • Forecasting.
      TypeScript
      MIT License
      113720Updated Aug 2, 2025Aug 2, 2025
    • Prometheus exporter for performance metrics from Slurm.
      Go
      GNU General Public License v3.0
      182351Updated Jun 16, 2025Jun 16, 2025
    • wmdp

      Public
      WMDP is a LLM proxy benchmark for hazardous knowledge in bio, cyber, and chemical security. We also release code for RMU, an unlearning method which reduces LLM…
      Jupyter Notebook
      MIT License
      43167123Updated May 29, 2025May 29, 2025
    • AISES

      Public
      CSS
      3200Updated Apr 24, 2025Apr 24, 2025
    • cluster-docs-old

      Public archive
      CSS
      MIT License
      2140Updated Apr 10, 2025Apr 10, 2025
    • Prometheus exporter for the stats in the cgroup accounting with slurm. This will also collect stats of a job using NVIDIA GPUs.
      Python
      Apache License 2.0
      23100Updated Apr 10, 2025Apr 10, 2025
    • mask

      Public
      Code for evaluating AI systems on the MASK honesty benchmark.
      Python
      MIT License
      142010Updated Mar 6, 2025Mar 6, 2025
    • Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"
      Jupyter Notebook
      MIT License
      239020Updated Feb 27, 2025Feb 27, 2025
    • Measuring correlations between safety benchmarks and general AI capabilities benchmarks.
      Python
      MIT License
      31100Updated Oct 2, 2024Oct 2, 2024
    • HarmBench

      Public
      HarmBench: A Standardized Evaluation Framework for Automated Red Teaming and Robust Refusal
      Jupyter Notebook
      MIT License
      139936305Updated Aug 16, 2024Aug 16, 2024
    • This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.
      Python
      MIT License
      279100Updated May 19, 2024May 19, 2024
    • HTML
      MIT License
      0000Updated Mar 28, 2024Mar 28, 2024
    • JavaScript
      MIT License
      0110Updated Mar 6, 2024Mar 6, 2024
    • Jupyter Notebook
      0400Updated Oct 30, 2023Oct 30, 2023
    • reading

      Public
      1100Updated Oct 26, 2023Oct 26, 2023
    • Cost-effectiveness models, tools, and results for various AI safety field-building programs.
      Python
      MIT License
      4602Updated Aug 15, 2023Aug 15, 2023
    • Website for the Trojan Detection Challenge NeurIPS 2022 competition
      JavaScript
      MIT License
      0000Updated Jul 28, 2023Jul 28, 2023
    • GoSlurmMailer - drop in replacement for default slurm MailProg. Delivers slurm job messages to various destinations.
      Go
      6000Updated Jun 21, 2023Jun 21, 2023
    • 277700Updated May 31, 2023May 31, 2023
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.