Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 6.4k 726

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.5k 182

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.7k 281

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 17.1k 1.4k

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 992 103

Repositories

Showing 10 of 569 repositories
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 68 Apache-2.0 17 1 32 Updated Mar 28, 2026
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 3,659 Apache-2.0 517 14 (1 issue needs help) 65 Updated Mar 28, 2026
  • S2AND Public

    Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite

    allenai/S2AND’s past year of commit activity
    Python 104 20 4 1 Updated Mar 28, 2026
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 1,002 Apache-2.0 197 15 51 Updated Mar 28, 2026
  • skiff2-actions Public

    GitHub actions for skiff2 repositories.

    allenai/skiff2-actions’s past year of commit activity
    TypeScript 1 Apache-2.0 0 0 1 Updated Mar 28, 2026
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    allenai/rslearn’s past year of commit activity
    Python 77 Apache-2.0 14 20 9 Updated Mar 28, 2026
  • olmoearth_pretrain Public

    Earth system foundation model data, training, and eval

    allenai/olmoearth_pretrain’s past year of commit activity
    Python 161 32 7 18 Updated Mar 27, 2026
  • prescience Public

    PreScience: A Benchmark for Forecasting Scientific Contributions

    allenai/prescience’s past year of commit activity
    Python 25 Apache-2.0 4 0 0 Updated Mar 27, 2026
  • beaker-gantry Public

    Gantry provides an API that streamlines running experiments in Beaker

    allenai/beaker-gantry’s past year of commit activity
    Python 33 Apache-2.0 7 4 3 Updated Mar 27, 2026
  • asta-bench Public
    allenai/asta-bench’s past year of commit activity
    Python 88 Apache-2.0 15 1 14 Updated Mar 27, 2026