Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 6.3k 702

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.4k 164

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.7k 269

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 16.9k 1.3k

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 965 92

Repositories

Showing 10 of 549 repositories
  • olmoearth_pretrain Public

    Earth system foundation model data, training, and eval

    allenai/olmoearth_pretrain’s past year of commit activity
    Python 134 26 5 23 Updated Feb 5, 2026
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    allenai/rslearn’s past year of commit activity
    Python 71 Apache-2.0 12 25 7 Updated Feb 5, 2026
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 64 Apache-2.0 11 1 32 Updated Feb 5, 2026
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 777 Apache-2.0 138 8 51 Updated Feb 5, 2026
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 3,563 Apache-2.0 493 13 (1 issue needs help) 55 Updated Feb 5, 2026
  • datamap-rs Public

    Data mapping framework for rust stuff

    allenai/datamap-rs’s past year of commit activity
    Rust 44 Apache-2.0 4 0 2 Updated Feb 5, 2026
  • olmo-api Public

    HTTP API for https://olmo.allen.ai

    allenai/olmo-api’s past year of commit activity
    Python 0 Apache-2.0 0 15 13 Updated Feb 5, 2026
  • olmo-bonepick Public

    Tools to build fast quality classifiers for Olmo data filtering

    allenai/olmo-bonepick’s past year of commit activity
    Python 0 Apache-2.0 0 0 0 Updated Feb 5, 2026
  • ai2-scholarqa-lib Public

    Repo housing the open sourced code for the ai2 scholar qa app and also the corresponding library

    allenai/ai2-scholarqa-lib’s past year of commit activity
    Python 254 Apache-2.0 47 6 2 Updated Feb 4, 2026
  • duplodocus Public

    Tooling for exact and MinHash deduplication of large-scale text datasets

    allenai/duplodocus’s past year of commit activity
    Rust 66 Apache-2.0 5 1 1 Updated Feb 4, 2026