Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 6.4k 722

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.5k 180

  3. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.7k 278

  4. olmocr olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    Python 17.1k 1.4k

  5. OLMoE OLMoE Public

    OLMoE: Open Mixture-of-Experts Language Models

    Jupyter Notebook 990 102

Repositories

Showing 10 of 564 repositories
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 986 Apache-2.0 193 15 53 Updated Mar 24, 2026
  • olmoearth_pretrain Public

    Earth system foundation model data, training, and eval

    allenai/olmoearth_pretrain’s past year of commit activity
    Python 160 32 7 19 Updated Mar 24, 2026
  • datamap-rs Public

    Data mapping framework for rust stuff

    allenai/datamap-rs’s past year of commit activity
    Rust 49 Apache-2.0 4 1 2 Updated Mar 24, 2026
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 3,649 Apache-2.0 515 14 (1 issue needs help) 65 Updated Mar 24, 2026
  • olmo-api Public

    HTTP API for https://olmo.allen.ai

    allenai/olmo-api’s past year of commit activity
    Python 2 Apache-2.0 0 15 7 Updated Mar 23, 2026
  • allenai/agent-baselines’s past year of commit activity
    Python 120 Apache-2.0 14 1 1 Updated Mar 23, 2026
  • asta-bench Public
    allenai/asta-bench’s past year of commit activity
    Python 88 Apache-2.0 15 2 14 Updated Mar 24, 2026
  • S2AND Public

    Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite

    allenai/S2AND’s past year of commit activity
    Python 104 20 4 1 Updated Mar 24, 2026
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    allenai/rslearn’s past year of commit activity
    Python 76 Apache-2.0 14 19 12 Updated Mar 23, 2026
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    allenai/olmocr’s past year of commit activity
    Python 17,055 Apache-2.0 1,363 43 18 Updated Mar 23, 2026