Skip to content

Pinned Loading

  1. OLMo OLMo Public

    Modeling, training, eval, and inference code for OLMo

    Python 5.2k 554

  2. dolma dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    Python 1.1k 124

  3. scispacy scispacy Public

    A full spaCy pipeline and models for scientific/biomedical documents.

    Python 1.8k 232

  4. ai2thor ai2thor Public

    An open-source platform for Visual AI.

    C# 1.3k 227

Repositories

Showing 10 of 493 repositories
  • OLMo-core Public

    PyTorch building blocks for the OLMo ecosystem

    allenai/OLMo-core’s past year of commit activity
    Python 62 Apache-2.0 15 2 21 Updated Feb 27, 2025
  • ai2thor Public

    An open-source platform for Visual AI.

    allenai/ai2thor’s past year of commit activity
    C# 1,270 Apache-2.0 227 242 4 Updated Feb 27, 2025
  • reward-bench Public

    RewardBench: the first evaluation tool for reward models.

    allenai/reward-bench’s past year of commit activity
    Python 515 Apache-2.0 59 5 3 Updated Feb 27, 2025
  • olmo-cookbook Public

    OLMost every training recipe you need to perform data interventions with the OLMo family of models.

    allenai/olmo-cookbook’s past year of commit activity
    Python 8 Apache-2.0 3 0 3 Updated Feb 27, 2025
  • OLMo Public

    Modeling, training, eval, and inference code for OLMo

    allenai/OLMo’s past year of commit activity
    Python 5,242 Apache-2.0 554 48 56 Updated Feb 27, 2025
  • rslearn Public

    A tool for developing remote sensing datasets and models.

    allenai/rslearn’s past year of commit activity
    Python 28 Apache-2.0 2 7 2 Updated Feb 27, 2025
  • open-instruct Public

    AllenAI's post-training codebase

    allenai/open-instruct’s past year of commit activity
    Python 2,712 Apache-2.0 343 14 12 Updated Feb 27, 2025
  • olmocr Public

    Toolkit for linearizing PDFs for LLM datasets/training

    allenai/olmocr’s past year of commit activity
    Python 1,720 Apache-2.0 121 19 16 Updated Feb 26, 2025
  • dolma Public

    Data and tools for generating and inspecting OLMo pre-training data.

    allenai/dolma’s past year of commit activity
    Python 1,095 Apache-2.0 124 24 20 Updated Feb 26, 2025
  • allenai/rslearn_projects’s past year of commit activity
    Python 6 Apache-2.0 2 6 3 Updated Feb 26, 2025