Skip to content
Change the repository type filter

All

    Repositories list

    • OLMo-core

      Public
      PyTorch building blocks for the OLMo ecosystem
      Python
      92522740Updated Dec 10, 2025Dec 10, 2025
    • olmoearth_projects

      Public
      OlmoEarth projects
      Python
      54512Updated Dec 10, 2025Dec 10, 2025
    • rslearn

      Public
      A tool for developing remote sensing datasets and models.
      Python
      1059203Updated Dec 10, 2025Dec 10, 2025
    • open-instruct

      Public
      AllenAI's post-training codebase
      Python
      4713.4k1144Updated Dec 10, 2025Dec 10, 2025
    • OLMost every training recipe you need to perform data interventions with the OLMo family of models.
      Python
      1157131Updated Dec 10, 2025Dec 10, 2025
    • olmocr

      Public
      Toolkit for linearizing PDFs for LLM datasets/training
      Python
      1.2k16k3113Updated Dec 9, 2025Dec 9, 2025
    • PeerRead

      Public
      Data and code for Kang et al., NAACL 2018's paper titled "A Dataset of Peer Reviews (PeerRead): Collection, Insights and NLP Applications"
      Python
      10842453Updated Dec 9, 2025Dec 9, 2025
    • dnw

      Public
      Discovering Neural Wirings (https://arxiv.org/abs/1906.00586)
      Python
      1713626Updated Dec 9, 2025Dec 9, 2025
    • Python
      5419325Updated Dec 9, 2025Dec 9, 2025
    • savn

      Public
      Learning to Learn how to Learn: Self-Adaptive Visual Navigation using Meta-Learning (https://arxiv.org/abs/1812.00971)
      Python
      55193195Updated Dec 9, 2025Dec 9, 2025
    • TOPICAL

      Public
      🪄📄 TOPICAL: TOPIC pages AutomagicaLly
      Python
      3921Updated Dec 9, 2025Dec 9, 2025
    • Earth system foundation model data, training, and eval
      Python
      17113213Updated Dec 9, 2025Dec 9, 2025
    • Python
      616156Updated Dec 9, 2025Dec 9, 2025
    • Tooling for exact and MinHash deduplication of large-scale text datasets
      Rust
      34001Updated Dec 9, 2025Dec 9, 2025
    • Set up your GitHub Actions workflow with the Beaker command-line client
      Python
      3608Updated Dec 9, 2025Dec 9, 2025
    • scirepeval

      Public
      SciRepEval benchmark training and evaluation scripts
      Python
      137831Updated Dec 8, 2025Dec 8, 2025
    • Python
      129810Updated Dec 8, 2025Dec 8, 2025
    • infinigram-api

      Public
      Python
      118761Updated Dec 6, 2025Dec 6, 2025
    • molmoact

      Public
      Official Repository for MolmoAct
      Python
      2826971Updated Dec 6, 2025Dec 6, 2025
    • S2AFF

      Public
      link raw affiliation to ROR ids
      Jupyter Notebook
      53060Updated Dec 5, 2025Dec 5, 2025
    • FlexOlmo

      Public
      Code and training scripts for FlexOlmo
      Python
      16117511Updated Dec 5, 2025Dec 5, 2025
    • atlantes

      Public
      Efficient and low latency real-time global-scale GPS trajectory modeling
      Python
      146016Updated Dec 5, 2025Dec 5, 2025
    • scispacy

      Public
      A full spaCy pipeline and models for scientific/biomedical documents.
      Python
      2491.9k353Updated Dec 4, 2025Dec 4, 2025
    • ScienceWorld

      Public
      ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.
      Scala
      32316150Updated Dec 3, 2025Dec 3, 2025
    • decon

      Public
      decontamination
      Rust
      01700Updated Dec 3, 2025Dec 3, 2025
    • Official code for NeurIPS 2025 paper "AutoDiscovery: Open-ended Scientific Discovery via Bayesian Surprise"
      Python
      1510900Updated Dec 2, 2025Dec 2, 2025
    • dolma3

      Public
      Jupyter Notebook
      42901Updated Nov 30, 2025Nov 30, 2025
    • IFBench

      Public
      Python
      169300Updated Nov 27, 2025Nov 27, 2025
    • CodeScientist: An automated scientific discovery system for code-based experiments
      Python
      4030310Updated Nov 27, 2025Nov 27, 2025
    • panda

      Public
      Panda ("plan-and-act") agent for Autonomous Scientific Discovery
      Python
      2400Updated Nov 26, 2025Nov 26, 2025