18 projects
bonepick
CLI tool for training efficient CPU-based text quality classifiers and annotating data for distillation of classifiers.
ai2-olmo-eval
In-loop evaluation tasks for language modeling
cached-path
A file utility for accessing both local and remote files through a unified interface
beaker-gantry
Gantry streamlines running Python experiments in Beaker by managing containers and boilerplate for you
poormanray
A minimal alternative to Ray for distributed data processing on EC2 instances
beaker-py
A Python Beaker client
ai2-olmo-core
Core training module for the Open Language Model (OLMo)
dolma-rust-components
Rust components for Dolma - Toolkit for pre-processing LLM training data.
dolma
Toolkit for pre-processing LLM training data.
ai2-olmo
Open Language Model (OLMo)
ai2-catwalk
A library for evaluating language models.
ai2-tango
A library for choreographing your machine learning research.
bettermap
Parallelized drop-in replacements for Python's map function
allennlp-models
Officially supported models for the AllenNLP framework
allennlp
An open-source NLP research library, built on PyTorch.
oocmap
A file-backed dictionary for Python
allennlp-semparse
A framework for building semantic parsers (including neural module networks) with AllenNLP, built by the authors of AllenNLP
allennlp-server
Simple demo server for AllenNLP models and training config builder.