oumi · PyPI

Oumi - Modeling Platform

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

oumi

These details have not been verified by PyPI

Project description

Oumi Logo

Everything you need to build state-of-the-art foundation models, end-to-end

🔥 News

[2026/03] Upgraded to Transformers v5, TRL v0.30, vLLM v0.19, and veRL v0.7 compatibility
[2026/03] MCP Integration Phase 1: package scaffold and dependencies for MCP server support
[2026/03] New: oumi deploy command for deploying oumi models dedicated inference endpoints on fireworks.ai and parasail
[2026/03] Added support for Qwen3.5 model family
[2026/03] Inference engines received multiple improvements: list_models api, improved error reporting
[2026/02] Preview of using the Oumi Platform and Lambda to fine-tune and deploy a 4B model for user intent classification
[2026/02] Lambda and Oumi partner for end-to-end custom model development
[2025/12] Oumi v0.6.0 released with Python 3.13 support, oumi analyze CLI command, TRL 0.26+ support, and more
[2025/12] WeMakeDevs AI Agents Assemble Hackathon: Oumi webinar on Finetuning for Text-to-SQL
[2025/12] Oumi co-sponsors WeMakeDevs AI Agents Assemble Hackathon with over 2000 project submissions
[2025/11] Oumi v0.5.0 released with advanced data synthesis, hyperparameter tuning automation, support for OpenEnv, and more
[2025/11] Example notebook to perform RLVF fine-tuning with OpenEnv, an open source library from the Meta PyTorch team for creating, deploying, and distributing agentic RL environments
[2025/10] Oumi v0.4.1 and v0.4.2 released] with support for Qwen3-VL and Transformers v4.56, data synthesis documentation and examples, and many bug fixes

Older updates

[2025/09] Oumi v0.4.0 released with DeepSpeed support, a Hugging Face Hub cache management tool, KTO/Vision DPO trainer support
[2025/08] Training and inference support for OpenAI's gpt-oss-20b and gpt-oss-120b: recipes here
[2025/08] Aug 14 Webinar - OpenAI's gpt-oss: Separating the Substance from the Hype.
[2025/08] Oumi v0.3.0 released with model quantization (AWQ), an improved LLM-as-a-Judge API, and Adaptive Inference
[2025/07] Recipe for Qwen3 235B
[2025/07] July 24 webinar: "Training a State-of-the-art Agent LLM with Oumi + Lambda"
[2025/06] Oumi v0.2.0 released with support for GRPO fine-tuning, a plethora of new model support, and much more
[2025/06] Announcement of Data Curation for Vision Language Models (DCVLR) competition at NeurIPS2025
[2025/06] Recipes for training, inference, and eval with the newly released Falcon-H1 and Falcon-E models
[2025/05] Support and recipes for InternVL3 1B
[2025/04] Added support for training and inference with Llama 4 models: Scout (17B activated, 109B total) and Maverick (17B activated, 400B total) variants, including full fine-tuning, LoRA, and QLoRA configurations
[2025/04] Recipes for Qwen3 model family
[2025/04] Introducing HallOumi: a State-of-the-Art Claim-Verification Model (technical overview)
[2025/04] Oumi now supports two new Vision-Language models: Phi4 and Qwen 2.5

🔎 About

Oumi is a fully open-source platform that streamlines the entire lifecycle of foundation models - from data preparation and training to evaluation and deployment. Whether you're developing on a laptop, launching large scale experiments on a cluster, or deploying models in production, Oumi provides the tools and workflows you need.

With Oumi, you can:

🚀 Train and fine-tune models from 10M to 405B parameters using state-of-the-art techniques (SFT, LoRA, QLoRA, GRPO, and more)
🤖 Work with both text and multimodal models (Llama, DeepSeek, Qwen, Phi, and others)
🔄 Synthesize and curate training data with LLM judges
⚡️ Deploy models efficiently with popular inference engines (vLLM, SGLang)
📊 Evaluate models comprehensively across standard benchmarks
🌎 Run anywhere - from laptops to clusters to clouds (AWS, Azure, GCP, Lambda, and more)
🔌 Integrate with both open models and commercial APIs (OpenAI, Anthropic, Vertex AI, Together, Parasail, ...)

All with one consistent API, production-grade reliability, and all the flexibility you need for research.

Learn more at oumi.ai, or jump right in with the quickstart guide.

🚀 Getting Started

Notebook	Try in Colab	Goal
🎯 Getting Started: A Tour		Quick tour of core features: training, evaluation, inference, and job management
🔧 Model Finetuning Guide		End-to-end guide to LoRA tuning with data prep, training, and evaluation
📚 Model Distillation		Guide to distilling large models into smaller, efficient ones
📋 Model Evaluation		Comprehensive model evaluation using Oumi's evaluation framework
☁️ Remote Training		Launch and monitor training jobs on cloud (AWS, Azure, GCP, Lambda, etc.) platforms
📈 LLM-as-a-Judge		Filter and curate training data with built-in judges

🔧 Usage

Installation

Choose the installation method that works best for you:

Using pip (Recommended)

# Basic installation
uv pip install oumi

# With GPU support
uv pip install 'oumi[gpu]'

# Latest development version
uv pip install git+https://github.com/oumi-ai/oumi.git

Don't have uv? Install it or use pip instead.

Using Docker

# Pull the latest image
docker pull ghcr.io/oumi-ai/oumi:latest

# Run oumi commands
docker run --gpus all -it ghcr.io/oumi-ai/oumi:latest oumi --help

# Train with a mounted config
docker run --gpus all -v $(pwd):/workspace -it ghcr.io/oumi-ai/oumi:latest \
    oumi train --config /workspace/my_config.yaml

Quick Install Script (Experimental)

Try Oumi without setting up a Python environment. This installs Oumi in an isolated environment:

curl -LsSf https://oumi.ai/install.sh | bash

For more advanced installation options, see the installation guide.

Oumi CLI

You can quickly use the oumi command to train, evaluate, and infer models using one of the existing recipes:

# Training
oumi train -c configs/recipes/smollm/sft/135m/quickstart_train.yaml

# Evaluation
oumi evaluate -c configs/recipes/smollm/evaluation/135m/quickstart_eval.yaml

# Inference
oumi infer -c configs/recipes/smollm/inference/135m_infer.yaml --interactive

For more advanced options, see the training, evaluation, inference, and llm-as-a-judge guides.

Running Jobs Remotely

You can run jobs remotely on cloud platforms (AWS, Azure, GCP, Lambda, etc.) using the oumi launch command:

# GCP
oumi launch up -c configs/recipes/smollm/sft/135m/quickstart_gcp_job.yaml

# AWS
oumi launch up -c configs/recipes/smollm/sft/135m/quickstart_gcp_job.yaml --resources.cloud aws

# Azure
oumi launch up -c configs/recipes/smollm/sft/135m/quickstart_gcp_job.yaml --resources.cloud azure

# Lambda
oumi launch up -c configs/recipes/smollm/sft/135m/quickstart_gcp_job.yaml --resources.cloud lambda

Note: Oumi is in beta and under active development. The core features are stable, but some advanced features might change as the platform improves.

💻 Why use Oumi?

If you need a comprehensive platform for training, evaluating, or deploying models, Oumi is a great choice.

Here are some of the key features that make Oumi stand out:

🔧 Zero Boilerplate: Get started in minutes with ready-to-use recipes for popular models and workflows. No need to write training loops or data pipelines.
🏢 Enterprise-Grade: Built and validated by teams training models at scale
🎯 Research Ready: Perfect for ML research with easily reproducible experiments, and flexible interfaces for customizing each component.
🌐 Broad Model Support: Works with most popular model architectures - from tiny models to the largest ones, text-only to multimodal.
🚀 SOTA Performance: Native support for distributed training techniques (FSDP, DeepSpeed, DDP) and optimized inference engines (vLLM, SGLang).
🤝 Community First: 100% open source with an active community. No vendor lock-in, no strings attached.

📚 Examples & Recipes

Explore the growing collection of ready-to-use configurations for state-of-the-art models and training workflows:

Note: These configurations are not an exhaustive list of what's supported, simply examples to get you started. You can find a more exhaustive list of supported models, and datasets (supervised fine-tuning, pre-training, preference tuning, and vision-language finetuning) in the oumi documentation.

Qwen Family

Model	Example Configurations
Qwen3-Next 80B A3B	LoRA • Inference • Inference (Instruct) • Evaluation
Qwen3 30B A3B	LoRA • Inference • Evaluation
Qwen3 32B	LoRA • Inference • Evaluation
Qwen3 14B	LoRA • Inference • Evaluation
Qwen3 8B	FFT • Inference • Evaluation
Qwen3 4B	FFT • Inference • Evaluation
Qwen3 1.7B	FFT • Inference • Evaluation
Qwen3 0.6B	FFT • Inference • Evaluation
QwQ 32B	FFT • LoRA • QLoRA • Inference • Evaluation
Qwen2.5-VL 3B	SFT • LoRA• Inference (vLLM) • Inference
Qwen2-VL 2B	SFT • LoRA • Inference (vLLM) • Inference (SGLang) • Inference • Evaluation

🐋 DeepSeek R1 Family

Model	Example Configurations
DeepSeek R1 671B	Inference (Together AI)
Distilled Llama 8B	FFT • LoRA • QLoRA • Inference • Evaluation
Distilled Llama 70B	FFT • LoRA • QLoRA • Inference • Evaluation
Distilled Qwen 1.5B	FFT • LoRA • Inference • Evaluation
Distilled Qwen 32B	LoRA • Inference • Evaluation

🦙 Llama Family

Model	Example Configurations
Llama 4 Scout Instruct 17B	FFT • LoRA • QLoRA • Inference (vLLM) • Inference • Inference (Together.ai)
Llama 4 Scout 17B	FFT
Llama 3.1 8B	FFT • LoRA • QLoRA • Pre-training • Inference (vLLM) • Inference • Evaluation
Llama 3.1 70B	FFT • LoRA • QLoRA • Inference • Evaluation
Llama 3.1 405B	FFT • LoRA • QLoRA
Llama 3.2 1B	FFT • LoRA • QLoRA • Inference (vLLM) • Inference (SGLang) • Inference • Evaluation
Llama 3.2 3B	FFT • LoRA • QLoRA • Inference (vLLM) • Inference (SGLang) • Inference • Evaluation
Llama 3.3 70B	FFT • LoRA • QLoRA • Inference (vLLM) • Inference • Evaluation
Llama 3.2 Vision 11B	SFT • Inference (vLLM) • Inference (SGLang) • Evaluation

🦅 Falcon family

Model	Example Configurations
Falcon-H1	FFT • Inference • Evaluation
Falcon-E (BitNet)	FFT • DPO • Evaluation

💎 Gemma 3 Family

Model	Example Configurations
Gemma 3 4B Instruct	FFT • Inference • Evaluation
Gemma 3 12B Instruct	LoRA • Inference • Evaluation
Gemma 3 27B Instruct	LoRA • Inference • Evaluation

🦉 OLMo 3 Family

Model	Example Configurations
OLMo 3 7B Instruct	FFT • Inference • Evaluation
OLMo 3 32B Instruct	LoRA • Inference • Evaluation

🎨 Vision Models

Model	Example Configurations
Llama 3.2 Vision 11B	SFT • LoRA • Inference (vLLM) • Inference (SGLang) • Evaluation
LLaVA 7B	SFT • Inference (vLLM) • Inference
Phi3 Vision 4.2B	SFT • LoRA • Inference (vLLM)
Phi4 Vision 5.6B	SFT • LoRA • Inference (vLLM) • Inference
Qwen2-VL 2B	SFT • LoRA • Inference (vLLM) • Inference (SGLang) • Inference • Evaluation
Qwen3-VL 2B	Inference
Qwen3-VL 4B	Inference
Qwen3-VL 8B	Inference
Qwen2.5-VL 3B	SFT • LoRA• Inference (vLLM) • Inference
SmolVLM-Instruct 2B	SFT • LoRA

🔍 Even more options

This section lists all the language models that can be used with Oumi. Thanks to the integration with the 🤗 Transformers library, you can easily use any of these models for training, evaluation, or inference.

Models prefixed with a checkmark (✅) have been thoroughly tested and validated by the Oumi community, with ready-to-use recipes available in the configs/recipes directory.

📋 Click to see more supported models

Instruct Models

Model	Size	Paper	HF Hub	License	Open [^1]
✅ SmolLM-Instruct	135M/360M/1.7B	Blog	Hub	Apache 2.0	✅
✅ DeepSeek R1 Family	1.5B/8B/32B/70B/671B	Blog	Hub	MIT	❌
✅ Llama 3.1 Instruct	8B/70B/405B	Paper	Hub	License	❌
✅ Llama 3.2 Instruct	1B/3B	Paper	Hub	License	❌
✅ Llama 3.3 Instruct	70B	Paper	Hub	License	❌
✅ Phi-3.5-Instruct	4B/14B	Paper	Hub	License	❌
✅ Qwen3	0.6B-32B	Paper	Hub	License	❌
Qwen2.5-Instruct	0.5B-70B	Paper	Hub	License	❌
OLMo 2 Instruct	7B	Paper	Hub	Apache 2.0	✅
✅ OLMo 3 Instruct	7B/32B	Paper	Hub	Apache 2.0	✅
MPT-Instruct	7B	Blog	Hub	Apache 2.0	✅
Command R	35B/104B	Blog	Hub	License	❌
Granite-3.1-Instruct	2B/8B	Paper	Hub	Apache 2.0	❌
Gemma 2 Instruct	2B/9B	Blog	Hub	License	❌
✅ Gemma 3 Instruct	4B/12B/27B	Blog	Hub	License	❌
DBRX-Instruct	130B MoE	Blog	Hub	Apache 2.0	❌
Falcon-Instruct	7B/40B	Paper	Hub	Apache 2.0	❌
✅ Llama 4 Scout Instruct	17B (Activated) 109B (Total)	Paper	Hub	License	❌
✅ Llama 4 Maverick Instruct	17B (Activated) 400B (Total)	Paper	Hub	License	❌

Vision-Language Models

Model	Size	Paper	HF Hub	License	Open
✅ Llama 3.2 Vision	11B	Paper	Hub	License	❌
✅ LLaVA-1.5	7B	Paper	Hub	License	❌
✅ Phi-3 Vision	4.2B	Paper	Hub	License	❌
✅ BLIP-2	3.6B	Paper	Hub	MIT	❌
✅ Qwen2-VL	2B	Blog	Hub	License	❌
✅ Qwen3-VL	2B/4B/8B	Blog	Hub	License	❌
✅ SmolVLM-Instruct	2B	Blog	Hub	Apache 2.0	✅

Base Models

Model	Size	Paper	HF Hub	License	Open
✅ SmolLM2	135M/360M/1.7B	Blog	Hub	Apache 2.0	✅
✅ Llama 3.2	1B/3B	Paper	Hub	License	❌
✅ Llama 3.1	8B/70B/405B	Paper	Hub	License	❌
✅ GPT-2	124M-1.5B	Paper	Hub	MIT	✅
DeepSeek V2	7B/13B	Blog	Hub	License	❌
Gemma2	2B/9B	Blog	Hub	License	❌
GPT-J	6B	Blog	Hub	Apache 2.0	✅
GPT-NeoX	20B	Paper	Hub	Apache 2.0	✅
Mistral	7B	Paper	Hub	Apache 2.0	❌
Mixtral	8x7B/8x22B	Blog	Hub	Apache 2.0	❌
MPT	7B	Blog	Hub	Apache 2.0	✅
OLMo	1B/7B	Paper	Hub	Apache 2.0	✅
✅ Llama 4 Scout	17B (Activated) 109B (Total)	Paper	Hub	License	❌

Reasoning Models

Model	Size	Paper	HF Hub	License	Open
✅ gpt-oss	20B/120B	Paper	Hub	Apache 2.0	❌
✅ Qwen3	0.6B-32B	Paper	Hub	License	❌
✅ Qwen3-Next	80B-A3B	Blog	Hub	License	❌
Qwen QwQ	32B	Blog	Hub	License	❌

Code Models

Model	Size	Paper	HF Hub	License	Open
✅ Qwen2.5 Coder	0.5B-32B	Blog	Hub	License	❌
DeepSeek Coder	1.3B-33B	Paper	Hub	License	❌
StarCoder 2	3B/7B/15B	Paper	Hub	License	✅

Math Models

Model	Size	Paper	HF Hub	License	Open
DeepSeek Math	7B	Paper	Hub	License	❌

📖 Documentation

To learn more about all the platform's capabilities, see the Oumi documentation.

🤝 Join the Community

Oumi is a community-first effort. Whether you are a developer, a researcher, or a non-technical user, all contributions are very welcome!

To contribute to the oumi repository, please check the CONTRIBUTING.md for guidance on how to contribute to send your first Pull Request.
Make sure to join our Discord community to get help, share your experiences, and contribute to the project!
If you are interested in joining one of the community's open-science efforts, check out our open collaboration page.

🙏 Acknowledgements

Oumi makes use of several libraries and tools from the open-source community. We would like to acknowledge and deeply thank the contributors of these projects! ✨ 🌟 💫

📝 Citation

If you find Oumi useful in your research, please consider citing it:

@software{oumi2025,
  author = {Oumi Community},
  title = {Oumi: an Open, End-to-end Platform for Building Large Foundation Models},
  month = {January},
  year = {2025},
  url = {https://github.com/oumi-ai/oumi}
}

📜 License

This project is licensed under the Apache License 2.0. See the LICENSE file for details.

[^1]: Open models are defined as models with fully open weights, training code, and data, and a permissive license. See Open Source Definitions for more information.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

oumi

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.8

May 7, 2026

0.7

Jan 29, 2026

0.6.0

Dec 17, 2025

0.5.0

Nov 18, 2025

0.4.2

Oct 20, 2025

0.4.1

Oct 14, 2025

0.4.0

Sep 2, 2025

0.3.0

Aug 5, 2025

0.2.1

Jul 11, 2025

0.2.0

Jun 23, 2025

0.1.14

Jun 10, 2025

0.1.13

May 29, 2025

0.1.12

Apr 16, 2025

0.1.11

Apr 6, 2025

0.1.10

Mar 25, 2025

0.1.9

Mar 24, 2025

0.1.8

Mar 10, 2025

0.1.7

Feb 25, 2025

0.1.6

Feb 22, 2025

0.1.5

Feb 20, 2025

0.1.4

Feb 18, 2025

0.1.3

Jan 28, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

oumi-0.8.tar.gz (3.4 MB view details)

Uploaded May 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

oumi-0.8-py3-none-any.whl (1.0 MB view details)

Uploaded May 7, 2026 Python 3

File details

Details for the file oumi-0.8.tar.gz.

File metadata

Download URL: oumi-0.8.tar.gz
Upload date: May 7, 2026
Size: 3.4 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for oumi-0.8.tar.gz
Algorithm	Hash digest
SHA256	`61265b6420a0f4c7a81114b5e1d71c3e3a03da44922b0f5ca8aa39041a4320c3`
MD5	`6c0b00a45ad64b9640719c402cd84310`
BLAKE2b-256	`7cceeb870a801da94a138384e8d97fcd9d05ff0a423a529a7da75cf5baa08731`

See more details on using hashes here.

Provenance

The following attestation bundles were made for oumi-0.8.tar.gz:

Publisher: release_pypi.yaml on oumi-ai/oumi

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: oumi-0.8.tar.gz
- Subject digest: 61265b6420a0f4c7a81114b5e1d71c3e3a03da44922b0f5ca8aa39041a4320c3
- Sigstore transparency entry: 1462899019
- Sigstore integration time: May 7, 2026
Source repository:
- Permalink: oumi-ai/oumi@f6f852545e204d0399fdfcf8445e7840e1672865
- Branch / Tag: refs/tags/v0.8
- Owner: https://github.com/oumi-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release_pypi.yaml@f6f852545e204d0399fdfcf8445e7840e1672865
- Trigger Event: workflow_dispatch

File details

Details for the file oumi-0.8-py3-none-any.whl.

File metadata

Download URL: oumi-0.8-py3-none-any.whl
Upload date: May 7, 2026
Size: 1.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for oumi-0.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a2d3e8947f19d2c639ec79072b1cdc26a90556371ac8db45b275e362bd874823`
MD5	`bb0e786d2f72506d97e557e5f2cb470a`
BLAKE2b-256	`4c1699d7585cb0160ded0f19e77fe72f45fb1f9545fbb74c3478b1532f6b24d5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for oumi-0.8-py3-none-any.whl:

Publisher: release_pypi.yaml on oumi-ai/oumi

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: oumi-0.8-py3-none-any.whl
- Subject digest: a2d3e8947f19d2c639ec79072b1cdc26a90556371ac8db45b275e362bd874823
- Sigstore transparency entry: 1462899063
- Sigstore integration time: May 7, 2026
Source repository:
- Permalink: oumi-ai/oumi@f6f852545e204d0399fdfcf8445e7840e1672865
- Branch / Tag: refs/tags/v0.8
- Owner: https://github.com/oumi-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release_pypi.yaml@f6f852545e204d0399fdfcf8445e7840e1672865
- Trigger Event: workflow_dispatch

oumi 0.8

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Everything you need to build state-of-the-art foundation models, end-to-end

🔥 News

🔎 About

🚀 Getting Started

🔧 Usage

Installation

Oumi CLI

Running Jobs Remotely

💻 Why use Oumi?

📚 Examples & Recipes

Qwen Family

🐋 DeepSeek R1 Family

🦙 Llama Family

🦅 Falcon family

💎 Gemma 3 Family

🦉 OLMo 3 Family

🎨 Vision Models

🔍 Even more options

Instruct Models

Vision-Language Models

Base Models

Reasoning Models

Code Models

Math Models

📖 Documentation

🤝 Join the Community

🙏 Acknowledgements

📝 Citation

📜 License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance