Skip to main content

VeOmni: Scaling any Modality Model Training to any Accelerators with PyTorch native Training Framework

Project description

VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo

GitHub Repo stars Paper Documentation WeChat

🍪 Overview

VeOmni is a versatile framework for both single- and multi-modal pre-training and post-training. It empowers users to seamlessly scale models of any modality across various accelerators, offering both flexibility and user-friendliness.

Our guiding principles when building VeOmni are:

  • Flexibility and Modularity: VeOmni is built with a modular design, allowing users to decouple most components and replace them with their own implementations as needed.

  • Trainer-free: VeOmni supports linear training scripts that avoid rigid, structured trainer classes (e.g., PyTorch-Lightning or HuggingFace Trainer). These training scripts expose the entire training logic to users for maximum transparency and control. Besides, VeOmni supports a basic trainer for text-only or vlm/omni models training and a rl trainer as a trainer backend in reinforcement learning.

  • Omni model native: VeOmni enables users to effortlessly scale any omni-model across devices and accelerators.

  • Torch native: VeOmni is designed to leverage PyTorch’s native functions to the fullest extent, ensuring maximum compatibility and performance.

🔥 Latest News

📚 Key Features

  • FSDP, FSDP2 backend for training.
  • Sequence Parallelism with Deepspeed Ulysess, support with non-async and async mode.
  • Experts Parallelism support large MOE model training, like Qwen3-Moe.
  • Efficient GroupGemm kernel for Moe model, Liger-Kernel.
  • Compatible with HuggingFace Transformers models. Qwen3, Qwen3-VL, Qwen3-Moe, etc
  • Dynamic batching strategy, Omnidata processing
  • Torch Distributed Checkpoint for checkpoint.
  • Support for both Nvidia-GPU and Ascend-NPU training.
  • Experiment tracking with wandb

📝 Upcoming Features and Changes

🚀 Getting Started

Documentation

Quick Start

✏️ Supported Models

Model Model size Example config File
DeepSeek2.5/3/R1 236B/671B deepseek.yaml
Llama3-3.3 1B/3B/8B/70B llama3.yaml
Qwen2-3 0.5B/1.5B/3B/7B/14B/32B/72B/ qwen2_5.yaml
Qwen2-3 VL/QVQ 2B/3B/7B/32B/72B qwen3_vl_dense.yaml
Qwen3-VL MoE 30BA3B/235BA22B qwen3_vl_moe.yaml
Qwen3-MoE 30BA3B/235BA22B qwen3-moe.yaml
Qwen2-3 Omni 7B/30BA3B qwen25_omni.yaml
Wan Wan2.1-I2V-14B-480P wan_sft.yaml
Omni Model Any Modality Training seed_omni.yaml

Support new models to VeOmni see Support New Models

⛰️ Performance

For more details, please refer to our paper.

💡 Awesome work using VeOmni

🎨 Contributing

Contributions from the community are welcome! Please check out CONTRIBUTING.md our project roadmap(To be updated),

📝 Citation and Acknowledgement

If you find VeOmni useful for your research and applications, feel free to give us a star ⭐ or cite us using:

@article{ma2025veomni,
  title={VeOmni: Scaling Any Modality Model Training with Model-Centric Distributed Recipe Zoo},
  author={Ma, Qianli and Zheng, Yaowei and Shi, Zhelun and Zhao, Zhongkai and Jia, Bin and Huang, Ziyue and Lin, Zhiqi and Li, Youjie and Yang, Jiacheng and Peng, Yanghua and others},
  journal={arXiv preprint arXiv:2508.02317},
  year={2025}
}

Thanks to the following projects for their excellent work:

Star History

Star History Chart

🌱 About ByteDance Seed Team

Founded in 2023, ByteDance Seed Team is dedicated to crafting the industry's most advanced AI foundation models. The team aspires to become a world-class research team and make significant contributions to the advancement of science and society. You can get to know Bytedance Seed better through the following channels👇

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

veomni-0.1.9a2.tar.gz (867.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

veomni-0.1.9a2-py3-none-any.whl (1.1 MB view details)

Uploaded Python 3

File details

Details for the file veomni-0.1.9a2.tar.gz.

File metadata

  • Download URL: veomni-0.1.9a2.tar.gz
  • Upload date:
  • Size: 867.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for veomni-0.1.9a2.tar.gz
Algorithm Hash digest
SHA256 437ff883b9ad94968f4891134fabe32ab71056518453f6d7a190dc9aa248624b
MD5 c45acd9033b00456631370bf3eaa16fe
BLAKE2b-256 09e5ed0210abe6fab5586efea6851103570b9822ef04d91dba6456400c4f6537

See more details on using hashes here.

Provenance

The following attestation bundles were made for veomni-0.1.9a2.tar.gz:

Publisher: publish.yml on ByteDance-Seed/VeOmni

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file veomni-0.1.9a2-py3-none-any.whl.

File metadata

  • Download URL: veomni-0.1.9a2-py3-none-any.whl
  • Upload date:
  • Size: 1.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for veomni-0.1.9a2-py3-none-any.whl
Algorithm Hash digest
SHA256 da733990b1f12f2fd445ab854801e456f4c7d494e5c667c175d9bd22ecee70b9
MD5 ee1bb1144402d3b269fc17db0494f22b
BLAKE2b-256 2370f337ab7f16f4da1197bbbeb43896ae84b91c1d4a81a2ab8caa02b8c85e1a

See more details on using hashes here.

Provenance

The following attestation bundles were made for veomni-0.1.9a2-py3-none-any.whl:

Publisher: publish.yml on ByteDance-Seed/VeOmni

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page