Skip to main content

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Project description

NVIDIA Cosmos Header

Getting Started

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Documentation.

System Architecture

Cosmos-RL provides toolchain to enable large scale RL training workload with following features:

  1. Parallelism
    • Tensor Parallelism
    • Sequence Parallelism
    • Context Parallelism
    • FSDP Parallelism
    • Pipeline Parallelism
  2. Fully asynchronous (replicas specialization)
    • Policy (Consumer): Replicas of training instances
    • Rollout (Producer): Replicas of generation engines
    • Low-precision training (FP8) and rollout (FP8 & FP4) support
  3. Single-Controller Architecture
    • Efficient messaging system (e.g., weight-sync, rollout, evaluate) to coordinate policy and rollout replicas
    • Dynamic NCCL Process Groups for on-the-fly GPU [un]registration to enable fault-tolerant and elastic large-scale RL training

Policy-Rollout-Controller Decoupled Architecture

License and Contact

This project will download and install additional third-party open source software projects. Review the license terms of these open source projects before use.

NVIDIA Cosmos source code is released under the Apache 2 License.

NVIDIA Cosmos models are released under the NVIDIA Open Model License. For a custom license, please contact cosmos-license@nvidia.com.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cosmos_rl-0.4.3.tar.gz (5.6 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cosmos_rl-0.4.3-py3-none-any.whl (1.6 MB view details)

Uploaded Python 3

File details

Details for the file cosmos_rl-0.4.3.tar.gz.

File metadata

  • Download URL: cosmos_rl-0.4.3.tar.gz
  • Upload date:
  • Size: 5.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for cosmos_rl-0.4.3.tar.gz
Algorithm Hash digest
SHA256 67ab8d080b245ae5daf58ef2691a611f418a42e36b44cf10cb380d6adbc521e3
MD5 ed359280f1cf6f4fd109acff1701bb1b
BLAKE2b-256 6c7025b3fb0cb504d493b18a15f310bfd6e901c613ecdeee3eefd1bc93a4123f

See more details on using hashes here.

File details

Details for the file cosmos_rl-0.4.3-py3-none-any.whl.

File metadata

  • Download URL: cosmos_rl-0.4.3-py3-none-any.whl
  • Upload date:
  • Size: 1.6 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for cosmos_rl-0.4.3-py3-none-any.whl
Algorithm Hash digest
SHA256 b667773f60b0a49135b57f614697e5bea336daddc57b60acb6dfdc0a396a68aa
MD5 0d483c0100c1d577b06cce6f1aa72f19
BLAKE2b-256 b32ebef42d96460348fb17b5e2b0a5e799b684f56ffdb307b15369ef067dad64

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page