Skip to main content

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Project description

NVIDIA Cosmos Header

Getting Started

Cosmos-RL is a flexible and scalable Reinforcement Learning framework specialized for Physical AI applications.

Documentation.

System Architecture

Cosmos-RL provides toolchain to enable large scale RL training workload with following features:

  1. Parallelism
    • Tensor Parallelism
    • Sequence Parallelism
    • Context Parallelism
    • FSDP Parallelism
    • Pipeline Parallelism
  2. Fully asynchronous (replicas specialization)
    • Policy (Consumer): Replicas of training instances
    • Rollout (Producer): Replicas of generation engines
    • Low-precision training (FP8) and rollout (FP8 & FP4) support
  3. Single-Controller Architecture
    • Efficient messaging system (e.g., weight-sync, rollout, evaluate) to coordinate policy and rollout replicas
    • Dynamic NCCL Process Groups for on-the-fly GPU [un]registration to enable fault-tolerant and elastic large-scale RL training

Policy-Rollout-Controller Decoupled Architecture

License and Contact

This project will download and install additional third-party open source software projects. Review the license terms of these open source projects before use.

NVIDIA Cosmos source code is released under the Apache 2 License.

NVIDIA Cosmos models are released under the NVIDIA Open Model License. For a custom license, please contact cosmos-license@nvidia.com.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cosmos_rl-0.4.2.tar.gz (5.5 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cosmos_rl-0.4.2-py3-none-any.whl (1.6 MB view details)

Uploaded Python 3

File details

Details for the file cosmos_rl-0.4.2.tar.gz.

File metadata

  • Download URL: cosmos_rl-0.4.2.tar.gz
  • Upload date:
  • Size: 5.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for cosmos_rl-0.4.2.tar.gz
Algorithm Hash digest
SHA256 31025e04791b3dfa3b08c600a11fcd42257998d86fa61cc48ea92b0d7ef53211
MD5 6c116bf1ab595cadf796c8c8c6d0fc74
BLAKE2b-256 eb6c188be3cfc84af6f7744767737311a6532af5d10bc4b760b8a419329fc94f

See more details on using hashes here.

File details

Details for the file cosmos_rl-0.4.2-py3-none-any.whl.

File metadata

  • Download URL: cosmos_rl-0.4.2-py3-none-any.whl
  • Upload date:
  • Size: 1.6 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for cosmos_rl-0.4.2-py3-none-any.whl
Algorithm Hash digest
SHA256 774c8f4822e31649b1f660ef8c5970feefd07035e7c107d00e1662ba6fd613ac
MD5 848bf727611fbd7c9a27a644375c04b0
BLAKE2b-256 8a7d5d27cf9edf62b149e4befd1fe59217c2f7a37d08827aa1c8a2e13eecf4c7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page