Skip to main content

eXtensible Orchestration for Reinforcement Learning

Project description

xorl

eXtensible Orchestration for Reinforcement Learning — a lightweight infrastructure for RL training.

Features

  • TrainConfig — dataclass-based configuration for RL hyperparameters (PPO-style: gamma, GAE, clipping, etc.)
  • Trainer — base trainer with optimizer setup, checkpointing, and training loop
  • Utilities — seed management, gradient norm computation, parameter counting

Installation

pip install xorl

Quick Start

from turl import TrainConfig, Trainer

config = TrainConfig(
    lr=3e-4,
    batch_size=64,
    gamma=0.99,
    clip_range=0.2,
)

License

Apache-2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xorl-0.1.1.tar.gz (3.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

xorl-0.1.1-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file xorl-0.1.1.tar.gz.

File metadata

  • Download URL: xorl-0.1.1.tar.gz
  • Upload date:
  • Size: 3.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for xorl-0.1.1.tar.gz
Algorithm Hash digest
SHA256 d139b4e1f03243f8e350da4bd92b5405de373722ab613725a34ce69081896f59
MD5 49a09e837ab197b49d0e291add4750ab
BLAKE2b-256 0cd95823cfca4b642a2cd090846f170add7097858fa07535153dc96657edf9ec

See more details on using hashes here.

File details

Details for the file xorl-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: xorl-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 3.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for xorl-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 bdb49b00bf4424caf6d2505880de79b6fae354ae540cead1ec1d74ccff232af7
MD5 65ee84026d178c380cca304ee2716d9a
BLAKE2b-256 2748bcd3b9eefed6ae0303d440a2264510600e53680041c9a354e701c1a12c0a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page