Deep Reinforcement Learning with JAX and Equinox.

These details have not been verified by PyPI

Project description

Lerax

This is a work in progress implementation of a JAX based reinforcement learning library using Equinox. The main feature is Neural Differential Equation based models. NDEs can be extraordinarily computationally intensive, this library is intended to provide an optimised implementation of NDEs and other RL algorithms using just in time compilation (JIT). Paired with environments that support JIT, high performance is possible using the Anakin architecture for fully GPU based RL.

I'm working on this in my free time, so it may take a while to get to a usable state. I'm also mainly developing this for personal research, so it may not be suitable for all use cases.

Code Style

This code is written to follow the Equinox's abstract/final pattern for code structure and Black formatting. This is intended to make the code more readable and maintainable, and to ensure that it is consistent with the Equinox library. If you want to contribute, please follow these conventions.

Credit

A ton of the code is a slight translation of the code found in the Stable Baselines 3 and Gymnasium libraries which are both under the MIT license. The developers of these excellent libraries have done a great job of creating a solid foundation for reinforcement learning in Python, and I have learned a lot from their code.

In addition, the NDE code is heavily inspired by the work of Patrick Kidger and the entire library is based on his excellent Equinox library along with some use of Diffrax and jaxtyping.

TODO

Expand support beyond Box and Discrete spaces
Logging
- Code flow logging
- Training logging
- Migrate from tensorboard to aim
Documentation
- Standardize docstring formats
- Write documentation for all public APIs
- Publish docs
Testing
- More thorough unit testing
- Integration testing
- Runtime jaxtyping
Use it
- Personal research
Optimise for performance under JIT compilation
- Good vectorization support
- Sharding support for distributed training
Round out features
- Rendering support
- Expand RL variants to include more algorithms
- Create a more comprehensive set of environments

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.0.5

Jan 25, 2026

0.0.4

Dec 16, 2025

0.0.3

Dec 7, 2025

0.0.2

Nov 16, 2025

This version

0.0.1

Oct 12, 2025

0.0.1a1 pre-release

Nov 16, 2025

0.0.1a0 pre-release

Oct 12, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lerax-0.0.1.tar.gz (45.1 kB view details)

Uploaded Oct 12, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lerax-0.0.1-py3-none-any.whl (67.5 kB view details)

Uploaded Oct 12, 2025 Python 3

File details

Details for the file lerax-0.0.1.tar.gz.

File metadata

Download URL: lerax-0.0.1.tar.gz
Upload date: Oct 12, 2025
Size: 45.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for lerax-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`19748d9b3999c88947a170823c6c79af63cc353b73a8aa7d05ff0ea674de40e4`
MD5	`9a0f47077db30082cea8bb6f9c9e6518`
BLAKE2b-256	`d55af6c3e4784ac9f278cc678db511e1fd124e872726164b9fff8ce1520f1d2b`

See more details on using hashes here.

File details

Details for the file lerax-0.0.1-py3-none-any.whl.

File metadata

Download URL: lerax-0.0.1-py3-none-any.whl
Upload date: Oct 12, 2025
Size: 67.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.7

File hashes

Hashes for lerax-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`04b746e0b0224771562718ddd259451d43ba35a7f5c36bf38bb3e1b4d69e9579`
MD5	`bf209e126de897af64c94487396aae6a`
BLAKE2b-256	`45e6e6aed541be3110d8376e34b207e685b0e495594c6ebdde5a57e116f673f0`

See more details on using hashes here.

lerax 0.0.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Lerax

Code Style

Credit

TODO

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes