Fully differentiable reinforcement learning environments, written in Ivy.
Project description
What is Ivy Gym?
Ivy Gym opens the door for intersectional research between supervised learning (SL), reinforcement learning (RL), and trajectory optimization (TO), by implementing RL environments in a fully differentiable manner.
Specifically, Ivy gym provides differentiable implementations of the classic control tasks from OpenAI Gym, as well
as a new Swimmer task, which illustrates the simplicity of creating new tasks using Ivy. The differentiable nature
of the environments means that the cumulative reward can be directly optimized for in a supervised manner, without
need for reinforcement learning, which is the de facto approach for optimizing cumulative rewards. Ivy currently
supports Jax, TensorFlow, PyTorch, MXNet and Numpy. Check out the [docs](https://ivy-dl.org/gym) for more info!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ivy-gym-1.1.5.tar.gz
(11.1 kB
view details)
Built Distribution
ivy_gym-1.1.5-py3-none-any.whl
(17.1 kB
view details)
File details
Details for the file ivy-gym-1.1.5.tar.gz
.
File metadata
- Download URL: ivy-gym-1.1.5.tar.gz
- Upload date:
- Size: 11.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b12e399158cf485f51f4087052918174cb53820543562b176748dbd444c468fb |
|
MD5 | ca996285afb2fff1ce4f9df472b3eabd |
|
BLAKE2b-256 | f56d327c6219041beb3e59f72f8e2a8eeb80d5bda9439073995e2f5022633304 |
File details
Details for the file ivy_gym-1.1.5-py3-none-any.whl
.
File metadata
- Download URL: ivy_gym-1.1.5-py3-none-any.whl
- Upload date:
- Size: 17.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 71ae2c4e9cd601a719aaac4b0fd9332d52d5ee18209a0e814754d2f952806fad |
|
MD5 | 4ab0473f46f68a8d0114c6640ba84e64 |
|
BLAKE2b-256 | 36b7254740131bcbf446611bd83ec64cb644fdc758dd43250d62765de1074ad3 |