No project description provided

These details have not been verified by PyPI

Project links

Homepage

Project description

TorchRL

TorchRL is an open-source Reinforcement Learning (RL) library for PyTorch.

Key features

🐍 Python-first: Designed with Python as the primary language for ease of use and flexibility
⏱️ Efficient: Optimized for performance to support demanding RL research applications
🧮 Modular, customizable, extensible: Highly modular architecture allows for easy swapping, transformation, or creation of new components
📚 Documented: Thorough documentation ensures that users can quickly understand and utilize the library
✅ Tested: Rigorously tested to ensure reliability and stability
⚙️ Reusable functionals: Provides a set of highly reusable functions for cost functions, returns, and data processing

Design Principles

🔥 Aligns with PyTorch ecosystem: Follows the structure and conventions of popular PyTorch libraries (e.g., dataset pillar, transforms, models, data utilities)
➖ Minimal dependencies: Only requires Python standard library, NumPy, and PyTorch; optional dependencies for common environment libraries (e.g., OpenAI Gym) and datasets (D4RL, OpenX...)

Read the full paper for a more curated description of the library.

Getting started

Check our Getting Started tutorials for quickly ramp up with the basic features of the library!

Documentation and knowledge base

The TorchRL documentation can be found here. It contains tutorials and the API reference.

TorchRL also provides a RL knowledge base to help you debug your code, or simply learn the basics of RL. Check it out here.

We have some introductory videos for you to get to know the library better, check them out:

Spotlight publications

TorchRL being domain-agnostic, you can use it across many different fields. Here are a few examples:

ACEGEN: Reinforcement Learning of Generative Chemical Agents for Drug Discovery
BenchMARL: Benchmarking Multi-Agent Reinforcement Learning
BricksRL: A Platform for Democratizing Robotics and Reinforcement Learning Research and Education with LEGO
OmniDrones: An Efficient and Flexible Platform for Reinforcement Learning in Drone Control
RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark
Robohive: A unified framework for robot learning

Writing simplified and portable RL codebase with `TensorDict`

RL algorithms are very heterogeneous, and it can be hard to recycle a codebase across settings (e.g. from online to offline, from state-based to pixel-based learning). TorchRL solves this problem through TensorDict, a convenient data structure⁽¹⁾ that can be used to streamline one's RL codebase. With this tool, one can write a complete PPO training script in less than 100 lines of code!

Code

import torch
from tensordict.nn import TensorDictModule
from tensordict.nn.distributions import NormalParamExtractor
from torch import nn

from torchrl.collectors import SyncDataCollector
from torchrl.data.replay_buffers import TensorDictReplayBuffer, \
  LazyTensorStorage, SamplerWithoutReplacement
from torchrl.envs.libs.gym import GymEnv
from torchrl.modules import ProbabilisticActor, ValueOperator, TanhNormal
from torchrl.objectives import ClipPPOLoss
from torchrl.objectives.value import GAE

env = GymEnv("Pendulum-v1") 
model = TensorDictModule(
  nn.Sequential(
      nn.Linear(3, 128), nn.Tanh(),
      nn.Linear(128, 128), nn.Tanh(),
      nn.Linear(128, 128), nn.Tanh(),
      nn.Linear(128, 2),
      NormalParamExtractor()
  ),
  in_keys=["observation"],
  out_keys=["loc", "scale"]
)
critic = ValueOperator(
  nn.Sequential(
      nn.Linear(3, 128), nn.Tanh(),
      nn.Linear(128, 128), nn.Tanh(),
      nn.Linear(128, 128), nn.Tanh(),
      nn.Linear(128, 1),
  ),
  in_keys=["observation"],
)
actor = ProbabilisticActor(
  model,
  in_keys=["loc", "scale"],
  distribution_class=TanhNormal,
  distribution_kwargs={"low": -1.0, "high": 1.0},
  return_log_prob=True
  )
buffer = TensorDictReplayBuffer(
  storage=LazyTensorStorage(1000),
  sampler=SamplerWithoutReplacement(),
  batch_size=50,
  )
collector = SyncDataCollector(
  env,
  actor,
  frames_per_batch=1000,
  total_frames=1_000_000,
)
loss_fn = ClipPPOLoss(actor, critic)
adv_fn = GAE(value_network=critic, average_gae=True, gamma=0.99, lmbda=0.95)
optim = torch.optim.Adam(loss_fn.parameters(), lr=2e-4)

for data in collector:  # collect data
  for epoch in range(10):
      adv_fn(data)  # compute advantage
      buffer.extend(data)
      for sample in buffer:  # consume data
          loss_vals = loss_fn(sample)
          loss_val = sum(
              value for key, value in loss_vals.items() if
              key.startswith("loss")
              )
          loss_val.backward()
          optim.step()
          optim.zero_grad()
  print(f"avg reward: {data['next', 'reward'].mean().item(): 4.4f}")

Here is an example of how the environment API relies on tensordict to carry data from one function to another during a rollout execution: Alt Text

TensorDict makes it easy to re-use pieces of code across environments, models and algorithms.

Code

For instance, here's how to code a rollout in TorchRL:

- obs, done = env.reset()
+ tensordict = env.reset()
policy = SafeModule(
    model,
    in_keys=["observation_pixels", "observation_vector"],
    out_keys=["action"],
)
out = []
for i in range(n_steps):
-     action, log_prob = policy(obs)
-     next_obs, reward, done, info = env.step(action)
-     out.append((obs, next_obs, action, log_prob, reward, done))
-     obs = next_obs
+     tensordict = policy(tensordict)
+     tensordict = env.step(tensordict)
+     out.append(tensordict)
+     tensordict = step_mdp(tensordict)  # renames next_observation_* keys to observation_*
- obs, next_obs, action, log_prob, reward, done = [torch.stack(vals, 0) for vals in zip(*out)]
+ out = torch.stack(out, 0)  # TensorDict supports multiple tensor operations

Using this, TorchRL abstracts away the input / output signatures of the modules, env, collectors, replay buffers and losses of the library, allowing all primitives to be easily recycled across settings.

Code

Here's another example of an off-policy training loop in TorchRL (assuming that a data collector, a replay buffer, a loss and an optimizer have been instantiated):

- for i, (obs, next_obs, action, hidden_state, reward, done) in enumerate(collector):
+ for i, tensordict in enumerate(collector):
-     replay_buffer.add((obs, next_obs, action, log_prob, reward, done))
+     replay_buffer.add(tensordict)
    for j in range(num_optim_steps):
-         obs, next_obs, action, hidden_state, reward, done = replay_buffer.sample(batch_size)
-         loss = loss_fn(obs, next_obs, action, hidden_state, reward, done)
+         tensordict = replay_buffer.sample(batch_size)
+         loss = loss_fn(tensordict)
        loss.backward()
        optim.step()
        optim.zero_grad()

This training loop can be re-used across algorithms as it makes a minimal number of assumptions about the structure of the data.

TensorDict supports multiple tensor operations on its device and shape (the shape of TensorDict, or its batch size, is the common arbitrary N first dimensions of all its contained tensors):

Code

# stack and cat
tensordict = torch.stack(list_of_tensordicts, 0)
tensordict = torch.cat(list_of_tensordicts, 0)
# reshape
tensordict = tensordict.view(-1)
tensordict = tensordict.permute(0, 2, 1)
tensordict = tensordict.unsqueeze(-1)
tensordict = tensordict.squeeze(-1)
# indexing
tensordict = tensordict[:2]
tensordict[:, 2] = sub_tensordict
# device and memory location
tensordict.cuda()
tensordict.to("cuda:1")
tensordict.share_memory_()

TensorDict comes with a dedicated tensordict.nn module that contains everything you might need to write your model with it. And it is functorch and torch.compile compatible!

Code

transformer_model = nn.Transformer(nhead=16, num_encoder_layers=12)
+ td_module = SafeModule(transformer_model, in_keys=["src", "tgt"], out_keys=["out"])
src = torch.rand((10, 32, 512))
tgt = torch.rand((20, 32, 512))
+ tensordict = TensorDict({"src": src, "tgt": tgt}, batch_size=[20, 32])
- out = transformer_model(src, tgt)
+ td_module(tensordict)
+ out = tensordict["out"]

The TensorDictSequential class allows to branch sequences of nn.Module instances in a highly modular way. For instance, here is an implementation of a transformer using the encoder and decoder blocks:

encoder_module = TransformerEncoder(...)
encoder = TensorDictSequential(encoder_module, in_keys=["src", "src_mask"], out_keys=["memory"])
decoder_module = TransformerDecoder(...)
decoder = TensorDictModule(decoder_module, in_keys=["tgt", "memory"], out_keys=["output"])
transformer = TensorDictSequential(encoder, decoder)
assert transformer.in_keys == ["src", "src_mask", "tgt"]
assert transformer.out_keys == ["memory", "output"]

TensorDictSequential allows to isolate subgraphs by querying a set of desired input / output keys:

transformer.select_subsequence(out_keys=["memory"])  # returns the encoder
transformer.select_subsequence(in_keys=["tgt", "memory"])  # returns the decoder

Check TensorDict tutorials to learn more!

Features

A common interface for environments which supports common libraries (OpenAI gym, deepmind control lab, etc.)⁽¹⁾ and state-less execution (e.g. Model-based environments). The batched environments containers allow parallel execution⁽²⁾. A common PyTorch-first class of tensor-specification class is also provided. TorchRL's environments API is simple but stringent and specific. Check the documentation and tutorial to learn more!
Code
```
env_make = lambda: GymEnv("Pendulum-v1", from_pixels=True)
env_parallel = ParallelEnv(4, env_make)  # creates 4 envs in parallel
tensordict = env_parallel.rollout(max_steps=20, policy=None)  # random rollout (no policy given)
assert tensordict.shape == [4, 20]  # 4 envs, 20 steps rollout
env_parallel.action_spec.is_in(tensordict["action"])  # spec check returns True
```

multiprocess and distributed data collectors⁽²⁾ that work synchronously or asynchronously. Through the use of TensorDict, TorchRL's training loops are made very similar to regular training loops in supervised learning (although the "dataloader" -- read data collector -- is modified on-the-fly):

Code

env_make = lambda: GymEnv("Pendulum-v1", from_pixels=True)
collector = MultiaSyncDataCollector(
    [env_make, env_make],
    policy=policy,
    devices=["cuda:0", "cuda:0"],
    total_frames=10000,
    frames_per_batch=50,
    ...
)
for i, tensordict_data in enumerate(collector):
    loss = loss_module(tensordict_data)
    loss.backward()
    optim.step()
    optim.zero_grad()
    collector.update_policy_weights_()

Check our distributed collector examples to learn more about ultra-fast data collection with TorchRL.

efficient⁽²⁾ and generic⁽¹⁾ replay buffers with modularized storage:

Code

storage = LazyMemmapStorage(  # memory-mapped (physical) storage
    cfg.buffer_size,
    scratch_dir="/tmp/"
)
buffer = TensorDictPrioritizedReplayBuffer(
    alpha=0.7,
    beta=0.5,
    collate_fn=lambda x: x,
    pin_memory=device != torch.device("cpu"),
    prefetch=10,  # multi-threaded sampling
    storage=storage
)

Replay buffers are also offered as wrappers around common datasets for offline RL:

Code

from torchrl.data.replay_buffers import SamplerWithoutReplacement
from torchrl.data.datasets.d4rl import D4RLExperienceReplay
data = D4RLExperienceReplay(
    "maze2d-open-v0",
    split_trajs=True,
    batch_size=128,
    sampler=SamplerWithoutReplacement(drop_last=True),
)
for sample in data:  # or alternatively sample = data.sample()
    fun(sample)

cross-library environment transforms⁽¹⁾, executed on device and in a vectorized fashion⁽²⁾, which process and prepare the data coming out of the environments to be used by the agent:

Code

env_make = lambda: GymEnv("Pendulum-v1", from_pixels=True)
env_base = ParallelEnv(4, env_make, device="cuda:0")  # creates 4 envs in parallel
env = TransformedEnv(
    env_base,
    Compose(
        ToTensorImage(),
        ObservationNorm(loc=0.5, scale=1.0)),  # executes the transforms once and on device
)
tensordict = env.reset()
assert tensordict.device == torch.device("cuda:0")

Other transforms include: reward scaling (RewardScaling), shape operations (concatenation of tensors, unsqueezing etc.), concatenation of successive operations (CatFrames), resizing (Resize) and many more.

Unlike other libraries, the transforms are stacked as a list (and not wrapped in each other), which makes it easy to add and remove them at will:

env.insert_transform(0, NoopResetEnv())  # inserts the NoopResetEnv transform at the index 0

Nevertheless, transforms can access and execute operations on the parent environment:

transform = env.transform[1]  # gathers the second transform of the list
parent_env = transform.parent  # returns the base environment of the second transform, i.e. the base env + the first transform

various tools for distributed learning (e.g. memory mapped tensors)⁽²⁾;

various architectures and models (e.g. actor-critic)⁽¹⁾:

Code

# create an nn.Module
common_module = ConvNet(
    bias_last_layer=True,
    depth=None,
    num_cells=[32, 64, 64],
    kernel_sizes=[8, 4, 3],
    strides=[4, 2, 1],
)
# Wrap it in a SafeModule, indicating what key to read in and where to
# write out the output
common_module = SafeModule(
    common_module,
    in_keys=["pixels"],
    out_keys=["hidden"],
)
# Wrap the policy module in NormalParamsWrapper, such that the output
# tensor is split in loc and scale, and scale is mapped onto a positive space
policy_module = SafeModule(
    NormalParamsWrapper(
        MLP(num_cells=[64, 64], out_features=32, activation=nn.ELU)
    ),
    in_keys=["hidden"],
    out_keys=["loc", "scale"],
)
# Use a SafeProbabilisticTensorDictSequential to combine the SafeModule with a
# SafeProbabilisticModule, indicating how to build the
# torch.distribution.Distribution object and what to do with it
policy_module = SafeProbabilisticTensorDictSequential(  # stochastic policy
    policy_module,
    SafeProbabilisticModule(
        in_keys=["loc", "scale"],
        out_keys="action",
        distribution_class=TanhNormal,
    ),
)
value_module = MLP(
    num_cells=[64, 64],
    out_features=1,
    activation=nn.ELU,
)
# Wrap the policy and value funciton in a common module
actor_value = ActorValueOperator(common_module, policy_module, value_module)
# standalone policy from this
standalone_policy = actor_value.get_policy_operator()

exploration wrappers and modules to easily swap between exploration and exploitation⁽¹⁾:

Code

policy_explore = EGreedyWrapper(policy)
with set_exploration_type(ExplorationType.RANDOM):
    tensordict = policy_explore(tensordict)  # will use eps-greedy
with set_exploration_type(ExplorationType.DETERMINISTIC):
    tensordict = policy_explore(tensordict)  # will not use eps-greedy

A series of efficient loss modules and highly vectorized functional return and advantage computation.

Code

Loss modules

from torchrl.objectives import DQNLoss
loss_module = DQNLoss(value_network=value_network, gamma=0.99)
tensordict = replay_buffer.sample(batch_size)
loss = loss_module(tensordict)

Advantage computation

from torchrl.objectives.value.functional import vec_td_lambda_return_estimate
advantage = vec_td_lambda_return_estimate(gamma, lmbda, next_state_value, reward, done, terminated)

a generic trainer class⁽¹⁾ that executes the aforementioned training loop. Through a hooking mechanism, it also supports any logging or data transformation operation at any given time.
various recipes to build models that correspond to the environment being deployed.

If you feel a feature is missing from the library, please submit an issue! If you would like to contribute to new features, check our call for contributions and our contribution page.

Examples, tutorials and demos

A series of examples are provided with an illustrative purpose:

and many more to come!

Check the examples directory for more details about handling the various configuration settings.

We also provide tutorials and demos that give a sense of what the library can do.

Citation

If you're using TorchRL, please refer to this BibTeX entry to cite this work:

@misc{bou2023torchrl,
      title={TorchRL: A data-driven decision-making library for PyTorch}, 
      author={Albert Bou and Matteo Bettini and Sebastian Dittert and Vikash Kumar and Shagun Sodhani and Xiaomeng Yang and Gianni De Fabritiis and Vincent Moens},
      year={2023},
      eprint={2306.00577},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Installation

Create a conda environment where the packages will be installed.

conda create --name torch_rl python=3.9
conda activate torch_rl

PyTorch

Depending on the use of functorch that you want to make, you may want to install the latest (nightly) PyTorch release or the latest stable version of PyTorch. See here for a detailed list of commands, including pip3 or other special installation instructions.

Torchrl

You can install the latest stable release by using

pip3 install torchrl

This should work on linux, Windows 10 and OsX (Intel or Silicon chips). On certain Windows machines (Windows 11), one should install the library locally (see below).

The nightly build can be installed via

pip3 install torchrl-nightly

which we currently only ship for Linux and OsX (Intel) machines. Importantly, the nightly builds require the nightly builds of PyTorch too.

To install extra dependencies, call

pip3 install "torchrl[atari,dm_control,gym_continuous,rendering,tests,utils,marl,checkpointing]"

or a subset of these.

One may also desire to install the library locally. Three main reasons can motivate this:

the nightly/stable release isn't available for one's platform (eg, Windows 11, nightlies for Apple Silicon etc.);
contributing to the code;
install torchrl with a previous version of PyTorch (any version >= 2.0) (note that this should also be doable via a regular install followed by a downgrade to a previous pytorch version -- but the C++ binaries will not be available so some feature will not work,
such as prioritized replay buffers and the like.)

To install the library locally, start by cloning the repo:

git clone https://github.com/pytorch/rl

and don't forget to check out the branch or tag you want to use for the build:

git checkout v0.4.0

Go to the directory where you have cloned the torchrl repo and install it (after installing ninja)

cd /path/to/torchrl/
pip3 install ninja -U
python setup.py develop

One can also build the wheels to distribute to co-workers using

python setup.py bdist_wheel

Your wheels will be stored there ./dist/torchrl<name>.whl and installable via

pip install torchrl<name>.whl

Warning: Unfortunately, pip3 install -e . does not currently work. Contributions to help fix this are welcome!

On M1 machines, this should work out-of-the-box with the nightly build of PyTorch. If the generation of this artifact in MacOs M1 doesn't work correctly or in the execution the message (mach-o file, but is an incompatible architecture (have 'x86_64', need 'arm64e')) appears, then try

ARCHFLAGS="-arch arm64" python setup.py develop

To run a quick sanity check, leave that directory (e.g. by executing cd ~/) and try to import the library.

python -c "import torchrl"

This should not return any warning or error.

Optional dependencies

The following libraries can be installed depending on the usage one wants to make of torchrl:

# diverse
pip3 install tqdm tensorboard "hydra-core>=1.1" hydra-submitit-launcher

# rendering
pip3 install moviepy

# deepmind control suite
pip3 install dm_control

# gym, atari games
pip3 install "gym[atari]" "gym[accept-rom-license]" pygame

# tests
pip3 install pytest pyyaml pytest-instafail

# tensorboard
pip3 install tensorboard

# wandb
pip3 install wandb

Troubleshooting

If a ModuleNotFoundError: No module named ‘torchrl._torchrl errors occurs (or a warning indicating that the C++ binaries could not be loaded), it means that the C++ extensions were not installed or not found.

One common reason might be that you are trying to import torchrl from within the git repo location. The following code snippet should return an error if torchrl has not been installed in develop mode:
```
cd ~/path/to/rl/repo
python -c 'from torchrl.envs.libs.gym import GymEnv'
```
If this is the case, consider executing torchrl from another location.
If you're not importing torchrl from within its repo location, it could be caused by a problem during the local installation. Check the log after the python setup.py develop. One common cause is a g++/C++ version discrepancy and/or a problem with the ninja library.
If the problem persists, feel free to open an issue on the topic in the repo, we'll make our best to help!
On MacOs, we recommend installing XCode first. With Apple Silicon M1 chips, make sure you are using the arm64-built python (e.g. here). Running the following lines of code
```
wget https://raw.githubusercontent.com/pytorch/pytorch/master/torch/utils/collect_env.py
python collect_env.py
```
should display
```
OS: macOS *** (arm64)
```
and not
```
OS: macOS **** (x86_64)
```

Versioning issues can cause error message of the type undefined symbol and such. For these, refer to the versioning issues document for a complete explanation and proposed workarounds.

Asking a question

If you spot a bug in the library, please raise an issue in this repo.

If you have a more generic question regarding RL in PyTorch, post it on the PyTorch forum.

Contributing

Internal collaborations to torchrl are welcome! Feel free to fork, submit issues and PRs. You can checkout the detailed contribution guide here. As mentioned above, a list of open contributions can be found in here.

Contributors are recommended to install pre-commit hooks (using pre-commit install). pre-commit will check for linting related issues when the code is committed locally. You can disable th check by appending -n to your commit command: git commit -m <commit message> -n

Disclaimer

This library is released as a PyTorch beta feature. BC-breaking changes are likely to happen but they will be introduced with a deprecation warranty after a few release cycles.

License

TorchRL is licensed under the MIT License. See LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

2024.11.19

Nov 19, 2024

2024.11.18

Nov 18, 2024

2024.11.17

Nov 17, 2024

2024.11.16

Nov 16, 2024

2024.11.15

Nov 15, 2024

2024.11.14

Nov 14, 2024

2024.11.13

Nov 13, 2024

2024.11.12

Nov 12, 2024

2024.11.11

Nov 11, 2024

2024.11.10

Nov 10, 2024

2024.11.9

Nov 9, 2024

2024.11.8

Nov 8, 2024

2024.11.7

Nov 7, 2024

2024.11.6

Nov 6, 2024

2024.11.5

Nov 5, 2024

2024.11.4

Nov 4, 2024

2024.11.3

Nov 3, 2024

2024.11.2

Nov 2, 2024

2024.11.1

Nov 1, 2024

2024.10.31

Oct 31, 2024

2024.10.30

Oct 30, 2024

2024.10.29

Oct 29, 2024

2024.10.28

Oct 28, 2024

2024.10.27

Oct 27, 2024

2024.10.26

Oct 26, 2024

2024.10.25

Oct 25, 2024

2024.10.24

Oct 24, 2024

2024.10.23

Oct 23, 2024

2024.10.22

Oct 22, 2024

2024.10.21

Oct 21, 2024

2024.10.20

Oct 20, 2024

2024.10.19

Oct 19, 2024

2024.10.18

Oct 18, 2024

2024.10.17

Oct 17, 2024

2024.10.16

Oct 16, 2024

2024.10.15

Oct 15, 2024

2024.10.14

Oct 14, 2024

2024.10.13

Oct 13, 2024

2024.10.12

Oct 12, 2024

2024.10.11

Oct 11, 2024

2024.10.10

Oct 10, 2024

2024.10.9

Oct 9, 2024

2024.10.8

Oct 8, 2024

2024.10.7

Oct 7, 2024

2024.10.6

Oct 6, 2024

2024.10.5

Oct 5, 2024

2024.10.4

Oct 4, 2024

2024.10.3

Oct 3, 2024

2024.10.2

Oct 2, 2024

2024.10.1

Oct 1, 2024

2024.9.30

Sep 30, 2024

2024.9.29

Sep 29, 2024

2024.9.28

Sep 28, 2024

2024.9.27

Sep 27, 2024

2024.9.26

Sep 26, 2024

2024.9.25

Sep 25, 2024

2024.9.24

Sep 24, 2024

2024.9.23

Sep 23, 2024

2024.9.22

Sep 22, 2024

2024.9.21

Sep 21, 2024

2024.9.20

Sep 20, 2024

2024.9.19

Sep 19, 2024

2024.9.16

Sep 16, 2024

2024.9.15

Sep 15, 2024

2024.9.14

Sep 14, 2024

2024.9.13

Sep 13, 2024

2024.9.12

Sep 12, 2024

2024.9.11

Sep 11, 2024

2024.9.10

Sep 10, 2024

2024.9.9

Sep 9, 2024

2024.9.8

Sep 8, 2024

2024.9.7

Sep 7, 2024

2024.9.6

Sep 6, 2024

2024.9.5

Sep 5, 2024

2024.9.4

Sep 4, 2024

2024.9.3

Sep 3, 2024

2024.9.2

Sep 2, 2024

2024.9.1

Sep 1, 2024

2024.8.31

Aug 31, 2024

This version

2024.8.30

Aug 30, 2024

2024.8.29

Aug 29, 2024

2024.8.28

Aug 28, 2024

2024.8.27

Aug 27, 2024

2024.8.26

Aug 26, 2024

2024.8.25

Aug 25, 2024

2024.8.24

Aug 24, 2024

2024.8.23

Aug 23, 2024

2024.8.22

Aug 22, 2024

2024.8.21

Aug 21, 2024

2024.8.20

Aug 20, 2024

2024.8.19

Aug 19, 2024

2024.8.18

Aug 18, 2024

2024.8.17

Aug 17, 2024

2024.8.16

Aug 16, 2024

2024.8.15

Aug 15, 2024

2024.8.14

Aug 14, 2024

2024.8.13

Aug 13, 2024

2024.8.12

Aug 12, 2024

2024.8.11

Aug 11, 2024

2024.8.10

Aug 10, 2024

2024.8.9

Aug 9, 2024

2024.8.8

Aug 8, 2024

2024.8.7

Aug 7, 2024

2024.8.6

Aug 6, 2024

2024.8.5

Aug 5, 2024

2024.8.4

Aug 4, 2024

2024.8.3

Aug 3, 2024

2024.8.2

Aug 2, 2024

2024.8.1

Aug 1, 2024

2024.7.31

Jul 31, 2024

2024.7.30

Jul 30, 2024

2024.7.29

Jul 29, 2024

2024.7.28

Jul 28, 2024

2024.7.27

Jul 27, 2024

2024.7.26

Jul 26, 2024

2024.7.25

Jul 25, 2024

2024.7.24

Jul 24, 2024

2024.7.23

Jul 23, 2024

2024.7.22

Jul 22, 2024

2024.7.21

Jul 21, 2024

2024.7.20

Jul 20, 2024

2024.7.19

Jul 19, 2024

2024.7.18

Jul 18, 2024

2024.7.17

Jul 17, 2024

2024.7.16

Jul 16, 2024

2024.7.15

Jul 15, 2024

2024.7.14

Jul 14, 2024

2024.7.13

Jul 13, 2024

2024.7.12

Jul 12, 2024

2024.7.11

Jul 11, 2024

2024.7.10

Jul 10, 2024

2024.7.9

Jul 9, 2024

2024.7.3

Jul 3, 2024

2024.6.28

Jun 28, 2024

2024.6.27

Jun 27, 2024

2024.6.26

Jun 26, 2024

2024.6.25

Jun 25, 2024

2024.6.24

Jun 24, 2024

2024.6.23

Jun 23, 2024

2024.6.22

Jun 22, 2024

2024.6.21

Jun 21, 2024

2024.6.20

Jun 20, 2024

2024.6.19

Jun 19, 2024

2024.6.18

Jun 18, 2024

2024.6.17

Jun 17, 2024

2024.6.16

Jun 16, 2024

2024.6.15

Jun 15, 2024

2024.6.14

Jun 14, 2024

2024.6.13

Jun 13, 2024

2024.6.12

Jun 12, 2024

2024.6.11

Jun 11, 2024

2024.6.10

Jun 10, 2024

2024.6.9

Jun 9, 2024

2024.6.3

Jun 3, 2024

2024.6.2

Jun 2, 2024

2024.6.1

Jun 1, 2024

2024.5.31

May 31, 2024

2024.5.30

May 30, 2024

2024.5.29

May 29, 2024

2024.5.28

May 28, 2024

2024.5.27

May 27, 2024

2024.5.26

May 26, 2024

2024.5.25

May 25, 2024

2024.5.24

May 24, 2024

2024.5.23

May 23, 2024

2024.5.22

May 22, 2024

2024.5.21

May 21, 2024

2024.5.20

May 20, 2024

2024.5.19

May 19, 2024

2024.5.18

May 18, 2024

2024.5.17

May 17, 2024

2024.5.16

May 16, 2024

2024.5.15

May 15, 2024

2024.5.14

May 14, 2024

2024.5.13

May 13, 2024

2024.5.12

May 12, 2024

2024.5.11

May 11, 2024

2024.5.10

May 10, 2024

2024.5.9

May 9, 2024

2024.5.8

May 8, 2024

2024.5.7

May 7, 2024

2024.5.6

May 6, 2024

2024.5.5

May 5, 2024

2024.5.4

May 4, 2024

2024.5.3

May 3, 2024

2024.5.2

May 2, 2024

2024.5.1

May 1, 2024

2024.4.30

Apr 30, 2024

2024.4.29

Apr 29, 2024

2024.4.28

Apr 28, 2024

2024.4.27

Apr 27, 2024

2024.4.26

Apr 26, 2024

2024.4.25

Apr 25, 2024

2024.4.24

Apr 24, 2024

2024.4.3

Apr 3, 2024

2024.4.2

Apr 2, 2024

2024.4.1

Apr 1, 2024

2024.3.31

Mar 31, 2024

2024.3.30

Mar 30, 2024

2024.3.29

Mar 29, 2024

2024.3.27

Mar 27, 2024

2024.3.26

Mar 26, 2024

2024.3.25

Mar 25, 2024

2024.3.24

Mar 24, 2024

2024.3.23

Mar 23, 2024

2024.3.22

Mar 22, 2024

2024.3.21

Mar 21, 2024

2024.3.20

Mar 20, 2024

2024.3.19

Mar 19, 2024

2024.3.18

Mar 18, 2024

2024.3.17

Mar 17, 2024

2024.3.16

Mar 16, 2024

2024.3.15

Mar 15, 2024

2024.3.14

Mar 14, 2024

2024.3.13

Mar 13, 2024

2024.3.12

Mar 12, 2024

2024.3.11

Mar 11, 2024

2024.3.10

Mar 10, 2024

2024.3.9

Mar 9, 2024

2024.3.8

Mar 8, 2024

2024.3.7

Mar 7, 2024

2024.3.6

Mar 6, 2024

2024.3.5

Mar 5, 2024

2024.3.4

Mar 4, 2024

2024.3.3

Mar 3, 2024

2024.3.2

Mar 2, 2024

2024.3.1

Mar 1, 2024

2024.2.29

Feb 29, 2024

2024.2.28

Feb 28, 2024

2024.2.27

Feb 27, 2024

2024.2.26

Feb 26, 2024

2024.2.25

Feb 25, 2024

2024.2.24

Feb 24, 2024

2024.2.23

Feb 23, 2024

2024.2.22

Feb 22, 2024

2024.2.21

Feb 21, 2024

2024.2.20

Feb 20, 2024

2024.2.19

Feb 19, 2024

2024.2.18

Feb 18, 2024

2024.2.17

Feb 17, 2024

2024.2.16

Feb 16, 2024

2024.2.15

Feb 15, 2024

2024.2.14

Feb 14, 2024

2024.2.13

Feb 13, 2024

2024.2.12

Feb 12, 2024

2024.2.11

Feb 11, 2024

2024.2.10

Feb 10, 2024

2024.2.9

Feb 9, 2024

2024.2.8

Feb 8, 2024

2024.2.7

Feb 7, 2024

2024.2.6

Feb 6, 2024

2024.2.5

Feb 5, 2024

2024.2.4

Feb 4, 2024

2024.2.3

Feb 3, 2024

2024.2.2

Feb 2, 2024

2024.2.1

Feb 1, 2024

2024.1.31

Jan 31, 2024

2024.1.30

Jan 30, 2024

2024.1.29

Jan 29, 2024

2024.1.28

Jan 28, 2024

2024.1.27

Jan 27, 2024

2024.1.25

Jan 25, 2024

2024.1.24

Jan 24, 2024

2024.1.23

Jan 23, 2024

2024.1.22

Jan 22, 2024

2024.1.21

Jan 21, 2024

2024.1.20

Jan 20, 2024

2024.1.19

Jan 19, 2024

2024.1.18

Jan 18, 2024

2024.1.17

Jan 17, 2024

2024.1.16

Jan 16, 2024

2024.1.15

Jan 15, 2024

2024.1.14

Jan 14, 2024

2024.1.13

Jan 13, 2024

2024.1.12

Jan 12, 2024

2024.1.11

Jan 11, 2024

2024.1.10

Jan 10, 2024

2024.1.9

Jan 9, 2024

2024.1.8

Jan 8, 2024

2024.1.7

Jan 7, 2024

2024.1.6

Jan 6, 2024

2024.1.5

Jan 5, 2024

2024.1.4

Jan 4, 2024

2024.1.3

Jan 3, 2024

2024.1.2

Jan 2, 2024

2024.1.1

Jan 1, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

torchrl_nightly-2024.8.30-cp312-cp312-win_amd64.whl (976.4 kB view details)

Uploaded Aug 30, 2024 CPython 3.12 Windows x86-64

torchrl_nightly-2024.8.30-cp312-cp312-manylinux1_x86_64.whl (8.3 MB view details)

Uploaded Aug 30, 2024 CPython 3.12

torchrl_nightly-2024.8.30-cp311-cp311-win_amd64.whl (976.4 kB view details)

Uploaded Aug 30, 2024 CPython 3.11 Windows x86-64

torchrl_nightly-2024.8.30-cp311-cp311-manylinux1_x86_64.whl (8.3 MB view details)

Uploaded Aug 30, 2024 CPython 3.11

torchrl_nightly-2024.8.30-cp310-cp310-win_amd64.whl (977.7 kB view details)

Uploaded Aug 30, 2024 CPython 3.10 Windows x86-64

torchrl_nightly-2024.8.30-cp310-cp310-manylinux1_x86_64.whl (8.3 MB view details)

Uploaded Aug 30, 2024 CPython 3.10

torchrl_nightly-2024.8.30-cp39-cp39-win_amd64.whl (975.3 kB view details)

Uploaded Aug 30, 2024 CPython 3.9 Windows x86-64

torchrl_nightly-2024.8.30-cp39-cp39-manylinux1_x86_64.whl (8.3 MB view details)

Uploaded Aug 30, 2024 CPython 3.9

torchrl_nightly-2024.8.30-cp38-cp38-win_amd64.whl (977.4 kB view details)

Uploaded Aug 30, 2024 CPython 3.8 Windows x86-64

torchrl_nightly-2024.8.30-cp38-cp38-manylinux1_x86_64.whl (8.3 MB view details)

Uploaded Aug 30, 2024 CPython 3.8

File details

Details for the file torchrl_nightly-2024.8.30-cp312-cp312-win_amd64.whl.

File metadata

Download URL: torchrl_nightly-2024.8.30-cp312-cp312-win_amd64.whl
Upload date: Aug 30, 2024
Size: 976.4 kB
Tags: CPython 3.12, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for torchrl_nightly-2024.8.30-cp312-cp312-win_amd64.whl
Algorithm	Hash digest
SHA256	`f6492ba23491c016a1de0ef1599f8cf68ff10258cf6b0d1749b50b5ee7f2bb7b`
MD5	`3abfb91f8d0fbff50adb75fa18a6ce1f`
BLAKE2b-256	`2f9bf16d0471bc84baa682d07ee4f2a2e5bbc0dfafed133a0225766e19850360`

See more details on using hashes here.

File details

Details for the file torchrl_nightly-2024.8.30-cp312-cp312-manylinux1_x86_64.whl.

File metadata

Download URL: torchrl_nightly-2024.8.30-cp312-cp312-manylinux1_x86_64.whl
Upload date: Aug 30, 2024
Size: 8.3 MB
Tags: CPython 3.12
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.0

File hashes

Hashes for torchrl_nightly-2024.8.30-cp312-cp312-manylinux1_x86_64.whl
Algorithm	Hash digest
SHA256	`2d8ca7591bc5d745034e85553537c25e7a76491f4f44e82b61cb6ce45ec0561a`
MD5	`e549bbb3132963784b8daf97aa65c7c4`
BLAKE2b-256	`54da0758607565c929564fb4f8cfcc5ce604af2614b30e5183460c11d36d3ab6`

See more details on using hashes here.

File details

Details for the file torchrl_nightly-2024.8.30-cp311-cp311-win_amd64.whl.

File metadata

Download URL: torchrl_nightly-2024.8.30-cp311-cp311-win_amd64.whl
Upload date: Aug 30, 2024
Size: 976.4 kB
Tags: CPython 3.11, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for torchrl_nightly-2024.8.30-cp311-cp311-win_amd64.whl
Algorithm	Hash digest
SHA256	`c8d25745e0546228f933024dbd739c6905a99aef92dabfef8b3fabcb7a4523b5`
MD5	`c851b22f1ccacff1e2b92bf364930cbc`
BLAKE2b-256	`41b93b6f71c78a06f4ca1e8a80186b301622c9ab4ff2fd0a5268667dc4a8b41e`

See more details on using hashes here.

File details

Details for the file torchrl_nightly-2024.8.30-cp311-cp311-manylinux1_x86_64.whl.

File metadata

Download URL: torchrl_nightly-2024.8.30-cp311-cp311-manylinux1_x86_64.whl
Upload date: Aug 30, 2024
Size: 8.3 MB
Tags: CPython 3.11
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.11.0

File hashes

Hashes for torchrl_nightly-2024.8.30-cp311-cp311-manylinux1_x86_64.whl
Algorithm	Hash digest
SHA256	`914bc2e7d3aac1998fab08a54b72e0b0778bca3d42db799ae5a484966b9ca707`
MD5	`98fd18dd3718d71ca8a4621327c11b77`
BLAKE2b-256	`7d44262fcc5a2fb9b1307106859386c64bb94649f20a9820ab26f27e55f571c9`

See more details on using hashes here.

File details

Details for the file torchrl_nightly-2024.8.30-cp310-cp310-win_amd64.whl.

File metadata

Download URL: torchrl_nightly-2024.8.30-cp310-cp310-win_amd64.whl
Upload date: Aug 30, 2024
Size: 977.7 kB
Tags: CPython 3.10, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for torchrl_nightly-2024.8.30-cp310-cp310-win_amd64.whl
Algorithm	Hash digest
SHA256	`3e9fdb9981eb88d3c42893b8e71d96094f64620a4770719243f0675d5e4aefbf`
MD5	`59787ce5797d60cb97cb35e2af005d8b`
BLAKE2b-256	`b70859192a5ed0b720cc656d1e94c767b100ec64483591d01e33e4adb5ed3e3f`

See more details on using hashes here.

File details

Details for the file torchrl_nightly-2024.8.30-cp310-cp310-manylinux1_x86_64.whl.

File metadata

Download URL: torchrl_nightly-2024.8.30-cp310-cp310-manylinux1_x86_64.whl
Upload date: Aug 30, 2024
Size: 8.3 MB
Tags: CPython 3.10
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.10.1

File hashes

Hashes for torchrl_nightly-2024.8.30-cp310-cp310-manylinux1_x86_64.whl
Algorithm	Hash digest
SHA256	`d2f565822affb94f8e35ec8876833f6682110bf999ab23a09b8da53086d365f6`
MD5	`87f868e9c29862198a9bb6dda15e6942`
BLAKE2b-256	`7228ebdab6c2170698c6ebc40f900babc9947aaf056ed95c15b1c2708ef10fbd`

See more details on using hashes here.

File details

Details for the file torchrl_nightly-2024.8.30-cp39-cp39-win_amd64.whl.

File metadata

Download URL: torchrl_nightly-2024.8.30-cp39-cp39-win_amd64.whl
Upload date: Aug 30, 2024
Size: 975.3 kB
Tags: CPython 3.9, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for torchrl_nightly-2024.8.30-cp39-cp39-win_amd64.whl
Algorithm	Hash digest
SHA256	`b312c71fc10422d3bd174ad9aebd972210d595398ccba06c5fd8492ad0d96d06`
MD5	`da6f29cdd7473cf79f164b0fa2c80bb2`
BLAKE2b-256	`70d53505254472576ba7fc64a1e164115ab09d3c89dea7d7b8a74e9dd4ff5585`

See more details on using hashes here.

File details

Details for the file torchrl_nightly-2024.8.30-cp39-cp39-manylinux1_x86_64.whl.

File metadata

Download URL: torchrl_nightly-2024.8.30-cp39-cp39-manylinux1_x86_64.whl
Upload date: Aug 30, 2024
Size: 8.3 MB
Tags: CPython 3.9
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.9.0

File hashes

Hashes for torchrl_nightly-2024.8.30-cp39-cp39-manylinux1_x86_64.whl
Algorithm	Hash digest
SHA256	`67e0052d441251990bf4108502c65ecc4368cb77cbc0eed9c41a1ccd8331297d`
MD5	`056718ef93d099e03d6ef05623c0f248`
BLAKE2b-256	`ed3bfecfddf1d8d942778881bebcdcda6f7ee73494ed5ae9280ce1f38ffd342a`

See more details on using hashes here.

File details

Details for the file torchrl_nightly-2024.8.30-cp38-cp38-win_amd64.whl.

File metadata

Download URL: torchrl_nightly-2024.8.30-cp38-cp38-win_amd64.whl
Upload date: Aug 30, 2024
Size: 977.4 kB
Tags: CPython 3.8, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.9.13

File hashes

Hashes for torchrl_nightly-2024.8.30-cp38-cp38-win_amd64.whl
Algorithm	Hash digest
SHA256	`6866edbccc1e4d8f6f02d9b12f95acb8c5f0f9e398117b86ba3160c32db67ce7`
MD5	`5c945502888a2cc09856e5c7b6b170eb`
BLAKE2b-256	`9c0e41b1e9eec712d71480278e79fe681960beac40a8d4a1d9e56dc5587094cd`

See more details on using hashes here.

File details

Details for the file torchrl_nightly-2024.8.30-cp38-cp38-manylinux1_x86_64.whl.

File metadata

Download URL: torchrl_nightly-2024.8.30-cp38-cp38-manylinux1_x86_64.whl
Upload date: Aug 30, 2024
Size: 8.3 MB
Tags: CPython 3.8
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.8.1

File hashes

Hashes for torchrl_nightly-2024.8.30-cp38-cp38-manylinux1_x86_64.whl
Algorithm	Hash digest
SHA256	`92afdb64e51a8217fe4a37c7c0234c3c1ec74b9e7782796167bc71efc155f880`
MD5	`ee97f610f5243f73287c158d81794d07`
BLAKE2b-256	`35d7b0a744ef589d8086a30300e73ab08176c9d27bfd64bf702bc014128e930b`

See more details on using hashes here.

torchrl-nightly 2024.8.30

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TorchRL

Key features

Design Principles

Getting started

Documentation and knowledge base

Spotlight publications

Writing simplified and portable RL codebase with TensorDict

Features

Loss modules

Advantage computation

Examples, tutorials and demos

Citation

Installation

Asking a question

Contributing

Disclaimer

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

Writing simplified and portable RL codebase with `TensorDict`