No project description provided
Project description
📖 TensorDict
TensorDict is a dictionary-like class that inherits properties from tensors, making it easy to work with collections of tensors in PyTorch. It provides a simple and intuitive way to manipulate and process tensors, allowing you to focus on building and training your models.
Key Features | Examples | Installation | Citation | License
Key Features
TensorDict makes your code-bases more readable, compact, modular and fast. It abstracts away tailored operations, making your code less error-prone as it takes care of dispatching the operation on the leaves for you.
The key features are:
- 🧮 Composability:
TensorDict
generalizestorch.Tensor
operations to collection of tensors. - ⚡️ Speed: asynchronous transfer to device, fast node-to-node communication through
consolidate
, compatible withtorch.compile
. - ✂️ Shape operations: Perform tensor-like operations on TensorDict instances, such as indexing, slicing or concatenation.
- 🌐 Distributed / multiprocessed capabilities: Easily distribute TensorDict instances across multiple workers, devices and machines.
- 💾 Serialization and memory-mapping
- λ Functional programming and compatibility with
torch.vmap
- 📦 Nesting: Nest TensorDict instances to create hierarchical structures.
- ⏰ Lazy preallocation: Preallocate memory for TensorDict instances without initializing the tensors.
- 📝 Specialized dataclass for torch.Tensor (
@tensorclass
)
Examples
This section presents a couple of stand-out applications of the library. Check our Getting Started guide for an overview of TensorDict's features!
Fast copy on device
TensorDict
optimizes transfers from/to device to make them safe and fast.
By default, data transfers will be made asynchronously and synchronizations will be called whenever needed.
# Fast and safe asynchronous copy to 'cuda'
td_cuda = TensorDict(**dict_of_tensor, device="cuda")
# Fast and safe asynchronous copy to 'cpu'
td_cpu = td_cuda.to("cpu")
# Force synchronous copy
td_cpu = td_cuda.to("cpu", non_blocking=False)
Coding an optimizer
For instance, using TensorDict
you can code the Adam optimizer as you would for a single torch.Tensor
and apply
that to a TensorDict
input as well. On cuda
, these operations will rely on fused kernels, making it very fast to
execute:
class Adam:
def __init__(self, weights: TensorDict, alpha: float=1e-3,
beta1: float=0.9, beta2: float=0.999,
eps: float = 1e-6):
# Lock for efficiency
weights = weights.lock_()
self.weights = weights
self.t = 0
self._mu = weights.data.clone()
self._sigma = weights.data.mul(0.0)
self.beta1 = beta1
self.beta2 = beta2
self.alpha = alpha
self.eps = eps
def step(self):
self._mu.mul_(self.beta1).add_(self.weights.grad, 1 - self.beta1)
self._sigma.mul_(self.beta2).add_(self.weights.grad.pow(2), 1 - self.beta2)
self.t += 1
mu = self._mu.div_(1-self.beta1**self.t)
sigma = self._sigma.div_(1 - self.beta2 ** self.t)
self.weights.data.add_(mu.div_(sigma.sqrt_().add_(self.eps)).mul_(-self.alpha))
Training a model
Using tensordict primitives, most supervised training loops can be rewritten in a generic way:
for i, data in enumerate(dataset):
# the model reads and writes tensordicts
data = model(data)
loss = loss_module(data)
loss.backward()
optimizer.step()
optimizer.zero_grad()
With this level of abstraction, one can recycle a training loop for highly heterogeneous task. Each individual step of the training loop (data collection and transform, model prediction, loss computation etc.) can be tailored to the use case at hand without impacting the others. For instance, the above example can be easily used across classification and segmentation tasks, among many others.
Installation
With Pip:
To install the latest stable version of tensordict, simply run
pip install tensordict
This will work with Python 3.7 and upward as well as PyTorch 1.12 and upward.
To enjoy the latest features, one can use
pip install tensordict-nightly
With Conda:
Install tensordict
from conda-forge
channel.
conda install -c conda-forge tensordict
Citation
If you're using TensorDict, please refer to this BibTeX entry to cite this work:
@misc{bou2023torchrl,
title={TorchRL: A data-driven decision-making library for PyTorch},
author={Albert Bou and Matteo Bettini and Sebastian Dittert and Vikash Kumar and Shagun Sodhani and Xiaomeng Yang and Gianni De Fabritiis and Vincent Moens},
year={2023},
eprint={2306.00577},
archivePrefix={arXiv},
primaryClass={cs.LG}
}
Disclaimer
TensorDict is at the beta-stage, meaning that there may be bc-breaking changes introduced, but they should come with a warranty. Hopefully these should not happen too often, as the current roadmap mostly involves adding new features and building compatibility with the broader PyTorch ecosystem.
License
TensorDict is licensed under the MIT License. See LICENSE for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
File details
Details for the file tensordict_nightly-2024.11.18-cp312-cp312-win_amd64.whl
.
File metadata
- Download URL: tensordict_nightly-2024.11.18-cp312-cp312-win_amd64.whl
- Upload date:
- Size: 360.3 kB
- Tags: CPython 3.12, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7438adf0ae8b5cf0306891429af9f16252854c957ca814c143dae7cde041343f |
|
MD5 | 458e8489365a532196b6e40933faccfd |
|
BLAKE2b-256 | 638bc35b9fe135a027b17371ec6c1ad557a302caba833789f89848a6ab27bdec |
File details
Details for the file tensordict_nightly-2024.11.18-cp311-cp311-win_amd64.whl
.
File metadata
- Download URL: tensordict_nightly-2024.11.18-cp311-cp311-win_amd64.whl
- Upload date:
- Size: 354.5 kB
- Tags: CPython 3.11, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7350b7f256abf66fa7ca080e59ddba6dfe968fe4e8fe0df25f92991d3d8194f0 |
|
MD5 | 9b0420c6e3c1ed5743362cc9fecd07da |
|
BLAKE2b-256 | 74dd17703adc473a517fdca86be1c046e2580988e90df94f963bcb89b2b80c45 |
File details
Details for the file tensordict_nightly-2024.11.18-cp310-cp310-win_amd64.whl
.
File metadata
- Download URL: tensordict_nightly-2024.11.18-cp310-cp310-win_amd64.whl
- Upload date:
- Size: 353.5 kB
- Tags: CPython 3.10, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ddca2d36f17f16614be6194e0575594791ed359cc4608e3bd11a9f7d1b0aa885 |
|
MD5 | 2c7275ce003c30855114bf5094005f5f |
|
BLAKE2b-256 | ce2adc112bbc28b040af348ebd5d2f820dd102092433af1946c8e00e082e3851 |
File details
Details for the file tensordict_nightly-2024.11.18-cp39-cp39-win_amd64.whl
.
File metadata
- Download URL: tensordict_nightly-2024.11.18-cp39-cp39-win_amd64.whl
- Upload date:
- Size: 353.4 kB
- Tags: CPython 3.9, Windows x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 16bc276df9edacabcc7c56c4059e302028a4611bb923174842ecbb807ea94867 |
|
MD5 | b26c0418f7dc7911cd441be73dd5e4c6 |
|
BLAKE2b-256 | c3c0eb3adb64ba06283826ba964f2bd8b3d60e4135cc4eea07d5bb772f81a4cf |