Skip to main content

No project description provided

Project description

Docs - GitHub.io Discord Shield Benchmarks Python version GitHub license pypi version pypi nightly version Downloads Downloads codecov circleci Conda - Platform Conda (channel only)

📖 TensorDict

TensorDict is a dictionary-like class that inherits properties from tensors, making it easy to work with collections of tensors in PyTorch. It provides a simple and intuitive way to manipulate and process tensors, allowing you to focus on building and training your models.

Key Features | Examples | Installation | Citation | License

Key Features

TensorDict makes your code-bases more readable, compact, modular and fast. It abstracts away tailored operations, making your code less error-prone as it takes care of dispatching the operation on the leaves for you.

The key features are:

  • 🧮 Composability: TensorDict generalizes torch.Tensor operations to collection of tensors.
  • ⚡️ Speed: asynchronous transfer to device, fast node-to-node communication through consolidate, compatible with torch.compile.
  • ✂️ Shape operations: Perform tensor-like operations on TensorDict instances, such as indexing, slicing or concatenation.
  • 🌐 Distributed / multiprocessed capabilities: Easily distribute TensorDict instances across multiple workers, devices and machines.
  • 💾 Serialization and memory-mapping
  • λ Functional programming and compatibility with torch.vmap
  • 📦 Nesting: Nest TensorDict instances to create hierarchical structures.
  • Lazy preallocation: Preallocate memory for TensorDict instances without initializing the tensors.
  • 📝 Specialized dataclass for torch.Tensor (@tensorclass)

tensordict.png

Examples

This section presents a couple of stand-out applications of the library. Check our Getting Started guide for an overview of TensorDict's features!

Fast copy on device

TensorDict optimizes transfers from/to device to make them safe and fast. By default, data transfers will be made asynchronously and synchronizations will be called whenever needed.

# Fast and safe asynchronous copy to 'cuda'
td_cuda = TensorDict(**dict_of_tensor, device="cuda")
# Fast and safe asynchronous copy to 'cpu'
td_cpu = td_cuda.to("cpu")
# Force synchronous copy
td_cpu = td_cuda.to("cpu", non_blocking=False)

Coding an optimizer

For instance, using TensorDict you can code the Adam optimizer as you would for a single torch.Tensor and apply that to a TensorDict input as well. On cuda, these operations will rely on fused kernels, making it very fast to execute:

class Adam:
    def __init__(self, weights: TensorDict, alpha: float=1e-3,
                 beta1: float=0.9, beta2: float=0.999,
                 eps: float = 1e-6):
        # Lock for efficiency
        weights = weights.lock_()
        self.weights = weights
        self.t = 0

        self._mu = weights.data.clone()
        self._sigma = weights.data.mul(0.0)
        self.beta1 = beta1
        self.beta2 = beta2
        self.alpha = alpha
        self.eps = eps

    def step(self):
        self._mu.mul_(self.beta1).add_(self.weights.grad, 1 - self.beta1)
        self._sigma.mul_(self.beta2).add_(self.weights.grad.pow(2), 1 - self.beta2)
        self.t += 1
        mu = self._mu.div_(1-self.beta1**self.t)
        sigma = self._sigma.div_(1 - self.beta2 ** self.t)
        self.weights.data.add_(mu.div_(sigma.sqrt_().add_(self.eps)).mul_(-self.alpha))

Training a model

Using tensordict primitives, most supervised training loops can be rewritten in a generic way:

for i, data in enumerate(dataset):
    # the model reads and writes tensordicts
    data = model(data)
    loss = loss_module(data)
    loss.backward()
    optimizer.step()
    optimizer.zero_grad()

With this level of abstraction, one can recycle a training loop for highly heterogeneous task. Each individual step of the training loop (data collection and transform, model prediction, loss computation etc.) can be tailored to the use case at hand without impacting the others. For instance, the above example can be easily used across classification and segmentation tasks, among many others.

Installation

With Pip:

To install the latest stable version of tensordict, simply run

pip install tensordict

This will work with Python 3.7 and upward as well as PyTorch 1.12 and upward.

To enjoy the latest features, one can use

pip install tensordict-nightly

With Conda:

Install tensordict from conda-forge channel.

conda install -c conda-forge tensordict

Citation

If you're using TensorDict, please refer to this BibTeX entry to cite this work:

@misc{bou2023torchrl,
      title={TorchRL: A data-driven decision-making library for PyTorch},
      author={Albert Bou and Matteo Bettini and Sebastian Dittert and Vikash Kumar and Shagun Sodhani and Xiaomeng Yang and Gianni De Fabritiis and Vincent Moens},
      year={2023},
      eprint={2306.00577},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Disclaimer

TensorDict is at the beta-stage, meaning that there may be bc-breaking changes introduced, but they should come with a warranty. Hopefully these should not happen too often, as the current roadmap mostly involves adding new features and building compatibility with the broader PyTorch ecosystem.

License

TensorDict is licensed under the MIT License. See LICENSE for details.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

tensordict_nightly-2024.10.2-cp312-cp312-win_amd64.whl (349.4 kB view details)

Uploaded CPython 3.12 Windows x86-64

tensordict_nightly-2024.10.2-cp311-cp311-win_amd64.whl (348.7 kB view details)

Uploaded CPython 3.11 Windows x86-64

tensordict_nightly-2024.10.2-cp310-cp310-win_amd64.whl (347.8 kB view details)

Uploaded CPython 3.10 Windows x86-64

tensordict_nightly-2024.10.2-cp39-cp39-win_amd64.whl (347.7 kB view details)

Uploaded CPython 3.9 Windows x86-64

File details

Details for the file tensordict_nightly-2024.10.2-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.10.2-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 a9dac69101a63a8e45334f63e5a7715533e77f85f1ea5fc03eddc96332ade448
MD5 f1d2469702744734a6f52a6aa5f7f717
BLAKE2b-256 78601adba298eb81baf3e67691cd1e71feadc9ab9a062eba960826a5006b609a

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.10.2-cp312-cp312-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.10.2-cp312-cp312-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 fef354499bd2c1043a83bbc397ab7133762e8a6486d5a17638e70e403b3bc82a
MD5 15364a40d9c60b005801f8d3b1963e1d
BLAKE2b-256 26a1776bda6ac2812e82509850e2b4162bf2718abdba78dec5697603d947c23a

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.10.2-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.10.2-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 5000778184ea6478737ea4b4bd909a062a8c6bc3968eb7c4c38c67ec45989ad6
MD5 8c8451deba97cecac4ce034a04de2b7c
BLAKE2b-256 179f9af0777c084ba7bb62f2f21a050b9c41136a99e46d4cde597f6a9d6ce09d

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.10.2-cp311-cp311-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.10.2-cp311-cp311-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 5e3250b4fb1bbd6662cfc6b986064a8193d2894049c4e72339253d9628905cb4
MD5 b4f8b3a41a9af2771fc345e4849adf2f
BLAKE2b-256 0f309b7e24363bbaf092d7a398f0f1d01b215ce60f8ad67d327f5c1f75f0a19a

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.10.2-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.10.2-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 28089845023659e6608e616b7acc8733c62d40341a6cceb82dbbddb746034f06
MD5 1d3c6e34aaec4e5b1dc15a0bc247038b
BLAKE2b-256 2304c7cbf87ee5c79f8aa090fed52680b1199adbc8f4ee0925ab8ad99060be75

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.10.2-cp310-cp310-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.10.2-cp310-cp310-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 aaf5ffbbbf1a9a0a1882eb090b73b536ec2bed20a6c801e524b0d160589db0cd
MD5 9936ccb93b907750d778addd05593a81
BLAKE2b-256 1b9855392f4d0f9b9b805cfe3526e949c22e739be4d2eae05bc8b183e9b8ef38

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.10.2-cp39-cp39-win_amd64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.10.2-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 fb9e6ef2e599a088a35275ee1eb6a910941d48d853e68ff5a884efbd4cc1affd
MD5 f0b5ff088a2083162055420cb10a84c1
BLAKE2b-256 a212d19a246b23b9d9bf73415ce92969064da15dd3257bb39d305e0bf89af92d

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.10.2-cp39-cp39-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.10.2-cp39-cp39-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 160bdbbcb919688675413a37f060784b7f2399ceb9afcd08ed056d4b03efa80c
MD5 9fe9b821ba59c785ecae41bc248a4980
BLAKE2b-256 8a08bfe6292bf56a16f2263dc2b6603590f9839b9888b0f74c8b5e0578109a50

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page