Skip to main content

No project description provided

Project description

Docs - GitHub.io Discord Shield Benchmarks Python version GitHub license pypi version pypi nightly version Downloads Downloads codecov circleci Conda - Platform Conda (channel only)

📖 TensorDict

TensorDict is a dictionary-like class that inherits properties from tensors, making it easy to work with collections of tensors in PyTorch. It provides a simple and intuitive way to manipulate and process tensors, allowing you to focus on building and training your models.

Key Features | Examples | Installation | Citation | License

Key Features

TensorDict makes your code-bases more readable, compact, modular and fast. It abstracts away tailored operations, making your code less error-prone as it takes care of dispatching the operation on the leaves for you.

The key features are:

  • 🧮 Composability: TensorDict generalizes torch.Tensor operations to collection of tensors.
  • ⚡️ Speed: asynchronous transfer to device, fast node-to-node communication through consolidate, compatible with torch.compile.
  • ✂️ Shape operations: Perform tensor-like operations on TensorDict instances, such as indexing, slicing or concatenation.
  • 🌐 Distributed / multiprocessed capabilities: Easily distribute TensorDict instances across multiple workers, devices and machines.
  • 💾 Serialization and memory-mapping
  • λ Functional programming and compatibility with torch.vmap
  • 📦 Nesting: Nest TensorDict instances to create hierarchical structures.
  • Lazy preallocation: Preallocate memory for TensorDict instances without initializing the tensors.
  • 📝 Specialized dataclass for torch.Tensor (@tensorclass)

tensordict.png

Examples

This section presents a couple of stand-out applications of the library. Check our Getting Started guide for an overview of TensorDict's features!

Fast copy on device

TensorDict optimizes transfers from/to device to make them safe and fast. By default, data transfers will be made asynchronously and synchronizations will be called whenever needed.

# Fast and safe asynchronous copy to 'cuda'
td_cuda = TensorDict(**dict_of_tensor, device="cuda")
# Fast and safe asynchronous copy to 'cpu'
td_cpu = td_cuda.to("cpu")
# Force synchronous copy
td_cpu = td_cuda.to("cpu", non_blocking=False)

Coding an optimizer

For instance, using TensorDict you can code the Adam optimizer as you would for a single torch.Tensor and apply that to a TensorDict input as well. On cuda, these operations will rely on fused kernels, making it very fast to execute:

class Adam:
    def __init__(self, weights: TensorDict, alpha: float=1e-3,
                 beta1: float=0.9, beta2: float=0.999,
                 eps: float = 1e-6):
        # Lock for efficiency
        weights = weights.lock_()
        self.weights = weights
        self.t = 0

        self._mu = weights.data.clone()
        self._sigma = weights.data.mul(0.0)
        self.beta1 = beta1
        self.beta2 = beta2
        self.alpha = alpha
        self.eps = eps

    def step(self):
        self._mu.mul_(self.beta1).add_(self.weights.grad, 1 - self.beta1)
        self._sigma.mul_(self.beta2).add_(self.weights.grad.pow(2), 1 - self.beta2)
        self.t += 1
        mu = self._mu.div_(1-self.beta1**self.t)
        sigma = self._sigma.div_(1 - self.beta2 ** self.t)
        self.weights.data.add_(mu.div_(sigma.sqrt_().add_(self.eps)).mul_(-self.alpha))

Training a model

Using tensordict primitives, most supervised training loops can be rewritten in a generic way:

for i, data in enumerate(dataset):
    # the model reads and writes tensordicts
    data = model(data)
    loss = loss_module(data)
    loss.backward()
    optimizer.step()
    optimizer.zero_grad()

With this level of abstraction, one can recycle a training loop for highly heterogeneous task. Each individual step of the training loop (data collection and transform, model prediction, loss computation etc.) can be tailored to the use case at hand without impacting the others. For instance, the above example can be easily used across classification and segmentation tasks, among many others.

Installation

With Pip:

To install the latest stable version of tensordict, simply run

pip install tensordict

This will work with Python 3.7 and upward as well as PyTorch 1.12 and upward.

To enjoy the latest features, one can use

pip install tensordict-nightly

With Conda:

Install tensordict from conda-forge channel.

conda install -c conda-forge tensordict

Citation

If you're using TensorDict, please refer to this BibTeX entry to cite this work:

@misc{bou2023torchrl,
      title={TorchRL: A data-driven decision-making library for PyTorch},
      author={Albert Bou and Matteo Bettini and Sebastian Dittert and Vikash Kumar and Shagun Sodhani and Xiaomeng Yang and Gianni De Fabritiis and Vincent Moens},
      year={2023},
      eprint={2306.00577},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Disclaimer

TensorDict is at the beta-stage, meaning that there may be bc-breaking changes introduced, but they should come with a warranty. Hopefully these should not happen too often, as the current roadmap mostly involves adding new features and building compatibility with the broader PyTorch ecosystem.

License

TensorDict is licensed under the MIT License. See LICENSE for details.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

tensordict_nightly-2024.9.30-cp312-cp312-win_amd64.whl (348.4 kB view details)

Uploaded CPython 3.12 Windows x86-64

tensordict_nightly-2024.9.30-cp311-cp311-win_amd64.whl (347.8 kB view details)

Uploaded CPython 3.11 Windows x86-64

tensordict_nightly-2024.9.30-cp310-cp310-win_amd64.whl (346.8 kB view details)

Uploaded CPython 3.10 Windows x86-64

tensordict_nightly-2024.9.30-cp39-cp39-win_amd64.whl (346.8 kB view details)

Uploaded CPython 3.9 Windows x86-64

File details

Details for the file tensordict_nightly-2024.9.30-cp312-cp312-win_amd64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.9.30-cp312-cp312-win_amd64.whl
Algorithm Hash digest
SHA256 94b1652d26be4b8cc2d5ed6f91a10233d90ec5f56a0b64ac3f585a9fb1faab94
MD5 e1a69e38f87f69e58a321c80e4ebd7c8
BLAKE2b-256 1333d278626bced5c7bd5225e5197c40e0f37e8183de891c4c6878a857cfa2f1

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.9.30-cp312-cp312-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.9.30-cp312-cp312-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 d2b5d632742987c9a785cac09bdead8c81ffc2a8518cb914b39900880d2cc870
MD5 72a91c30d8f5af348b35c0784d519594
BLAKE2b-256 523f6631ee24ca094c9cf704631f3b07b9188d70bdd7a28a9fd98178571408b0

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.9.30-cp311-cp311-win_amd64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.9.30-cp311-cp311-win_amd64.whl
Algorithm Hash digest
SHA256 87176a69f2854dcab7752de9f4a772ee7f2c29f8fa5f87658226201294cd9e9e
MD5 8c0ffd62166e255bcaabbaa9981dddaf
BLAKE2b-256 771d2d6d84c298d4edcb063788c07a0aa493f572347e374180d9aa06643303a1

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.9.30-cp311-cp311-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.9.30-cp311-cp311-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 d07f855487bf2c34e95a3c3a10f3588ab126da9cc665c3e274094ddc201a2bd0
MD5 ac92233887bdd6895cbba7579d79afc0
BLAKE2b-256 39f4720a363978852e6825f34db47c8e294461cad50487b9a5159c0fac94cfef

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.9.30-cp310-cp310-win_amd64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.9.30-cp310-cp310-win_amd64.whl
Algorithm Hash digest
SHA256 f84322db1d8c3d5017cccf908af6af054ee390ad7579deb2778c4312132ab95e
MD5 d68beb2a6d8f1f855d0125a02b30d004
BLAKE2b-256 c742c6a8140e5f1c73346d1bdd9765c3a5a25eb734c3eda1f7582f06b911877f

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.9.30-cp310-cp310-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.9.30-cp310-cp310-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 fcb1644e4db4b096ea76b5e32da2706646d2dcdff225956c24b90d9aab994c29
MD5 35508e7b39180235029ba581ddf628c4
BLAKE2b-256 e0b918221ef1b2955da122bddf4e72c497fd815ed452ab12ff45ba4f8852ed0c

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.9.30-cp39-cp39-win_amd64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.9.30-cp39-cp39-win_amd64.whl
Algorithm Hash digest
SHA256 ed371f98337e02b328e34a3a83e7ae29fff384ec14917b7d32fef8014b74ccd3
MD5 76f55779afc2ce1fed8f22f85724063f
BLAKE2b-256 1d7a05033fc0c0c75de5ff8a8f42459c3df53a9c6c4a7c4d762dd24fed347634

See more details on using hashes here.

File details

Details for the file tensordict_nightly-2024.9.30-cp39-cp39-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for tensordict_nightly-2024.9.30-cp39-cp39-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 fe780d6f95c8424d3e1d876cc93d9d8916ab8969f25c81cc3af98d4fef387fe7
MD5 c4bc58990544cfe2dec45bb6daa60b6a
BLAKE2b-256 6d39dfddfe0190ef11eb1c54e884f1eacb41dd5f121fb72bba73f41c3f49ba83

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page