Cache PyTorch module outputs on the fly

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

torchcache

Effortlessly cache PyTorch module outputs on-the-fly with torchcache.

The documentation is available torchcache.readthedocs.io.

Features
Installation
Usage
Assumptions
Use cases
How it works
- Automatic cache management
- Tensor hashing
Environment variables
Contribution

Features

Cache PyTorch module outputs either in-memory or persistently to disk.
Simple decorator-based interface for easy usage.
Uses an MRU (most-recently-used) cache to limit memory/disk usage

Installation

pip install torchcache

Usage

Quickly cache the output of your PyTorch module with a single decorator:

from torchcache import torchcache

@torchcache()
class MyModule(nn.Module):
    def __init__(self):
        super().__init__()
        self.linear = nn.Linear(10, 10)

    def forward(self, x):
        # This output will be cached
        return self.linear(x)

Assumptions

To ensure seamless operation, torchcache assumes the following:

Your module is a subclass of nn.Module.
The module's forward method accepts any number of positional arguments with shapes (B, \*), where B is the batch size and \* represents any number of dimensions. All tensors should be on the same device and have the same dtype.
The forward method returns a single tensor of shape (B, \*).

Use cases

A common use case is caching the outputs of frozen, pre-trained model backbones to accelerate training:

import torch
import torch.nn as nn
from torchcache import torchcache

@torchcache(persistent=True)
class MyBackbone(nn.Module):
    def __init__(self):
        super().__init__()
        self.linear = nn.Linear(10, 10)
        self.eval()
        self.requires_grad_(False)

    def forward(self, x):
        # Cached to disk
        return self.linear(x)

class MyModel(nn.Module):
    def __init__(self):
        super().__init__()
        self.backbone = MyBackbone()
        self.head = nn.Linear(10, 10)

    def forward(self, x):
        x = self.backbone(x)  # Cached output
        x = self.head(x)      # Not cached
        return x

model = MyModel()

How it works

torchcache emerged from the need to cache the projected output of a large vision backbone, as it was taking to majority of the training time. However, as with any cache, I had to be careful with cache size management, memory usage and cache invalidation.

Here's an overview of how torchcache addresses these challenges:

Automatic cache management

torchcache automatically manages the cache by hashing both:

The decorated module (including its source code obtained through inspect.getsource) and its args/kwargs.
The inputs provided to the module's forward method.

This hash serves as the cache key for the forward method's output per item in a batch. When our MRU (most-recently-used) cache fills up for the given session, the system continues running the forward method and dismisses the newest output. This MRU strategy streamlines cache invalidation, aligning with the iterative nature of neural network training, without requiring any auxiliary record-keeping.

:warning: Warning: To avoid having to calculate the directory size on every forward pass, torchcache measures and limits the size of the persistent data created only for the given session. To prevent the persistent cache from growing indefinitely, you should periodically clear the cache directory. Note that if you let torchcache create a temporary directory, it will be automatically deleted when the session ends.

Tensor hashing

Creating an effective hashing mechanism for torch tensors involved addressing several criteria:

Deterministic Hashing: Consistent inputs should invariably yield the same hash.
Speed: Given its execution on every forward pass—regardless of caching status—the hashing process needs to be rapid.
Size Constraints: Given the frequent use of mixed precision in backbone models, it was crucial to prevent overflow scenarios.
Batch Sensitivity: The cache shouldn't invalidate with every new batch due to fluctuating batch sizes or sequences.

torchcache achieves these via the steps outlined below:

Coefficient Generation: We initiate a coefficient tensor rolling with powers of 2 (i.e. [1, 2, 4, 8, ...]). After reaching 2^15, the sequence starts over to sidestep overflow situations, particularly when using mixed precision.
Tensor Flattening & Subsampling: Flatten the input tensor and subsample subsample_count elements from it, by default 10000. This is done to avoid using the whole batch as input to the hash. The subsampling is done deterministically by taking every tensor.shape[0] // subsample_count element.
Hashing Process: The subsampled tensor is then multiplied by the coefficient tensor. The final hash is obtained by summing the results along the batch dimension.

Environment variables

Customize torchcache logging behavior using the following environment variables:

TORCHCACHE_LOG_LEVEL - logging level, defaults to WARN
TORCHCACHE_LOG_FMT - logging format, defaults to [torchcache] - %(asctime)s - %(name)s - %(levelname)s - %(message)s
TORCHCACHE_LOG_DATEFMT - logging date format, defaults to %Y-%m-%d %H:%M:%S
TORCHCACHE_LOG_FILE - path to the log file, defaults to None. Opened in append mode.

Contribution

Ensure you have Python installed.
Install poetry.
Run poetry install to set up dependencies.
Run poetry run pre-commit install to install pre-commit hooks.
Create a branch, make your changes, and open a pull request.

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.4.0

Sep 25, 2023

0.3.2

Sep 8, 2023

0.3.1

Sep 1, 2023

This version

0.2.0

Aug 29, 2023

0.1.0

Aug 28, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torchcache-0.2.0.tar.gz (12.5 kB view hashes)

Uploaded Aug 29, 2023 Source

Built Distribution

torchcache-0.2.0-py3-none-any.whl (10.9 kB view hashes)

Uploaded Aug 29, 2023 Python 3

Hashes for torchcache-0.2.0.tar.gz

Hashes for torchcache-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`faa22d5fa587b8eb434683722d033a3e53997ba30c40adf51fac67869cedffc6`
MD5	`d226708be553f4165229951f7ffcaea4`
BLAKE2b-256	`89f9daa50c1aff6fce704193e2e44d74ba54d5b48cf40134ffbfa5ce4b575137`

Hashes for torchcache-0.2.0-py3-none-any.whl

Hashes for torchcache-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b15232d60d495ab2393814140cf27209376b3aa0a621d29251c6a8f91aa80bfa`
MD5	`b5e62efd770302b3082314598aabfe12`
BLAKE2b-256	`ad0a8ed1e76e1b53968e38e9cf2f378cffc01b8e12f0f36b7f2cf56e7cc4cc2f`