Unofficial PyTorch implementation of Titans: Learning to Memorize at Test Time

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
- Python :: 3.10
Topic
- Scientific/Engineering :: Artificial Intelligence

Project description

Unofficial Implementation of Titans: Learning to Memorize at Test Time

This is an unofficial PyTorch implementation of the paper "Titans: Learning to Memorize at Test Time" by Ali Behrouz, Peilin Zhong, and Vahab Mirrokni.

Overview

Titans is a novel neural architecture that combines attention-based short-term memory with a neural long-term memory module. The architecture addresses the limitations of both recurrent models (which compress data into fixed-size memory) and attention mechanisms (which have quadratic complexity).

Key Features

Neural Long-term Memory: A module that learns to memorize historical context
Persistent Memory: Learnable tokens that encode task-specific knowledge
Three Architectural Variants:
- MAC (Memory as Context): Uses memory as context for attention
- MAG (Memory as Gate): Combines memory with core branch using gating
- MAL (Memory as Layer): Integrates memory as a separate layer

Installation

From PyPI (Recommended)

# Install core package (minimal dependencies)
pip install titans-unofficial

# Install with examples dependencies (for running example scripts)
pip install "titans-unofficial[examples]"

# Install with development dependencies (for contributing)
pip install "titans-unofficial[dev]"

From Source (Development)

# Clone the repository
git clone https://github.com/Shehryar718/titans-unofficial.git
cd titans-unofficial

# Install in development mode with all extras
pip install -e ".[dev,examples]"

Project Structure

titans-unofficial/
├── titans/
│   ├── __init__.py
│   ├── models/
│   │   ├── titans_base.py    # Base class for all variants
│   │   ├── titans_mac.py     # Memory as Context implementation
│   │   ├── titans_mag.py     # Memory as Gate implementation
│   │   └── titans_mal.py     # Memory as Layer implementation
│   └── utils/
│       ├── memory.py         # Neural Memory Module
│       ├── attention.py      # Attention mechanisms
│       └── persistent_memory.py  # Persistent Memory implementation
├── examples/
│   ├── text_classification.py  # Text classification example
│   ├── language_modeling.py    # Language modeling example
│   └── fine_tuning.py         # Fine-tuning example
├── pytests/
│   └── test_memory.py         # Tests for memory module
├── requirements.txt
├── LICENSE
└── README.md

Usage

Text Classification

from titans import TitansMAC, TitansMAG, TitansMAL
from examples.text_classification import TitansForClassification

# Initialize model
model = TitansForClassification(
    vocab_size=30000,
    d_model=128,
    n_layers=2,
    n_heads=4,
    num_classes=2,
    memory_depth=2,
    persistent_tokens=8,
    window_size=16,
    model_type="mal"  # Choose from: "mac", "mag", "mal"
)

# Train and evaluate
python examples/text_classification.py

Language Modeling

from titans import TitansMAC, TitansMAG, TitansMAL
from examples.language_modeling import TitansForLanguageModeling

# Initialize model
model = TitansForLanguageModeling(
    vocab_size=30000,
    d_model=128,
    n_layers=2,
    n_heads=4,
    memory_depth=2,
    persistent_tokens=16,
    window_size=128,
    model_type="mac"
)

# Train and generate text
python examples/fine_tuning.py

Architecture Details

Neural Memory Module

The neural memory module consists of:

Key/Value/Query projections for memory access
Multi-layer perceptron for memory processing
Momentum-based update mechanism with configurable parameters
Weight decay for forgetting mechanism
Gradient scaling for numerical stability

Variants

MAC (Memory as Context)
- Memory output serves as additional context
- Efficient for tasks requiring long-range dependencies
- Parallel processing with chunked computation
- Configurable chunk size and parallel processing
MAG (Memory as Gate)
- Gating mechanism to combine memory with core processing
- Adaptive balance between short and long-term memory
- Enhanced numerical stability
- Improved gradient flow through gating
MAL (Memory as Layer)
- Memory integrated as a separate layer
- Direct memory access at each layer
- Sliding window attention for efficiency
- Layer-wise memory updates

Example Tasks

The repository includes implementations for:

Text Classification (Binary and multi-class)
Language Modeling with test-time adaptation
Fine-tuning with early stopping

Each example demonstrates different aspects of the Titans architecture:

Memory reset between epochs for fresh adaptation
Efficient batch processing with dynamic batching
Gradient scaling for numerical stability
Early stopping and model checkpointing
Proper memory state management

Testing

# Run all tests
pytest pytests/

# Run specific test file
pytest pytests/test_memory.py

Citation

This repository provides an unofficial implementation of the Titans architecture.
If you reference this work, please cite the original paper:

@article{behrouz2024titans,
  title={Titans: Learning to Memorize at Test Time},
  author={Behrouz, Ali and Zhong, Peilin and Mirrokni, Vahab},
  journal={arXiv preprint arXiv:2501.00663},
  year={2024}
}

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
- Python :: 3.10
Topic
- Scientific/Engineering :: Artificial Intelligence

Release history Release notifications | RSS feed

This version

1.1.0

Feb 27, 2025

1.0.8

Feb 26, 2025

1.0.6

Feb 26, 2025

1.0.5

Feb 26, 2025

1.0.3

Feb 26, 2025

1.0.2

Feb 26, 2025

1.0.1

Feb 26, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

titans_unofficial-1.1.0.tar.gz (27.2 kB view details)

Uploaded Feb 27, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

titans_unofficial-1.1.0-py3-none-any.whl (17.4 kB view details)

Uploaded Feb 27, 2025 Python 3

File details

Details for the file titans_unofficial-1.1.0.tar.gz.

File metadata

Download URL: titans_unofficial-1.1.0.tar.gz
Upload date: Feb 27, 2025
Size: 27.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for titans_unofficial-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`80654f5a20da2bca41fba790439bae67e3db24c3fa1053f4611cb8235c54e1d8`
MD5	`e2584aed5a9a883aea1aa6bbad8083f8`
BLAKE2b-256	`0f55bcf220f1518caa580133471d1f22a45add0e434686d809a2780459e756a9`

See more details on using hashes here.

File details

Details for the file titans_unofficial-1.1.0-py3-none-any.whl.

File metadata

Download URL: titans_unofficial-1.1.0-py3-none-any.whl
Upload date: Feb 27, 2025
Size: 17.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for titans_unofficial-1.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`456695dfb3e1ca01941bddd6487f89097f8372a39dbd764f587fda46a94e7669`
MD5	`d9be9d231cc28d9ba993b08354c54269`
BLAKE2b-256	`5e39f48e19736651fa3cbed7bf2c3b99a64b851f082c4093740320157894c884`

See more details on using hashes here.

titans-unofficial 1.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Unofficial Implementation of Titans: Learning to Memorize at Test Time

Overview

Key Features

Installation

From PyPI (Recommended)

From Source (Development)

Project Structure

Usage

Text Classification

Language Modeling

Architecture Details

Neural Memory Module

Variants

Example Tasks

Testing

Citation

License

Contributing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes