Robust PyTorch modules and research tooling for Memory Caching (arXiv:2602.24281)

These details have not been verified by PyPI

Project links

Project description

Memory Caching Reproduction (PyTorch)

Community reproduction of Memory Caching: RNNs with Growing Memory (arXiv:2602.24281, Feb 27, 2026).

Status

This repository is not an official release from the paper authors.
Current work targets mechanism-faithful implementation of the Memory Caching wrapper before paper-scale metric parity.
We do not currently claim exact reproduction of published numbers.

Current scope

Core MC wrapper with Residual / GRM / Soup / SSC.
Segmentation modes: constant and logarithmic.
Backends: linear, DLA, Titans, SWLA(c=2).
Smoke harness for train/eval across all backends.
Benchmark harnesses: NIAH, MQAR, LongBench scaffold, retrieval scaffold.
Benchmark scoring follows explicit task-aligned policies (exact_match, token_f1, rouge_l_f1) with per-row metric labels.
Optional JSONL dataset-file ingestion path for LongBench/retrieval runners.
Included sample dataset files: examples/longbench_subset.jsonl, examples/retrieval_subset.jsonl.
Deterministic tokenizer/data/train/eval pipeline with real checkpoint artifacts.
Artifact bundles: metrics + rows + csv + report + manifest.
Phase3 reports include trend, smoke-target dashboard, statistical summary, and artifact checksums.
Default benchmark adapters are rule-based compatibility adapters; benchmark scores from these adapters are harness checks, not model-quality evidence.
Deep-memory backends (DLA/Titans) and SWLA(c=2) are reference implementations and are not yet validated against paper-reported training dynamics or metrics.
Titans convention note: paper-recursion faithfulness claims require titans_update_convention="paper"; gradient_descent is provided as an explicit alternative convention.

Quickstart

uv flow (recommended):

uv sync --extra dev
./scripts/checks/no_large_artifacts.sh
./scripts/checks/phase2.sh
./scripts/checks/bench_smoke.sh
./scripts/checks/pipeline_smoke.sh
./scripts/checks/resume_consistency.sh
uv run python scripts/reports/release_gate_v1.py --mode repo --out outputs/reports/release_gate_repo_v1.json

pip editable flow:

python -m venv .venv
source .venv/bin/activate
python -m pip install --upgrade pip
python -m pip install -e ".[dev]"
mc list-variants

When to use:

Use uv for reproducible local development workflows in this repository.
Use pip install -e . / pip install -e ".[dev]" when integrating with an existing Python environment.

Torch/CUDA note:

CPU-only example:
- python -m pip install torch --index-url https://download.pytorch.org/whl/cpu
CUDA example (for CUDA 12.1 builds):
- python -m pip install torch --index-url https://download.pytorch.org/whl/cu121
Install a torch build that matches your local CUDA runtime/driver stack before CUDA workflows.

Install verification:

mc list-variants
mc smoke-eval --backend linear --device cpu --warmup-steps 1 --batch-size 1 --seq-len 8 --vocab-size 16 --d-model 8 --num-heads 2

Debug trace example:

uv run mc debug-layer --backend linear --aggregation grm --seq-len 8 --d-model 8 --num-heads 2 --out-json outputs/debug/debug_layer.json

Onboarding acceptance criteria:

A new contributor should be able to complete the one-hour onboarding path and open a docs-only PR without changing internal scripts.
See docs/CONTRIBUTOR_DRY_RUN.md for the dry-run record and expected outcomes.

Key docs

docs/reproduction_report.md
docs/CONTRIBUTOR_ONBOARDING.md
docs/CONTRIBUTOR_DRY_RUN.md
docs/CONTRIBUTING.md
docs/ARCHITECTURE.md
docs/ENV_COMPAT_MATRIX.md
docs/CLAIM_TO_EVIDENCE_MATRIX.md
docs/CLAIM_BOUNDARY.md
docs/RELEASE_GATE_CHECKLIST_V1.md
docs/PYPI_RELEASE_RUNBOOK.md
docs/CONSUMER_SUPPORT_MATRIX.md
docs/BACKEND_CAPABILITY_MATRIX.md
docs/PAPER_TO_CODE.md
docs/BACKEND_API_CONTRACT.md
docs/BACKEND_REPRODUCIBILITY.md
docs/BENCHMARK_COMMAND_MATRIX.md
docs/BENCHMARK_SWEEP_RUNBOOK.md
docs/BENCHMARK_EVAL_CONTRACT.md
docs/TRAINING_PARITY_TABLE.md
docs/TRAINING_PARITY_TABLE_FULL.md
docs/TRAINING_BOOTSTRAP.md
docs/PROGRESS_LEDGER.md

Notes

docs_tmp/ and outputs/ are intentionally gitignored.

Stable package boundary

The stable import surface is intentionally narrow:

memory_caching.MCConfig
memory_caching.MemoryCachingLayer
memory_caching.SegmentCache
memory_caching.LinearMemoryBackend
memory_caching.DLABackend
memory_caching.TitansBackend
memory_caching.SWLABackend

The CLI, smoke helpers, benchmark runners, report generators, and release scripts are repo tooling. They are useful for reproduction work, but they are not the stable public package contract.

Canonical terminology used in this repository:

engineering scaffold: code quality, reproducibility, packaging, and report-generation integrity
scientific evidence: model-backed artifacts with non-smoke targets and truthful manifests
paper parity: faithful reproduction of the paper's reported baselines, metrics, and missing comparison rows

scientific evidence is stricter than the engineering scaffold, but it is still not the same as paper parity.

For runtime use, prefer the explicit layer methods:

layer(x) for the normal forward path
layer.forward_with_cache(x) when you need cached segment checkpoints
layer.inspect(x) when you need per-token routing/debug rows

Backend claim boundary

linear is an unnormalized matrix-memory backend used as the wrapper's linear reference path. It should not be read as a full normalized linear-attention baseline.
dla, titans, and swla are mechanism-oriented reference implementations. They are useful for wrapper-faithfulness work, but they are not yet validated against paper-reported training dynamics or metric parity.
titans and swla currently use constant scalar coefficients where the paper presents time-indexed coefficients.
soup is only true state-space mixing for backends that implement state mixing. Non-mixable backends use an explicit output-mixture fallback when that compatibility path is enabled.

Release surfaces

Engineering gate:
- uv run python scripts/reports/release_gate_v1.py --mode repo --out outputs/reports/release_gate_repo_v1.json
Scientific gate:
- uv run python scripts/reports/release_gate_v1.py --mode scientific --out outputs/reports/release_gate_scientific_v1.json

The engineering gate covers repository integrity and public-package mechanics. The scientific gate remains stricter and blocks parity claims unless model-backed evidence and non-smoke targets are present.

What a green scientific gate still does not prove:

it does not prove full paper parity
it does not prove missing paper baselines such as Log-Linear++
it does not prove throughput parity or unpublished-author-internal equivalence

Public API stability

Stable runtime imports:

memory_caching.MCConfig
memory_caching.MemoryCachingLayer
memory_caching.SegmentCache
memory_caching.LinearMemoryBackend
memory_caching.DLABackend
memory_caching.TitansBackend
memory_caching.SWLABackend

Internal or repo-only surfaces:

memory_caching.smoke
benchmark adapters/runners
report-generation scripts
release-gate scripts

Install from PyPI / wheel / source

From source:

python -m pip install -e .

From source with dev extras:

python -m pip install -e ".[dev]"

From a built wheel:

python -m pip install dist/*.whl

Minimal examples

examples/minimal_layer.py
examples/inspect_layer.py

Both examples are part of the stable public package surface.

Remaining paper-parity blocker

Full paper parity is still blocked by missing paper baselines, most notably configs/train/log_linear_pp.placeholder.yaml.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Mar 6, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

memory_caching-0.1.0.tar.gz (53.2 kB view details)

Uploaded Mar 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

memory_caching-0.1.0-py3-none-any.whl (43.7 kB view details)

Uploaded Mar 6, 2026 Python 3

File details

Details for the file memory_caching-0.1.0.tar.gz.

File metadata

Download URL: memory_caching-0.1.0.tar.gz
Upload date: Mar 6, 2026
Size: 53.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for memory_caching-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`8f36ffbab738a6133752875002fae2b69f2c843235d0dad63927f1fb90c75210`
MD5	`6621b8c1573fe0373d5d306d7703a023`
BLAKE2b-256	`74cbaf7c235786ab3195d009dc874897e49c3cf649371c2403e3e5029f270b28`

See more details on using hashes here.

File details

Details for the file memory_caching-0.1.0-py3-none-any.whl.

File metadata

Download URL: memory_caching-0.1.0-py3-none-any.whl
Upload date: Mar 6, 2026
Size: 43.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for memory_caching-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8571e1336f4e75fb08a9565babe35fad1c2ffd632b57425fe871c4c56fd921e7`
MD5	`2069bc744b292e4835715678381751a3`
BLAKE2b-256	`eebb11f8be9b0cdffa657872ecbb1066a0f46d90ef682063ae3448387707115a`

See more details on using hashes here.

memory-caching 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Memory Caching Reproduction (PyTorch)

Status

Current scope

Quickstart

Key docs

Notes

Stable package boundary

Backend claim boundary

Release surfaces

Public API stability

Install from PyPI / wheel / source

Minimal examples

Remaining paper-parity blocker

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes