File-based persistent memory for AI agents. Zero dependencies.

These details have not been verified by PyPI

Project links

Project description

Antaris Memory

Production-ready file-based persistent memory for AI agents. Zero dependencies (core).

Store, search, decay, and consolidate agent memories using only the Python standard library. Sharded storage for scalability, fast search indexes, automatic schema migration. No vector databases, no infrastructure, no API keys.

What's New in v1.0.0

BM25-inspired search — proper relevance ranking with IDF weighting. No more wall of 0.50 scores.
File locking — cross-platform os.mkdir()-based locks prevent concurrent writer data loss
Optimistic conflict detection — mtime/hash tracking catches stale read-modify-write patterns
78 tests — comprehensive coverage across search, locking, versioning, and all core features

See CHANGELOG.md for full version history.

What It Does

Sharded storage for production scalability (10,000+ memories, sub-second search)
Fast search indexes (full-text, tags, dates) stored as transparent JSON files
Automatic schema migration from single-file to sharded format with rollback
Multi-agent shared memory pools with namespace isolation and access controls
Retrieval weighted by recency × importance × access frequency (Ebbinghaus-inspired decay)
Input gating classifies incoming content by priority (P0–P3) and drops ephemeral noise at intake
Detects contradictions between stored memories using deterministic rule-based comparison
Runs fully offline — zero network calls, zero tokens, zero API keys

What It Doesn't Do

Not a vector database — no embeddings. Search uses TF-IDF-style keyword matching on an inverted index, not semantic similarity. If you need "find memories similar in meaning," this isn't the right tool yet.
Not a knowledge graph — flat memory store with metadata indexing. No entity relationships or graph traversal.
Not semantic — contradiction detection compares normalized statements using explicit conflict rules (negation, numeric disagreement), not inference. It will not catch contradictions phrased differently.
Not LLM-dependent — all operations are deterministic. No model calls, no prompt engineering.
Not infinitely scalable — JSON file storage works well up to ~50,000 memories per workspace. Beyond that, you'll want a database. We're honest about this because we'd rather you succeed than discover limits in production.

Design Goals

Goal	Rationale
Deterministic	Same input → same output. No model variance.
Offline	No network, no API keys, no phoning home.
Minimal surface area	One class (`MemorySystem`), obvious method names.
No hidden processes	Consolidation and synthesis run only when called.
Transparent storage	Plain JSON files. Inspect with any text editor.

Install

pip install antaris-memory

Quick Start

from antaris_memory import MemorySystem

mem = MemorySystem("./workspace", half_life=7.0)
mem.load()  # Load existing state (no-op if first run)

# Store memories
mem.ingest("Decided to use PostgreSQL for the database.",
           source="meeting-notes", category="strategic")
mem.ingest("The API costs $500/month — too expensive.",
           source="review", category="operational")

# Search (BM25 ranking — confidence varies by relevance)
for r in mem.search("database decision"):
    print(f"[{r.confidence:.2f}] {r.content}")
# → [1.00] Decided to use PostgreSQL for the database.
# → [0.88] Database backup strategy needs review.

# Detailed search with score explanations
for r in mem.search("database decision", explain=True):
    print(f"[{r.relevance:.2f}] {r.content[:60]}  ({r.explanation})")

# Temporal queries
mem.on_date("2026-02-14")
mem.narrative(topic="database migration")

# Selective deletion
mem.forget(entity="John Doe")       # GDPR-ready, with audit trail
mem.forget(before_date="2025-01-01")

# Background consolidation
report = mem.consolidate()

mem.save()

Search Quality (v1.0)

Search uses BM25-inspired ranking with IDF weighting, field boosting, and length normalization. Scores are normalized to 0.0–1.0:

results = mem.search("PostgreSQL database migration", explain=True)

Score	Content	Why
1.00	PostgreSQL migration completed successfully.	Matched: postgresql, migration
0.88	Decided to use PostgreSQL for the database.	Matched: postgresql, database
0.33	Database backup strategy needs review.	Matched: database (single term, lower IDF)

Previous versions returned flat 0.50 confidence for all results. v1.0 differentiates by term rarity (IDF), exact phrase matching (1.5x boost), tag matches (1.2x), and decay weighting.

More examples in the examples/ directory:

quickstart.py — basic usage
openclaw_integration.py — OpenClaw agent integration
langchain_integration.py — LangChain memory backend

Input Gating (P0–P3)

Input gating classifies content at intake by priority level. Low-value data (greetings, filler, acknowledgments) never enters storage, keeping memory clean without manual curation.

mem.ingest_with_gating("CRITICAL: API key compromised", source="alerts")
# → P0 (critical) → stored in strategic tier

mem.ingest_with_gating("Decided to switch to PostgreSQL", source="meeting")
# → P1 (operational) → stored in operational tier

mem.ingest_with_gating("thanks for the update!", source="chat")
# → P3 (ephemeral) → dropped, not stored

Level	Category	Stored	Examples
P0	Strategic	✅	Security alerts, errors, deadlines, financial commitments
P1	Operational	✅	Decisions, assignments, technical choices
P2	Tactical	✅	Background info, research, general discussion
P3	—	❌	Greetings, acknowledgments, filler

Classification uses keyword and pattern matching — no LLM calls. It's fast (0.177ms avg) but not perfect. Edge cases exist; when in doubt, it errs toward storing.

Knowledge Synthesis

Knowledge synthesis identifies gaps in stored knowledge and integrates new research. It scans existing memories for topics mentioned frequently but lacking detail, then suggests targeted research.

# What does the agent not know enough about?
suggestions = mem.research_suggestions(limit=5)
# → [{"topic": "token optimization", "reason": "mentioned 3x, no details", "priority": "P1"}, ...]

# Integrate external findings
report = mem.synthesize(research_results={
    "token optimization": "Context window management techniques..."
})

Memory Decay

Memories fade over time unless reinforced by access:

score = importance × 2^(-age / half_life) + reinforcement

Fresh memories score high
Unused memories decay toward zero
Accessed memories are automatically reinforced (each search hit boosts the score)
Below-threshold memories are candidates for compression

Consolidation

Run periodically to maintain memory health:

report = mem.consolidate()

Sample output (10 memories, 2 near-duplicates, 3 topic clusters):

{
  "timestamp": "2026-02-16T02:23:58",
  "total": 10,
  "active": 10,
  "archive_candidates": 0,
  "duplicates": 0,
  "clusters": 3,
  "contradictions": 0,
  "top_clusters": {
    "postgresql": ["4d8c1f76", "9178bfd3"],
    "cost": ["a0811e1b", "5b42672b"],
    "$500": ["a0811e1b", "5b42672b"]
  }
}

Finds and merges near-duplicate memories (e.g., "Chose PostgreSQL" and "PostgreSQL selected as database")
Discovers topic clusters (memories that reference the same subjects)
Flags contradictions (e.g., "API costs are reasonable" vs "API costs too much" — when phrased with explicit negation)
Suggests memories for archival (old, low-importance, rarely accessed)

Concurrency

Multiple processes can safely read and write to the same memory workspace. Concurrency guarantees apply to writers using Antaris Memory's locking utilities. If external tools modify workspace files without respecting locks, correctness is not guaranteed.

File Locking

from antaris_memory import FileLock

# Exclusive access to a resource
with FileLock("/path/to/shard.json", timeout=10.0):
    data = load(shard)
    modify(data)
    save(shard, data)

# Non-blocking try
lock = FileLock("/path/to/shard.json")
if lock.acquire(blocking=False):
    try:
        ...
    finally:
        lock.release()

Locks use os.mkdir() — atomic on all platforms, works on network filesystems, zero dependencies. Stale locks from crashed processes are automatically detected and broken (by age or dead PID).

Optimistic Conflict Detection

For read-heavy workloads where locking overhead isn't worth it:

from antaris_memory import VersionTracker

tracker = VersionTracker()

# Snapshot before reading
version = tracker.snapshot("/path/to/data.json")
data = load(data_path)
modify(data)

# Check before writing — raises ConflictError if another process modified the file
tracker.check(version)
save(data_path, data)

# Or use the retry helper:
tracker.safe_update("/path/to/data.json", lambda d: {**d, "count": d["count"] + 1})

Safety Stack

All JSON writes use atomic_write_json() which combines:

Atomic writes (tmpfile → fsync → os.replace) — prevents torn files
File locks (os.mkdir) — prevents lost updates from concurrent writers
Directory fsync (POSIX) — crash-consistent renames

To opt out of locking for single-process workloads: atomic_write_json(path, data, lock=False).

Benchmarks

Measured on Apple M4, Python 3.14 (beta). Results on Python 3.9–3.13 will be comparable — no version-specific optimizations are used. Reproducible via scripts/ollama_benchmark.py.

Memories	Ingest	Search (avg)	Search (p99)	Consolidate	Disk
100	5.3ms (0.053ms/entry)	0.40ms	0.65ms	4.2ms	117KB
500	16.8ms (0.034ms/entry)	1.70ms	2.51ms	84.3ms	575KB
1,000	33.2ms (0.033ms/entry)	3.43ms	5.14ms	343.3ms	1.1MB
5,000	173.7ms (0.035ms/entry)	17.10ms	25.70ms	4.3s	5.6MB

v1.0 search uses BM25 scoring with IDF weighting, field boosting, and length normalization.

Input gating (P0–P3 classification): 0.177ms avg per input.

Scaling notes: JSON file storage is practical up to ~50,000 memories per workspace. At that scale, expect ~50-100ms search and ~50MB on disk. Beyond that, consider sharding across multiple workspaces or migrating to a database. We chose this limit deliberately — most agent workloads generate hundreds to low thousands of memories, not millions.

Storage Format

v0.4 (sharded) — memories are split across multiple files by date and topic:

workspace/
├── shards/
│   ├── 2026-02-strategic.json    # Strategic memories from Feb 2026
│   ├── 2026-02-operational.json  # Operational memories from Feb 2026
│   └── 2026-01-tactical.json     # Tactical memories from Jan 2026
├── indexes/
│   ├── search_index.json         # Full-text inverted index
│   ├── tag_index.json            # Tag → memory hash lookup
│   └── date_index.json           # Date range index
├── migrations/
│   └── history.json              # Applied migration log
└── memory_audit.json             # Deletion audit trail (GDPR)

Each shard is a plain JSON file containing an array of memory entries:

{
  "hash": "a1b2c3d4e5f6",
  "content": "Decided to use PostgreSQL",
  "source": "meeting-notes",
  "category": "strategic",
  "created": "2026-02-15T10:00:00",
  "importance": 1.0,
  "confidence": 0.8,
  "sentiment": {"strategic": 0.6},
  "tags": ["postgresql", "deployment"]
}

v0.2/v0.3 (legacy) — single memory_metadata.json file. Automatically migrated to sharded format on first v0.4 load, with backup and rollback support.

Storage format may evolve between versions. Breaking changes will increment MAJOR version. See CHANGELOG.

Architecture

MemorySystem (v1.0)
├── ShardManager       — Distributes memories across date/topic shards
├── IndexManager       — Full-text, tag, and date indexes for fast lookup
│   ├── SearchIndex    — Inverted index for text search
│   ├── TagIndex       — Tag → memory hash mapping
│   └── DateIndex      — Date range queries
├── MigrationManager   — Schema versioning with backup and rollback
├── SearchEngine       — BM25-inspired ranking with IDF, phrase boost, field boost
├── FileLock           — Cross-platform directory-based file locking
├── VersionTracker     — Optimistic conflict detection (mtime/hash)
├── InputGate          — P0-P3 classification at intake
├── DecayEngine        — Ebbinghaus forgetting curves
├── SentimentTagger    — Rule-based keyword tone tagging
├── TemporalEngine     — Date queries and narrative building
├── ConfidenceEngine   — Reliability scoring
├── CompressionEngine  — Old file summarization
├── ForgettingEngine   — Selective deletion with audit
├── ConsolidationEngine — Dedup, clustering, contradiction detection
└── KnowledgeSynthesizer — Gap identification and research integration

Data flow: ingest → classify (P0-P3) → normalize → shard-route → index → persist → search (index lookup) → decay-weight → return

Module notes: core_v4.py (imported as MemorySystem) is the production path — sharded storage, indexes, BM25 search. core.py is the legacy single-file implementation, kept for backward compatibility (from antaris_memory.core import MemorySystem as LegacyMemorySystem). New code should always use the default import.

Works With Local Models (Ollama)

All memory operations are local and deterministic — no tokens consumed, no API calls. Pair with Ollama for a fully local agent stack at zero marginal cost.

mem = MemorySystem("./workspace")
mem.load()
mem.ingest_with_gating("Meeting notes from standup", source="daily")
results = mem.search("standup decisions")

On a Mac Mini (32GB) running Ollama for inference and antaris-memory for persistence, your entire agent stack runs locally. On a Mac Studio (256GB), you can run 70B+ models alongside thousands of indexed memories with sub-millisecond lookups.

Running Tests

git clone https://github.com/Antaris-Analytics/antaris-memory.git
cd antaris-memory
python -m pytest tests/ -v

All 78 tests pass with zero external dependencies. No test fixtures, no mocking libraries, no network access.

Migrating from v0.x

Upgrading is automatic. Your existing workspace loads without changes:

# Same API — just upgrade the package
pip install antaris-memory==1.0.0

# Existing workspaces load automatically
mem = MemorySystem("./existing_workspace")
mem.load()  # Auto-detects v0.2/v0.3/v0.4 format, migrates if needed, rebuilds search index

v0.2/v0.3 → v1.0: Single-file format auto-migrates to sharded storage on first load (with backup)
v0.4 → v1.0: No migration needed. Search index rebuilds automatically.
Search results: confidence now reflects actual relevance (0.0-1.0) instead of static 0.50. Code using r.confidence will see better values, not different types.
New explain=True: Optional — returns SearchResult objects instead of MemoryEntry. Existing code unaffected.

Zero Dependencies (Core)

The core package uses only the Python standard library — no install-time dependencies. An optional [embeddings] extra (pip install antaris-memory[embeddings]) installs openai for future embedding-based search (not yet implemented in core — planned for v1.1). All current operations (ingest, search, decay, consolidation) are fully deterministic with no external calls.

Comparison

	Antaris Memory	LangChain Memory	Mem0	Zep
Search ranking	✅ BM25 with IDF	❌ Exact match	✅ Embeddings	✅ Embeddings
Input gating	✅ P0-P3	❌	❌	❌
Knowledge synthesis	✅ Gap detection	❌	❌	❌
No database required	✅	❌	❌	❌
Memory decay	✅ Ebbinghaus	❌	❌	⚠️ Temporal graphs
Tone tagging	✅ Rule-based keywords	❌	❌	✅ NLP
Temporal queries	✅	❌	❌	✅
Contradiction detection	✅ Rule-based	❌	❌	⚠️ Fact evolution
Selective forgetting	✅ With audit	❌	⚠️ Invalidation	⚠️ Invalidation
Infrastructure needed	None	Redis/PG	Vector + KV + Graph	PostgreSQL + Vector

Honest caveat: LangChain, Mem0, and Zep offer features we don't — embeddings-based semantic search, graph relationships, real-time sync. They require more infrastructure but may be the right choice if you need those capabilities. Antaris Memory is for teams that want a simple, transparent, offline-first memory primitive.

Part of the Antaris Analytics Suite

antaris-memory — Persistent memory for AI agents (this package)
antaris-router — Adaptive model routing with outcome learning
antaris-guard — Security and prompt injection detection
antaris-context — Context window optimization

License

Licensed under the Apache License 2.0. See LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

5.5.0

Mar 15, 2026

5.4.0

Mar 15, 2026

5.3.0

Mar 15, 2026

5.2.3

Mar 14, 2026

5.2.1

Mar 11, 2026

5.2.0

Mar 11, 2026

5.0.1

Mar 10, 2026

5.0.0

Mar 9, 2026

4.9.22

Mar 9, 2026

4.9.21

Mar 9, 2026

4.9.20

Mar 8, 2026

4.9.19

Mar 8, 2026

4.9.18

Mar 7, 2026

4.9.17

Mar 7, 2026

4.9.16

Mar 6, 2026

4.9.15

Mar 6, 2026

4.9.14

Mar 5, 2026

4.9.13

Mar 5, 2026

4.9.12

Mar 5, 2026

4.9.11

Mar 5, 2026

4.9.10

Mar 4, 2026

4.9.7

Mar 4, 2026

4.9.6

Mar 4, 2026

4.9.5

Mar 3, 2026

4.9.4

Mar 3, 2026

4.9.3

Mar 3, 2026

4.9.2

Mar 3, 2026

4.9.1

Mar 3, 2026

4.9.0

Mar 3, 2026

4.8.0

Mar 3, 2026

4.7.1

Mar 3, 2026

4.7.0

Mar 3, 2026

4.6.8

Mar 2, 2026

4.6.7

Mar 2, 2026

4.6.6

Mar 2, 2026

4.6.5

Mar 2, 2026

4.6.0

Mar 2, 2026

4.5.6

Mar 1, 2026

4.5.5

Mar 1, 2026

4.5.4

Mar 1, 2026

4.5.3

Mar 1, 2026

4.5.2

Feb 28, 2026

4.5.1

Feb 28, 2026

4.5.0

Feb 28, 2026

4.2.1

Feb 27, 2026

4.2.0

Feb 27, 2026

4.1.1

Feb 27, 2026

4.1.0

Feb 26, 2026

4.0.3

Feb 25, 2026

4.0.2

Feb 25, 2026

4.0.1

Feb 25, 2026

4.0.0

Feb 23, 2026

3.3.0

Feb 22, 2026

3.1.0

Feb 21, 2026

3.0.0

Feb 21, 2026

2.4.0

Feb 21, 2026

2.1.1

Feb 19, 2026

2.1.0

Feb 19, 2026

2.0.0

Feb 19, 2026

This version

1.1.0

Feb 17, 2026

1.0.1

Feb 16, 2026

1.0.0

Feb 16, 2026

0.4.0

Feb 16, 2026

0.3.0

Feb 16, 2026

0.2.1

Feb 15, 2026

0.2.0

Feb 15, 2026

0.1.1

Feb 15, 2026

0.1.0

Feb 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

antaris_memory-1.1.0.tar.gz (76.7 kB view details)

Uploaded Feb 17, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

antaris_memory-1.1.0-py3-none-any.whl (68.3 kB view details)

Uploaded Feb 17, 2026 Python 3

File details

Details for the file antaris_memory-1.1.0.tar.gz.

File metadata

Download URL: antaris_memory-1.1.0.tar.gz
Upload date: Feb 17, 2026
Size: 76.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for antaris_memory-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`8e1fdabf5b74e28f7c554dbd2bd4c8f1c7bfb3032e09fc107d5547145bc4a167`
MD5	`2d7ab1572b90f45cb5bf8e1d311ffa5d`
BLAKE2b-256	`b144cf4a1185be55024278cb2899ab473ecca876a1e9bb1d138501989ee95dd8`

See more details on using hashes here.

File details

Details for the file antaris_memory-1.1.0-py3-none-any.whl.

File metadata

Download URL: antaris_memory-1.1.0-py3-none-any.whl
Upload date: Feb 17, 2026
Size: 68.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for antaris_memory-1.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1f1d63321b6621a141a44519dc0cdb56e12852a351ac312353455b437564307e`
MD5	`ab68bd48f7421982edf86e578d4f1f9f`
BLAKE2b-256	`57b466d70b4f7bb8f694100c525ea2de8d9f8a00365cc959bf887f2d396ff487`

See more details on using hashes here.

antaris-memory 1.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Antaris Memory

What's New in v1.0.0

What It Does

What It Doesn't Do

Design Goals

Install

Quick Start

Search Quality (v1.0)

Input Gating (P0–P3)

Knowledge Synthesis

Memory Decay

Consolidation

Concurrency

File Locking

Optimistic Conflict Detection

Safety Stack

Benchmarks

Storage Format

Architecture

Works With Local Models (Ollama)

Running Tests

Migrating from v0.x

Zero Dependencies (Core)

Comparison

Part of the Antaris Analytics Suite

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes