File-based persistent memory for AI agents. Zero dependencies.

These details have not been verified by PyPI

Project links

Project description

🧠 antaris-memory

Persistent, intelligent memory for AI agents. The flagship package of the Antaris Analytics suite.

What Is This?

AI agents are stateless by default. Every spawn is a cold start with no knowledge of what happened before. antaris-memory solves this by giving agents a persistent, searchable, intelligent memory store that:

Remembers what happened across sessions, spawns, and agent restarts
Retrieves the right memories when they're needed, using an 11-layer search engine
Decays old memories gracefully so signal-to-noise stays high
Learns from mistakes, facts, and procedures with specialized memory types
Shares knowledge across multi-agent teams
Enriches itself via LLM hooks to dramatically improve recall

This is not a vector database wrapper. It is a zero-dependency, pure-Python, file-backed memory system designed from first principles for agentic workloads.

⚡ Quick Start

pip install antaris-memory

from antaris_memory import MemorySystem

mem = MemorySystem(workspace="./memory", agent_name="my-agent")
mem.load()

# Store a memory
mem.ingest("Deployed v2.3.1 to production at 14:32 UTC. All checks green.", 
          source="deploy-log", session_id="session-123", channel_id="ops-channel")

# Cross-session recall - finds memories from other sessions
results = mem.search("production deployment", crossSessionRecall="semantic")
for r in results:
    print(f"[{r.session_id}] {r.content}")

mem.save()

That's it. No API keys required, no external services, no configuration files.

📦 Installation

pip install antaris-memory

Version: 5.0.1
Requirements: Python 3.8+ · Zero external dependencies · stdlib only

🗺️ Feature Matrix

Feature	Available	Version
Core ingestion & search	✅	v1.0
Memory types (episodic/fact/mistake/procedure/preference)	✅	v1.0
Temporal decay	✅	v1.0
Export / Import	✅	v4.2
GCS cloud backend	✅	v4.2
Web & data file ingestion	✅	v4.7
Tiered storage (hot/warm/cold)	✅	v4.7
LLM enrichment hooks	✅	v4.6.5
11-layer search architecture	✅	v4.x
Graph intelligence (entity/relationship)	✅	v4.8/v4.9
Shared / team memory pools	✅	v4.8
Context packets (cold-spawn solver)	✅	v1.1
MCP server	✅	v4.9
Hybrid BM25 + semantic embedding search	✅	v4.x
Co-occurrence / PPMI semantic tier	✅	v4.x
Input gating (P0–P3 priority)	✅	v4.x
Cross-session memory recall	✅	v5.0.1
Auto memory type classification	✅	v5.0.1
Session/channel provenance	✅	v5.0.1
doc2query (search query generation)	✅	v5.0.2
Recovery system	✅	v3.3
CLI tooling	✅	v4.x

📖 Table of Contents

Core API
Memory Types
Ingestion Methods
Search & Retrieval
11-Layer Search Architecture
Tiered Storage
LLM Enrichment
Graph Intelligence
Context Packets
Shared / Team Memory Pools
MCP Server
GCS Backend
Export & Import
Recovery System
Co-occurrence / PPMI Semantic Tier
Input Gating
Hybrid Semantic Search
Maintenance & Operations
Stats & Health
CLI Reference
Full API Reference

🔧 Core API

Constructor

from antaris_memory import MemorySystem

mem = MemorySystem(
    workspace="./memory",        # Root directory for all memory files (required)
    agent_name="my-agent",       # REQUIRED: agent scoping, omitting triggers UserWarning
    half_life=7.0,               # Decay half-life in days (default: 7.0)
    tag_terms=None,              # Custom auto-tag terms (list of strings)
    use_sharding=True,           # Enterprise sharding for large stores
    use_indexing=True,           # Pre-built search indexes (faster queries)
    enable_read_cache=True,      # LRU in-memory cache
    cache_max_entries=1000,      # Max LRU cache entries
    enricher=None,               # LLM enrichment callable (see LLM Enrichment)
    tiered_storage=True,         # Hot/warm/cold tier management
    graph_intelligence=True,     # Entity extraction + knowledge graph
    quality_routing=True,        # Follow-up pattern detection
    semantic_expansion=True,     # Word embedding query expansion
)

Why agent_name matters: Each agent gets a scoped memory namespace. Without it, memories from different agents bleed together, and per-agent search filtering stops working. Always set it.

Lifecycle

# Load memories from disk into memory
count = mem.load()
print(f"Loaded {count} memories")

# Save current state to disk
path = mem.save()

# Flush Write-Ahead Log (WAL) to shards without full save
result = mem.flush()
# → {"flushed_entries": 42, "wal_cleared": True}

# Graceful shutdown: flush + release all resources
mem.close()

WAL (Write-Ahead Log): Every ingest() call appends to a WAL first. This makes writes fast and crash-safe. The WAL is periodically compacted into shards. Use flush() frequently in long-running agents; use close() at shutdown.

🏷️ Memory Types

Memory types are not just labels — they change how memories decay, how they score in search, and how they surface in context packets.

Type	Decay Rate	Importance Multiplier	Special Behavior
`episodic`	Normal (half_life days)	1×	Default type. General events and observations.
`fact`	Normal	1× (high recall priority)	Verified facts. Prioritized in recall.
`mistake`	10× slower	2× importance	Surfaces as Known Pitfalls in context packets. Never forget your mistakes.
`preference`	3× slower	1×	High context-matched recall. User/agent preferences persist longer.
`procedure`	3× slower	1×	High task-matched recall. How-to knowledge stays relevant.

Why This Matters for Agents

Mistakes outlive everything. If an agent tried an approach that failed, that memory persists 10× longer and scores 2× higher. It will surface when context is relevant — preventing the agent from repeating the same mistake.

Procedures don't decay during active projects. A procedure stored on Monday is still highly relevant on Friday. Regular episodic memories would fade; procedures don't.

# Store a verified fact
mem.ingest_fact(
    "The production database is PostgreSQL 14.2, hosted on RDS us-east-1",
    source="infra-docs",
    tags=["database", "infrastructure"]
)

# Record a mistake with full context
entry = mem.ingest_mistake(
    what_happened="Used DROP TABLE instead of TRUNCATE, lost test data",
    correction="Always use TRUNCATE for clearing data; reserve DROP TABLE for schema removal",
    root_cause="Confused SQL semantics under pressure",
    severity="high",
    tags=["sql", "database", "destructive-ops"]
)

# Store a user/agent preference
mem.ingest_preference(
    "User prefers concise responses under 200 words unless explicitly asked for detail",
    source="user-feedback"
)

# Store a repeatable procedure
mem.ingest_procedure(
    "Deployment checklist: 1) Run tests, 2) Tag release, 3) Push to staging, 4) Monitor 10min, 5) Push to prod",
    source="runbook"
)

📥 Ingestion Methods

Basic Ingestion

# Full control ingestion with v5.0.1 features
entry_id = mem.ingest(
    content="Completed sprint 14. Delivered auth module, skipped rate-limiter due to scope.",
    source="sprint-retro",
    category="engineering",
    memory_type="episodic",      # Auto-classified (semantic/episodic) in v5.0.1
    tags=["sprint", "auth"],
    agent_id="agent-007",        # Override agent scoping
    session_id="session-abc",    # Session provenance (v5.0.1)
    channel_id="channel-main",   # Channel provenance (v5.0.1)
    source_url="https://example.com/doc",  # Source URL tracking (v5.0.1)
    content_hash="sha256:abc123",  # Content hash for deduplication (v5.0.1)
)

Typed Ingestion Shortcuts

# Facts — verified knowledge
mem.ingest_fact("API rate limit is 1000 req/min", source="api-docs", tags=["api"])

# Preferences — persist 3× longer
mem.ingest_preference("Prefers dark mode and compact layouts", source="user-settings")

# Procedures — task-matched recall
mem.ingest_procedure("To reset 2FA: go to /settings → Security → Reset authenticator", source="support-kb")

# Mistakes — 2× importance, 10× half-life
entry = mem.ingest_mistake(
    what_happened="Sent email to wrong recipient list",
    correction="Always preview recipient list before sending bulk mail",
    root_cause="Copy-paste error in distribution list",
    severity="medium",
    tags=["email", "communication"]
)

File & Directory Ingestion

# Ingest a single file
count = mem.ingest_file("./notes/meeting-2024-03.md", category="meetings")

# Ingest all matching files in a directory
count = mem.ingest_directory(
    "./docs",
    category="documentation",
    pattern="*.md"           # Glob pattern, default: *
)
print(f"Ingested {count} documents")

Web Ingestion

# Ingest a web page (and optionally crawl linked pages)
result = mem.ingest_url(
    "https://docs.example.com/api",
    depth=2,           # How many levels of links to follow (default: 1)
    incremental=True   # Skip pages already ingested (default: True)
)
# → {"ingested": 14, "skipped_duplicates": 3, "source_url": "https://docs.example.com/api"}

# Remove all memories from a source URL
result = mem.delete_source("https://docs.example.com/api")
# → {"deleted": 14, "source_url": "https://docs.example.com/api"}

Structured Data Ingestion

# CSV file
result = mem.ingest_data_file("./data/customers.csv", format="csv")
# → {"ingested": 512, "source": "customers.csv", "format": "csv"}

# JSON file
result = mem.ingest_data_file("./data/events.json", format="json")

# SQLite database
result = mem.ingest_data_file("./data/app.db", format="sqlite")

# Auto-detect format
result = mem.ingest_data_file("./data/report.csv", format="auto")

# Ingest from a SQL query
result = mem.ingest_sql(
    db_path="./data/app.db",
    query="SELECT id, title, body, created_at FROM articles WHERE published = 1"
)

Input Gating

See the Input Gating section for P0–P3 priority filtering.

🔍 Search & Retrieval

Core Search

results = mem.search(
    query="production deployment failure",
    limit=10,                        # Max results (default: 10)
    tags=["production"],             # Filter by tags
    tag_mode="any",                  # "any" or "all" (default: "any")
    date_range=("2024-01-01", "2024-03-31"),  # ISO date strings
    use_decay=True,                  # Apply temporal decay scoring (default: True)
    category="engineering",          # Filter by category
    min_confidence=0.3,              # Minimum relevance score (0.0–1.0)
    sentiment_filter="negative",     # Filter by sentiment
    memory_type="mistake",           # Filter by memory type
    explain=True,                    # Include score breakdown in results
    session_id="session-abc",        # Filter to specific session
    agent_id="agent-007",            # Filter to specific agent
    channel_id="ops-channel",        # Filter by channel (v5.0.1)
    crossSessionRecall="semantic",   # Cross-session semantic filtering (v5.0.1)
    include_cold=False,              # Include cold-tier memories (default: False)
)

for result in results:
    print(f"[{result.score:.3f}] {result.content[:100]}")
    if hasattr(result, 'score_breakdown'):
        print(f"  BM25={result.score_breakdown.get('bm25'):.3f}")

Search With Context (Instrumented)

results, ctx = mem.search_with_context(
    query="API authentication patterns",
    limit=10,
    instrumentation_context={"session": "my-session"},
    cooccurrence_boost=True          # Apply PPMI co-occurrence reranking
)

# ctx is a SearchContext object
print(f"Expanded query: {ctx.expanded_query}")
print(f"Intent detected: {ctx.intent}")
print(f"Tiers searched: {ctx.tiers_searched}")
print(f"Search time: {ctx.search_time_ms:.1f}ms")

Recency-First Retrieval

# Get recent memories without keyword matching (pure recency)
recent = mem.recent(
    limit=20,
    agent_id="agent-007",       # Filter to agent (optional)
    include_shared=True         # Include shared pool memories
)

Temporal Queries

# All memories from a specific date
entries = mem.on_date("2024-03-15")

# All memories in a date range
entries = mem.between("2024-03-01", "2024-03-31")

Narrative Generation

# Generate a prose narrative about a topic from memory
story = mem.narrative("deployment incidents in Q1")
print(story)
# → "In January, the team experienced three deployment incidents.
#    The first occurred on January 8th when..."

Analysis & Synthesis

# Structured analysis of memories related to a topic
analysis = mem.analyze("user authentication", limit=20)
# → {
#     "topic": "user authentication",
#     "memory_count": 8,
#     "themes": ["OAuth", "JWT", "session management"],
#     "timeline": [...],
#     "sentiment_distribution": {...}
# }

# Free-text knowledge synthesis
summary = mem.synthesize_knowledge("deployment best practices", limit=30)
print(summary)
# → "Based on accumulated knowledge: deployments succeed most often when..."

🏗️ 11-Layer Search Architecture

antaris-memory uses an 11-layer search pipeline. Each layer refines the ranked list before returning results. This is why recall quality dramatically exceeds naive keyword search.

Query Input
    │
    ▼
┌─────────────────────────────────────────────────────────────────┐
│  Layer 1: BM25+ TF-IDF                                          │
│           BM25_DELTA=1.0 floor ensures smooth scoring           │
├─────────────────────────────────────────────────────────────────┤
│  Layer 2: Exact Phrase Bonus                                     │
│           1.5× in content body · 1.3× in enriched summary       │
├─────────────────────────────────────────────────────────────────┤
│  Layer 3: Field Boosting                                         │
│           Tags 1.2× · Source 1.1× · Category 1.3×               │
├─────────────────────────────────────────────────────────────────┤
│  Layer 4: Rarity Boost + Proper Noun Boost                       │
│           ≤1% corpus → 2.0× · 1–5% → 1.5× · 5–15% → 1.2×      │
│           Proper nouns (NNP detection) → 1.5×                   │
├─────────────────────────────────────────────────────────────────┤
│  Layer 5: Sliding Window Context + Positional Salience           │
│           First/last window → 1.3× (intro/conclusion bias)      │
├─────────────────────────────────────────────────────────────────┤
│  Layer 6: Semantic Expansion                                     │
│           QueryExpander · PPMIBootstrap · CategoryTagger         │
├─────────────────────────────────────────────────────────────────┤
│  Layer 7: Intent Reranker                                        │
│           Detects: temporal · entity · event · decision          │
│                    comparison · howto · quantity · location      │
├─────────────────────────────────────────────────────────────────┤
│  Layer 8: Qualifier & Negation Sensitivity                       │
│           Handles: before/after · success/failure · negation     │
├─────────────────────────────────────────────────────────────────┤
│  Layer 9: Cross-Memory Clustering Boost                          │
│           Post-normalization cluster coherence scoring           │
├─────────────────────────────────────────────────────────────────┤
│  Layer 10: MiniLM Embedding Reranker                             │
│            .word_embeddings.json + .embeddings_minilm.json       │
├─────────────────────────────────────────────────────────────────┤
│  Layer 11: Pseudo-Relevance Feedback                             │
│            Top-term extraction from top-3 docs · 70/30 blend    │
└─────────────────────────────────────────────────────────────────┘
    │
    ▼
Ranked Results

Layer Details

Layer 1 — BM25+ TF-IDF: The baseline relevance signal. BM25+ with a delta floor of 1.0 ensures no term scores as zero, preventing the "lucky keyword match" problem that plagues basic TF-IDF implementations.

Layer 2 — Exact Phrase Bonus: When the exact query phrase appears verbatim, the score jumps. Content hits get a bigger boost (1.5×) than hits in enriched summaries (1.3×) to reward genuine signal over derived metadata.

Layer 3 — Field Boosting: Memories indexed with a matching category (1.3×) or matching tags (1.2×) rank higher. This rewards structured ingestion.

Layer 4 — Rarity & Proper Noun Boost: Rare terms matter more. If a term appears in ≤1% of the corpus, matching it scores 2× over a common term. Proper nouns (detected via capitalization heuristics) get an additional 1.5× because they're almost always semantically important.

Layer 5 — Positional Salience: Not all text positions are equal. The first and last windows of a memory entry score 1.3× higher — mimicking human reading patterns where intro/conclusion carry the most signal.

Layer 6 — Semantic Expansion: The query is expanded with related terms using PPMIBootstrap co-occurrence statistics and the CategoryTagger. A search for "API failure" also matches "endpoint crash", "service down", "503 error" — without any synonym dictionary.

Layer 7 — Intent Reranker: Detects the semantic intent of the query and reranks accordingly. A "how to" query surfaces procedural memories first. A "when did" query surfaces episodic/temporal memories first.

Layer 8 — Qualifier & Negation Sensitivity: Understands before/after, success/failure, and negation. Searching for "failed deployment" does not match "successful deployment" — a distinction most search systems get wrong.

Layer 9 — Clustering Boost: Memories that cluster with other highly-relevant results get a bonus. This rewards coherent knowledge clusters over isolated matching documents.

Layer 10 — MiniLM Embedding Reranker: Pre-computed sentence embeddings (MiniLM-based) provide semantic similarity scoring. Works entirely from local files — no model inference at search time.

Layer 11 — Pseudo-Relevance Feedback: The top 3 results are analyzed for their most distinctive terms. Those terms are folded back into the query at a 70/30 blend. This is a classic IR technique (Rocchio) applied to agent memory — the search becomes smarter the more memories you have.

🗄️ Tiered Storage (Hot / Warm / Cold)

Large memory stores would be expensive to load entirely on startup. Tiered storage solves this by keeping recent memories fast and old memories accessible but lazy.

Tier	Age	Behavior
Hot	0–3 days	Loaded on startup. Always in memory.
Warm	3–14 days	Loaded on-demand when hot search returns < 3 results.
Cold	14+ days	Never auto-loaded. Requires `include_cold=True`.

# Default search (hot + warm if needed)
results = mem.search("recent API changes")

# Explicitly search cold tier too
results = mem.search("API changes from last month", include_cold=True)

# Check tier distribution
stats = mem.get_stats()
print(f"Hot: {stats['hot_entries']}")
print(f"Warm: {stats['warm_entries']}")
print(f"Cold: {stats['cold_entries']}")

# Get the most-accessed hot entries
hot = mem.get_hot_entries(top_n=10)

Why tiers matter: An agent that's been running for months might have 50,000+ memories. Loading all of them on every spawn is expensive. Tiered storage means startup costs stay constant regardless of total memory size hot tier is small, warm tier is medium, cold tier is archived.

🤖 LLM Enrichment

Out-of-the-box keyword search only finds what's literally in the text. LLM enrichment dramatically improves recall by having a language model generate additional search signals at ingest time.

How It Works

You provide an enricher callable when constructing MemorySystem
On every ingest(), your callable receives the content and returns metadata
That metadata is indexed with weighted boost factors:
- search_queries: 3× TF weight — artificial query-document pairs
- enriched_summary: 2× TF weight — search-optimized restatement
- search_keywords: 2× TF weight — extra search terms

Enricher Interface

from typing import TypedDict, List

class EnrichmentResult(TypedDict):
    tags: List[str]             # Auto-generated tags
    summary: str                # Search-optimized restatement of the content
    keywords: List[str]         # Additional search terms
    search_queries: List[str]   # doc2query: LLM-generated search queries (v5.0.2)

Example: Anthropic Enricher

import anthropic

client = anthropic.Anthropic()

def my_enricher(content: str) -> dict:
    response = client.messages.create(
        model="claude-3-5-haiku-20241022",
        max_tokens=500,
        messages=[{
            "role": "user",
            "content": f"""Analyze this memory and return JSON:
{{
  "tags": ["tag1", "tag2"],
  "summary": "one-sentence search-optimized restatement",
  "keywords": ["keyword1", "keyword2"],
  "search_queries": ["what query should find this?", "another natural query"]
}}

Memory: {content}"""
        }]
    )
    import json
    return json.loads(response.content[0].text)

mem = MemorySystem(
    workspace="./memory",
    agent_name="my-agent",
    enricher=my_enricher
)

Batch Enrichment

Enrich memories that were ingested before an enricher was configured:

# Enrich all non-enriched entries in batches of 50
count = mem.re_enrich(
    batch_size=50,
    progress_fn=lambda i, total: print(f"Enriching {i}/{total}"),
    overwrite=False   # True to re-enrich already-enriched entries
)
print(f"Enriched {count} entries")

# Track enrichment costs
enrichment_count = mem.get_enrichment_count(reset=False)
stats = mem.get_stats()
print(f"Total enrichments: {stats['enrichment_count']}")
print(f"Estimated cost: ${stats['enrichment_cost_usd']:.4f}")

🕸️ Graph Intelligence

Beyond keyword search, antaris-memory builds a knowledge graph from ingested content. Entity extraction happens automatically — you don't need to annotate anything.

What It Does

EntityExtractor: Identifies named entities (people, organizations, projects, locations, concepts) from memory content using zero-dependency heuristics
MemoryGraph: A in-memory knowledge graph of entity relationships derived from co-occurrence patterns and explicit relationship signals

Graph Queries

# Search by relationship triple (subject, relation, object)
triples = mem.graph_search(
    subject="PostgreSQL",          # Filter by subject (None = any)
    relation="used_by",            # Filter by relation type (None = any)
    obj=None                       # Filter by object (None = any)
)
for triple in triples:
    print(f"{triple.subject} --[{triple.relation}]--> {triple.object}")

# Find shortest path between two entities
path = mem.entity_path(
    source="deployment-service",
    target="production-database",
    max_hops=3
)
print(" → ".join(path))
# → "deployment-service → RDS → production-database"

# Get full entity info
entity = mem.get_entity("PostgreSQL")
# → {"canonical": "PostgreSQL", "aliases": [...], "edge_count": 12, "memories": [...]}

# Graph statistics
stats = mem.get_graph_stats()
print(f"Nodes: {stats['nodes']}")
print(f"Edges: {stats['edges']}")
print(f"Density: {stats['density']:.4f}")

# Rebuild the graph from scratch (after bulk ingestion)
node_count = mem.rebuild_graph()
print(f"Graph rebuilt with {node_count} nodes")

# Rebuild topic clusters
cluster_count = mem.rebuild_clusters()

When to Use Graph Search

Graph search excels at questions keyword search struggles with:

"What services depend on the auth module?" → graph_search(subject=None, relation="depends_on", obj="auth-module")
"How is the payment service connected to the database?" → entity_path("payment-service", "database")
"What do we know about the Stripe integration?" → get_entity("Stripe")

📋 Context Packets

The cold spawn problem: when you launch a new sub-agent, it has zero context. You could dump 50 raw memories into its prompt, but that's token-inefficient and hard to parse.

Context packets solve this. They are structured, token-budgeted memory summaries that prime an agent with exactly what it needs for a specific task.

Building a Context Packet

packet = mem.build_context_packet(
    task="Deploy the new auth service to production",
    tags=["deployment", "auth"],
    category="engineering",
    environment="production",
    instructions="Focus on known failure modes and deployment checklist",
    max_memories=20,
    max_tokens=3000,
    min_relevance=0.3,
    include_mistakes=True,    # Adds "Known Pitfalls" section
    max_pitfalls=5
)

# Render to markdown (for agent system prompt injection)
markdown = packet.render()
print(markdown)

# Trim to a strict token budget
packet.trim(max_tokens=2000)

# Serialize to dict (for JSON transport to sub-agents)
data = packet.to_dict()

Multi-Query Packets

When a task requires multiple knowledge domains:

packet = mem.build_context_packet_multi(
    task="Migrate the database to the new schema",
    queries=[
        "database migration procedures",
        "schema change failures",
        "rollback procedures",
        "maintenance window requirements"
    ],
    max_tokens=4000,
    include_mistakes=True
)

Example Rendered Output

# Context Packet: Deploy auth service to production

## Relevant Knowledge
1. **Deployment Checklist** (score: 0.89, procedure)
   Run tests → Tag release → Push staging → Monitor 10min → Push prod

2. **Auth Service Architecture** (score: 0.82, fact)
   JWT-based. Refresh tokens stored in Redis. Session expiry: 24h.

## Known Pitfalls ⚠️
1. **Failed auth deployment (2024-02-14)** [SEVERITY: HIGH]
   - What happened: Deployed without running migration scripts first
   - Correction: Always run `alembic upgrade head` before service restart
   - Root cause: Skipped pre-deploy checklist under time pressure

Why This Is Critical

Context packets are the connective tissue of multi-agent systems. Without them, sub-agents are islands. With them, every spawned agent inherits the team's accumulated knowledge, including hard-won lessons from past failures.

👥 Shared / Team Memory Pools

Multi-agent systems need shared knowledge. A research agent should be able to write findings that a writing agent can later retrieve. Shared memory pools enable cross-agent knowledge sharing.

Setup

from antaris_memory import MemorySystem, AgentRole

# Agent 1: Coordinator
mem1 = MemorySystem(workspace="./agent1-memory", agent_name="coordinator")
pool = mem1.enable_shared_pool(
    pool_dir="./shared-pool",       # Shared filesystem location
    pool_name="project-alpha",
    agent_id="coordinator",
    role=AgentRole.COORDINATOR,     # COORDINATOR | WRITER | READER
    load_existing=True
)

# Agent 2: Worker (separate process/instance)
mem2 = MemorySystem(workspace="./agent2-memory", agent_name="worker")
pool2 = mem2.enable_shared_pool(
    pool_dir="./shared-pool",
    pool_name="project-alpha",
    agent_id="worker",
    role=AgentRole.WRITER
)

Writing to the Shared Pool

# Write to the shared pool (available to all agents in the pool)
entry = mem1.shared_write(
    content="Research complete: competitor uses GraphQL, not REST. Swagger docs at /api-docs.",
    namespace="research",        # Organize by namespace
    category="competitive-intel",
    metadata={"source": "api-analysis", "confidence": 0.9}
)

Reading from the Shared Pool

# Search shared pool
results = mem2.shared_search(
    query="competitor API architecture",
    namespace="research",
    limit=5
)
for r in results:
    print(r.content)

Agent Roles

Role	Can Read	Can Write	Can Admin
`COORDINATOR`	✅	✅	✅
`WRITER`	✅	✅	❌
`READER`	✅	❌	❌

🔌 MCP Server

antaris-memory ships with a built-in MCP (Model Context Protocol) server, allowing any MCP-compatible client (Claude Desktop, etc.) to interact with memory directly.

Starting the MCP Server

Via CLI:

python -m antaris_memory serve \
    --workspace ./memory \
    --agent-name my-agent

Via Python:

from antaris_memory.mcp import AntarisMCPServer

server = AntarisMCPServer(
    workspace="./memory",
    agent_name="my-agent"
)
server.run_stdio()

MCP Tools Exposed

The MCP server exposes memory operations as tools that MCP clients can call:

memory_search — search memories
memory_ingest — store new memories
memory_recent — get recent entries
memory_stats — get memory statistics
memory_context_packet — build context packet for a task

Claude Desktop Integration

Add to your Claude Desktop config.json:

{
  "mcpServers": {
    "antaris-memory": {
      "command": "python",
      "args": ["-m", "antaris_memory", "serve", "--workspace", "/path/to/memory", "--agent-name", "claude"]
    }
  }
}

☁️ GCS Cloud Backend

For cloud-native deployments, antaris-memory supports Google Cloud Storage as a backend.

from antaris_memory.backends import GCSMemoryBackend
from antaris_memory import MemorySystem

backend = GCSMemoryBackend(
    bucket="my-agent-memory-bucket",
    prefix="agents/production/"
)

mem = MemorySystem(
    workspace="./local-cache",    # Local cache directory
    agent_name="prod-agent",
    backend=backend               # GCS backend for persistence
)

Use cases:

Persistent memory that survives container restarts
Shared memory accessible from multiple cloud instances
Memory backup and audit trail in GCS

Requirements: google-cloud-storage must be installed separately (pip install google-cloud-storage). The core antaris-memory package remains zero-dependency.

📤 Export & Import

Move memory stores between agents, environments, or archive them for later use.

Export

# Export all memories to a JSON file
count = mem.export(
    output_path="./backup/memory-2024-03.json",
    include_metadata=True    # Include scores, timestamps, enrichment data
)
print(f"Exported {count} memories")

Export format:

{
  "version": "5.0.1",
  "exported_at": "2024-03-15T14:32:00Z",
  "workspace": "./memory",
  "entries": [
    {
      "id": "mem_abc123",
      "content": "...",
      "memory_type": "fact",
      "tags": ["api", "auth"],
      "created_at": "2024-03-10T09:15:00Z",
      "score": 0.92,
      "enriched_summary": "...",
      "search_queries": ["auth API", "JWT authentication"],
      "graph_entities": ["JWT", "OAuth"],
      "session_id": "session-abc",
      "channel_id": "ops-channel", 
      "source_url": "https://docs.example.com/auth",
      "content_hash": "sha256:abc123def456"
    }
  ]
}

Import

# Import memories (merge with existing)
count = mem.import_from(
    input_path="./backup/memory-2024-03.json",
    merge=True    # True = merge; False = replace
)
print(f"Imported {count} memories")

# Alias
count = mem.import_memories("./backup/memory-2024-03.json")

Use cases:

Bootstrap a new agent with knowledge from an existing agent
Restore from backup after data loss
Migrate from staging to production
Share domain knowledge between specialized agents

🔄 Recovery System

When an agent is spawned mid-task, it needs to reconstruct what happened before. The recovery system provides structured presets for this.

Presets

Preset	Memories	Time Window	Approximate Tokens
`smart` (default)	50	24 hours	~5,000–10,000
`minimal`	10	Current session	~1,000–2,000

# Smart recovery — get full 24h context
recent = mem.recent(limit=50)

# Build a targeted recovery packet
packet = mem.build_context_packet(
    task="Continue where we left off",
    max_memories=50,
    max_tokens=8000
)
recovery_context = packet.render()

Why recovery matters: An agent spawned to continue a long-running task needs to know: what was decided, what was tried, what failed, and what's pending. Without recovery context, it starts from scratch and may repeat mistakes or redo completed work.

📊 Co-occurrence / PPMI Semantic Tier

The PPMIBootstrap component builds a co-occurrence statistical model over the memory corpus. This enables semantic query expansion without any ML models or external dependencies.

How PPMI Works

PPMI (Positive Pointwise Mutual Information) measures how much more often two terms co-occur than expected by chance. Terms with high PPMI are semantically related.

PPMI(term_a, term_b) = max(0, log( P(a,b) / (P(a) × P(b)) ))

Over time, as you ingest memories, the PPMI matrix learns your domain's vocabulary automatically. "API" and "endpoint" will have high PPMI. "deployment" and "rollback" will have high PPMI.

Practical Effect

A search for "API crash" will be expanded with high-PPMI neighbors like "endpoint failure", "service error", "HTTP 500," terms that appear in the same context in your memory store, not in a generic synonym dictionary.

# co-occurrence stats visible in stats output
stats = mem.get_stats()
print(f"Co-occurrence pairs: {stats['cooccurrence_pairs']}")

# Use cooccurrence boost explicitly
results, ctx = mem.search_with_context(
    query="API problems",
    cooccurrence_boost=True
)
print(f"Expanded query: {ctx.expanded_query}")
# → "API problems endpoint failure service error 503"

🚦 Input Gating

Not every piece of information is worth storing. Input gating classifies content by priority and drops low-value content before it enters the memory store.

Priority Levels

Level	Label	Behavior
P0	Critical	Always stored, highest importance weighting
P1	Important	Stored
P2	Standard	Stored
P3	Ephemeral	Dropped — never stored

# The context dict informs the gating decision
entry_id = mem.ingest_with_gating(
    content="User said 'ok thanks'",
    source="chat-log",
    context={
        "session_type": "casual",
        "has_factual_claim": False,
        "is_action": False,
        "sentiment": "neutral"
    }
)
# → 0 (dropped as P3 ephemeral content)

entry_id = mem.ingest_with_gating(
    content="Production outage: auth service down. ETA 15 minutes. Cause: Redis connection pool exhausted.",
    source="incident-log",
    context={
        "is_incident": True,
        "severity": "high",
        "has_factual_claim": True
    }
)
# → memory_id (stored as P0 critical)

Why gating matters for agents: Agents that remember everything get noisy. An LLM-in-the-loop conversational agent might process thousands of utterances per day. Without gating, the memory store fills with "ok", "got it", "sure," drowning out the signal.

🔀 Hybrid Semantic Search

When you have embedding infrastructure available, antaris-memory can blend BM25 keyword search with cosine similarity semantic search.

Blend Ratio

Final Score = 0.40 × BM25_score + 0.60 × cosine_similarity

Setup

# Provide any embedding function — OpenAI, local model, whatever you have
def my_embed(text: str) -> list[float]:
    # Returns a list of floats (embedding vector)
    import openai
    response = openai.embeddings.create(
        model="text-embedding-3-small",
        input=text
    )
    return response.data[0].embedding

mem.set_embedding_fn(my_embed)

# All subsequent searches now use hybrid BM25+semantic scoring
results = mem.search("machine learning pipeline optimization")

Without an embedding function, all 11 layers still run — including the Layer 10 MiniLM reranker using pre-computed local embeddings. The set_embedding_fn hook adds a 60% cosine similarity signal on top.

🔧 Maintenance & Operations

Compaction

Over time, memories accumulate duplicates, stale entries, and expired content. Compaction cleans them up.

result = mem.compact()
# → {
#     "entries_before": 1240,
#     "entries_after": 987,
#     "removed_count": 253,
#     "shards_before": 8,
#     "shards_after": 6,
#     "space_freed_mb": 12.4,
#     "duration_ms": 340
# }

Forgetting

Selectively remove memories matching criteria:

result = mem.forget(
    topic="sprint-12",           # Remove by topic/keyword
    entity="TempProject",        # Remove by entity name
    before_date="2023-12-31"     # Remove entries older than date
)
# → {"forgotten": 42, "criteria": {...}}

Consolidation

Group similar memories and deduplicate:

result = mem.consolidate()
# → {"consolidated": 18, "duplicates_removed": 7}

Compression

Archive old memories to compressed format:

archived = mem.compress_old(days=60)   # Compress entries older than 60 days
print(f"Archived {len(archived)} entries")

Reindexing

Rebuild search indexes after bulk imports or schema changes:

mem.reindex()  # Rebuilds all search indexes

Relevance Management

# Mark memories as used (boosts their score for future recall)
count = mem.mark_used(
    memory_ids=["mem_abc123", "mem_def456"],
    context="used in deployment planning"
)

# Manually boost a specific memory
success = mem.boost_relevance(
    memory_id="mem_abc123",
    multiplier=1.5          # 1.5× score boost
)

Migration

# Migrate from older format to v4
result = mem.migrate_to_v4()
# → {"migrated": 840, "errors": 0, "duration_ms": 1200}

# Rollback if something goes wrong
result = mem.rollback_migration()

# Validate data integrity
report = mem.validate_data()
# → {"valid": True, "errors": [], "warnings": 3, "checked": 987}

📈 Stats & Health

Comprehensive Stats

stats = mem.get_stats()
# Equivalent: mem.stats()

Key	Description
`total_entries`	Total memories across all tiers
`hot_entries`	Entries in hot tier (0–3 days)
`warm_entries`	Entries in warm tier (3–14 days)
`cold_entries`	Entries in cold tier (14+ days)
`wal_size`	Write-Ahead Log entry count
`enrichment_count`	Total LLM enrichment calls made
`enrichment_cost_usd`	Estimated enrichment cost
`graph_enabled`	Whether graph intelligence is active
`graph_nodes`	Total entities in knowledge graph
`graph_edges`	Total relationships in knowledge graph
`cooccurrence_pairs`	PPMI co-occurrence term pairs
`cache_size`	Current LRU cache size
`avg_search_time_ms`	Average search latency
`cache_hit_rate`	LRU cache hit percentage
`disk_usage_mb`	Total disk usage
`version`	Library version
`workspace`	Workspace path
`agent_name`	Configured agent name

Health Check

health = mem.get_health()
# → {
#     "status": "ok",         # "ok" or "degraded"
#     "checks": {
#         "workspace_accessible": True,
#         "memories_loaded": True,
#         "wal_ok": True,
#         "search_ok": True,
#         "graph_ok": True
#     }
# }

if health["status"] != "ok":
    failed = [k for k, v in health["checks"].items() if not v]
    print(f"Health degraded: {failed}")

🖥️ CLI Reference

The antaris-memory CLI provides 4 commands for workspace management.

Global Flags

All commands accept:

--workspace PATH    Path to the memory workspace directory (default: ./memory)

`init` | Initialize a workspace

python -m antaris_memory init \
    --workspace ./my-agent-memory \
    --agent-name my-agent \
    [--force]

Flag	Description
`--workspace PATH`	Target directory to initialize
`--agent-name NAME`	Agent name to embed in config
`--force`	Overwrite existing workspace

Example output:

✓ Workspace initialized: ./my-agent-memory
✓ Agent name: my-agent
✓ Created: shards/, wal/, indexes/
Ready to use.

`status` — Show workspace status

python -m antaris_memory status --workspace ./my-agent-memory

Example output:

antaris-memory v5.0.1
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Workspace:      ./my-agent-memory
Agent:          my-agent
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Total entries:  1,247
  Hot  (0-3d):  83
  Warm (3-14d): 312
  Cold (14d+):  852
WAL entries:    14
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Graph nodes:    428
Graph edges:    1,102
Cooccurrence:   8,940 pairs
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Disk usage:     47.2 MB
Cache hit rate: 82.4%
Avg search:     8.3 ms
━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━
Health: ✓ ok

`rebuild-graph` — Rebuild knowledge graph

python -m antaris_memory rebuild-graph --workspace ./my-agent-memory

Example output:

Rebuilding knowledge graph...
Processed 1,247 entries
✓ Graph rebuilt: 428 nodes, 1,102 edges
Duration: 2.4s

Use after bulk imports or when graph data seems stale.

`serve` — Start MCP server

python -m antaris_memory serve \
    --workspace ./my-agent-memory \
    --agent-name my-agent

Flag	Description
`--workspace PATH`	Memory workspace to serve
`--agent-name NAME`	Agent name for scoping

The server communicates over stdio (MCP protocol). Connect via any MCP-compatible client.

📚 Full API Reference

Constructor

Parameter	Type	Default	Description
`workspace`	`str`	required	Root directory for memory files
`half_life`	`float`	`7.0`	Temporal decay half-life in days
`tag_terms`	`list`	`None`	Custom auto-tag terms
`use_sharding`	`bool`	`True`	Enterprise shard splitting
`use_indexing`	`bool`	`True`	Pre-built search indexes
`enable_read_cache`	`bool`	`True`	LRU read cache
`cache_max_entries`	`int`	`1000`	LRU cache size limit
`agent_name`	`str`	`None`	Agent scope (⚠️ required)
`enricher`	`Callable`	`None`	LLM enrichment hook
`tiered_storage`	`bool`	`True`	Hot/warm/cold tier management
`graph_intelligence`	`bool`	`True`	Entity extraction + graph
`quality_routing`	`bool`	`True`	Follow-up pattern detection
`semantic_expansion`	`bool`	`True`	PPMI query expansion

Lifecycle Methods

Method	Returns	Description
`load()`	`int`	Load from disk; returns entry count
`save()`	`str`	Save to disk; returns path
`flush()`	`dict`	Compact WAL to shards
`close()`	`None`	Flush + release resources

Ingestion Methods

Method	Returns	Description
`ingest(content, source, category, memory_type, tags, agent_id, session_id, channel_id, source_url, content_hash)`	`int`	Store a memory entry
`ingest_fact(content, source, tags, category)`	`int`	Store a verified fact
`ingest_preference(content, source, tags, category)`	`int`	Store a preference
`ingest_procedure(content, source, tags, category)`	`int`	Store a procedure
`ingest_mistake(what_happened, correction, root_cause, severity, tags)`	`MemoryEntry`	Store a mistake with full context
`ingest_file(file_path, category)`	`int`	Ingest a file
`ingest_directory(dir_path, category, pattern)`	`int`	Ingest a directory of files
`ingest_url(url, depth, incremental)`	`dict`	Ingest web content
`ingest_data_file(path, format, **kwargs)`	`dict`	Ingest CSV/JSON/SQLite
`ingest_sql(db_path, query)`	`dict`	Ingest SQL query results
`ingest_with_gating(content, source, context)`	`int`	Ingest with P0-P3 priority gating

Search & Retrieval Methods

Method	Returns	Description
`search(query, limit, tags, tag_mode, date_range, use_decay, category, min_confidence, sentiment_filter, memory_type, explain, session_id, agent_id, include_cold)`	`list`	11-layer search
`search_with_context(query, limit, instrumentation_context, cooccurrence_boost)`	`tuple[list, SearchContext]`	Search with instrumentation
`recent(limit, agent_id, include_shared)`	`list`	Recency-first retrieval
`on_date(date)`	`list`	All memories from a date
`between(start, end)`	`list`	All memories in date range
`analyze(query, limit)`	`dict`	Structured topic analysis
`synthesize_knowledge(topic, limit)`	`str`	Free-text knowledge synthesis
`narrative(topic)`	`str`	Prose narrative from memories

Graph Methods

Method	Returns	Description
`graph_search(subject, relation, obj)`	`list`	Query relationship triples
`entity_path(source, target, max_hops)`	`list`	Find entity relationship path
`get_entity(canonical)`	`dict`	Get entity node info
`get_graph_stats()`	`dict`	Graph statistics
`rebuild_graph()`	`int`	Rebuild graph from all entries
`rebuild_clusters()`	`int`	Rebuild topic clusters

Context Packet Methods

Method	Returns	Description
`build_context_packet(task, tags, category, environment, instructions, max_memories, max_tokens, min_relevance, include_mistakes, max_pitfalls)`	`ContextPacket`	Build single-query context packet
`build_context_packet_multi(task, queries, ...)`	`ContextPacket`	Build multi-query context packet

ContextPacket methods:

packet.render() → str (markdown)
packet.to_dict() → dict (serializable)
packet.trim(max_tokens) → trims in-place

Shared Pool Methods

Method	Returns	Description
`enable_shared_pool(pool_dir, pool_name, agent_id, role, load_existing)`	`SharedMemoryPool`	Enable shared pool
`shared_write(content, namespace, category, metadata)`	`object`	Write to shared pool
`shared_search(query, namespace, limit)`	`list`	Search shared pool

LLM Enrichment Methods

Method	Returns	Description
`re_enrich(batch_size, progress_fn, overwrite)`	`int`	Batch-enrich existing entries
`get_enrichment_count(reset)`	`int`	Get enrichment call count
`set_embedding_fn(fn)`	`None`	Set embedding function for hybrid search

Maintenance Methods

Method	Returns	Description
`compact()`	`dict`	Remove dupes, expire stale
`consolidate()`	`dict`	Group and deduplicate similar memories
`compress_old(days)`	`list`	Compress entries older than N days
`reindex()`	`None`	Rebuild search indexes
`forget(topic, entity, before_date)`	`dict`	Selectively remove memories
`delete_source(source_url)`	`dict`	Remove all memories from a source
`mark_used(memory_ids, context)`	`int`	Mark memories as used
`boost_relevance(memory_id, multiplier)`	`bool`	Boost a memory's score

Stats & Health Methods

Method	Returns	Description
`get_stats()` / `stats()`	`dict`	Comprehensive statistics
`get_health()`	`dict`	Health check
`get_hot_entries(top_n)`	`list`	Most-accessed hot entries

Data Integrity Methods

Method	Returns	Description
`export(output_path, include_metadata)`	`int`	Export to JSON
`import_from(input_path, merge)`	`int`	Import from JSON
`import_memories(path)`	`int`	Import (alias)
`validate_data()`	`dict`	Validate data integrity
`migrate_to_v4()`	`dict`	Migrate from older format
`rollback_migration()`	`dict`	Rollback migration

🧪 Full Example: Production Agent

import json
import anthropic
from antaris_memory import MemorySystem

# ── Enricher ─────────────────────────────────────────────────────────────
client = anthropic.Anthropic()

def enricher(content: str) -> dict:
    response = client.messages.create(
        model="claude-3-5-haiku-20241022",
        max_tokens=400,
        messages=[{"role": "user", "content": f"""
Return JSON only:
{{"tags": ["tag1"], "summary": "one-line restatement", 
 "keywords": ["kw1"], "search_queries": ["natural query that should find this"]}}

Content: {content[:500]}"""}]
    )
    try:
        return json.loads(response.content[0].text)
    except Exception:
        return {"tags": [], "summary": content[:100], "keywords": [], "search_queries": []}

# ── Setup ─────────────────────────────────────────────────────────────────
mem = MemorySystem(
    workspace="./production-memory",
    agent_name="prod-agent",
    enricher=enricher,
    tiered_storage=True,
    graph_intelligence=True,
    semantic_expansion=True,
)
count = mem.load()
print(f"Loaded {count} memories")

# ── Ingest various memory types ───────────────────────────────────────────
mem.ingest_fact(
    "AWS us-east-1 is our primary region. Failover to us-west-2.",
    source="infra-docs",
    tags=["aws", "infrastructure"]
)

mem.ingest_procedure(
    "Incident response: 1) Page on-call, 2) Open incident channel, 3) Assign incident commander, 4) Update status page",
    source="runbook",
    tags=["incident", "ops"]
)

mem.ingest_mistake(
    what_happened="Deployed to production without a feature flag, caused 2h outage",
    correction="All new features must ship behind a LaunchDarkly flag",
    root_cause="Skipped pre-deploy checklist",
    severity="critical",
    tags=["deployment", "feature-flags", "outage"]
)

mem.ingest_url("https://docs.example.com/api/v2", depth=2, incremental=True)

# ── Build context packet for sub-agent ───────────────────────────────────
packet = mem.build_context_packet(
    task="Deploy new payment service to production",
    tags=["deployment", "payment"],
    max_tokens=4000,
    include_mistakes=True,
    max_pitfalls=5
)

print(packet.render())  # Inject into sub-agent system prompt

# ── Search ────────────────────────────────────────────────────────────────
results = mem.search(
    "production deployment failure",
    limit=5,
    memory_type="mistake",
    explain=True
)

for r in results:
    print(f"[{r.score:.3f}] {r.content[:80]}")

# ── Graph query ───────────────────────────────────────────────────────────
path = mem.entity_path("payment-service", "aws-rds", max_hops=3)
print(" → ".join(path))

# ── Maintenance ───────────────────────────────────────────────────────────
stats = mem.get_stats()
print(f"Entries: {stats['total_entries']} | Graph: {stats['graph_nodes']} nodes")

health = mem.get_health()
if health["status"] != "ok":
    print(f"DEGRADED: {health}")

result = mem.compact()
print(f"Compacted: removed {result['removed_count']} entries")

mem.save()
mem.close()

🏛️ Architecture Overview

antaris-memory/
├── Core
│   ├── MemorySystem (MemorySystemV4)
│   ├── MemoryEntry (typed entry schema)
│   └── WAL (Write-Ahead Log for crash safety)
│
├── Storage
│   ├── ShardManager (enterprise sharding)
│   ├── TierManager (hot/warm/cold routing)
│   └── GCSMemoryBackend (cloud backend)
│
├── Search
│   ├── BM25PlusIndex (Layer 1)
│   ├── PhraseMatcher (Layer 2)
│   ├── FieldBooster (Layer 3)
│   ├── RarityBooster (Layer 4)
│   ├── WindowScorer (Layer 5)
│   ├── QueryExpander + PPMIBootstrap (Layer 6)
│   ├── IntentReranker (Layer 7)
│   ├── QualifierFilter (Layer 8)
│   ├── ClusterBooster (Layer 9)
│   ├── EmbeddingReranker (Layer 10)
│   └── PseudoRelevanceFeedback (Layer 11)
│
├── Intelligence
│   ├── EntityExtractor
│   ├── MemoryGraph
│   ├── LLMEnricher
│   └── CategoryTagger
│
├── Multi-Agent
│   ├── SharedMemoryPool
│   ├── AgentRole (COORDINATOR/WRITER/READER)
│   └── AgentConfig
│
├── Context
│   ├── ContextPacketBuilder
│   └── ContextPacket
│
└── Server
    ├── AntarisMCPServer
    └── CLI (init/status/rebuild-graph/serve)

📄 License

APACHE 2.0 License

🔗 Links

PyPI: https://pypi.org/project/antaris-memory/
Antaris Analytics LLC: https://antarisanalytics.ai
Suite: antaris-suite
Parsica: https://parsica.ai/ (Document Parsing using the same method)

antaris-memory is the flagship package of the Antaris Analytics LLC suite — built for production AI agent deployments.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

5.5.0

Mar 15, 2026

5.4.0

Mar 15, 2026

5.3.0

Mar 15, 2026

5.2.3

Mar 14, 2026

5.2.1

Mar 11, 2026

This version

5.2.0

Mar 11, 2026

5.0.1

Mar 10, 2026

5.0.0

Mar 9, 2026

4.9.22

Mar 9, 2026

4.9.21

Mar 9, 2026

4.9.20

Mar 8, 2026

4.9.19

Mar 8, 2026

4.9.18

Mar 7, 2026

4.9.17

Mar 7, 2026

4.9.16

Mar 6, 2026

4.9.15

Mar 6, 2026

4.9.14

Mar 5, 2026

4.9.13

Mar 5, 2026

4.9.12

Mar 5, 2026

4.9.11

Mar 5, 2026

4.9.10

Mar 4, 2026

4.9.7

Mar 4, 2026

4.9.6

Mar 4, 2026

4.9.5

Mar 3, 2026

4.9.4

Mar 3, 2026

4.9.3

Mar 3, 2026

4.9.2

Mar 3, 2026

4.9.1

Mar 3, 2026

4.9.0

Mar 3, 2026

4.8.0

Mar 3, 2026

4.7.1

Mar 3, 2026

4.7.0

Mar 3, 2026

4.6.8

Mar 2, 2026

4.6.7

Mar 2, 2026

4.6.6

Mar 2, 2026

4.6.5

Mar 2, 2026

4.6.0

Mar 2, 2026

4.5.6

Mar 1, 2026

4.5.5

Mar 1, 2026

4.5.4

Mar 1, 2026

4.5.3

Mar 1, 2026

4.5.2

Feb 28, 2026

4.5.1

Feb 28, 2026

4.5.0

Feb 28, 2026

4.2.1

Feb 27, 2026

4.2.0

Feb 27, 2026

4.1.1

Feb 27, 2026

4.1.0

Feb 26, 2026

4.0.3

Feb 25, 2026

4.0.2

Feb 25, 2026

4.0.1

Feb 25, 2026

4.0.0

Feb 23, 2026

3.3.0

Feb 22, 2026

3.1.0

Feb 21, 2026

3.0.0

Feb 21, 2026

2.4.0

Feb 21, 2026

2.1.1

Feb 19, 2026

2.1.0

Feb 19, 2026

2.0.0

Feb 19, 2026

1.1.0

Feb 17, 2026

1.0.1

Feb 16, 2026

1.0.0

Feb 16, 2026

0.4.0

Feb 16, 2026

0.3.0

Feb 16, 2026

0.2.1

Feb 15, 2026

0.2.0

Feb 15, 2026

0.1.1

Feb 15, 2026

0.1.0

Feb 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

antaris_memory-5.2.0.tar.gz (1.8 MB view details)

Uploaded Mar 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

antaris_memory-5.2.0-py3-none-any.whl (1.8 MB view details)

Uploaded Mar 11, 2026 Python 3

File details

Details for the file antaris_memory-5.2.0.tar.gz.

File metadata

Download URL: antaris_memory-5.2.0.tar.gz
Upload date: Mar 11, 2026
Size: 1.8 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for antaris_memory-5.2.0.tar.gz
Algorithm	Hash digest
SHA256	`2cdb8b24347e9db43e2ab3aa46b9449568d321d3e07ea81396eab1f1621d5b29`
MD5	`dfa8fdd5590b48252a9895abd008e7ab`
BLAKE2b-256	`2d82cb4788caa1ee5b35466ae8ad80c5379ec50a17efb55c481cbe891157069b`

See more details on using hashes here.

File details

Details for the file antaris_memory-5.2.0-py3-none-any.whl.

File metadata

Download URL: antaris_memory-5.2.0-py3-none-any.whl
Upload date: Mar 11, 2026
Size: 1.8 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for antaris_memory-5.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d34f83d2f6a7815048448f27fc08b9e06f3e792a3e81d152d687d7260a1adde3`
MD5	`91fd4a38269814b2c997e4e51777a33f`
BLAKE2b-256	`2d73b0e6fe8fbc55b2c0b046fa03580eb52052aa3a1802e4038a2004887b85d1`

See more details on using hashes here.

antaris-memory 5.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🧠 antaris-memory

What Is This?

⚡ Quick Start

📦 Installation

🗺️ Feature Matrix

📖 Table of Contents

🔧 Core API

Constructor

Lifecycle

🏷️ Memory Types

Why This Matters for Agents

📥 Ingestion Methods

Basic Ingestion

Typed Ingestion Shortcuts

File & Directory Ingestion

Web Ingestion

Structured Data Ingestion

Input Gating

🔍 Search & Retrieval

Core Search

Search With Context (Instrumented)

Recency-First Retrieval

Temporal Queries

Narrative Generation

Analysis & Synthesis

🏗️ 11-Layer Search Architecture

Layer Details

🗄️ Tiered Storage (Hot / Warm / Cold)

🤖 LLM Enrichment

How It Works

Enricher Interface

Example: Anthropic Enricher

Batch Enrichment

🕸️ Graph Intelligence

What It Does

Graph Queries

When to Use Graph Search

📋 Context Packets

Building a Context Packet

Multi-Query Packets

Example Rendered Output

Why This Is Critical

👥 Shared / Team Memory Pools

Setup

Writing to the Shared Pool

Reading from the Shared Pool

Agent Roles

🔌 MCP Server

Starting the MCP Server

MCP Tools Exposed

Claude Desktop Integration

☁️ GCS Cloud Backend

📤 Export & Import

Export

Import

🔄 Recovery System

Presets

📊 Co-occurrence / PPMI Semantic Tier

How PPMI Works

Practical Effect

🚦 Input Gating

Priority Levels

🔀 Hybrid Semantic Search

Blend Ratio

Setup

🔧 Maintenance & Operations

Compaction

Forgetting

Consolidation

Compression

Reindexing

`init` | Initialize a workspace

`status` — Show workspace status

`rebuild-graph` — Rebuild knowledge graph

`serve` — Start MCP server