Unified Memory SDK for LLM applications with multi-tier storage, policy-driven lifecycle, and intelligent summarization

These details have not been verified by PyPI

Project links

Project description

Axon

Unified Memory SDK for LLM Applications

Documentation · Examples · API Reference · Changelog

What is Axon?

Axon is a production-ready memory management system for Large Language Model (LLM) applications. It provides intelligent multi-tier storage, policy-driven lifecycle management, and semantic recall with automatic compaction and summarization.

Think of it as a smart caching layer for your LLM's memory - automatically organizing memories by importance, managing token budgets, and ensuring compliance.

Features

Multi-Tier Architecture - Automatic routing across ephemeral, session, and persistent tiers
Policy-Driven Lifecycle - Configure TTL, capacity limits, promotion/demotion thresholds
Semantic Search - Vector-based similarity search with metadata filtering
Automatic Compaction - Summarize and compress memories to manage token budgets
Audit Logging - Complete audit trails for compliance (GDPR, HIPAA)
PII Detection - Automatic detection and classification of sensitive information
Transaction Support - Two-phase commit (2PC) for atomic multi-tier operations
Structured Logging - Production-grade JSON logging with correlation IDs
Framework Integration - First-class support for LangChain and LlamaIndex

Quick Start

Installation

pip install axon

Basic Usage

import asyncio
from axon import MemorySystem
from axon.core.templates import balanced

async def main():
    # Create memory system
    system = MemorySystem(config=balanced())

    # Store memories with automatic tier routing
    await system.store(
        "User prefers dark mode",
        importance=0.8,
        tags=["preference", "ui"]
    )

    # Recall memories semantically
    results = await system.recall("user preferences", k=5)

    for entry in results:
        print(f"{entry.text} (importance: {entry.metadata.importance})")

asyncio.run(main())

Architecture

graph TB
    A[Your LLM Application] --> B[MemorySystem API]
    B --> C{Router}
    C -->|importance < 0.3| D[Ephemeral Tier]
    C -->|importance 0.3-0.7| E[Session Tier]
    C -->|importance > 0.7| F[Persistent Tier]

    D --> G[In-Memory / Redis]
    E --> H[Redis / ChromaDB]
    F --> I[Qdrant / Pinecone / ChromaDB]

    J[PolicyEngine] -.->|promotion| C
    J -.->|demotion| C

    style B fill:#4051B5,color:#fff
    style C fill:#5C6BC0,color:#fff

Why Axon?

Problem	Axon Solution
Token Limits	Automatic summarization and compaction
Cost	Intelligent tier routing reduces expensive vector DB operations
Session Management	Built-in session isolation with TTL and lifecycle policies
PII & Privacy	Automatic PII detection with configurable privacy levels
Observability	Structured logging and audit trails for compliance

Use Cases

Chatbot with Persistent Memory

from axon.integrations.langchain import AxonChatMemory
from langchain_openai import ChatOpenAI
from langchain.chains import LLMChain

memory = AxonChatMemory(system=MemorySystem(...))
llm = ChatOpenAI(model="gpt-4")
chain = LLMChain(llm=llm, memory=memory)

# Conversations persist across sessions
response = await chain.arun("What did we discuss last week?")

RAG with Multi-Tier Storage

from axon.integrations.llamaindex import AxonVectorStore
from llama_index.core import VectorStoreIndex

vector_store = AxonVectorStore(system=MemorySystem(...))
index = VectorStoreIndex.from_vector_store(vector_store)

query_engine = index.as_query_engine()
response = await query_engine.aquery("Explain quantum computing")

Audit-Compliant Memory

from axon.core import AuditLogger

audit_logger = AuditLogger(max_events=10000, enable_rotation=True)
system = MemorySystem(config=config, audit_logger=audit_logger)

# All operations automatically logged
await system.store("Sensitive data", privacy_level=PrivacyLevel.RESTRICTED)

# Export audit trail
events = await system.export_audit_log(operation=OperationType.STORE)

Storage Adapters

Axon supports multiple backends:

In-Memory - Development and testing
Redis - Ephemeral caching with TTL
ChromaDB - Local vector storage
Qdrant - Production vector database
Pinecone - Managed vector database

Core Concepts

Memory Tiers

Ephemeral (importance < 0.3): Short-lived, high-volume data
Session (0.3 ≤ importance < 0.7): Session-scoped context
Persistent (importance ≥ 0.7): Long-term semantic storage

Policies

Define lifecycle rules for each tier:

from axon.core.policies import SessionPolicy

policy = SessionPolicy(
    ttl_minutes=60,           # Session expires after 1 hour
    max_items=100,            # Limit to 100 memories
    summarize_after=50,       # Summarize when reaching 50 items
    promote_threshold=0.8,    # Promote high-importance memories
)

Routing

Automatic tier selection based on:

Importance scores
Access patterns (recency, frequency)
Capacity constraints
Explicit tier hints

Advanced Features

Compaction Strategies

# Count-based compaction
await system.compact(tier="session", strategy="count", threshold=50)

# Semantic similarity compaction
await system.compact(tier="session", strategy="semantic", threshold=0.9)

# Hybrid strategy (combines multiple approaches)
await system.compact(tier="session", strategy="hybrid")

Privacy & PII Detection

# Automatic PII detection enabled by default
entry_id = await system.store("Contact: john@example.com, 555-1234")

# Check detected PII
tier, entry = await system._get_entry_by_id(entry_id)
print(entry.metadata.pii_detection.detected_types)
# Output: {'email', 'phone'}

print(entry.metadata.privacy_level)
# Output: PrivacyLevel.INTERNAL

Transactions (2PC)

from axon.core.transaction import TransactionManager, IsolationLevel

tx_manager = TransactionManager(registry, isolation_level=IsolationLevel.SERIALIZABLE)

async with tx_manager.transaction() as tx:
    await tx.store_in_tier("ephemeral", entry1)
    await tx.store_in_tier("persistent", entry2)
    # Atomic commit across both tiers

Documentation

Getting Started - 5-minute quickstart guide
Core Concepts - Understanding tiers, policies, and routing
API Reference - Complete API documentation
Storage Adapters - Backend configuration guides
Advanced Features - Audit, privacy, transactions
Integrations - LangChain and LlamaIndex
Deployment - Production deployment guide
Examples - Working code examples

Development

Prerequisites

Python 3.9+
Virtual environment (recommended)

Setup

# Clone repository
git clone https://github.com/yourusername/Axon.git
cd Axon

# Create virtual environment
python -m venv venv
source venv/bin/activate  # On Windows: .\venv\Scripts\activate

# Install with dev dependencies
pip install -e ".[dev]"

Running Tests

# Run all tests
pytest

# Run with coverage
pytest --cov=axon --cov-report=html

# Run specific test markers
pytest -m unit              # Unit tests only
pytest -m integration       # Integration tests

Code Quality

# Format code
black src/ tests/

# Lint
ruff check src/ tests/

# Type check
mypy src/axon

Examples

See the examples/ directory for working examples:

[01-10] - Storage adapter examples (Qdrant, Pinecone, ChromaDB, Redis)
[11-15] - Advanced features (compaction, audit, privacy)
[16-25] - Integration examples (transactions, scoring, policy)
[26-27] - Framework integrations (LangChain chatbot, LlamaIndex RAG)

Project Status

Version: 1.0.0-beta

Test Coverage: 97.8% passing (634/646 tests)

Production Readiness: 70%

✅ Core functionality complete
✅ LangChain/LlamaIndex integrations
✅ Audit logging and privacy features
✅ Transaction support (2PC)
⚠️ Documentation in progress
⚠️ Performance optimization ongoing

Roadmap

See ROADMAP.md for detailed sprint planning.

v1.0 (Current - Beta):

✅ Core memory system
✅ Multi-tier routing
✅ Storage adapters (5/6 complete)
✅ LangChain/LlamaIndex integrations
🚧 Documentation
🚧 Performance optimization

v1.1 (Planned):

SQLite adapter
CLI tools for backup/restore
Performance benchmarks
Extended monitoring

v2.0 (Future):

GraphQL API
Real-time sync
Multi-tenancy support
Advanced security features

Contributing

We welcome contributions! See CONTRIBUTING.md for development guidelines.

Development Workflow

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Make your changes
Run tests and linters
Commit (git commit -m 'Add amazing feature')
Push to branch (git push origin feature/amazing-feature)
Open a Pull Request

License

Axon is released under the MIT License.

Support

GitHub Issues: Report bugs
Discussions: Ask questions
Documentation: Read the docs

Acknowledgments

Built with:

Pydantic - Data validation
ChromaDB - Vector storage
Qdrant - Vector database
Redis - Caching layer

Made with ❤️ by the Axon Team

Website · GitHub · Twitter

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.0b2 pre-release

Nov 11, 2025

This version

1.0.0b1 pre-release

Nov 11, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

axon_sdk-1.0.0b1.tar.gz (281.7 kB view details)

Uploaded Nov 11, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

axon_sdk-1.0.0b1-py3-none-any.whl (124.9 kB view details)

Uploaded Nov 11, 2025 Python 3

File details

Details for the file axon_sdk-1.0.0b1.tar.gz.

File metadata

Download URL: axon_sdk-1.0.0b1.tar.gz
Upload date: Nov 11, 2025
Size: 281.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.4

File hashes

Hashes for axon_sdk-1.0.0b1.tar.gz
Algorithm	Hash digest
SHA256	`23b02aef668299363b895d175a58397683cec0b6b4d4ac5b549b6bb1eb06d2d6`
MD5	`142599f155babe2ec97e5c1a469d656b`
BLAKE2b-256	`7b3a9be1382d32d33107194aeb254a18bff1793af7e4d030daceafe2a60f9abd`

See more details on using hashes here.

File details

Details for the file axon_sdk-1.0.0b1-py3-none-any.whl.

File metadata

Download URL: axon_sdk-1.0.0b1-py3-none-any.whl
Upload date: Nov 11, 2025
Size: 124.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.4

File hashes

Hashes for axon_sdk-1.0.0b1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a4f78e421878ffddcb74ac8904586747c033e6603d0e8d8f8ea9160274100575`
MD5	`97054b8790b1f30d3fbcef79de9d3628`
BLAKE2b-256	`faaafb0c1be46ea3fb0a5ee22c4bc15dc3e4c5d26ec4cb5ead8c748053a65c10`

See more details on using hashes here.

axon-sdk 1.0.0b1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Axon

What is Axon?

Features

Quick Start

Installation

Basic Usage

Architecture

Why Axon?

Use Cases

Chatbot with Persistent Memory

RAG with Multi-Tier Storage

Audit-Compliant Memory

Storage Adapters

Core Concepts

Memory Tiers

Policies

Routing

Advanced Features

Compaction Strategies

Privacy & PII Detection

Transactions (2PC)

Documentation

Development

Prerequisites

Setup

Running Tests

Code Quality

Examples

Project Status

Roadmap

Contributing

Development Workflow

License

Support

Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes