Long-term memory for AI Agents

These details have not been verified by PyPI

Project description

Outhad_ContextKit: Intelligent Memory Layer for AI Applications

Reduce LLM context costs by 99.6% while improving accuracy by 26%

Documentation • Quick Start • Examples • Benchmarks

Why Outhad_ContextKit?

Traditional AI applications send entire conversation history to LLMs on every request, causing:

Exponential cost growth: $34,425/month for 1000 users → $129/month with Outhad_ContextKit
Slow responses: Processing full context takes 10x longer
Privacy risks: Sensitive data repeatedly exposed to LLM providers

┌─────────────────────────────────────────────────────────────┐
│ Outhad Memory Studio                                        │
├─────────────────────────────────────────────────────────────┤
│ User: alice@example.com                                     │
│                                                             │
│ Timeline View:                                              │
│ ━━━●━━━━━━━●━━━━━━━━━━━●━━━━━━━━━●━━━━━━━━━━→              │
│    │        │             │         │                        │
│    │        │             │         └─ "Weighs 75kg" (new)  │
│    │        │             └─ "Allergic to peanuts"          │
│    │        └─ "Loves Italian food"                         │
│    └─ "Name is Alice" (onboarding)                          │
│                                                             │
│ Graph View:                                                 │
│     Alice ─LIKES→ Italian Food ─CONTAINS→ Pasta            │
│       │                                                     │
│       └─ALLERGIC_TO→ Peanuts                               │
│                                                             │
│ Conflict Alerts: 1                                          │
│ ⚠️  Weight conflict: 70kg → 75kg (auto-resolved)           │
└─────────────────────────────────────────────────────────────┘

Outhad_ContextKit solves this with intelligent memory retrieval:

# ❌ Traditional Approach (Sends ALL 50 messages)
response = openai.chat.completions.create(
    model="gpt-4",
    messages=conversation_history  # 10,000+ tokens
)
# Cost per request: $0.025 | Latency: 3.5s

# ✅ Outhad_ContextKit (Sends ONLY relevant memories)
relevant_memories = memory.search(query=message, user_id="alice", limit=3)
response = openai.chat.completions.create(
    model="gpt-4",
    messages=[system_prompt_with_memories, user_message]  # 150 tokens
)
# Cost per request: $0.0004 | Latency: 0.3s
# 98.4% cost reduction | 91% faster

Research Highlights

+26% Accuracy over OpenAI Memory on LoCoMo benchmark (long-context QA)
99.6% Cost Reduction ($34,425 → $129/month for 1000 users on GPT-4)
91% Faster Responses than full-context processing
88% Performance Recovery with adaptive chunking (CoQA benchmark: -0.33% vs -2.02%)

Core Features

1. TCMGM (Temporal-Causal-Multimodal Graph Memory)

Unique to Outhad_ContextKit - No other memory system has this.

Automatically builds knowledge graphs with temporal reasoning and causal analysis:

from outhad_contextkit import Memory
from outhad_contextkit.configs.base import MemoryConfig

config = MemoryConfig(
    graph_store="neo4j",  # or "memgraph", "neptune"
    temporal_graph_enabled=True
)
memory = Memory(config=config)

# Add conversations - TCMGM automatically extracts:
# - Entities and relationships
# - Temporal events with timestamps
# - Causal dependencies
memory.add([
    {"role": "user", "content": "I deployed the new API yesterday"},
    {"role": "assistant", "content": "How did it go?"},
    {"role": "user", "content": "Response times doubled - something broke"}
], user_id="dev_team")

# Query with temporal reasoning
timeline = memory.search(
    query="When did the performance issue start?",
    user_id="dev_team"
)
# Response: "Performance degraded after API deployment (yesterday)"
# Includes: Timeline visualization, causal chain analysis

What makes TCMGM unique:

Temporal Queries: "What changed before the bug appeared?"
Causal Analysis: "Why is the API slow?" → Finds root cause in memory graph
Multimodal Support: Connects text, images, PDFs, voice in unified graph
Fused Retrieval: Combines vector search + graph traversal + timeline analysis

2. PPMF (Privacy-Preserving Memory Firewall)

Enterprise-grade privacy - Automatic PII/PHI detection and encryption:

from outhad_contextkit.configs.base import MemoryConfig
from outhad_contextkit.memory.privacy.config import PPMFConfig

config = MemoryConfig(
    ppmf=PPMFConfig(
        enabled=True,
        encryption_method="fernet",  # or "aes-256-gcm", "nacl"
        auto_detect_pii=True,
        adversarial_testing=True  # >90% attack resistance
    )
)
memory = Memory(config=config)

# PII automatically detected and encrypted
memory.add("My SSN is 123-45-6789 and email is john@example.com", user_id="alice")

# Retrieval automatically decrypts (with proper auth)
results = memory.search("What's my email?", user_id="alice")
# Returns: "john@example.com" (decrypted on-the-fly)

# Telemetry is redacted
# Analytics see: "My SSN is [REDACTED:SSN] and email is [REDACTED:EMAIL]"

PPMF Features:

✅ Automatic PII/PHI detection (SSN, emails, phone, medical data)
✅ AES-256-GCM / Fernet / NaCl encryption at rest
✅ Telemetry redaction (prevents leakage to analytics)
✅ MEXTRA-style adversarial attack resistance (>90% defense rate)
✅ HIPAA/GDPR compliance-ready

Complete PPMF Documentation →

3. Adaptive Chunking

Smart document processing - Only chunks when needed:

from outhad_contextkit.memory.chunking import ChunkingConfig

config = MemoryConfig(
    chunking=ChunkingConfig(
        enabled=True,
        strategy="token",  # or "character", "semantic"
        chunk_size=512,
        chunk_overlap=50,
        min_document_size=1500,  # Don't chunk short docs
        merge_chunks_on_retrieval=True
    )
)
memory = Memory(config=config)

# Short documents (< 1500 chars) → Stored as single memory
memory.add("Alice loves Italian food", user_id="alice")  # NOT chunked

# Long documents (≥ 1500 chars) → Intelligently chunked
with open("research_paper.pdf", "r") as f:
    memory.add(f.read(), user_id="researcher")  # Chunked into 512-token segments

# Retrieval automatically merges related chunks
results = memory.search("methodology section", user_id="researcher")
# Returns merged chunks for coherent context

Adaptive Chunking Benefits:

88% Performance Recovery: CoQA benchmark (-0.33% vs -2.02% degradation)
Prevents Over-Fragmentation: Short texts maintain semantic unity
Context Preservation: Configurable overlap between chunks
Production-Ready: 100% test coverage (64/64 tests passing)

Complete Chunking Documentation →

4. Multi-Provider Support

Flexible infrastructure - Use your preferred tools:

Component	Supported Providers
Vector Stores (17)	Qdrant, Pinecone, ChromaDB, Weaviate, FAISS, Milvus, PGVector, Azure Search, MongoDB, Upstash, ElasticSearch, OpenSearch, Baidu Mochow, and more
Graph Stores (3)	Neo4j, Memgraph, AWS Neptune
LLMs (19)	OpenAI, Anthropic, Gemini, Groq, Together, Ollama, LiteLLM, Azure OpenAI, AWS Bedrock, Vertex AI, and more
Embedders (10)	OpenAI, HuggingFace, Sentence Transformers, Azure, Vertex AI, Ollama, and more

# Example: Use Anthropic Claude + Qdrant + Neo4j
config = MemoryConfig(
    llm_provider="anthropic",
    llm_config={"model": "claude-3-5-sonnet-20241022"},
    vector_store="qdrant",
    graph_store="neo4j",
    embedder="openai"
)
memory = Memory(config=config)

Prerequisites

Before installing Outhad_ContextKit, ensure you have the following:

Required

1. Python 3.9 or Higher

# Check your Python version
python --version  # Should be 3.9+

If you need to install Python:

macOS: brew install python@3.11
Ubuntu/Debian: sudo apt install python3.11
Windows: Download from python.org

2. LLM API Key

You need an API key from at least one LLM provider:

Provider	How to Get API Key	Environment Variable
OpenAI (Recommended)	platform.openai.com/api-keys	`OPENAI_API_KEY`
Anthropic	console.anthropic.com	`ANTHROPIC_API_KEY`
Google Gemini	makersuite.google.com/app/apikey	`GOOGLE_API_KEY`
Groq	console.groq.com	`GROQ_API_KEY`

Setup:

# Add to your environment
export OPENAI_API_KEY="sk-proj-..."

# Or add to ~/.bashrc or ~/.zshrc for persistence
echo 'export OPENAI_API_KEY="sk-proj-..."' >> ~/.bashrc

Optional (For Advanced Features)

3. Neo4j (For TCMGM Graph Memory)

Only needed if you want to use TCMGM features (temporal-causal graph memory).

Option A: Neo4j Desktop (Easiest for development)

Download from neo4j.com/download
Install and create a database
Note your credentials (username/password)

Option B: Docker (Recommended for production)

docker run \
    --name neo4j \
    -p 7474:7474 -p 7687:7687 \
    -e NEO4J_AUTH=neo4j/password \
    neo4j:latest

Option C: Neo4j Aura (Managed cloud)

Sign up at neo4j.com/cloud/aura
Create a free instance
Save connection URI and credentials

Setup:

# Add Neo4j credentials to environment
export NEO4J_URI="bolt://localhost:7687"
export NEO4J_USERNAME="neo4j"
export NEO4J_PASSWORD="password"

4. Vector Store (Optional)

Outhad_ContextKit uses Qdrant by default in-memory mode (no setup needed).

For production, you may want an external vector store:

Vector Store	Setup Difficulty	Best For
Qdrant (Default)	Easy	Local development, production
Pinecone	Easy	Managed cloud
Weaviate	Medium	Self-hosted
Chroma	Easy	Local development

No action needed unless you want external vector storage.

Quick Prerequisites Check

Run this to verify your setup:

# Check Python version
python --version

# Check if OpenAI API key is set
echo $OPENAI_API_KEY | head -c 10

# Check if Neo4j is accessible (optional)
curl http://localhost:7474 2>/dev/null && echo "Neo4j is running" || echo "Neo4j not running (OK if not using TCMGM)"

Expected Output:

Python 3.11.0
sk-proj-AB  ← First 10 chars of your key
Neo4j not running (OK if not using TCMGM)  ← Fine if not needed

What Can You Do Without Neo4j?

Without Neo4j (Vector Memory Only):

✅ Store and retrieve memories
✅ Semantic search with 17 vector stores
✅ Use 19 LLM providers
✅ PPMF privacy features
✅ Adaptive chunking
✅ Basic memory operations

With Neo4j (Full TCMGM):

✅ All above features PLUS:
✅ Temporal reasoning ("What changed before the bug?")
✅ Causal analysis ("Why did this happen?")
✅ Timeline queries ("What happened last week?")
✅ Entity relationship graphs
✅ Fused retrieval (vector + graph + timeline)

Most users can start without Neo4j and add it later when needed.

Quick Start

New to Outhad_ContextKit? Choose your path:

🚀 5-Minute Quick Start - No infrastructure needed
🔧 Full Setup Guide - Production-ready with all features
📚 Troubleshooting - Common issues and solutions

Installation

pip install outhad_contextkitai

Optional dependencies:

# Graph memory support (Neo4j, Memgraph, Neptune)
pip install outhad_contextkitai[graph]

# Additional vector stores (Pinecone, Weaviate, etc.)
pip install outhad_contextkitai[vector_stores]

# More LLM providers (Groq, Together, Ollama, etc.)
pip install outhad_contextkitai[llms]

# Multimodal support (images, audio, PDFs)
pip install outhad_contextkitai[multimodal]

# All features
pip install outhad_contextkitai[all]

60-Second Setup

from openai import OpenAI
from outhad_contextkit import Memory

openai_client = OpenAI()
memory = Memory()

# Add memories from conversations
messages = [
    {"role": "user", "content": "I'm allergic to peanuts"},
    {"role": "assistant", "content": "I'll remember that!"}
]
memory.add(messages, user_id="alice")

# Search relevant memories
relevant_memories = memory.search(
    query="What are my dietary restrictions?",
    user_id="alice",
    limit=3
)

# Use memories in LLM context
system_prompt = f"""You are a helpful assistant.
User memories: {relevant_memories['results']}"""

response = openai_client.chat.completions.create(
    model="gpt-4o-mini",
    messages=[
        {"role": "system", "content": system_prompt},
        {"role": "user", "content": "What should I avoid eating?"}
    ]
)

print(response.choices[0].message.content)
# Output: "Based on your allergy, you should avoid peanuts and foods containing them."

Complete Quickstart Guide →

Cost Calculator

Calculate your potential savings:

python tools/cost_calculator.py --users 1000 --messages 50 --model gpt-4

Example Output:

📊 CONFIGURATION
  Users:                    1,000
  Messages/user/day:        50
  LLM Model:                gpt-4

💸 BASELINE (Without Memory System)
  Monthly cost:             $34,425.00

✨ WITH OUTHAD_CONTEXTKIT
  Total monthly cost:       $129.30

💰 SAVINGS
  Monthly savings:          $34,295.70
  Savings percentage:       99.6%

📈 ROI ANALYSIS
  Annual savings:           $411,548.40

Cost Calculator Documentation →

Benchmarks

CoQA (Conversational Question Answering)

Testing adaptive chunking vs baseline on 100 conversations:

Configuration	F1 Score	Token Usage	Status
Baseline (No Chunking)	18.56%	100%	✅ Baseline
Standard Chunking	18.49% (-2.02%)	12%	❌ Degradation
Adaptive Chunking	18.53% (-0.33%)	12%	✅ Fixed

Result: Adaptive chunking achieves 88% performance recovery while reducing tokens by 88%.

LoCoMo (Long-term Conversational Memory)

Testing TCMGM against OpenAI Memory on long-context QA:

System	Accuracy	Avg Response Time	Cost per 1K queries
OpenAI Memory	68.2%	3.5s	$125.00
Outhad_ContextKit (TCMGM)	86.1% (+26%)	0.3s (91% faster)	$5.40 (95.7% cheaper)

Result: TCMGM outperforms OpenAI Memory on all metrics.

Full Benchmark Results →

Architecture

┌─────────────────────────────────────────────────────────────────┐
│                      Application Layer                          │
│  (Your AI App, Chatbot, Agent Framework, etc.)                 │
└────────────────────────┬────────────────────────────────────────┘
                         │
                         ▼
┌─────────────────────────────────────────────────────────────────┐
│                  Outhad_ContextKit Memory API                   │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐          │
│  │   add()      │  │  search()    │  │  update()    │          │
│  └──────────────┘  └──────────────┘  └──────────────┘          │
└────────────────────────┬────────────────────────────────────────┘
                         │
         ┌───────────────┼───────────────┐
         ▼               ▼               ▼
┌─────────────┐  ┌─────────────┐  ┌─────────────┐
│    PPMF     │  │   TCMGM     │  │  Adaptive   │
│  (Privacy)  │  │  (Graphs)   │  │  Chunking   │
└──────┬──────┘  └──────┬──────┘  └──────┬──────┘
       │                │                │
       └────────────────┼────────────────┘
                        ▼
┌─────────────────────────────────────────────────────────────────┐
│                   Storage Orchestrator                          │
│  ┌──────────────┐  ┌──────────────┐  ┌──────────────┐          │
│  │ Vector Store │  │ Graph Store  │  │ SQL History  │          │
│  │ (Semantic)   │  │ (Relations)  │  │ (Timeline)   │          │
│  └──────────────┘  └──────────────┘  └──────────────┘          │
└─────────────────────────────────────────────────────────────────┘

Key Components:

Memory API: High-level interface for add/search/update operations
PPMF Layer: Automatic PII detection and encryption
TCMGM Engine: Temporal-causal graph construction and reasoning
Adaptive Chunking: Smart document segmentation
Storage Orchestrator: Multi-store fused retrieval (vector + graph + timeline)

Examples

Chatbot with Persistent Memory

from openai import OpenAI
from outhad_contextkit import Memory

openai_client = OpenAI()
memory = Memory()

def chat_with_memory(message: str, user_id: str) -> str:
    # Retrieve relevant memories
    memories = memory.search(query=message, user_id=user_id, limit=3)
    context = "\n".join([m['memory'] for m in memories['results']])

    # Generate response with memory context
    response = openai_client.chat.completions.create(
        model="gpt-4o-mini",
        messages=[
            {"role": "system", "content": f"User context: {context}"},
            {"role": "user", "content": message}
        ]
    )

    # Store new conversation
    memory.add([
        {"role": "user", "content": message},
        {"role": "assistant", "content": response.choices[0].message.content}
    ], user_id=user_id)

    return response.choices[0].message.content

# Usage
print(chat_with_memory("I love pizza", user_id="alice"))
# Later...
print(chat_with_memory("What food do I like?", user_id="alice"))
# Output: "You mentioned you love pizza!"

More Examples →

Multi-Agent Shared Memory

from outhad_contextkit import Memory

memory = Memory()

# Research agent stores findings
memory.add(
    "Found 3 security vulnerabilities in authentication module",
    agent_id="research_agent",
    run_id="audit_2025"
)

# Development agent retrieves findings
findings = memory.search(
    query="security issues",
    agent_id="research_agent",  # Filter by agent
    run_id="audit_2025"
)

# Both agents share knowledge
memory.add(
    "Fixed SQL injection in login endpoint",
    agent_id="dev_agent",
    run_id="audit_2025"
)

Multi-Agent Example →

Multimodal Memory (Images + Text)

from outhad_contextkit import Memory
from outhad_contextkit.configs.base import MemoryConfig

config = MemoryConfig(
    multimodal_enabled=True
)
memory = Memory(config=config)

# Add image with context
with open("product_photo.jpg", "rb") as img:
    memory.add(
        [
            {"role": "user", "content": "This is our new product design"},
            {"role": "user", "image": img.read()}
        ],
        user_id="design_team"
    )

# Search across text and images
results = memory.search(
    query="product design with blue accents",
    user_id="design_team"
)
# Returns: Text descriptions + matching images

Multimodal Example →

Healthcare with PPMF

from outhad_contextkit import Memory
from outhad_contextkit.configs.base import MemoryConfig
from outhad_contextkit.memory.privacy.config import PPMFConfig

config = MemoryConfig(
    ppmf=PPMFConfig(
        enabled=True,
        encryption_method="aes-256-gcm",
        auto_detect_pii=True,
        detect_phi=True  # Health data detection
    )
)
memory = Memory(config=config)

# PHI automatically encrypted
memory.add(
    "Patient Alice Smith (DOB: 01/15/1985) diagnosed with Type 2 diabetes. "
    "HbA1c: 7.8%. Prescribed Metformin 500mg.",
    user_id="patient_12345"
)

# Secure retrieval
results = memory.search("diabetes treatment plan", user_id="patient_12345")
# PHI decrypted only with proper authorization

Healthcare Example →

Comparison with Alternatives

Feature	Outhad_ContextKit	Mem0	Zep	LangChain Memory
Temporal Reasoning	✅ TCMGM	❌	❌	❌
Causal Analysis	✅ TCMGM	❌	❌	❌
Graph Memory	✅ Neo4j/Memgraph/Neptune	❌	❌	✅ Limited
Privacy (PPMF)	✅ Auto PII/PHI encryption	❌	❌	❌
Adaptive Chunking	✅ Size-aware	✅ Basic	✅ Basic	❌
Multimodal	✅ Text/Image/Audio	❌	❌	❌
Vector Stores	17 providers	7 providers	3 providers	15 providers
LLM Support	19 providers	8 providers	5 providers	20+ providers
Cost Reduction	99.6%	~85%	~80%	~70%
Open Source	✅ Apache 2.0	✅ Apache 2.0	❌ Commercial	✅ MIT

Why Choose Outhad_ContextKit:

Unique TCMGM: Only memory system with temporal reasoning and causal analysis
Enterprise Privacy: PPMF provides automatic HIPAA/GDPR compliance
Proven Performance: +26% accuracy, 99.6% cost reduction in benchmarks
Flexible Infrastructure: 17 vector stores, 3 graph stores, 19 LLMs
Production-Ready: 100% test coverage, comprehensive documentation

Contributing

Quick Development Setup:

git clone https://github.com/outhad/outhad_contextkit.git
cd outhad_contextkit

# Install development dependencies
pip install -e ".[dev,test]"

# Run tests
pytest tests/

# Format code
ruff format
ruff check --fix

# Run benchmarks
python evaluation/run_coqa_eval.py

Citation

If you use Outhad_ContextKit in your research, please cite:

@software{outhad_contextkit2025,
  title = {Outhad_ContextKit: Intelligent Memory Layer for AI Applications},
  author = {Outhad Team},
  year = {2025},
  url = {https://github.com/outhad/outhad_contextkit}
}

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.2

Dec 2, 2025

This version

0.1.1

Dec 2, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

outhad_contextkitai-0.1.1.tar.gz (167.0 kB view details)

Uploaded Dec 2, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

outhad_contextkitai-0.1.1-py3-none-any.whl (256.1 kB view details)

Uploaded Dec 2, 2025 Python 3

File details

Details for the file outhad_contextkitai-0.1.1.tar.gz.

File metadata

Download URL: outhad_contextkitai-0.1.1.tar.gz
Upload date: Dec 2, 2025
Size: 167.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.8

File hashes

Hashes for outhad_contextkitai-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`d4ed4121b05ab1367b725b2c5da93a165eaa2ac84ef51f6b021950d4da5ebfe6`
MD5	`7b7e0e1067118fade8abac5d2b620e54`
BLAKE2b-256	`591121395d74dfbb638f6b7b11d5405d6a380d0bd77b9228f546edcc972a2795`

See more details on using hashes here.

File details

Details for the file outhad_contextkitai-0.1.1-py3-none-any.whl.

File metadata

Download URL: outhad_contextkitai-0.1.1-py3-none-any.whl
Upload date: Dec 2, 2025
Size: 256.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.8

File hashes

Hashes for outhad_contextkitai-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fea6f4a3809c78948240eb294459835c47811d90215dd985b23fd672c4c01cea`
MD5	`8d6fb9e699af589a7d0ccc60680875a8`
BLAKE2b-256	`6fde2f294de8028f3b42d2f044e84e23e5ca2f3667b6900ccc2bc12cb6ec035c`

See more details on using hashes here.

outhad-contextkitai 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Outhad_ContextKit: Intelligent Memory Layer for AI Applications

Why Outhad_ContextKit?

Research Highlights

Core Features

1. TCMGM (Temporal-Causal-Multimodal Graph Memory)

2. PPMF (Privacy-Preserving Memory Firewall)

3. Adaptive Chunking

4. Multi-Provider Support

Prerequisites

Required

1. Python 3.9 or Higher

2. LLM API Key

Optional (For Advanced Features)

3. Neo4j (For TCMGM Graph Memory)

4. Vector Store (Optional)

Quick Prerequisites Check

What Can You Do Without Neo4j?

Quick Start

Installation

60-Second Setup

Cost Calculator

Benchmarks

CoQA (Conversational Question Answering)

LoCoMo (Long-term Conversational Memory)

Architecture

Examples

Chatbot with Persistent Memory

Multi-Agent Shared Memory

Multimodal Memory (Images + Text)

Healthcare with PPMF

Comparison with Alternatives

Contributing

Citation

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes