Hierarchical memory system for AI agents - async, graph-aware, with hybrid retrieval and memory lifecycle management

These details have not been verified by PyPI

Project links

Project description

DeepContext

Hierarchical memory system for AI agents -- async, graph-aware, with hybrid retrieval and memory lifecycle management.

DeepContext gives AI agents persistent, structured memory. Conversations are automatically broken into semantic facts, stored with embeddings, linked in a knowledge graph, and retrieved using a hybrid pipeline that fuses vector similarity, keyword search, and graph traversal.

Features

Hierarchical Memory -- Working, short-term, and long-term tiers inspired by human cognition
Memory Types -- Semantic (facts), episodic (events), and procedural (how-to) memories
Knowledge Graph -- Entities and relationships extracted from conversations, stored in PostgreSQL (no Neo4j required)
Hybrid Retrieval -- Reciprocal Rank Fusion (RRF) across vector, keyword, and graph search
Memory Lifecycle -- Ebbinghaus forgetting curve decay, consolidation of short-term into long-term, automatic cleanup
Fully Async -- Built on SQLAlchemy async, asyncpg, and AsyncOpenAI
Multi-user -- All memories scoped by user_id
REST API -- FastAPI server with 7 endpoints
Pluggable LLM -- OpenAI and OpenRouter support out of the box
SQLite Fallback -- Works without PostgreSQL for development

Quick Start

Installation

git clone https://github.com/umairinayat/DeepContext.git
cd DeepContext
python -m venv .venv

# Windows
.venv\Scripts\activate
# Linux/macOS
source .venv/bin/activate

pip install -e ".[all]"

Configuration

Create a .env file in the project root:

DEEPCONTEXT_OPENAI_API_KEY=sk-your-key-here

# PostgreSQL (recommended for production)
# DEEPCONTEXT_DATABASE_URL=postgresql+asyncpg://user:pass@localhost:5432/deepcontext

# SQLite fallback (default, no setup needed)
# Automatically uses ~/.deepcontext/memory.db

All settings can be passed as environment variables with the DEEPCONTEXT_ prefix, or directly in code.

Basic Usage

import asyncio
from deepcontext import DeepContext

async def main():
    ctx = DeepContext(openai_api_key="sk-...")
    await ctx.init()

    # Store memories from a conversation
    response = await ctx.add(
        messages=[
            {"role": "user", "content": "I'm a Python developer working at Acme Corp"},
            {"role": "assistant", "content": "Nice to meet you!"},
        ],
        user_id="user_1",
        conversation_id="conv_1",
    )
    print(f"Stored {response.memories_added} memories, found {response.entities_found} entities")

    # Search memories
    results = await ctx.search("What does the user do for work?", user_id="user_1")
    for r in results.results:
        print(f"  [{r.tier.value}] {r.text} (score: {r.score:.3f})")

    # Explore the knowledge graph
    neighbors = await ctx.get_entity_graph("user_1", "Acme Corp", depth=2)
    for n in neighbors:
        print(f"  {n['entity']} --{n['relation']}--> (depth {n['depth']})")

    # Run lifecycle maintenance (decay + consolidation + cleanup)
    stats = await ctx.run_lifecycle("user_1")
    print(f"Decayed: {stats['memories_decayed']}, Consolidated: {stats['memories_consolidated']}")

    await ctx.close()

asyncio.run(main())

Interactive Demo

python examples/chat_demo.py

An interactive chatbot that remembers conversations across turns. Special commands:

Command	Description
`memories`	Search stored memories
`graph <entity>`	Show knowledge graph for an entity
`lifecycle`	Run decay / consolidation / cleanup
`exit`	Quit

Architecture

deepcontext/
  __init__.py               DeepContext (alias), exports
  core/
    settings.py             Configuration (pydantic-settings, env vars, .env)
    types.py                Enums, Pydantic models (facts, entities, responses)
    clients.py              OpenAI/OpenRouter async client wrapper
  memory/
    engine.py               MemoryEngine -- main orchestrator
  extraction/
    extractor.py            LLM-based fact and entity extraction
    prompts.py              Prompt templates for extraction/classification
  retrieval/
    hybrid.py               HybridRetriever (vector + keyword + graph + RRF)
  graph/
    knowledge_graph.py      Entity/relationship CRUD, BFS traversal
  lifecycle/
    manager.py              Ebbinghaus decay, consolidation, cleanup
  vectorstore/
    base.py                 Abstract vector store interface
    pgvector_store.py       pgvector implementation (SQLite cosine fallback)
  db/
    database.py             Async SQLAlchemy engine manager
    models/
      base.py               Base ORM model
      memory.py             Memory table (embeddings, tiers, types, decay)
      graph.py              Entity, Relationship, ConversationSummary tables
  api/
    server.py               FastAPI REST API

How It Works

Memory Pipeline

When you call ctx.add(messages, user_id):

Conversation ──> LLM Extraction ──> Classification ──> Embedding ──> Storage
                      |                   |                             |
                      v                   v                             v
                 Facts, Entities    ADD / UPDATE /              Knowledge Graph
                 Relationships      REPLACE / NOOP                  Update

Extraction -- The LLM analyzes the conversation and extracts semantic facts, episodic events, entities, and relationships
Classification -- Each extracted fact is compared against existing memories. The LLM decides whether to ADD, UPDATE, REPLACE, or skip (NOOP)
Embedding -- New/updated facts are embedded using the configured embedding model
Storage -- Memories are stored with their embeddings, tier (short-term), type, importance, and confidence scores
Graph Update -- Extracted entities and relationships are upserted into the knowledge graph
Auto-consolidation -- If short-term memory count exceeds the threshold, consolidation is triggered

Hybrid Retrieval

When you call ctx.search(query, user_id):

Query ──> Embed ──> Vector Search (0.6) ──┐
  |                                        ├──> RRF Fusion ──> Scoring ──> Results
  ├─────> Keyword Search (0.25) ──────────┤
  |                                        |
  └─────> Graph Expansion (0.15) ─────────┘

Vector search -- Query is embedded and compared via cosine similarity (pgvector or Python fallback)
Keyword search -- PostgreSQL tsvector full-text search (ILIKE fallback on SQLite)
Graph expansion -- Entities mentioned in the query are found, their graph neighbors are traversed, and memories referencing those entities are boosted
RRF fusion -- Results from all three strategies are combined using Reciprocal Rank Fusion (weights: vector 0.6, keyword 0.25, graph 0.15)
Scoring -- Final score applies importance, recency decay, confidence, and access-count boost
Access tracking -- Each returned memory's access count and timestamp are updated

Memory Lifecycle

When you call ctx.run_lifecycle(user_id):

Short-term Memories ──> Decay (Ebbinghaus) ──> Consolidation (LLM merge) ──> Long-term
                              |                        |
                              v                        v
                        Deactivate               Group by entity
                       (importance < 0.05)     overlap (Union-Find)

Decay -- Ebbinghaus forgetting curve: R = e^(-0.693 * days / effective_half_life). Frequently accessed memories decay slower. Memories below 0.05 importance are deactivated
Consolidation -- When short-term memory count >= threshold (default 20), memories are grouped by entity overlap (Union-Find), each group is merged by the LLM into a long-term fact, and source memories are deactivated
Cleanup -- Remaining low-importance non-long-term memories are soft-deleted

REST API

Start the server:

uvicorn deepcontext.api.server:app --reload

Endpoints

Method	Path	Description
`GET`	`/health`	Health check
`POST`	`/memory/add`	Extract and store memories from messages
`POST`	`/memory/search`	Hybrid search across memories
`PUT`	`/memory/update`	Update a memory's text and re-embed
`DELETE`	`/memory/delete`	Soft-delete a memory
`POST`	`/graph/neighbors`	Get knowledge graph neighborhood
`POST`	`/lifecycle/run`	Run decay + consolidation + cleanup

Example: Add Memory

curl -X POST http://localhost:8000/memory/add \
  -H "Content-Type: application/json" \
  -d '{
    "messages": [
      {"role": "user", "content": "I prefer Python over JavaScript"},
      {"role": "assistant", "content": "Got it, Python is your go-to!"}
    ],
    "user_id": "user_1",
    "conversation_id": "conv_1"
  }'

Example: Search

curl -X POST http://localhost:8000/memory/search \
  -H "Content-Type: application/json" \
  -d '{
    "query": "programming languages",
    "user_id": "user_1",
    "limit": 5
  }'

Configuration Reference

All settings use the DEEPCONTEXT_ env prefix. Set them in .env or pass directly to DeepContext().

Setting	Default	Description
`database_url`	SQLite fallback	PostgreSQL connection URL (`postgresql+asyncpg://...`)
`llm_provider`	`openai`	`openai` or `openrouter`
`openai_api_key`	--	Required for OpenAI provider
`openrouter_api_key`	--	Required for OpenRouter provider
`llm_model`	`gpt-4o-mini`	Model for fact extraction and classification
`embedding_model`	`text-embedding-3-small`	Embedding model
`embedding_dimensions`	`1536`	Embedding vector dimensions
`consolidation_threshold`	`20`	Short-term memories before auto-consolidation
`decay_half_life_days`	`7.0`	Ebbinghaus half-life for episodic decay
`connection_similarity_threshold`	`0.6`	Min cosine similarity for memory connections
`max_connections_per_memory`	`5`	Max connections per memory node
`debug`	`false`	Enable debug logging
`auto_consolidate`	`true`	Auto-consolidate on add

Development

Running Tests

pip install -e ".[dev]"
pytest tests/ -v

110 tests covering all subsystems. Tests use in-memory SQLite and mock LLM clients -- no API keys or database needed.

PostgreSQL + pgvector Setup (Production)

DeepContext works with SQLite for development, but PostgreSQL with pgvector is recommended for production (native vector indexing, full-text search with tsvector, JSONB).

Windows

Install PostgreSQL 15+ from https://www.postgresql.org/download/windows/ (the installer includes pgAdmin)

Install pgvector -- after PostgreSQL is installed:

# Option A: Using pgvector installer (recommended)
# Download the latest release from https://github.com/pgvector/pgvector/releases
# Run the .exe installer matching your PostgreSQL version

# Option B: Build from source (requires Visual Studio Build Tools)
git clone https://github.com/pgvector/pgvector.git
cd pgvector
# Set environment for your PG version, e.g.:
set "PG_HOME=C:\Program Files\PostgreSQL\16"
nmake /F Makefile.win install

Create the database and enable pgvector:

psql -U postgres -c "CREATE DATABASE deepcontext;"
psql -U postgres -d deepcontext -c "CREATE EXTENSION IF NOT EXISTS vector;"

Update .env:

DEEPCONTEXT_DATABASE_URL=postgresql+asyncpg://postgres:yourpassword@localhost:5432/deepcontext

Run Alembic migrations:
```
alembic upgrade head
```

Linux / macOS

# Ubuntu/Debian
sudo apt install postgresql-16 postgresql-16-pgvector

# macOS (Homebrew)
brew install postgresql@16 pgvector

# Create database
createdb deepcontext
psql deepcontext -c "CREATE EXTENSION IF NOT EXISTS vector;"

# Set connection URL
export DEEPCONTEXT_DATABASE_URL="postgresql+asyncpg://user:pass@localhost:5432/deepcontext"

# Run migrations
alembic upgrade head

Database Migrations (Alembic)

The project uses Alembic for schema versioning. Migration files are in alembic/versions/.

# Apply all migrations (requires PostgreSQL connection)
alembic upgrade head

# Check current migration status
alembic current

# Generate a new migration after model changes
alembic revision --autogenerate -m "description of changes"

# Rollback one migration
alembic downgrade -1

# Generate SQL without applying (offline mode)
alembic upgrade head --sql

Note: Alembic migrations target PostgreSQL. SQLite mode uses Base.metadata.create_all() at runtime and does not need Alembic.

Tech Stack

Component	Technology
Language	Python 3.11+ with full type annotations
ORM	SQLAlchemy 2.0 (async)
Vector Store	pgvector (PostgreSQL)
LLM	OpenAI API / OpenRouter
API	FastAPI + Uvicorn
Validation	Pydantic v2 + pydantic-settings
Math	NumPy
Migrations	Alembic
Testing	pytest + pytest-asyncio + httpx

License

MIT -- see LICENSE for details.

Built by @umairinayat

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Feb 22, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepcontext-0.1.0.tar.gz (62.6 kB view details)

Uploaded Feb 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

deepcontext-0.1.0-py3-none-any.whl (44.0 kB view details)

Uploaded Feb 22, 2026 Python 3

File details

Details for the file deepcontext-0.1.0.tar.gz.

File metadata

Download URL: deepcontext-0.1.0.tar.gz
Upload date: Feb 22, 2026
Size: 62.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for deepcontext-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`9db343e8f49d27f5027e92b6a0aa9c19d4b7c1ef1322c1ad17e2e74b848a96ed`
MD5	`c8e1aa2b40ec74e41af16a5e180400fd`
BLAKE2b-256	`085506f125c4ee62d056bdab2e68e1181473fcdb240134b4dbfd3adcc69337eb`

See more details on using hashes here.

File details

Details for the file deepcontext-0.1.0-py3-none-any.whl.

File metadata

Download URL: deepcontext-0.1.0-py3-none-any.whl
Upload date: Feb 22, 2026
Size: 44.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for deepcontext-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`80cbf969e99793098d3200a26390a023e147570db893cc72f7a6e06ffca934fc`
MD5	`9f66ae0770593d0bdee479a57fff82d2`
BLAKE2b-256	`a55e372e090698d80224746d420fff0f0907b4b727b9020fe3ba691359f43615`

See more details on using hashes here.

deepcontext 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

DeepContext

Features

Quick Start

Installation

Configuration

Basic Usage

Interactive Demo

Architecture

How It Works

Memory Pipeline

Hybrid Retrieval

Memory Lifecycle

REST API

Endpoints

Example: Add Memory

Example: Search

Configuration Reference

Development

Running Tests

PostgreSQL + pgvector Setup (Production)

Windows

Linux / macOS

Database Migrations (Alembic)

Tech Stack

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes