A self-consolidating memory layer for AI agents with schema-first design, intelligent merging, and hybrid search capabilities

These details have not been verified by PyPI

Project description

🧠 OntoMem: The Self-Consolidating Memory

中文版本

OntoMem is built on the concept of Ontology Memory—structured, coherent knowledge representation for AI systems.

Give your AI agent a "coherent" memory, not just "fragmented" retrieval.

OntoMem Framework Diagram

Traditional RAG (Retrieval-Augmented Generation) systems retrieve text fragments. OntoMem maintains structured entities using Pydantic schemas and intelligent merging algorithms.

It excels at Time-Series Consolidation: effortlessly merging streaming observations (like logs or chat turns) into coherent "Daily Snapshots" or "Session Summaries" simply by defining a composite key (e.g., user_id + date).

It doesn't just store data—it continuously "digests" and "organizes" it.

📰 News

Details

[2026-01-21] v0.1.5 Released:
- 🎯 Production Safety: Added max_workers parameter to control LLM batch processing concurrency
- ⚡ Rate Limit Protection: Prevents hitting API rate limits from providers like OpenAI, preventing account throttling
- 🔧 Fine-Grained Control: Customize concurrency per merge strategy (default: 5 workers)
- Learn more →
[2026-01-19] v0.1.4 Released:
- API Improvement: Renamed merge_strategy parameter to strategy_or_merger for better clarity and flexibility
- Enhancement: Added **kwargs support to directly pass merger-specific parameters (like rule and dynamic_rule for CUSTOM_RULE) through OMem without pre-configuration
- Benefit: Cleaner API and more intuitive usage patterns for advanced merging scenarios
- Learn more →
[2026-01-19] v0.1.3 Released:
- New Feature: Added MergeStrategy.LLM.CUSTOM_RULE strategy for user-defined merge logic. Inject static rules and dynamic context (via functions) directly into the LLM merger!
- Breaking Change: Renamed legacy strategies for clarity:
  - KEEP_OLD → KEEP_EXISTING
  - KEEP_NEW → KEEP_INCOMING
  - FIELD_MERGE → MERGE_FIELD
- Learn more about Custom Rules

✨ Why OntoMem?

🧩 Schema-First & Type-Safe

Built on Pydantic. All memories are strongly-typed objects. Say goodbye to {"unknown": "dict"} hell and embrace IDE autocomplete and type checking.

⏱️ Temporal Consolidation (Time-Slicing)

OntoMem isn't just about ID deduplication. By using Composite Keys (e.g., lambda x: f"{x.user}_{x.date}"), you can automatically aggregate a day's worth of fragmented events into a Single Daily Record.

Input: 1,000 fragmented logs/observations throughout the day.
Output: 1 structured, LLM-synthesized "Daily Summary" object.

🔄 Auto-Evolution

When you insert new data about an existing entity, OntoMem doesn't create duplicates. It intelligently merges them into a Golden Record using configurable strategies (Conflict Resolution, List Appending, or LLM-powered Synthesis).

🔍 Hybrid Search

Key-Value Lookup: O(1) exact access (e.g., "Get me Alice's summary for 2024-01-01").
Vector Search: Semantic similarity search across your entire timeline (e.g., "When was Alice frustrated?").

💾 Stateful & Persistent

Save your complete memory state (structured data + vector indices) to disk and restore it in seconds on next startup.

🧠 OntoMem vs. Other Memory Systems

Most memory libraries store Raw Text or Chat History. OntoMem stores Consolidated Knowledge.

Feature	OntoMem 🧠	Mem0 / Zep	LangChain Memory	Vector DBs (Pinecone/Chroma)
Core Storage Unit	✅ Structured Objects (Pydantic)	Text Chunks + Metadata	Raw Chat Logs	Embedding Vectors
Data "Digestion"	✅ Auto-Consolidation & merging	Simple Extraction	❌ Append-only	❌ Append-only
Time Awareness	✅ Time-Slicing (Daily/Session Aggregation)	❌ Timestamp metadata only	❌ Sequential only	❌ Metadata filtering only
Conflict Resolution	✅ LLM Logic (Synthesize/Prioritize)	❌ Last-write-wins	❌ None	❌ None
Type Safety	✅ Strict Schema	⚠️ Loose JSON	❌ String only	❌ None
Ideal For	Long-term Agent Profiles, Knowledge Graphs	Simple RAG, Search	Chatbots, Context Window	Semantic Search

💡 The "Consolidation" Advantage

Traditional RAG: Stores 50 chunks of "Alice likes apples", "Alice likes bananas". Search returns 50 fragments.
OntoMem: Merges them into 1 object: User(name="Alice", likes=["apples", "bananas"]). Search returns one complete truth.

🚀 Quick Start

Build a structured memory store in 30 seconds.

1. Define & Initialize

from pydantic import BaseModel
from ontomem import OMem
from langchain_openai import ChatOpenAI, OpenAIEmbeddings

# 1. Define your memory schema
class UserProfile(BaseModel):
    name: str
    skills: list[str]
    last_seen: str

# 2. Initialize with LLM merging and concurrency control (v0.1.5+)
memory = OMem(
    memory_schema=UserProfile,
    key_extractor=lambda x: x.name,
    llm_client=ChatOpenAI(model="gpt-4o"),
    embedder=OpenAIEmbeddings(),
    max_workers=3  # 🆕 Control LLM batch concurrency to prevent rate limits
)

2. Add & Merge (Auto-Consolidation)

OntoMem automatically merges data for the same ID.

# First observation
memory.add(UserProfile(name="Alice", skills=["Python"], last_seen="10:00"))

# Later observation (New skill added, time updated)
memory.add(UserProfile(name="Alice", skills=["Docker"], last_seen="11:00"))

# Retrieve the consolidated "Golden Record"
alice = memory.get("Alice")
print(alice.skills)     # ['Python', 'Docker'] (Lists merged!)
print(alice.last_seen)  # "11:00" (Updated!)

3. Search & Retrieve

# Exact retrieval
profile = memory.get("Alice")

# All keys in memory
all_keys = memory.keys

# Clear or remove
memory.remove("Alice")

💡 Advanced Examples

Example 1: The "Self-Improving" Debugger (Logic Evolution)

An AI agent that doesn't just store errors—it synthesizes debugging wisdom over time using LLM.BALANCED strategy.

from ontomem import OMem, MergeStrategy
from langchain_openai import ChatOpenAI, OpenAIEmbeddings

class BugFixExperience(BaseModel):
    error_signature: str
    solutions: list[str]
    prevention_tips: str

memory = OMem(
    memory_schema=BugFixExperience,
    key_extractor=lambda x: x.error_signature,
    llm_client=ChatOpenAI(model="gpt-4o"),
    embedder=OpenAIEmbeddings(),
    strategy_or_merger=MergeStrategy.LLM.BALANCED
)

# Day 1: Pip install
memory.add(BugFixExperience(
    error_signature="ModuleNotFoundError: pandas",
    solutions=["pip install pandas"],
    prevention_tips="Check requirements.txt"
))

# Day 2: Docker container (Different solution!)
memory.add(BugFixExperience(
    error_signature="ModuleNotFoundError: pandas",
    solutions=["apt-get install python3-pandas"],  # Added to list!
    prevention_tips="Use system packages in containers"  # LLM merges both tips
))

# Result: Single record with merged solutions + synthesized advice
guidance = memory.get("ModuleNotFoundError: pandas")
print(guidance.prevention_tips)
# >>> "In standard environments, check requirements.txt. 
#      In containerized environments, prefer system packages..."

Example 2: Temporal Memory & Daily Consolidation (Time-Series)

Turn a stream of fragmented events into a single "Daily Summary" record using Composite Keys.

from ontomem import OMem, MergeStrategy

class DailyTrace(BaseModel):
    user: str
    date: str
    actions: list[str]  # Accumulates all day
    summary: str        # LLM synthesizes entire day

memory = OMem(
    memory_schema=DailyTrace,
    key_extractor=lambda x: f"{x.user}_{x.date}",  # <-- THE MAGIC KEY
    llm_client=ChatOpenAI(model="gpt-4o"),
    embedder=OpenAIEmbeddings(),
    strategy_or_merger=MergeStrategy.LLM.BALANCED
)

# 9:00 AM event
memory.add(DailyTrace(user="Alice", date="2024-01-01", actions=["Login"]))

# 5:00 PM event (Same day → Merges into SAME record)
memory.add(DailyTrace(user="Alice", date="2024-01-01", actions=["Logout"]))

# Next day (New date → NEW record)
memory.add(DailyTrace(user="Alice", date="2024-01-02", actions=["Login"]))

# Results:
# - alice_2024-01-01: actions=["Login", "Logout"], summary="Active trading day..."
# - alice_2024-01-02: actions=["Login"], summary="Brief session..."

# Semantic search across time
results = memory.search("When was Alice frustrated?", top_k=1)

For a complete working example, see examples/06_temporal_memory_consolidation.py

🔍 Semantic Search

Build an index and search by natural language:

# Build vector index
memory.build_index()

# Semantic search
results = memory.search("Find researchers working on transformer models and attention mechanisms")

for researcher in results:
    print(f"- {researcher.name}: {researcher.research_interests}")

🛠️ Merge Strategies

Choose how to handle conflicts:

Strategy	Behavior	Use Case
`MERGE_FIELD`	Non-null overwrites, lists append	Simple attribute collection
`KEEP_INCOMING`	Latest data wins	Status updates (current role, last seen)
`KEEP_EXISTING`	First observation stays	Historical records (first publication year)
`LLM.BALANCED`	LLM-driven semantic merging	Complex synthesis, contradiction resolution
`LLM.PREFER_INCOMING`	LLM merges semantically, prefers new data on conflict	New information should take priority when contradictions arise
`LLM.PREFER_EXISTING`	LLM merges semantically, prefers existing data on conflict	Existing data should take priority when contradictions arise
`LLM.CUSTOM_RULE`	User-defined merge logic with dynamic context	Domain-specific rules, context-aware merging

# Example: LLM intelligently merges conflicting information
memory = OMem(
    ...,
    strategy_or_merger=MergeStrategy.LLM.BALANCED  # or LLM.PREFER_INCOMING, LLM.PREFER_EXISTING, LLM.CUSTOM_RULE
)

🔧 Custom Merge Rules (Advanced)

Inject your own merge logic with static rules and dynamic context:

from datetime import datetime

# Define a dynamic rule function (evaluated at merge time)
def get_time_context():
    hour = datetime.now().hour
    if hour >= 9 and hour <= 17:
        return "Business hours: Prefer stable, verified data"
    else:
        return "After-hours: Prioritize recent updates"

# Use CUSTOM_RULE with static rule + dynamic context
memory = OMem(
    memory_schema=BugFixExperience,
    key_extractor=lambda x: x.error_signature,
    llm_client=ChatOpenAI(model="gpt-4o"),
    embedder=OpenAIEmbeddings(),
    strategy_or_merger=MergeStrategy.LLM.CUSTOM_RULE,
    rule="""
    Merge debugging experiences intelligently:
    - Combine all unique solutions into a list
    - Synthesize prevention tips, incorporating domain context
    - Keep the most recent solution as primary recommendation
    """,
    dynamic_rule=get_time_context  # Evaluated at merge time!
)

# Example use: Errors evolve over time
memory.add(BugFixExperience(
    error_signature="ModuleNotFoundError: pandas",
    solutions=["pip install pandas"],
    prevention_tips="Check requirements.txt first"
))

memory.add(BugFixExperience(
    error_signature="ModuleNotFoundError: pandas",
    solutions=["Use conda-forge mirror"],
    prevention_tips="In restricted networks, try conda"
))

# Result: LLM merges both, considering time context from dynamic_rule
result = memory.get("ModuleNotFoundError: pandas")
print(result.prevention_tips)
# >>> "Check requirements.txt first. For restricted networks, use conda-forge mirror..."

When to use CUSTOM_RULE:

Complex domain-specific merging logic
Time/context-aware decisions (e.g., "prefer old data at 2 AM, new data during day")
Environment-specific rules (e.g., "production mode: conservative, staging: aggressive")
Multi-factor decision making that LLM strategies don't cover

⚡ Controlling LLM Concurrency (v0.1.5+)

When using LLM-based merge strategies (LLM.BALANCED, LLM.PREFER_INCOMING, LLM.PREFER_EXISTING, LLM.CUSTOM_RULE), OntoMem makes batch API calls to your LLM provider. By default, these can run concurrently, which may hit rate limits or API throttling.

The `max_workers` Parameter

Control the maximum number of concurrent LLM requests using the max_workers parameter:

memory = OMem(
    memory_schema=UserProfile,
    key_extractor=lambda x: x.uid,
    llm_client=ChatOpenAI(model="gpt-4o"),
    embedder=OpenAIEmbeddings(),
    strategy_or_merger=MergeStrategy.LLM.BALANCED,
    max_workers=3  # Limit to 3 concurrent requests
)

Configuration Guidelines

Scenario	Recommended `max_workers`	Rationale
Development/Testing	`2-3`	Conservative, prevents API errors
Production (Small)	`3-5`	Default: 5. Balanced speed/safety
Production (Large)	`5-10+`	Depends on your LLM provider tier
Rate-Limited Accounts	`1-2`	Safest: processes serially or semi-serially

Tuning Tips

Start Conservative: Begin with max_workers=2 to ensure stability
Monitor Performance: Check merge times and error rates
Gradually Increase: If stable, try max_workers=5, then higher
Check Provider Limits: Verify your OpenAI tier's rate limits (requests per minute)
Observe Errors: If you see RateLimitError, reduce max_workers

Example: Production Setup

import os

# Read from environment
max_workers = int(os.getenv("ONTOMEM_MAX_WORKERS", 3))

memory = OMem(
    memory_schema=UserProfile,
    key_extractor=lambda x: x.uid,
    llm_client=ChatOpenAI(model="gpt-4o"),
    embedder=OpenAIEmbeddings(),
    strategy_or_merger=MergeStrategy.LLM.BALANCED,
    max_workers=max_workers  # Easy to adjust without code changes
)

Note: The max_workers parameter only affects LLM-based merge strategies. Classic strategies (MERGE_FIELD, KEEP_INCOMING, KEEP_EXISTING) do not use LLM and are not affected.

💾 Save & Load

Snapshot your entire memory state:

# Save (structured data → memory.json, vectors → FAISS indices)
memory.dump("./researcher_knowledge")

# Later, restore instantly
new_memory = OMem(...)
new_memory.load("./researcher_knowledge")

🔧 Installation & Setup

Basic Installation

pip install ontomem

Or with uv:

uv add ontomem

📦 For Developers

To set up the development environment with all testing and documentation tools:

uv sync --group dev

Core Requirements:

Python 3.11+
LangChain (for LLM integration)
Pydantic (for schema definition)
FAISS (for vector search)

👨‍💻 Author

Yifan Feng - evanfeng97@gmail.com

🤝 Contributing

We're building the next generation of AI memory standards. PRs and issues welcome!

📝 License

Licensed under the Apache License, Version 2.0 - See LICENSE file for details.

You are free to use, modify, and distribute this software under the terms of the Apache License 2.0.

Built with ❤️ for AI developers who believe memory is more than just search.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.2.3

Mar 18, 2026

0.2.2

Mar 18, 2026

0.2.1

Mar 18, 2026

0.2.0

Jan 28, 2026

0.1.9

Jan 23, 2026

This version

0.1.8

Jan 22, 2026

0.1.7

Jan 22, 2026

0.1.6

Jan 21, 2026

0.1.5

Jan 21, 2026

0.1.4

Jan 19, 2026

0.1.3

Jan 19, 2026

0.1.2

Jan 18, 2026

0.1.1

Jan 17, 2026

0.1.0

Jan 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ontomem-0.1.8.tar.gz (2.5 MB view details)

Uploaded Jan 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ontomem-0.1.8-py3-none-any.whl (36.9 kB view details)

Uploaded Jan 22, 2026 Python 3

File details

Details for the file ontomem-0.1.8.tar.gz.

File metadata

Download URL: ontomem-0.1.8.tar.gz
Upload date: Jan 22, 2026
Size: 2.5 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ontomem-0.1.8.tar.gz
Algorithm	Hash digest
SHA256	`562885f7c10991604bbacbe957a77af824e73a1ed7ee610e06c48d52f0ce0d49`
MD5	`43546005def5511b2fb480696b603633`
BLAKE2b-256	`ef56f62f9cdcce9a9132f9bd0a56ba1684f78269926dcf1bcee19894e8c3d9be`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ontomem-0.1.8.tar.gz:

Publisher: publish-pypi.yml on yifanfeng97/ontomem

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ontomem-0.1.8.tar.gz
- Subject digest: 562885f7c10991604bbacbe957a77af824e73a1ed7ee610e06c48d52f0ce0d49
- Sigstore transparency entry: 844597754
- Sigstore integration time: Jan 22, 2026
Source repository:
- Permalink: yifanfeng97/ontomem@6851eb05bf437ce41c031905ae155a7a4a45f828
- Branch / Tag: refs/tags/v0.1.8
- Owner: https://github.com/yifanfeng97
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@6851eb05bf437ce41c031905ae155a7a4a45f828
- Trigger Event: push

File details

Details for the file ontomem-0.1.8-py3-none-any.whl.

File metadata

Download URL: ontomem-0.1.8-py3-none-any.whl
Upload date: Jan 22, 2026
Size: 36.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ontomem-0.1.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`71ae701d70870ec2dc4b52ed8dd77404f3df1f8ddab37ae3a53addcf55cf8d9a`
MD5	`2814c9eaecf49ef7a66d4cee451f0110`
BLAKE2b-256	`cf27d2b1cabb64891a59de4917a903e3b02164de08b91853c1926c15e933e2ba`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ontomem-0.1.8-py3-none-any.whl:

Publisher: publish-pypi.yml on yifanfeng97/ontomem

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ontomem-0.1.8-py3-none-any.whl
- Subject digest: 71ae701d70870ec2dc4b52ed8dd77404f3df1f8ddab37ae3a53addcf55cf8d9a
- Sigstore transparency entry: 844597760
- Sigstore integration time: Jan 22, 2026
Source repository:
- Permalink: yifanfeng97/ontomem@6851eb05bf437ce41c031905ae155a7a4a45f828
- Branch / Tag: refs/tags/v0.1.8
- Owner: https://github.com/yifanfeng97
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@6851eb05bf437ce41c031905ae155a7a4a45f828
- Trigger Event: push

ontomem 0.1.8

Navigation

Verified details

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

🧠 OntoMem: The Self-Consolidating Memory

📰 News

✨ Why OntoMem?

🧩 Schema-First & Type-Safe

⏱️ Temporal Consolidation (Time-Slicing)

🔄 Auto-Evolution

🔍 Hybrid Search

💾 Stateful & Persistent

🧠 OntoMem vs. Other Memory Systems

💡 The "Consolidation" Advantage

🚀 Quick Start

1. Define & Initialize

2. Add & Merge (Auto-Consolidation)

3. Search & Retrieve

💡 Advanced Examples

🔍 Semantic Search

🛠️ Merge Strategies

The max_workers Parameter

Configuration Guidelines

Tuning Tips

💾 Save & Load

🔧 Installation & Setup

Basic Installation

👨‍💻 Author

🤝 Contributing

📝 License

Project details

Verified details

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

The `max_workers` Parameter