Lightweight memory system for AI agents with vector search and graph storage

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

genovo

These details have not been verified by PyPI

Project description

💾 memg-core

The foundation of structured memory for AI agents.

memg-core is the deterministic, schema-driven memory engine at the heart of the larger MEMG system. It gives AI developers a fast, reliable, testable memory layer powered by:

YAML-based schema definition (for custom memory types)
Dual-store backend (Qdrant for vectors, Kuzu for graph queries)
Public Python API for all memory operations
Built-in support for auditability, structured workflows, and self-managed memory loops

It's designed for AI agents that build, debug, and improve themselves — and for humans who demand clean, explainable, memory-driven systems.

🧩 This is just the core. The full memg system builds on this to add multi-agent coordination, long-term memory policies, and deeper retrieval pipelines — currently in progress.

Features

Vector Search: Fast semantic search with Qdrant
Graph Storage: Optional relationship analysis with Kuzu
Enhanced Search Control: Granular control over result detail levels (none, self, all)
Display Field Overrides: Custom display fields that override anchor fields for better UX
YAML-Based Datetime Formatting: Consistent datetime formatting across all operations
Force/Exclude Display: Fine-grained control over which fields are always shown or hidden
Offline-First: 100% local embeddings with FastEmbed - no API keys needed
Type-Agnostic: Configurable memory types via YAML schemas
See Also Discovery: Knowledge graph-style associative memory retrieval
Lightweight: Minimal dependencies, optimized for performance
Production Ready: Robust error handling, deterministic ID management, comprehensive testing

Quick Start

Python Package

pip install memg-core

# Set up environment variables for storage paths
export QDRANT_STORAGE_PATH="/path/to/qdrant"
export KUZU_DB_PATH="/path/to/kuzu/database"
export YAML_PATH="config/core.memo.yaml"

# Use the core library in your app
# Example usage shown below in the Usage section

Development setup

# 1) Create virtualenv and install slim runtime deps for library usage
python3 -m venv .venv && source .venv/bin/activate
pip install -r requirements.txt

# 2) For running tests and linters locally, install dev deps
pip install -r requirements-dev.txt

# 3) Run tests
export YAML_PATH="config/core.test.yaml"
export QDRANT_STORAGE_PATH="$HOME/.local/share/qdrant"
export KUZU_DB_PATH="$HOME/.local/share/kuzu/memg"
mkdir -p "$QDRANT_STORAGE_PATH" "$HOME/.local/share/kuzu"
PYTHONPATH=$(pwd)/src pytest -q

Usage

from memg_core.api.public import add_memory, search, delete_memory

# Add a note
note_hrid = add_memory(
    memory_type="note",
    payload={
        "statement": "Set up Postgres with Docker for local development",
        "project": "backend-setup"
    },
    user_id="demo_user"
)
print(f"Created note: {note_hrid}")  # Returns HRID like "NOTE_AAA001"

# Add a document with more details
doc_hrid = add_memory(
    memory_type="document",
    payload={
        "statement": "Docker Postgres Configuration Guide",
        "details": "Complete setup guide for running PostgreSQL in Docker containers for local development",
        "project": "backend-setup"
    },
    user_id="demo_user"
)

# Search for memories
results = search(
    query="postgres docker setup",
    user_id="demo_user",
    limit=5
)
for r in results:
    print(f"[{r.memory.memory_type}] {r.memory.hrid}: {r.memory.payload['statement']} - Score: {r.score:.2f}")

# Search with memory type filtering
note_results = search(
    query="postgres",
    user_id="demo_user",
    memory_type="note",
    limit=10
)

# Enhanced search control (v0.7.4+)
# Control result detail levels: "none" (minimal), "self" (default), "all" (maximum)
minimal_results = search(
    query="postgres docker",
    user_id="demo_user",
    include_details="none",  # Shows only display fields
    limit=5
)

# Search with graph expansion and full details
expanded_results = search(
    query="postgres setup",
    user_id="demo_user",
    include_details="all",    # Shows full payload for both seeds and neighbors
    hops=2,                   # Expand 2 levels in the knowledge graph
    limit=3
)

# Delete a memory using HRID
success = delete_memory(hrid=note_hrid, user_id="demo_user")
print(f"Deletion successful: {success}")

YAML Schema Examples

Core ships with example schemas under config/:

core.memo.yaml: Basic memory types (memo, note, document, task)
software_dev.yaml: Enhanced schema with bug and solution types for development workflows
core.test.yaml: Test configuration for development

Configure the schema:

export YAML_PATH="config/core.memo.yaml"  # Basic schema
# or
export YAML_PATH="config/software_dev.yaml"  # Enhanced with bug/solution types
# or
export YAML_PATH="config/core.test.yaml"  # For testing

New v0.7.4 YAML Features

Display Field Overrides: Customize what field is shown in search results

- name: task
  parent: memo
  fields:
    details: { type: string }
    status: { type: enum, choices: [todo, done] }
  override:
    display_field: details  # Show 'details' instead of 'statement' in results

Force/Exclude Display: Control field visibility

- name: document
  parent: memo
  fields:
    title: { type: string }
    content: { type: string }
    internal_notes: { type: string }
  override:
    force_display: [title]        # Always show title, even in minimal mode
    exclude_display: [internal_notes]  # Never show internal notes

YAML-Based Datetime Formatting: Consistent timestamps

defaults:
  datetime_format: "%Y-%m-%d %H:%M:%S"  # Applied to all datetime fields

Supported Field Types: Rich type system

- name: product
  fields:
    name: { type: string, required: true }  # Text (required)
    price: { type: float }           # Decimal numbers
    quantity: { type: int }          # Integers (also: integer)
    in_stock: { type: bool }         # Booleans (also: boolean)
    created_at: { type: datetime }   # Timestamps
    tags: { type: list }             # Lists
    category: { type: enum, choices: [A, B, C] }  # Enumerations
    embedding: { type: vector }      # Embedding vectors

Currently Enforced Constraints:

✅ required: true - Field must be present
✅ type: <type> - Type must match (string, int, float, bool, datetime, list, enum, vector)
✅ enum with choices: [...] - Value must be one of the specified choices
✅ system: true - System fields (id, user_id, timestamps) handled automatically

Not Yet Implemented (tracked in [#issue]):

⚠️ default: value - Defaults are not auto-applied (fields absent if not provided)
⚠️ max_length: N - String length validation not enforced
⚠️ Other Pydantic-style constraints (min_value, pattern, etc.)

Embedding Configuration

Default: FastEmbed (Offline, No API Keys)

MEMG Core uses FastEmbed by default for 100% offline, local embeddings:

# Optional: Configure a different FastEmbed model
export EMBEDDER_MODEL="Snowflake/snowflake-arctic-embed-xs"  # Default
# Other options: intfloat/e5-small, BAAI/bge-small-en-v1.5, etc.

Custom Embedders

You can provide your own embedder (OpenAI, Cohere, custom models, etc.) by implementing a simple protocol:

from memg_core import MemgClient, EmbedderProtocol

class OpenAIEmbedder:
    """Example: Use OpenAI embeddings instead of FastEmbed."""

    def __init__(self, api_key: str, model: str = "text-embedding-3-small"):
        import openai
        self.client = openai.OpenAI(api_key=api_key)
        self.model = model

    def get_embedding(self, text: str) -> list[float]:
        """Generate embedding for a single text."""
        response = self.client.embeddings.create(input=[text], model=self.model)
        return response.data[0].embedding

    def get_embeddings(self, texts: list[str]) -> list[list[float]]:
        """Generate embeddings for multiple texts."""
        response = self.client.embeddings.create(input=texts, model=self.model)
        return [item.embedding for item in response.data]

# Use your custom embedder
embedder = OpenAIEmbedder(api_key="your-api-key")
client = MemgClient(
    yaml_path="config/core.memo.yaml",
    db_path="/path/to/db",
    embedder=embedder
)

# Now all memory operations use your custom embedder
hrid = client.add_memory(
    memory_type="note",
    payload={"statement": "Using OpenAI embeddings"},
    user_id="user123"
)

The embedder protocol is simple - just implement two methods:

get_embedding(text: str) -> list[float] - for single text
get_embeddings(texts: list[str]) -> list[list[float]] - for batch processing

Note: When using custom embedders, ensure the vector dimension matches your Qdrant configuration (default: 384 for FastEmbed's arctic-xs model).

Configuration

Configure via environment variables:

# Required: Storage paths
export QDRANT_STORAGE_PATH="$HOME/.local/share/qdrant"
export KUZU_DB_PATH="$HOME/.local/share/kuzu/memg"
export YAML_PATH="config/core.memo.yaml"

# Optional: Embeddings
export EMBEDDER_MODEL="Snowflake/snowflake-arctic-embed-xs"  # Default

# Optional: For MCP server (if using)
export MEMORY_SYSTEM_MCP_PORT=8787

Requirements

Python 3.11+
No API keys required!

Architecture

memg-core provides a deterministic, YAML-driven memory layer with dual storage:

YAML-driven schema engine - Define custom memory types with zero hardcoded fields
Qdrant/Kuzu dual-store - Vector similarity + graph relationships
Public Python API - Clean interface for all memory operations
Configurable schemas - Examples in config/ for different use cases

In Scope

✅ YAML schema definition and validation
✅ Memory CRUD operations with dual storage
✅ Semantic search with memory type filtering
✅ Public Python API with HRID-based interface
✅ User isolation with per-user HRID scoping

Coming in Full MEMG System

🔄 Schema contracts and multi-agent coordination
🔄 Async job processing and bulk operations
🔄 Advanced memory policies and retention
🔄 Multi-agent memory orchestration

License

MIT License - see LICENSE file for details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

genovo

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.7.5

Oct 1, 2025

0.7.4

Sep 24, 2025

0.7.3

Sep 20, 2025

0.7.2

Sep 16, 2025

0.7.1

Sep 9, 2025

0.7.0

Sep 3, 2025

0.6.8

Sep 3, 2025

0.6.7

Sep 2, 2025

0.6.6

Sep 1, 2025

0.6.4

Sep 1, 2025

0.6.3

Sep 1, 2025

0.6.2

Aug 30, 2025

0.6.1

Aug 28, 2025

0.6.0

Aug 28, 2025

0.5.2

Aug 19, 2025

0.5.1

Aug 16, 2025

0.5.0

Aug 16, 2025

0.3.7.dev1 pre-release

Aug 14, 2025

0.3.3

Aug 19, 2025

0.3.0

Aug 13, 2025

0.2.0

Aug 13, 2025

0.1.2.dev0 pre-release

Aug 11, 2025

0.1.dev77 pre-release

Aug 11, 2025

0.1.dev75 pre-release

Aug 10, 2025

0.1.dev69 pre-release

Aug 10, 2025

0.1.dev68 pre-release

Aug 10, 2025

0.0.1.dev80 pre-release

Aug 11, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

memg_core-0.7.5.tar.gz (107.8 kB view details)

Uploaded Oct 1, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

memg_core-0.7.5-py3-none-any.whl (64.2 kB view details)

Uploaded Oct 1, 2025 Python 3

File details

Details for the file memg_core-0.7.5.tar.gz.

File metadata

Download URL: memg_core-0.7.5.tar.gz
Upload date: Oct 1, 2025
Size: 107.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for memg_core-0.7.5.tar.gz
Algorithm	Hash digest
SHA256	`daf23d1a0a68f8667b02981847b70eaf68f5f0a21f37cf67108bc7bab6a2e391`
MD5	`c41f684f281736af370e2cb6efa334b6`
BLAKE2b-256	`73ae9f9819058e5b76f178a74528a9a55683f5959f457bd62f8399f9e3a1f746`

See more details on using hashes here.

Provenance

The following attestation bundles were made for memg_core-0.7.5.tar.gz:

Publisher: workflow.yml on genovo-ai/memg-core

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: memg_core-0.7.5.tar.gz
- Subject digest: daf23d1a0a68f8667b02981847b70eaf68f5f0a21f37cf67108bc7bab6a2e391
- Sigstore transparency entry: 574891578
- Sigstore integration time: Oct 1, 2025
Source repository:
- Permalink: genovo-ai/memg-core@253722d32714429939563e5292faefd9cc6df44d
- Branch / Tag: refs/tags/v0.7.5
- Owner: https://github.com/genovo-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflow.yml@253722d32714429939563e5292faefd9cc6df44d
- Trigger Event: push

File details

Details for the file memg_core-0.7.5-py3-none-any.whl.

File metadata

Download URL: memg_core-0.7.5-py3-none-any.whl
Upload date: Oct 1, 2025
Size: 64.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for memg_core-0.7.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4e161f37bf7a40a29cec9b537b239f538413ec2cf3270ef0322159f3c327e8c8`
MD5	`42c62f51abd1dd3118196b53cf01a427`
BLAKE2b-256	`1c198bf9495ba77883dc21941b6111662b9d3f81ef5f746bbbf919cb51dd710c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for memg_core-0.7.5-py3-none-any.whl:

Publisher: workflow.yml on genovo-ai/memg-core

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: memg_core-0.7.5-py3-none-any.whl
- Subject digest: 4e161f37bf7a40a29cec9b537b239f538413ec2cf3270ef0322159f3c327e8c8
- Sigstore transparency entry: 574891579
- Sigstore integration time: Oct 1, 2025
Source repository:
- Permalink: genovo-ai/memg-core@253722d32714429939563e5292faefd9cc6df44d
- Branch / Tag: refs/tags/v0.7.5
- Owner: https://github.com/genovo-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflow.yml@253722d32714429939563e5292faefd9cc6df44d
- Trigger Event: push

memg-core 0.7.5

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

💾 memg-core

Features

Quick Start

Python Package

Development setup

Usage

YAML Schema Examples

New v0.7.4 YAML Features

Embedding Configuration

Default: FastEmbed (Offline, No API Keys)

Custom Embedders

Configuration

Requirements

Architecture

In Scope

Coming in Full MEMG System

Links

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance