Local-first semantic memory server with vector search

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

veedubin

These details have not been verified by PyPI

Project description

Memini-ai

"I remember" in Latin (pronounced meh-mee-nee)

Local-first semantic memory server with vector search, MCP-compatible.

Overview

Memini-ai is a Python rewrite of Super-Memory-TS, designed as a local-first semantic memory server with vector search capabilities. It provides persistent memory storage and retrieval using PostgreSQL with pgvector as the backend.

Key Features

MCP-Compatible: Works with any MCP-compatible client (OpenCode, Claude Desktop, etc.)
Vector Search: BGE-Large embeddings (1024-dim) with MiniLM fallback (384-dim)
Hybrid Search: Combines vector similarity with BM25 text search
Project Isolation: Memories are isolated by project ID
File Indexing: Index and search project files with semantic chunking
Knowledge Graph: Live D3.js visualization of entities and relationships
CPU-First: Designed to run on CPU, optional GPU acceleration

Installation

Prerequisites

Python 3.11+
PostgreSQL 15+ with pgvector extension

Quick Start

# Install memini-ai
pip install memini-ai-dev

# Run the server
memini-ai --stdio

Development Installation

# Clone the repository
git clone https://github.com/Veedubin/memini-ai-dev.git
cd memini-ai-dev

# Create virtual environment
python -m venv .venv
source .venv/bin/activate  # Linux/Mac
# or: .venv\Scripts\activate  # Windows

# Install with dev dependencies
pip install -e ".[dev]"

Configuration

Memini-ai can be configured via environment variables or a JSON config file.

Environment Variables

Core Settings

Variable	Default	Description
`MEMINI_DB_URL`	(empty)	PostgreSQL connection URL (e.g., `postgresql://postgres:password@localhost:5434/postgres`)
`MEMINI_PROJECT_ID`	auto-generated	Project identifier for isolation
`MEMINI_EMBEDDING_DIM`	`1024`	Embedding dimension (1024 for BGE-Large or 384 for MiniLM)
`MEMINI_CHUNK_SIZE`	`512`	Chunk size for file indexing
`MEMINI_CHUNK_OVERLAP`	`50`	Overlap between chunks
`MEMINI_BATCH_SIZE`	`32`	Batch size for embedding generation
`MEMINI_WORKERS`	(cpu_count)	Number of worker threads
`MEMINI_LOG_LEVEL`	`info`	Logging level (debug, info, warning, error)
`MEMINI_DEVICE`	`auto`	Device for embeddings (`auto`, `cpu`, `cuda`)
`MEMINI_CONFIG_PATH`	None	Path to JSON config file

PostgreSQL Settings

Variable	Default	Description
`MEMINI_TABLE_NAME`	`memories`	Qdrant table name (legacy)
`MEMINI_DB_POOL_SIZE`	`10`	Connection pool size
`MEMINI_DB_MIN_SIZE`	`2`	Min pool connections
`MEMINI_DB_MAX_SIZE`	`20`	Max pool connections

⭐ Optional Features (all disabled by default)

All optional features are disabled by default. Set to true or appropriate value to enable.

Trust Engine

Variable	Default	Description
`MEMINI_TRUST_ENGINE`	`false`	Enable trust engine
`MEMINI_TRUST_THRESHOLD_ARCHIVE`	`0.2`	Archive memories below this trust
`MEMINI_TRUST_THRESHOLD_PROMOTE`	`0.8`	Promote to L1 above this trust
`MEMINI_TRUST_DELTA_USE`	`+0.05`	Trust delta for `agent_used` signal
`MEMINI_TRUST_DELTA_IGNORED`	`-0.02`	Trust delta for `agent_ignored` signal
`MEMINI_TRUST_DELTA_CORRECT`	`-0.15`	Trust delta for `user_corrected` signal
`MEMINI_TRUST_DELTA_CONFIRM`	`+0.10`	Trust delta for `user_confirmed` signal

Memory Graph

Variable	Default	Description
`MEMINI_MEMORY_GRAPH`	`false`	Enable memory graph
`MEMINI_GRAPH_ENTITY_EXTRACTION`	`true`	Auto extract entities from memories
`MEMINI_GRAPH_RELATIONSHIP_SUGGESTIONS`	`true`	Suggest relationships between memories

Knowledge Graph

Variable	Default	Description
`MEMINI_KG_ENABLED`	`false`	Enable knowledge graph (required for live visualization)
`MEMINI_KG_ENTITY_EXTRACTION`	`true`	Auto extract entities from memories
`MEMINI_KG_INFERENCE_DEPTH`	`3`	Max inference path depth
`MEMINI_KG_MAX_RESULTS`	`100`	Max knowledge graph query results

Tiered Loading

Variable	Default	Description
`MEMINI_TIERED_LOADING`	`false`	Enable tiered loading (L0/L1/L2 summaries)
`MEMINI_TIER0_MAX_TOKENS`	`100`	L0 summary max tokens (~100 tokens)
`MEMINI_TIER1_MAX_TOKENS`	`2000`	L1 summary max tokens (~2K tokens)
`MEMINI_TIER0_CACHE_TTL`	`3600`	L0 cache TTL in seconds
`MEMINI_TIER1_CACHE_TTL`	`7200`	L1 cache TTL in seconds

Auto-Extract

Variable	Default	Description
`MEMINI_AUTO_EXTRACT`	`false`	Enable automatic memory extraction
`MEMINI_AUTO_EXTRACT_TURNS`	`5`	Conversation turns between auto-extractions
`MEMINI_LLM_URL`	`http://localhost:11434/api/generate`	LLM endpoint for extraction
`MEMINI_LLM_MODEL`	`llama3.2`	LLM model name

Pre-Compression

Variable	Default	Description
`MEMINI_PRECOMPRESS`	`false`	Enable pre-compression extraction
`MEMINI_PRECOMPRESS_THRESHOLD`	`0.8`	Threshold for pre-compression

User Modeling

Variable	Default	Description
`MEMINI_USER_MODELING`	`false`	Enable user modeling
`MEMINI_USER_MODEL_MIN_SESSIONS`	`50`	Min sessions to build user profile

Memory Decay

Variable	Default	Description
`MEMINI_DECAY_ENABLED`	`false`	Enable memory decay
`MEMINI_DECAY_HALF_LIFE_DAYS`	`90`	Memory half-life in days
`MEMINI_CONSOLIDATION_INTERVAL_HOURS`	`168`	Consolidation interval (hours)
`MEMINI_CONSOLIDATION_SIMILARITY_THRESHOLD`	`0.92`	Merge similar memories above this

Multi-Peer

Variable	Default	Description
`MEMINI_MULTI_PEER_ENABLED`	`false`	Enable multi-peer sharing
`MEMINI_MULTI_PEER_GUEST_SHARING`	`true`	Allow guest memory sharing

Dialectic (Contradiction Detection)

Variable	Default	Description
`MEMINI_DIALECTIC_ENABLED`	`false`	Enable dialectic reasoning
`MEMINI_DIALECTIC_LLM_PROVIDER`	`ollama`	LLM provider for dialectic
`MEMINI_DIALECTIC_LLM_MODEL`	`llama3`	LLM model for dialectic
`MEMINI_DIALECTIC_AUTO_THRESHOLD`	`0.5`	Auto-trigger threshold

Quick Start with All Features Enabled

export MEMINI_DB_URL="postgresql://postgres:password@localhost:5434/postgres"
export MEMINI_EMBEDDING_DIM=384
export MEMINI_TRUST_ENGINE=true
export MEMINI_MEMORY_GRAPH=true
export MEMINI_KG_ENABLED=true
export MEMINI_TIERED_LOADING=true
export MEMINI_AUTO_EXTRACT=true
export MEMINI_PRECOMPRESS=true
export MEMINI_USER_MODELING=true
export MEMINI_DECAY_ENABLED=true
export MEMINI_MULTI_PEER_ENABLED=true
export MEMINI_DIALECTIC_ENABLED=true

# Run the server
memini-ai --stdio

JSON Config File

{
  "database": {
    "url": "postgresql://postgres:password@localhost:5432/postgres"
  },
  "model": {
    "embedding_dim": 1024
  },
  "indexer": {
    "chunk_size": 512,
    "chunk_overlap": 64
  },
  "logging": {
    "level": "INFO"
  }
}

Usage

MCP Tools

Memini-ai provides 6 MCP tools:

`query_memories`

Semantic search over memories.

{
  "query": "What files were modified yesterday?",
  "limit": 10,
  "strategy": "tiered"
}

Strategies:

tiered (default): MiniLM primary + BGE fallback
vector_only: Pure semantic similarity
text_only: BM25 keyword search
parallel: Dual-tier with RRF fusion

`add_memory`

Store a new memory entry.

{
  "content": "Remember to update the config file",
  "sourceType": "session",
  "sourcePath": "/path/to/file",
  "metadata": {"key": "value"}
}

`search_project`

Search indexed project files.

{
  "query": "authentication middleware",
  "topK": 20,
  "fileTypes": [".py", ".ts"],
  "paths": ["src/"]
}

`index_project`

Trigger project indexing.

{
  "path": ".",
  "force": false,
  "background": true
}

`get_file_contents`

Reconstruct a file from indexed chunks.

{
  "filePath": "src/main.py",
  "triggerIndex": false
}

`get_status`

Get server component status.

{}

Python API

from memini_ai.memory.system import MemorySystem
from memini_ai.memory.schema import MemoryEntry, MemorySourceType, SearchOptions, SearchStrategy

async def main():
    # Create and initialize
    system = MemorySystem()
    await system.initialize()

    # Add a memory
    entry = MemoryEntry(
        text="Python list comprehension tutorial",
        source_type=MemorySourceType.session,
    )
    memory_id = await system.add_memory(entry)

    # Query memories
    options = SearchOptions(topK=10, strategy=SearchStrategy.TIERED)
    results = await system.query_memories("list comprehension", options)

    # Delete memory
    await system.delete_memory(memory_id)

asyncio.run(main())

Docker Compose

For local development with PostgreSQL/pgvector:

version: '3.8'

services:
  postgres:
    image: timescale/timescaledb:latest-pg15
    ports:
      - "5432:5432"
    environment:
      - POSTGRES_PASSWORD=password
    volumes:
      - postgres_data:/var/lib/postgresql/data

  memini-ai:
    build: .
    depends_on:
      - postgres
    environment:
      - MEMINI_DB_URL=postgresql://postgres:password@postgres:5432/postgres
    volumes:
      - .:/app

volumes:
  postgres_data:

docker-compose up -d

Testing

# Run all tests
pytest tests/ -v

# Run unit tests only (skip integration)
pytest tests/ -v --ignore=tests/integration/

# Run integration tests (requires PostgreSQL with pgvector)
pytest tests/integration/ -v

# Run with coverage
pytest tests/ --cov=src/memini_ai --cov-report=term-missing

Integration Tests with Docker

# Start PostgreSQL with pgvector for integration tests
docker run -d --name postgres-test -e POSTGRES_PASSWORD=test -e POSTGRES_DB=memini_test -p 5432:5432 pgvector/pgvector:pg16

# Run integration tests
pytest tests/integration/ -v

# Cleanup
docker stop postgres-test && docker rm postgres-test

Quality Gates

Before submitting changes, ensure:

# Lint
ruff check src/

# Format
ruff format src/

# Type check
mypy src/

# Tests
pytest tests/ -x

Performance

Memini-ai is designed for sub-10ms query latency on cached queries.

Typical performance:

Query latency: < 10ms (after warmup)
Indexing: ~1000 files/second
Memory footprint: ~500MB (without model)

Performance Tuning

# Use faster embedding model (384-dim instead of 1024)
export MEMINI_EMBEDDING_DIM=384

# Increase workers for faster indexing
export MEMINI_WORKERS=8

# Larger batch size for embedding generation
export MEMINI_BATCH_SIZE=64

Architecture

memini_ai/
├── config.py           # Configuration management
├── server.py          # FastMCP server (35 tools)
├── api/
│   ├── visualization.py  # FastAPI server for live KG visualization
│   └── d3_template.py     # D3.js visualization template
├── memory/
│   ├── schema.py      # Pydantic models
│   ├── database.py    # VectorDatabase ABC
│   ├── search.py      # 4 search strategies
│   └── system.py      # MemorySystem coordinator
├── postgres/          # PostgreSQL/pgvector backend
│   ├── database.py    # PostgresDatabase implementation
│   ├── schema.py      # SQL schema definitions
│   └── queries.py     # SQL query builders
├── model/
│   ├── manager.py     # ModelManager singleton
│   └── embeddings.py  # Embedding generation
├── indexer/
│   ├── indexer.py     # ProjectIndexer
│   ├── chunker.py     # Semantic chunking
│   ├── watcher.py     # File watching
│   └── file_tracker.py # SQLite persistence
└── utils/
    ├── logger.py      # Structured logging
    └── hash.py        # SHA-256 utilities

License

MIT License - see LICENSE file for details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

veedubin

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.5.0

May 21, 2026

0.4.0

May 21, 2026

0.3.1

May 20, 2026

This version

0.3.0

May 20, 2026

0.2.8

May 19, 2026

0.2.7

May 19, 2026

0.2.6

May 18, 2026

0.2.5

May 18, 2026

0.2.3

May 18, 2026

0.2.2

May 18, 2026

0.2.0

May 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

memini_ai_dev-0.3.0.tar.gz (526.8 kB view details)

Uploaded May 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

memini_ai_dev-0.3.0-py3-none-any.whl (140.2 kB view details)

Uploaded May 20, 2026 Python 3

File details

Details for the file memini_ai_dev-0.3.0.tar.gz.

File metadata

Download URL: memini_ai_dev-0.3.0.tar.gz
Upload date: May 20, 2026
Size: 526.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for memini_ai_dev-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`1596ff8eee4c5276c689f96fd7615972e7e3f5ec6a1a951bf10c361222dfc47c`
MD5	`2d2d105f77f63932dce0dda8a764f7b0`
BLAKE2b-256	`330bd10544154df39748b9c94e8d69f8fb47d60744dfd5d42cadd6409cfc0c66`

See more details on using hashes here.

Provenance

The following attestation bundles were made for memini_ai_dev-0.3.0.tar.gz:

Publisher: workflow.yml on Veedubin/memini-ai-dev

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: memini_ai_dev-0.3.0.tar.gz
- Subject digest: 1596ff8eee4c5276c689f96fd7615972e7e3f5ec6a1a951bf10c361222dfc47c
- Sigstore transparency entry: 1576211741
- Sigstore integration time: May 20, 2026
Source repository:
- Permalink: Veedubin/memini-ai-dev@f202324b9ce9fca10b4015eea6ec721811fa2c1c
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/Veedubin
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflow.yml@f202324b9ce9fca10b4015eea6ec721811fa2c1c
- Trigger Event: push

File details

Details for the file memini_ai_dev-0.3.0-py3-none-any.whl.

File metadata

Download URL: memini_ai_dev-0.3.0-py3-none-any.whl
Upload date: May 20, 2026
Size: 140.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for memini_ai_dev-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`72c3f4c654458ea688843c1235e0b6d9d6e0756e777dca30449055f49bab48dd`
MD5	`e8997fe73658290c7f5cdbe679927eb9`
BLAKE2b-256	`88cda20f768dfe96d406a6973eca4ffd9a847694858cd754ab19ab569d2d106b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for memini_ai_dev-0.3.0-py3-none-any.whl:

Publisher: workflow.yml on Veedubin/memini-ai-dev

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: memini_ai_dev-0.3.0-py3-none-any.whl
- Subject digest: 72c3f4c654458ea688843c1235e0b6d9d6e0756e777dca30449055f49bab48dd
- Sigstore transparency entry: 1576211747
- Sigstore integration time: May 20, 2026
Source repository:
- Permalink: Veedubin/memini-ai-dev@f202324b9ce9fca10b4015eea6ec721811fa2c1c
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/Veedubin
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflow.yml@f202324b9ce9fca10b4015eea6ec721811fa2c1c
- Trigger Event: push

memini-ai-dev 0.3.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Memini-ai

Overview

Key Features

Installation

Prerequisites

Quick Start

Development Installation

Configuration

Environment Variables

Core Settings

PostgreSQL Settings

⭐ Optional Features (all disabled by default)

Trust Engine

Memory Graph

Knowledge Graph

Tiered Loading

Auto-Extract

Pre-Compression

User Modeling

Memory Decay

Multi-Peer

Dialectic (Contradiction Detection)

Quick Start with All Features Enabled

JSON Config File

Usage

MCP Tools

query_memories

add_memory

search_project

index_project

get_file_contents

get_status

Python API

Docker Compose

Testing

Integration Tests with Docker

Quality Gates

Performance

Performance Tuning

Architecture

License

Links

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`query_memories`

`add_memory`

`search_project`

`index_project`

`get_file_contents`

`get_status`