Local-first semantic memory server with vector search

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

veedubin

These details have not been verified by PyPI

Project description

Memini-ai v3.0

"I remember" in Latin (pronounced meh-mee-nee)

Local-first semantic memory server with vector search, MCP-compatible.

Overview

Memini-ai is a Python rewrite of Super-Memory-TS, designed as a local-first semantic memory server with vector search capabilities. It provides persistent memory storage and retrieval using Qdrant as the backend vector database.

Key Features

MCP-Compatible: Works with any MCP-compatible client (OpenCode, Claude Desktop, etc.)
Vector Search: BGE-Large embeddings (1024-dim) with MiniLM fallback (384-dim)
Hybrid Search: Combines vector similarity with BM25 text search
Project Isolation: Memories are isolated by project ID
File Indexing: Index and search project files with semantic chunking
Graceful Degradation: Works without Qdrant (returns errors for memory operations)
CPU-First: Designed to run on CPU, optional GPU acceleration

Installation

Prerequisites

Python 3.11+
Qdrant running locally or remotely

Quick Start

# Install memini-ai
pip install memini-ai

# Start Qdrant (if not running)
docker run -d --name qdrant -p 6333:6333 qdrant/qdrant

# Run the server
memini-ai --stdio

Development Installation

# Clone the repository
git clone https://github.com/veedubin/memini-ai.git
cd memini-ai

# Create virtual environment
python -m venv .venv
source .venv/bin/activate  # Linux/Mac
# or: .venv\Scripts\activate  # Windows

# Install with dev dependencies
pip install -e ".[dev]"

Configuration

Memini-ai can be configured via environment variables or a JSON config file.

Environment Variables

Variable	Default	Description
`MEMINI_QDRANT_URL`	`http://localhost:6333`	Qdrant server URL
`MEMINI_PROJECT_ID`	auto-generated	Project identifier for isolation
`MEMINI_EMBEDDING_DIM`	`1024`	Embedding dimension (1024 or 384)
`MEMINI_CHUNK_SIZE`	`512`	Chunk size for file indexing
`MEMINI_CHUNK_OVERLAP`	`64`	Overlap between chunks
`MEMINI_BATCH_SIZE`	`32`	Batch size for embedding generation
`MEMINI_WORKERS`	`4`	Number of worker threads
`MEMINI_LOG_LEVEL`	`INFO`	Logging level
`MEMINI_CONFIG_PATH`	None	Path to JSON config file

JSON Config File

{
  "qdrant": {
    "url": "http://localhost:6333"
  },
  "model": {
    "embedding_dim": 1024
  },
  "indexer": {
    "chunk_size": 512,
    "chunk_overlap": 64
  },
  "logging": {
    "level": "INFO"
  }
}

Usage

MCP Tools

Memini-ai provides 6 MCP tools:

`query_memories`

Semantic search over memories.

{
  "query": "What files were modified yesterday?",
  "limit": 10,
  "strategy": "tiered"
}

Strategies:

tiered (default): MiniLM primary + BGE fallback
vector_only: Pure semantic similarity
text_only: BM25 keyword search
parallel: Dual-tier with RRF fusion

`add_memory`

Store a new memory entry.

{
  "content": "Remember to update the config file",
  "sourceType": "session",
  "sourcePath": "/path/to/file",
  "metadata": {"key": "value"}
}

`search_project`

Search indexed project files.

{
  "query": "authentication middleware",
  "topK": 20,
  "fileTypes": [".py", ".ts"],
  "paths": ["src/"]
}

`index_project`

Trigger project indexing.

{
  "path": ".",
  "force": false,
  "background": true
}

`get_file_contents`

Reconstruct a file from indexed chunks.

{
  "filePath": "src/main.py",
  "triggerIndex": false
}

`get_status`

Get server component status.

{}

Python API

from memini_ai.memory.system import MemorySystem
from memini_ai.memory.schema import MemoryEntry, MemorySourceType, SearchOptions, SearchStrategy

async def main():
    # Create and initialize
    system = MemorySystem()
    await system.initialize()

    # Add a memory
    entry = MemoryEntry(
        text="Python list comprehension tutorial",
        source_type=MemorySourceType.session,
    )
    memory_id = await system.add_memory(entry)

    # Query memories
    options = SearchOptions(topK=10, strategy=SearchStrategy.TIERED)
    results = await system.query_memories("list comprehension", options)

    # Delete memory
    await system.delete_memory(memory_id)

asyncio.run(main())

Docker Compose

For local development with Qdrant:

version: '3.8'

services:
  qdrant:
    image: qdrant/qdrant
    ports:
      - "6333:6333"
      - "6334:6334"
    volumes:
      - qdrant_data:/qdrant/storage

  memini-ai:
    build: .
    depends_on:
      - qdrant
    environment:
      - MEMINI_QDRANT_URL=http://qdrant:6333
    volumes:
      - .:/app

volumes:
  qdrant_data:

docker-compose up -d

Testing

# Run all tests
pytest tests/ -v

# Run unit tests only (skip integration)
pytest tests/ -v --ignore=tests/integration/

# Run integration tests (requires Qdrant)
pytest tests/integration/ -v

# Run with coverage
pytest tests/ --cov=src/memini_ai --cov-report=term-missing

Integration Tests with Docker

# Start Qdrant for integration tests
docker run -d --name qdrant-test -p 6333:6333 qdrant/qdrant

# Run integration tests
pytest tests/integration/ -v

# Cleanup
docker stop qdrant-test && docker rm qdrant-test

Quality Gates

Before submitting changes, ensure:

# Lint
ruff check src/

# Format
ruff format src/

# Type check
mypy src/

# Tests
pytest tests/ -x

Performance

Memini-ai is designed for sub-10ms query latency on cached queries.

Typical performance:

Query latency: < 10ms (after warmup)
Indexing: ~1000 files/second
Memory footprint: ~500MB (without model)

Performance Tuning

# Use faster embedding model (384-dim instead of 1024)
export MEMINI_EMBEDDING_DIM=384

# Increase workers for faster indexing
export MEMINI_WORKERS=8

# Larger batch size for embedding generation
export MEMINI_BATCH_SIZE=64

Architecture

memini_ai/
├── config.py           # Configuration management
├── server.py          # FastMCP server (6 tools)
├── memory/
│   ├── schema.py      # Pydantic models
│   ├── database.py    # Qdrant CRUD operations
│   ├── search.py      # 4 search strategies
│   └── system.py      # MemorySystem coordinator
├── model/
│   ├── manager.py     # ModelManager singleton
│   └── embeddings.py # Embedding generation
├── indexer/
│   ├── indexer.py     # ProjectIndexer
│   ├── chunker.py     # Semantic chunking
│   ├── watcher.py     # File watching
│   └── file_tracker.py # SQLite persistence
└── utils/
    ├── logger.py     # Structured logging
    └── hash.py       # SHA-256 utilities

License

MIT License - see LICENSE file for details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

veedubin

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.5.0

May 21, 2026

0.4.0

May 21, 2026

0.3.1

May 20, 2026

0.3.0

May 20, 2026

0.2.8

May 19, 2026

0.2.7

May 19, 2026

0.2.6

May 18, 2026

0.2.5

May 18, 2026

0.2.3

May 18, 2026

This version

0.2.2

May 18, 2026

0.2.0

May 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

memini_ai_dev-0.2.2.tar.gz (496.0 kB view details)

Uploaded May 18, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

memini_ai_dev-0.2.2-py3-none-any.whl (119.2 kB view details)

Uploaded May 18, 2026 Python 3

File details

Details for the file memini_ai_dev-0.2.2.tar.gz.

File metadata

Download URL: memini_ai_dev-0.2.2.tar.gz
Upload date: May 18, 2026
Size: 496.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for memini_ai_dev-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`eee3cd7c40a24bdaa0eddb2b54700602cbbb4c1df55eaeb1a7803d1067f18a21`
MD5	`8de122e4cfd87b787495bb40944cf4b1`
BLAKE2b-256	`013d71b426bc0cb5c2cce0e2b0f3da9ee662cfc462e10607e69af16b38ed45a9`

See more details on using hashes here.

Provenance

The following attestation bundles were made for memini_ai_dev-0.2.2.tar.gz:

Publisher: workflow.yml on Veedubin/memini-ai-dev

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: memini_ai_dev-0.2.2.tar.gz
- Subject digest: eee3cd7c40a24bdaa0eddb2b54700602cbbb4c1df55eaeb1a7803d1067f18a21
- Sigstore transparency entry: 1566681434
- Sigstore integration time: May 18, 2026
Source repository:
- Permalink: Veedubin/memini-ai-dev@04083ab7fff4b1190d6550d57cf0179184a54997
- Branch / Tag: refs/tags/v0.2.2
- Owner: https://github.com/Veedubin
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflow.yml@04083ab7fff4b1190d6550d57cf0179184a54997
- Trigger Event: push

File details

Details for the file memini_ai_dev-0.2.2-py3-none-any.whl.

File metadata

Download URL: memini_ai_dev-0.2.2-py3-none-any.whl
Upload date: May 18, 2026
Size: 119.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for memini_ai_dev-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3de5f2afc1694f3fc61740338ef79febfa93248e30017db0e3e377c1aaf432e9`
MD5	`de1e9788b6ae8c0dcd26bbae700e36aa`
BLAKE2b-256	`f3e304eb227c28282181b7eefb6622afb80528d773111c634177f94c56e5b88f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for memini_ai_dev-0.2.2-py3-none-any.whl:

Publisher: workflow.yml on Veedubin/memini-ai-dev

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: memini_ai_dev-0.2.2-py3-none-any.whl
- Subject digest: 3de5f2afc1694f3fc61740338ef79febfa93248e30017db0e3e377c1aaf432e9
- Sigstore transparency entry: 1566681439
- Sigstore integration time: May 18, 2026
Source repository:
- Permalink: Veedubin/memini-ai-dev@04083ab7fff4b1190d6550d57cf0179184a54997
- Branch / Tag: refs/tags/v0.2.2
- Owner: https://github.com/Veedubin
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflow.yml@04083ab7fff4b1190d6550d57cf0179184a54997
- Trigger Event: push

memini-ai-dev 0.2.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Memini-ai v3.0

Overview

Key Features

Installation

Prerequisites

Quick Start

Development Installation

Configuration

Environment Variables

JSON Config File

Usage

MCP Tools

query_memories

add_memory

search_project

index_project

get_file_contents

get_status

Python API

Docker Compose

Testing

Integration Tests with Docker

Quality Gates

Performance

Performance Tuning

Architecture

License

Links

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`query_memories`

`add_memory`

`search_project`

`index_project`

`get_file_contents`

`get_status`