Batteries-included memory for voice bots—FAISS + SQLite + embeddings + encryption in one pip install

These details have not been verified by PyPI

Project links

Project description

brainfart

Batteries-included memory for voice bots. One pip install, one env var, done.

FAISS vector search + SQLite storage + fastembed embeddings + Gemini extraction + at-rest encryption—all bundled and ready to use.

Quick Start

Installation

pip install brainfart

Set your Gemini API key

export GOOGLE_API_KEY="your-gemini-api-key"

Add to your Pipecat bot

from pipecat.pipeline.pipeline import Pipeline
from brainfart import MemoryProcessor

# That's it—zero configuration needed
memory = MemoryProcessor(user_id="user123")

pipeline = Pipeline([
    transport.input(),
    stt_service,
    memory,           # <- Add memory here
    llm_service,
    tts_service,
    transport.output(),
])

What you get

Vector similarity search with FAISS (no external database)
Persistent storage in SQLite (survives restarts)
Automatic embeddings using all-MiniLM-L6-v2 (fast, ~90MB)
Smart extraction with Gemini Flash (extracts what's worth remembering)
At-rest encryption with Fernet (crash-safe, optional)

How it works

The memory system has two modes:

Extraction: As the user speaks, the processor buffers messages and periodically uses Gemini to extract memorable facts (identity, preferences, relationships, etc.)
Retrieval: Before each LLM response, relevant memories are retrieved using semantic search and injected into the context.

User speaks → Buffer → Extract (Gemini) → Store (FAISS + SQLite)
                                              ↓
                                         Retrieve → Inject → LLM

Configuration

Everything works with defaults, but you can customize:

Environment Variables

Variable	Default	Description
`GOOGLE_API_KEY`	Required	Gemini API key for extraction
`BRAINFART_TOP_K`	`5`	Memories to retrieve per query
`BRAINFART_DATA_DIR`	`~/.cache/brainfart`	Storage location
`BRAINFART_EMBEDDING_MODEL`	`all-MiniLM-L6-v2`	Embedding model
`BRAINFART_ENCRYPTION_KEY`	None	Enable encryption with this key

Constructor Parameters

memory = MemoryProcessor(
    user_id="user123",           # Required: user identifier
    agent_id="my-bot",           # Optional: for multi-agent isolation
    gemini_api_key="your-key",   # Optional: override env var
    top_k=10,                    # Optional: more memories
    embedding_model="all-mpnet-base-v2",  # Optional: larger model
    encryption_key="secret",     # Optional: enable encryption
    inject_memories=True,        # Optional: inject into LLM context
    extract_memories=True,       # Optional: extract from conversation
    extraction_interval=5,       # Optional: extract every N messages
)

Encryption

Enable at-rest encryption for security:

export BRAINFART_ENCRYPTION_KEY="your-secret-passphrase"

Or pass directly:

memory = MemoryProcessor(
    user_id="user123",
    encryption_key="your-secret-passphrase",
)

Data is encrypted on disk, decrypted only in memory. Crash-safe—if the process dies, data remains encrypted.

Multi-Agent Isolation

Different agents maintain separate memories for the same user:

# Sales bot memories
sales_memory = MemoryProcessor(user_id="user123", agent_id="sales-bot")

# Support bot memories (separate store)
support_memory = MemoryProcessor(user_id="user123", agent_id="support-bot")

Storage structure:

~/.cache/brainfart/
├── sales-bot/
│   └── user123.index
│   └── user123.db
└── support-bot/
    └── user123.index
    └── user123.db

Direct API Usage

Use the memory system without Pipecat:

from brainfart import LocalMemory, MemorySettings

settings = MemorySettings(
    data_dir="/path/to/data",
    encryption_key="secret",
)

memory = LocalMemory(settings, user_id="user123")
await memory.load()

# Store
await memory.store("User lives in SF", category="identity", importance=5)

# Retrieve
results = await memory.retrieve("Where does the user live?")
for r in results:
    print(f"[{r.category}] {r.content} (similarity: {r.similarity:.2f})")

await memory.close()

Memory Categories

Memories are categorized for better retrieval:

Category	Description	Example
`identity`	Core facts about who the user is	"User lives in San Francisco"
`preference`	Likes, dislikes, communication style	"User prefers concise answers"
`context`	Current projects, ongoing situations	"User is working on a Python project"
`relationship`	Emotional moments, shared experiences	"User was excited about promotion"
`surprise`	Unusual or noteworthy facts	"User has visited 50 countries"

Performance

First import: 5-15 seconds (model loading)
Subsequent imports: <1 second (cached)
Embedding: ~7-8ms per text
Retrieval: ~1-2ms (excluding embedding)
Install size: ~150MB (uses fastembed/ONNX instead of PyTorch)

Docker

Pre-download models during build for faster cold starts:

FROM python:3.11-slim

RUN pip install brainfart

# Pre-download embedding model
RUN python -c "from fastembed import TextEmbedding; TextEmbedding('sentence-transformers/all-MiniLM-L6-v2')"

# Set cache location (fastembed uses ~/.cache/fastembed by default)
ENV FASTEMBED_CACHE_PATH=/app/.cache/fastembed

Full Example

See examples/bot.py for a complete working bot.

Tutorial

For a detailed walkthrough of building a production memory system, including observability, custom extraction triggers, and lessons learned, see docs/TUTORIAL.md.

Known Issues

Redundant memories: The extraction process may create duplicate or near-duplicate memories. Deduplication is not yet implemented.

License

BSD-2-Clause (same as Pipecat)

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.3.0

Dec 30, 2025

0.2.0

Dec 30, 2025

0.1.0

Dec 30, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

brainfart-0.3.0.tar.gz (26.2 kB view details)

Uploaded Dec 30, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

brainfart-0.3.0-py3-none-any.whl (25.1 kB view details)

Uploaded Dec 30, 2025 Python 3

File details

Details for the file brainfart-0.3.0.tar.gz.

File metadata

Download URL: brainfart-0.3.0.tar.gz
Upload date: Dec 30, 2025
Size: 26.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for brainfart-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`5e60a35401346502b19f737d7d95c113fcff2804482d6c9a201f71cc9f03b43c`
MD5	`62d33d502e016ad12e23dd4ff528a782`
BLAKE2b-256	`25260964138dd85d2688322f0b1296011a7261f11c3786576fb2bdf7483f66cf`

See more details on using hashes here.

File details

Details for the file brainfart-0.3.0-py3-none-any.whl.

File metadata

Download URL: brainfart-0.3.0-py3-none-any.whl
Upload date: Dec 30, 2025
Size: 25.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for brainfart-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`070eca9c902ce181154e27fac4de609246bde937e409c48efa0bf5dba6031dd1`
MD5	`7aa207d6dc325db7553f35e028e3d2e1`
BLAKE2b-256	`d37bd613900a6690bc90cac5587803fb9b1ba074fd86aa31072a7d89bc80dedf`

See more details on using hashes here.

brainfart 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

brainfart

Quick Start

Installation

Set your Gemini API key

Add to your Pipecat bot

What you get

How it works

Configuration

Environment Variables

Constructor Parameters

Encryption

Multi-Agent Isolation

Direct API Usage

Memory Categories

Performance

Docker

Full Example

Tutorial

Known Issues

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes