Dead-simple local vector database powered by usearch HNSW.

These details have not been verified by PyPI

Project links

Project description

SimpleVecDB

The dead-simple, local-first vector database.

SimpleVecDB brings Chroma-like simplicity to a single SQLite file. Built on usearch HNSW indexing, it offers high-performance vector search, quantization, and zero infrastructure headaches. Perfect for local RAG, offline agents, and indie hackers who need production-grade vector search without the operational overhead.

Why SimpleVecDB?

Zero Infrastructure — Just a .db file. No Docker, no Redis, no cloud bills.
Blazing Fast — 10-100x faster search via usearch HNSW. Adaptive: brute-force for <10k vectors (perfect recall), HNSW for larger collections.
Truly Portable — Runs anywhere SQLite runs: Linux, macOS, Windows, even WASM.
Async Ready — Full async/await support with optional executor injection for thread-safe ONNX/usearch sharing.
Batteries Included — Optional FastAPI embeddings server + LangChain/LlamaIndex integrations via [integrations] extra.
Production Ready — Hybrid search (BM25 + vector), metadata filtering, multi-collection support, and automatic hardware acceleration.

When to Choose SimpleVecDB

Use Case	SimpleVecDB	Cloud Vector DB
Local RAG applications	✅ Perfect fit	❌ Overkill + latency
Offline-first agents	✅ No internet needed	❌ Requires connectivity
Prototyping & MVPs	✅ Zero config	⚠️ Setup overhead
Multi-tenant SaaS at scale	⚠️ Consider sharding	✅ Built for this
Budget-conscious projects	✅ $0/month	❌ $50-500+/month

Prerequisites

System Requirements:

Python 3.10+
SQLite 3.35+ with FTS5 support (included in Python 3.8+ standard library)
50MB+ disk space for core library, 500MB+ with [server] extras

Optional for GPU Acceleration:

CUDA 11.8+ for NVIDIA GPUs
Metal Performance Shaders (MPS) for Apple Silicon

Note: If using custom-compiled SQLite, ensure -DSQLITE_ENABLE_FTS5 is enabled for full-text search support.

Installation

# Standard installation (includes clustering, encryption)
pip install simplevecdb

# With LangChain & LlamaIndex integrations
pip install "simplevecdb[integrations]"

# With local embeddings server (adds 500MB+ models)
pip install "simplevecdb[server]"

What's included by default:

Vector search with HNSW indexing
Clustering (K-means, MiniBatch K-means, HDBSCAN)
Encryption (SQLCipher AES-256)
Async support

Verify Installation:

python -c "from simplevecdb import VectorDB; print('SimpleVecDB installed successfully!')"

Quickstart

SimpleVecDB is just a vector storage layer—it doesn't include an LLM or generate embeddings. This design keeps it lightweight and flexible. Choose your integration path:

Option 1: With OpenAI (Simplest)

Best for: Quick prototypes, production apps with OpenAI subscriptions.

from simplevecdb import VectorDB
from openai import OpenAI

db = VectorDB("knowledge.db")
collection = db.collection("docs")
client = OpenAI()

texts = ["Paris is the capital of France.", "Mitochondria powers cells."]
embeddings = [
    client.embeddings.create(model="text-embedding-3-small", input=t).data[0].embedding
    for t in texts
]

collection.add_texts(
    texts=texts,
    embeddings=embeddings,
    metadatas=[{"category": "geography"}, {"category": "biology"}]
)

# Search
query_emb = client.embeddings.create(
    model="text-embedding-3-small",
    input="capital of France"
).data[0].embedding

results = collection.similarity_search(query_emb, k=1)
print(results[0][0].page_content)  # "Paris is the capital of France."

# Filter by metadata
filtered = collection.similarity_search(query_emb, k=10, filter={"category": "geography"})

Option 2: Fully Local (Privacy-First)

Best for: Offline apps, sensitive data, zero API costs.

pip install "simplevecdb[server]"

from simplevecdb import VectorDB
from simplevecdb.embeddings.models import embed_texts

db = VectorDB("local.db")
collection = db.collection("docs")

texts = ["Paris is the capital of France.", "Mitochondria powers cells."]
embeddings = embed_texts(texts)  # Local HuggingFace models

collection.add_texts(texts=texts, embeddings=embeddings)

# Search
query_emb = embed_texts(["capital of France"])[0]
results = collection.similarity_search(query_emb, k=1)

# Hybrid search (BM25 + vector)
hybrid = collection.hybrid_search("powerhouse cell", k=2)

Optional: Run embeddings server (OpenAI-compatible)

simplevecdb-server --port 8000                # Default model, auto warm-up
simplevecdb-server --host 0.0.0.0 --port 9000 # Bind to all interfaces
simplevecdb-server --no-warmup                # Skip model preload on startup
simplevecdb-server --help                     # Show all options

See Setup Guide for configuration: model registry, rate limits, API keys, CORS, CUDA optimization.

Option 3: With LangChain or LlamaIndex

Best for: Existing RAG pipelines, framework-based workflows.

pip install "simplevecdb[integrations]"

from simplevecdb.integrations.langchain import SimpleVecDBVectorStore
from langchain_openai import OpenAIEmbeddings

store = SimpleVecDBVectorStore(
    db_path="langchain.db",
    embedding=OpenAIEmbeddings(model="text-embedding-3-small")
)

store.add_texts(["Paris is the capital of France."])
results = store.similarity_search("capital of France", k=1)
hybrid = store.hybrid_search("France capital", k=3)  # BM25 + vector

LlamaIndex:

from simplevecdb.integrations.llamaindex import SimpleVecDBLlamaStore
from llama_index.embeddings.openai import OpenAIEmbedding

store = SimpleVecDBLlamaStore(
    db_path="llama.db",
    embedding=OpenAIEmbedding(model="text-embedding-3-small")
)

See Examples for complete RAG workflows with Ollama.

Core Features

Multi-Collection Support

Organize vectors by domain within a single database file:

from simplevecdb import VectorDB, Quantization

db = VectorDB("app.db")
users = db.collection("users", quantization=Quantization.FLOAT16)  # 2x memory savings
products = db.collection("products", quantization=Quantization.BIT)  # 32x compression

# Isolated namespaces
users.add_texts(["Alice likes hiking"], embeddings=[[0.1]*384])
products.add_texts(["Hiking boots"], embeddings=[[0.9]*384])

Search Capabilities

# Vector similarity (cosine/L2) - adaptive search by default
results = collection.similarity_search(query_vector, k=10)

# Force exact search for perfect recall (brute-force)
results = collection.similarity_search(query_vector, k=10, exact=True)

# Force HNSW approximate search (faster, may miss some results)
results = collection.similarity_search(query_vector, k=10, exact=False)

# Parallel search with explicit thread count
results = collection.similarity_search(query_vector, k=10, threads=8)

# Batch search - 10x throughput for multiple queries
queries = [query1, query2, query3]  # List of embedding vectors
batch_results = collection.similarity_search_batch(queries, k=10)

# Keyword search (BM25)
results = collection.keyword_search("exact phrase", k=10)

# Hybrid (BM25 + vector fusion)
results = collection.hybrid_search("machine learning", k=10)
results = collection.hybrid_search("ML concepts", query_vector=my_vector, k=10)

# Metadata filtering
results = collection.similarity_search(
    query_vector,
    k=10,
    filter={"category": "technical", "verified": True}
)

Tip: LangChain and LlamaIndex integrations support all search methods.

Encryption (v2.1+)

Protect sensitive data with AES-256 at-rest encryption:

pip install "simplevecdb[encryption]"

from simplevecdb import VectorDB

# Create encrypted database
db = VectorDB("secure.db", encryption_key="your-secret-key")
collection = db.collection("confidential")

collection.add_texts(["sensitive data"], embeddings=[[0.1]*384])
db.close()

# Reopen requires same key
db = VectorDB("secure.db", encryption_key="your-secret-key")

Streaming Insert (v2.1+)

Memory-efficient ingestion for large datasets:

def load_documents():
    for line in open("large_file.jsonl"):
        doc = json.loads(line)
        yield (doc["text"], doc.get("metadata"), doc.get("embedding"))

for progress in collection.add_texts_streaming(load_documents(), batch_size=1000):
    print(f"Processed {progress['docs_processed']} documents")

Document Hierarchies (v2.1+)

Organize documents in parent-child relationships:

# Add parent document
parent_ids = collection.add_texts(["Main document"], embeddings=[[0.1]*384])

# Add children
child_ids = collection.add_texts(
    ["Chunk 1", "Chunk 2"],
    embeddings=[[0.11]*384, [0.12]*384],
    parent_ids=[parent_ids[0], parent_ids[0]]
)

# Navigate hierarchy
children = collection.get_children(parent_ids[0])
parent = collection.get_parent(child_ids[0])
descendants = collection.get_descendants(parent_ids[0])

Document Management (v2.4+)

Query and update documents without touching private internals:

# Get all documents (with optional metadata filter)
docs = collection.get_documents(filter_dict={"category": "tech"})
for doc_id, text, metadata in docs:
    print(f"[{doc_id}] {text[:50]}...")

# Paginated access (v2.5+)
page1 = collection.get_documents(limit=100)
page2 = collection.get_documents(limit=100, offset=100)

# Fetch stored embeddings
embeddings = collection.get_embeddings_by_ids([1, 2, 3])

# Batch update metadata (shallow merge)
collection.update_metadata([
    (1, {"reviewed": True}),
    (2, {"reviewed": True, "score": 0.95}),
])

# Quick stats
print(f"Collection has {collection.count()} documents, dim={collection.dim}")

# Delete an entire collection (v2.5+)
db.delete_collection("old_data")

Vector Clustering (v2.2+)

Discover natural groupings in your embeddings:

# Cluster documents and auto-generate tags
result = collection.cluster(n_clusters=5)
tags = collection.auto_tag(result, method="tfidf")
collection.assign_cluster_metadata(result, tags)

# Save for fast assignment of new documents
collection.save_cluster("categories", result)
collection.assign_to_cluster("categories", new_doc_ids)

Supports K-means, MiniBatch K-means, and HDBSCAN. See Clustering Guide for details.

Feature Matrix

Feature	Status	Description
Single-File Storage	✅	SQLite `.db` file or in-memory mode
Multi-Collection	✅	Isolated namespaces per database
HNSW Indexing	✅	usearch HNSW for 10-100x faster search
Adaptive Search	✅	Auto brute-force for <10k vectors, HNSW for larger
Vector Search	✅	Cosine, Euclidean metrics (L1 removed in v2.0)
Hybrid Search	✅	BM25 + vector fusion (Reciprocal Rank Fusion)
Quantization	✅	FLOAT32, FLOAT16, INT8, BIT for 2-32x compression
Parallel Operations	✅	`threads` parameter for add/search
Metadata Filtering	✅	SQL `WHERE` clause support
Framework Integration	✅	LangChain & LlamaIndex adapters via `[integrations]` extra
Hardware Acceleration	✅	Auto-detects CUDA/MPS/CPU + SIMD via usearch
Local Embeddings	✅	HuggingFace models via `[server]` extras
Built-in Encryption	✅	SQLCipher AES-256 at-rest encryption via `[encryption]` extras
Streaming Insert	✅	Memory-efficient large-scale ingestion with progress callbacks
Document Hierarchies	✅	Parent/child relationships for chunked docs
Vector Clustering	✅	K-means, MiniBatch K-means, HDBSCAN with auto-tagging (v2.2+)
Cluster Persistence	✅	Save/load cluster centroids for fast assignment (v2.2+)
Public Catalog API	✅	`get_documents`, `get_embeddings_by_ids`, `update_metadata` (v2.4+)
Executor Injection	✅	Share thread pool across async instances for ONNX safety (v2.4+)
Collection Management	✅	`delete_collection()`, paginated `get_documents(limit=, offset=)` (v2.5+)
Cross-Process Safety	✅	Advisory file locking on usearch index files (v2.5+)
FLOAT16 Quantization	✅	Half-precision storage with 2x compression (v2.5+)
Embeddings Server	✅	CORS, graceful shutdown, input validation, model warm-up (v2.5+)

Performance Benchmarks

10,000 vectors, 384 dimensions, k=10 search — Full benchmarks →

Quantization	Storage	Query Time	Compression
FLOAT32	36.0 MB	0.20 ms	1x
FLOAT16	28.7 MB	0.20 ms	2x
INT8	25.0 MB	0.16 ms	4x
BIT	21.8 MB	0.08 ms	32x

Key highlights:

3-34x faster than brute-force for collections >10k vectors
Adaptive search: perfect recall for small collections, HNSW for large
FLOAT16 recommended: best balance of speed, memory, and precision

Documentation

Setup Guide — Environment variables, server configuration, authentication
API Reference — Complete class/method documentation with type signatures
Benchmarks — Quantization strategies, batch sizes, hardware optimization
Integration Examples — RAG notebooks, Ollama workflows, production patterns
Contributing Guide — Development setup, testing, PR guidelines

Troubleshooting

Import Error: sqlite3.OperationalError: no such module: fts5

# Your Python's SQLite was compiled without FTS5
# Solution: Install Python from python.org (includes FTS5) or compile SQLite with:
# -DSQLITE_ENABLE_FTS5

Dimension Mismatch Error

# Ensure all vectors in a collection have identical dimensions
collection = db.collection("docs", dim=384)  # Explicit dimension

CUDA Not Detected (GPU Available)

# Verify CUDA installation
python -c "import torch; print(torch.cuda.is_available())"

# Reinstall PyTorch with CUDA support
pip install torch --index-url https://download.pytorch.org/whl/cu118

Slow Queries on Large Datasets

Enable quantization: collection = db.collection("docs", quantization=Quantization.INT8)
For >10k vectors, HNSW is automatic; tune with rebuild_index(connectivity=32)
Use exact=False to force HNSW even on smaller collections
Use metadata filtering to reduce search space

Roadmap

Hybrid Search (BM25 + Vector)
Multi-collection support
HNSW indexing (usearch backend)
Adaptive search (brute-force/HNSW)
SQLCipher encryption (at-rest data protection)
Streaming insert API for large-scale ingestion
Hierarchical document relationships (parent/child)
Cross-collection search
Vector clustering and auto-tagging (v2.2)
Public catalog API for document management (v2.4)
Async executor injection for thread-safe sharing (v2.4)
Collection management: delete_collection(), pagination (v2.5)
Cross-process file locking and connection health checks (v2.5)
Embeddings server hardening: CORS, graceful shutdown, input validation (v2.5)
Incremental clustering (online learning)
Cluster visualization exports

Vote on features or propose new ones in GitHub Discussions.

Contributing

Contributions are welcome! Whether you're fixing bugs, improving documentation, or proposing new features:

Read CONTRIBUTING.md for development setup
Check existing Issues and Discussions
Open a PR with clear description and tests

Community & Support

Get Help:

GitHub Discussions — Q&A and feature requests
GitHub Issues — Bug reports

Stay Updated:

GitHub Releases — Changelog and updates
Examples Gallery — Community-contributed notebooks

License

MIT License — Free for personal and commercial use.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.6.1

May 10, 2026

This version

2.6.0

May 6, 2026

2.5.0

Apr 7, 2026

2.4.0

Mar 22, 2026

2.3.0

Mar 8, 2026

2.2.1

Jan 27, 2026

2.2.0

Jan 17, 2026

2.1.0

Jan 2, 2026

2.0.0

Dec 23, 2025

1.3.0

Dec 7, 2025

1.2.0

Nov 26, 2025

1.1.1

Nov 24, 2025

1.1.0

Nov 23, 2025

1.0.0

Nov 23, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

simplevecdb-2.6.0.tar.gz (596.7 kB view details)

Uploaded May 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

simplevecdb-2.6.0-py3-none-any.whl (96.5 kB view details)

Uploaded May 6, 2026 Python 3

File details

Details for the file simplevecdb-2.6.0.tar.gz.

File metadata

Download URL: simplevecdb-2.6.0.tar.gz
Upload date: May 6, 2026
Size: 596.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.10 {"installer":{"name":"uv","version":"0.11.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for simplevecdb-2.6.0.tar.gz
Algorithm	Hash digest
SHA256	`5912a02e87a4bbe7e00f428b4beef5976f2918b90e3ff340b285c0830cfd8282`
MD5	`66b0853bb013dda5beec860d75d504a8`
BLAKE2b-256	`2f5e0da24a312ddfd5d53aa9fed977f65a1889f8d00422e2d30c4d2d5494dc97`

See more details on using hashes here.

File details

Details for the file simplevecdb-2.6.0-py3-none-any.whl.

File metadata

Download URL: simplevecdb-2.6.0-py3-none-any.whl
Upload date: May 6, 2026
Size: 96.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.10 {"installer":{"name":"uv","version":"0.11.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for simplevecdb-2.6.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2fa0175a5217782ea4fc7b699767b33ecb8467a0044f73c39e0a5551b93da149`
MD5	`4cc613c5496791fa73453a2ae420714c`
BLAKE2b-256	`9023390e75765cace3e46e3c160102d77f394fccfd25d47f925f84c7d9bcd483`

See more details on using hashes here.

simplevecdb 2.6.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

SimpleVecDB

Why SimpleVecDB?

When to Choose SimpleVecDB

Prerequisites

Installation

Quickstart

Option 1: With OpenAI (Simplest)

Option 2: Fully Local (Privacy-First)

Option 3: With LangChain or LlamaIndex

Core Features

Multi-Collection Support

Search Capabilities

Encryption (v2.1+)

Streaming Insert (v2.1+)

Document Hierarchies (v2.1+)

Document Management (v2.4+)

Vector Clustering (v2.2+)

Feature Matrix

Performance Benchmarks

Documentation

Troubleshooting

Roadmap

Contributing

Community & Support

Sponsors

Other Ways to Support

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes