Embedded vector store for local-first AI applications.

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
License
- OSI Approved :: MIT License
Programming Language

Project description

vectlite

Embedded vector store for local-first AI applications.

vectlite is a single-file, zero-dependency vector database written in Rust with Python bindings. It gives you dense + sparse hybrid search, HNSW indexing, metadata filtering, transactions, and crash-safe persistence in a single .vdb file -- no server, no Docker, no network calls.

Installation

pip install vectlite

Requires Python 3.9+. Pre-built wheels are available for macOS (x86_64, arm64), Linux (x86_64, aarch64), and Windows (x86_64).

Quick Start

import vectlite

with vectlite.open("knowledge.vdb", dimension=384) as db:
    # Insert records with vectors, metadata, and sparse terms
    db.upsert("doc1", embedding, {"source": "blog", "title": "Auth Guide"})
    db.upsert("doc2", embedding2, {"source": "notes", "title": "Billing"})

    # Search with filters
    results = db.search(embedding_query, k=5, filter={"source": "blog"})

    # Query-free inspection
    print(db.count(filter={"source": "blog"}))

Features

Core

Single-file storage -- one .vdb file per database, portable and easy to back up
Dense vectors -- cosine similarity with automatic HNSW indexing for large collections
Sparse vectors -- BM25-scored inverted index for keyword retrieval
Hybrid search -- dense + sparse fusion with linear or RRF strategies
Vector quantization -- scalar (int8, 4x), binary (32x), and product quantization (PQ) with 2-stage rescoring
Rich metadata -- str, int, float, bool, None, list, dict values
Crash-safe WAL -- writes land in a write-ahead log first, then checkpoint with compact()
Transactions -- atomic batched writes with db.transaction()
File locking -- advisory locks prevent corruption from concurrent access

Search & Retrieval

Metadata filters -- MongoDB-style operators: $eq, $ne, $gt, $gte, $lt, $lte, $in, $nin, $contains, $exists, $and, $or, $not
Nested filters -- dot-path traversal (author.name), $elemMatch, $size on lists and dicts
Named vectors -- multiple vector spaces per record (vectors={"title": [...], "body": [...]})
Multi-vector queries -- weighted search across vector spaces in a single call
MMR diversification -- mmr_lambda controls relevance vs. diversity trade-off
Namespaces -- logical isolation with per-namespace or cross-namespace search
Rerankers -- built-in text_match(), metadata_boost(), cross_encoder(), bi_encoder(), composable with compose()
Observability -- search_with_stats() returns timings, BM25 term scores, ANN stats, and per-result explain payloads

Data Management

Physical collections -- vectlite.open_store() manages a directory of independent databases
Bulk ingestion -- bulk_ingest() with deferred index rebuilds for fast imports
Listing & filtered counts -- list() and count(namespace=..., filter=...) without a vector query
Delete by filter -- remove matching records across a namespace slice in one call
Snapshots -- db.snapshot(path) creates a self-contained copy
Backup / Restore -- db.backup(dir) and vectlite.restore(dir, path) for full roundtrips
Read-only mode -- vectlite.open(path, read_only=True) for safe concurrent readers
Explicit close -- db.close() or with vectlite.open(...) as db: to release locks deterministically
Lock timeouts -- lock_timeout= retries for bounded lock acquisition waits
Text analyzers -- configurable tokenizer pipeline with stopwords, stemming, and n-grams

Usage

Hybrid Search with Reranking

import vectlite

db = vectlite.open("knowledge.vdb", dimension=384)

# Upsert with dense + sparse vectors
db.upsert(
    "doc1",
    dense_embedding,
    {"source": "docs", "title": "Auth Setup", "text": "How to configure SSO..."},
    sparse=vectlite.sparse_terms("How to configure SSO authentication"),
)

# Hybrid search with reranking
results = db.search(
    query_embedding,
    k=10,
    sparse=vectlite.sparse_terms("SSO authentication"),
    fusion="rrf",
    filter={"source": "docs"},
    explain=True,
    rerank=vectlite.rerankers.compose(
        vectlite.rerankers.text_match(),
        vectlite.rerankers.metadata_boost("source", {"docs": 0.5}),
    ),
)

for result in results:
    print(result["id"], result["score"])

Bulk Ingestion (Recommended for Large Imports)

For ingesting more than a few hundred records, use bulk_ingest() instead of calling upsert() in a loop. It writes records in WAL batches and rebuilds indexes only once at the end, making it orders of magnitude faster.

records = [
    {
        "id": f"doc{i}",
        "vector": embeddings[i],
        "metadata": {"source": "corpus", "chunk": i},
        "sparse": vectlite.sparse_terms(texts[i]),  # optional
    }
    for i in range(len(texts))
]

count = db.bulk_ingest(records, batch_size=5000)
print(f"Ingested {count} records")

The records parameter is a list[dict] where each dict has keys:

id (str, required) -- unique record identifier
vector (list[float], required) -- dense embedding vector
metadata (dict, optional) -- arbitrary metadata
sparse (dict[str, float], optional) -- sparse terms from sparse_terms()
vectors (dict[str, list[float]], optional) -- named vectors
namespace (str, optional) -- namespace override per record

upsert_many() and insert_many() also accept the same list[dict] format and rebuild indexes once, but don't batch WAL writes internally.

Collections

store = vectlite.open_store("./my_collections")
products = store.create_collection("products", dimension=384)
products.upsert("p1", embedding, {"name": "Widget", "price": 9.99})

logs = store.open_or_create_collection("logs", dimension=128)
print(store.collections())  # ["logs", "products"]

Transactions

with db.transaction() as tx:
    tx.upsert("doc1", emb1, {"source": "a"})
    tx.upsert("doc2", emb2, {"source": "b"})
    tx.delete("old_doc")
# All operations commit atomically or roll back on exception

Text Helpers

# Handles embedding + sparse term generation for you
vectlite.upsert_text(db, "doc1", "Auth setup guide", embed_fn, {"source": "docs"})
results = vectlite.search_text(db, "how to authenticate", embed_fn, k=5)

Analyzers

analyzer = vectlite.analyzers.Analyzer().lowercase().stopwords("en").stemmer("english")
terms = analyzer.sparse_terms("How to authenticate users with SSO")
# Use with upsert: db.upsert("doc1", emb, meta, sparse=terms)

Snapshots & Backup

db.snapshot("/backups/knowledge_2024.vdb")  # Self-contained copy
db.backup("/backups/full/")                 # Full backup with ANN sidecars

restored = vectlite.restore("/backups/full/", "restored.vdb")

Read-Only Mode

ro = vectlite.open("knowledge.vdb", read_only=True, lock_timeout=5.0)
results = ro.search(query, k=5)  # Reads work
ro.upsert(...)                    # Raises VectLiteError

Listing, Counting, and Lifecycle

db = vectlite.open("knowledge.vdb", dimension=384, lock_timeout=5.0)

records = db.list(namespace="docs", filter={"stale": False}, limit=20)
count = db.count(namespace="docs", filter={"source": "blog"})
deleted = db.delete_by_filter({"stale": True}, namespace="docs")

db.close()

Search Diagnostics

outcome = db.search_with_stats(query, k=5, sparse=terms, explain=True)

print(outcome["stats"]["timings"])       # {"dense_us": 120, "sparse_us": 45, ...}
print(outcome["stats"]["used_ann"])      # True
print(outcome["results"][0]["explain"])  # Detailed scoring breakdown

Vector Quantization

Reduce memory usage and accelerate search with quantized vectors. All methods use a 2-stage pipeline: fast quantized candidate selection followed by exact float32 rescoring.

# Scalar quantization (int8) -- 4x memory reduction, minimal recall loss
db.enable_quantization("scalar")

# Binary quantization -- 32x memory reduction, best for normalized embeddings
db.enable_quantization("binary", rescore_multiplier=10)

# Product quantization -- configurable compression for very large datasets
db.enable_quantization("product", num_sub_vectors=16, num_centroids=256)

# Search is transparently accelerated
results = db.search(query_embedding, k=10)

# Check status
print(db.is_quantized)         # True
print(db.quantization_method)  # "scalar", "binary", or "product"

# Disable
db.disable_quantization()

Quantization parameters persist across reopens in a .vdb.quant sidecar file.

Filter Operators

Operator	Example	Description
`$eq`	`{"field": {"$eq": "value"}}`	Equal (also `{"field": "value"}`)
`$ne`	`{"field": {"$ne": "value"}}`	Not equal
`$gt` / `$gte`	`{"field": {"$gt": 5}}`	Greater than (or equal)
`$lt` / `$lte`	`{"field": {"$lt": 20}}`	Less than (or equal)
`$in` / `$nin`	`{"field": {"$in": ["a", "b"]}}`	In / not in set
`$contains`	`{"field": {"$contains": "auth"}}`	Substring match
`$exists`	`{"field": {"$exists": True}}`	Field presence
`$and` / `$or`	`{"$and": [{...}, {...}]}`	Logical combinators
`$not`	`{"$not": {...}}`	Logical negation
`$elemMatch`	`{"tags": {"$elemMatch": {"$eq": "rust"}}}`	Match list elements
`$size`	`{"tags": {"$size": 3}}`	List length
dot-path	`{"author.name": "Alice"}`	Nested field access

Database Methods Reference

Write Methods

Method	Description
`db.upsert(id, vector, metadata, sparse=..., vectors=...)`	Insert or update a single record
`db.insert(id, vector, metadata, sparse=..., vectors=...)`	Insert a record (raises on duplicate id)
`db.upsert_many(records, namespace=None)`	Upsert a batch of records (single index rebuild)
`db.insert_many(records, namespace=None)`	Insert a batch (raises on duplicate ids)
`db.bulk_ingest(records, namespace=None, batch_size=10000)`	Fastest bulk import with batched WAL writes
`db.delete(id, namespace=None)`	Delete a single record
`db.delete_many(ids, namespace=None)`	Delete multiple records by id
`db.delete_by_filter(filter, namespace=None)`	Delete all matching records in one filtered pass

Read Methods

Method	Description
`db.get(id, namespace=None)`	Get a single record by id
`db.search(query, k=10, ...)`	Search and return a list of results
`db.search_with_stats(query, k=10, ...)`	Search with detailed performance stats
`db.count(namespace=None, filter=None)` or `len(db)`	Count records, optionally scoped by namespace/filter
`db.list(namespace=None, filter=None, limit=0, offset=0)`	List records without issuing a vector query
`db.namespaces()`	List all namespaces
`db.dimension`	Vector dimension (property)
`db.path`	Database file path (property)
`db.read_only`	Whether the database is read-only (property)

Quantization Methods

Method	Description
`db.enable_quantization(method, ...)`	Enable quantization (`"scalar"`, `"binary"`, or `"product"`)
`db.disable_quantization()`	Disable quantization and remove persisted parameters
`db.is_quantized`	Whether quantization is enabled (property)
`db.quantization_method`	Active method name or `None` (property)

Maintenance Methods

Method	Description
`db.compact()`	Fold WAL into snapshot and persist ANN indexes
`db.flush()`	Alias for `compact()`
`db.snapshot(dest)`	Create a self-contained `.vdb` copy
`db.backup(dest_dir)`	Full backup including ANN sidecar files
`db.transaction()`	Begin an atomic transaction (use as context manager)
`db.close()`	Flush pending state, release the file lock, and invalidate the handle
`with vectlite.open(...):`	Python context-manager form of automatic close

How It Works

Records are stored in a compact binary .vdb snapshot file
Writes go through a crash-safe WAL (.wal) before being applied in memory
compact() folds the WAL into the snapshot and persists HNSW sidecar files
Dense search uses HNSW indexes (auto-built for collections above ~128 records)
Sparse search uses an inverted index with BM25 scoring
Hybrid fusion combines dense + sparse via linear combination or reciprocal rank fusion
Advisory file locks (flock) prevent concurrent write corruption

License

MIT

Project details

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
License
- OSI Approved :: MIT License
Programming Language

Release history Release notifications | RSS feed

0.9.3

May 13, 2026

0.9.2

May 12, 2026

0.9.1

May 12, 2026

0.9.0

May 12, 2026

This version

0.1.12

May 10, 2026

0.1.11

Apr 1, 2026

0.1.10

Mar 31, 2026

0.1.3

Mar 30, 2026

0.1.2

Mar 30, 2026

0.1.1

Mar 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vectlite-0.1.12.tar.gz (73.8 kB view details)

Uploaded May 10, 2026 Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vectlite-0.1.12-cp39-abi3-win_amd64.whl (1.6 MB view details)

Uploaded May 10, 2026 CPython 3.9+Windows x86-64

vectlite-0.1.12-cp39-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (1.8 MB view details)

Uploaded May 10, 2026 CPython 3.9+manylinux: glibc 2.17+ x86-64

vectlite-0.1.12-cp39-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl (1.8 MB view details)

Uploaded May 10, 2026 CPython 3.9+manylinux: glibc 2.17+ ARM64

vectlite-0.1.12-cp39-abi3-macosx_11_0_arm64.whl (1.6 MB view details)

Uploaded May 10, 2026 CPython 3.9+macOS 11.0+ ARM64

vectlite-0.1.12-cp39-abi3-macosx_10_12_x86_64.whl (1.7 MB view details)

Uploaded May 10, 2026 CPython 3.9+macOS 10.12+ x86-64

File details

Details for the file vectlite-0.1.12.tar.gz.

File metadata

Download URL: vectlite-0.1.12.tar.gz
Upload date: May 10, 2026
Size: 73.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vectlite-0.1.12.tar.gz
Algorithm	Hash digest
SHA256	`3c2dc4113211c53c5a200b7559e0b4597574bf34443fcdc4ec4783e5861b6d05`
MD5	`17917629eabd96c48437a165899ff938`
BLAKE2b-256	`f4553fb1b79358ffbf407c1097a2a05a4861798bb59d726e1d89a59b86545eb6`

See more details on using hashes here.

File details

Details for the file vectlite-0.1.12-cp39-abi3-win_amd64.whl.

File metadata

Download URL: vectlite-0.1.12-cp39-abi3-win_amd64.whl
Upload date: May 10, 2026
Size: 1.6 MB
Tags: CPython 3.9+, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vectlite-0.1.12-cp39-abi3-win_amd64.whl
Algorithm	Hash digest
SHA256	`90acaa4da217440d73e75538553e750ae9e83149326a0f071bd85573fe0aa090`
MD5	`51d7bb28f5ecf8814a15c8099f0b1e5b`
BLAKE2b-256	`1387f4d37d54d9a340b4e23ac01835d3bc371fc2ecde6ea2da68d958db475b4a`

See more details on using hashes here.

File details

Details for the file vectlite-0.1.12-cp39-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

Download URL: vectlite-0.1.12-cp39-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Upload date: May 10, 2026
Size: 1.8 MB
Tags: CPython 3.9+, manylinux: glibc 2.17+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vectlite-0.1.12-cp39-abi3-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm	Hash digest
SHA256	`2d21de526d2a75ba40a5cb905dc342f8246248b5c48a340566689658ebb4f4cb`
MD5	`2c9278d4374fb64177fee9a98c300d2f`
BLAKE2b-256	`6f8ea1f43a5b4835db7b808a04bcae9154ebf8484d027117b05bd669f58f89fe`

See more details on using hashes here.

File details

Details for the file vectlite-0.1.12-cp39-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl.

File metadata

Download URL: vectlite-0.1.12-cp39-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Upload date: May 10, 2026
Size: 1.8 MB
Tags: CPython 3.9+, manylinux: glibc 2.17+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vectlite-0.1.12-cp39-abi3-manylinux_2_17_aarch64.manylinux2014_aarch64.whl
Algorithm	Hash digest
SHA256	`7a7ad7205e176379ff9e61c6c8a70cda550c4d0e4dff0061bfdfd005adaba0d1`
MD5	`71fd25f975c1a28dabb9ce17f7840ba5`
BLAKE2b-256	`d9137e7580ccd4202c7527a3d033bc713c213c3662f7f1ccaccaeec17b933ad1`

See more details on using hashes here.

File details

Details for the file vectlite-0.1.12-cp39-abi3-macosx_11_0_arm64.whl.

File metadata

Download URL: vectlite-0.1.12-cp39-abi3-macosx_11_0_arm64.whl
Upload date: May 10, 2026
Size: 1.6 MB
Tags: CPython 3.9+, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vectlite-0.1.12-cp39-abi3-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`3bacb10c7efc10c41f8da9fa88b83538ad5771fffc1c820c0e3748cd6c3c0164`
MD5	`87a6727038ae9ef3e64d49b9cce71401`
BLAKE2b-256	`6fe364de604fd5c301dcd459cf71020f5cfdcdd41de23daa0e14b9d370cecee0`

See more details on using hashes here.

File details

Details for the file vectlite-0.1.12-cp39-abi3-macosx_10_12_x86_64.whl.

File metadata

Download URL: vectlite-0.1.12-cp39-abi3-macosx_10_12_x86_64.whl
Upload date: May 10, 2026
Size: 1.7 MB
Tags: CPython 3.9+, macOS 10.12+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vectlite-0.1.12-cp39-abi3-macosx_10_12_x86_64.whl
Algorithm	Hash digest
SHA256	`4abd991a4288229a3bf1269abda448ca03c026cacf7de36d2e8207383d03a0c1`
MD5	`a24ec3d25f0fbaef3edd69ce4c7cec96`
BLAKE2b-256	`15fb17b8f5f5d37b51ed277d75cf298e69cab1c9df233366c2d44a2e5d23aa21`

See more details on using hashes here.

vectlite 0.1.12

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

vectlite

Installation

Quick Start

Features

Core

Search & Retrieval

Data Management

Usage

Hybrid Search with Reranking

Bulk Ingestion (Recommended for Large Imports)

Collections

Transactions

Text Helpers

Analyzers

Snapshots & Backup

Read-Only Mode

Listing, Counting, and Lifecycle

Search Diagnostics

Vector Quantization

Filter Operators

Database Methods Reference

Write Methods

Read Methods

Quantization Methods

Maintenance Methods

How It Works

Links

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes