Fast LMDB-backed vector cache with Python bindings and embedding helpers

Project description

vector_cache_lmdb

vector_cache_lmdb is a Rust library with Python bindings for storing and retrieving Vec<f32> embeddings by text key.

It uses LMDB via heed for durable mmap-backed storage with many concurrent readers and safely serialized writers across multiple processes.

Installation

pip install vector-cache-lmdb

Python usage

from vector_cache_lmdb import VectorCache

cache = VectorCache("/var/opt/cache.bin", max_items=1_000_000)

cache.put("this is a test string", [0.1, 0.5, 1.1])
print(cache.get("this is a test string"))

cache.put_multi(
    ["first string", "second string"],
    [[0.1, 0.5, 1.1], [2.0, 3.0, 4.0]],
)

print(cache.get_multi(["first string", "second string", "missing"]))
print(cache.delete("this is a test string"))
print(cache.delete_multi(["first string", "second string", "missing"]))

cache.reset()

Capacity can be configured in one of two mutually exclusive ways:

# Count-based capacity
cache = VectorCache("/var/opt/cache.bin", max_items=5_000_000)

# Byte-based capacity (vector payload bytes), fixed dimension required
cache = VectorCache("/var/opt/cache.bin", max_gb=2.0)

Passing both max_items and max_gb raises ValueError.

OpenAI Embeddings Helper

Use get_embeddings_with_cache with make_openai_embed_fn to keep ordering exact while filling cache misses:

from openai import OpenAI

from vector_cache_lmdb import (
    VectorCache,
    get_embeddings_with_cache,
    make_openai_embed_fn,
)

client = OpenAI()
cache = VectorCache("/var/opt/cache.bin", max_items=1_000_000)
embed_fn = make_openai_embed_fn("text-embedding-3-small", client=client)

texts = ["first string", "second string", "first string"]
embeddings = get_embeddings_with_cache(
    texts=texts,
    cache=cache,
    embed_fn=embed_fn,
)

make_openai_embed_fn loads .env automatically when client is omitted.

Sentence-Transformers Embeddings Helper

from sentence_transformers import SentenceTransformer

from vector_cache_lmdb import (
    VectorCache,
    get_embeddings_with_cache,
    make_sentence_transformers_embed_fn,
)

encoder = SentenceTransformer("sentence-transformers/all-MiniLM-L6-v2")
cache = VectorCache("/var/opt/cache.bin", max_items=1_000_000)
embed_fn = make_sentence_transformers_embed_fn(encoder)

texts = ["first string", "second string", "first string"]
embeddings = get_embeddings_with_cache(
    texts=texts,
    cache=cache,
    embed_fn=embed_fn,
)

Generic Custom Provider

from vector_cache_lmdb import get_embeddings_with_cache

def my_embed_fn(texts: list[str]) -> list[list[float]]:
    return my_provider.embed(texts)  # must return list[list[float]]

embeddings = get_embeddings_with_cache(
    texts=texts,
    cache=cache,
    embed_fn=my_embed_fn,
)

Notes

The path argument is an LMDB environment directory (not a single data file).
Hashing uses BLAKE3 for very fast fixed-size text keys.
get_multi preserves input order and returns None for missing entries.
Capacity is enforced with an on-disk LRU index shared across processes.
Eviction runs on writes:
- max_items: when len() > max_items
- max_gb: when bytes_len() > max_gb budget (vector payload bytes)
In max_gb mode, vector dimension is detected and must remain fixed.
For correctness, use a local filesystem path. LMDB locking semantics are not guaranteed on remote/network filesystems (for example SSHFS/NFS mounts).

Project details

Release history Release notifications | RSS feed

0.1.7

Apr 11, 2026

0.1.6

Apr 11, 2026

0.1.4

Apr 11, 2026

This version

0.1.3

Apr 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vector_cache_lmdb-0.1.3-cp313-cp313-manylinux_2_38_x86_64.whl (378.9 kB view details)

Uploaded Apr 11, 2026 CPython 3.13manylinux: glibc 2.38+ x86-64

File details

Details for the file vector_cache_lmdb-0.1.3-cp313-cp313-manylinux_2_38_x86_64.whl.

File metadata

Download URL: vector_cache_lmdb-0.1.3-cp313-cp313-manylinux_2_38_x86_64.whl
Upload date: Apr 11, 2026
Size: 378.9 kB
Tags: CPython 3.13, manylinux: glibc 2.38+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.11

File hashes

Hashes for vector_cache_lmdb-0.1.3-cp313-cp313-manylinux_2_38_x86_64.whl
Algorithm	Hash digest
SHA256	`024ee45ffa4ba6af8353599f843140f6d2fcc4557dfe8b44f554286559effb6e`
MD5	`f736323feb0755fce7825a3201dbc3f8`
BLAKE2b-256	`3ece7fd8ced9978a2aa8dc910f43f152edcdba23708f387e30364f7aa499d521`

See more details on using hashes here.

vector-cache-lmdb 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

vector_cache_lmdb

Installation

Python usage

OpenAI Embeddings Helper

Sentence-Transformers Embeddings Helper

Generic Custom Provider

Notes

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes