Embedded single-file property-graph database with vector and BM25 full-text search
Project description
LatticeDB Python Bindings
Python bindings for LatticeDB, an embedded single-file property-graph database with native vector and BM25 full-text search.
Installation
pip install latticedb
Published wheels are expected to bundle the native shared library on supported platforms.
If you are installing from a source checkout, the package build can either:
- bundle a prebuilt
liblatticefromLATTICE_BUNDLE_LIB_DIR/LATTICE_BUNDLE_LIB_PATH, or - build
liblatticewith Zig during the wheel build
For example, to bundle a staged installed library into a locally built wheel:
export LATTICE_BUNDLE_LIB_DIR=/tmp/lattice-install/lib
pip wheel . -w dist
pip install dist/latticedb-*.whl
At runtime, explicit library discovery overrides still work via LATTICE_LIB_PATH, LATTICE_PREFIX, and pkg-config.
Migration note: embedding helpers now live in the dedicated latticedb.embedding module. See ../../docs/client_api_migration.md for the preferred API names and deprecated compatibility aliases.
Installed-prefix workflow:
zig build install --prefix /tmp/lattice-install
export LATTICE_PREFIX=/tmp/lattice-install
Alternatively, discovery can use pkg-config:
export PKG_CONFIG_PATH=/tmp/lattice-install/lib/pkgconfig
Quick Start
import numpy as np
from latticedb import Database
with Database("knowledge.db", create=True, enable_vectors=True, vector_dimensions=4) as db:
# Create nodes, edges, and index content
with db.write() as txn:
alice = txn.create_node(
labels=["Person"],
properties={"name": "Alice", "age": 30},
)
bob = txn.create_node(
labels=["Person"],
properties={"name": "Bob", "age": 25},
)
txn.create_edge(alice.id, bob.id, "KNOWS")
# Index text for full-text search
txn.fts_index(alice.id, "Alice works on machine learning research")
txn.fts_index(bob.id, "Bob studies deep learning and neural networks")
# Store vector embeddings
txn.set_vector(alice.id, "embedding", np.array([1.0, 0.0, 0.0, 0.0], dtype=np.float32))
txn.set_vector(bob.id, "embedding", np.array([0.0, 1.0, 0.0, 0.0], dtype=np.float32))
txn.commit()
# Query with Cypher
result = db.query("MATCH (n:Person) WHERE n.age > 20 RETURN n.name, n.age")
for row in result:
print(row)
# Vector similarity search
query_vec = np.array([0.9, 0.1, 0.0, 0.0], dtype=np.float32)
for r in db.vector_search(query_vec, k=2):
print(f"Node {r.node_id}: distance={r.distance:.4f}")
# Full-text search
for r in db.fts_search("machine learning"):
print(f"Node {r.node_id}: score={r.score:.4f}")
# Fuzzy search (typo-tolerant)
for r in db.fts_search_fuzzy("machin lerning"):
print(f"Node {r.node_id}: score={r.score:.4f}")
API Reference
Database
Database(
path: str | Path,
*,
create: bool = False, # Create if doesn't exist
read_only: bool = False, # Open in read-only mode
cache_size_mb: int = 100, # Page cache size
enable_vectors: bool | None = None, # Preferred vector config flag
enable_vector: bool | None = None, # Deprecated compatibility alias
vector_dimensions: int = 128 # Vector dimensions
)
Methods
open()/close()- Open/close the database (also works as context manager)read()- Start a read-only transaction (context manager)write()- Start a read-write transaction (context manager)query(cypher, parameters=None)- Execute a Cypher queryvector_search(vector, k=10, ef_search=64)- k-NN vector searchfts_search(query, limit=10)- Full-text searchfts_search_fuzzy(query, limit=10, max_distance=0, min_term_length=0)- Fuzzy full-text searchread_stream(stream, after_sequence=0, limit=100, timeout_ms=0)- Read durable stream records by cursorget_stream_offset(stream, consumer)- Read a committed consumer offsetchanges(after_sequence=0, limit=100, timeout_ms=0)- Read the built-in graph changefeedcache_clear()- Clear the query cachecache_stats()- Get cache hit/miss statistics
Transaction
Read Operations
get_node(node_id)- Get a node by ID, returnsNodeorNonenode_exists(node_id)- Check if a node existsget_property(node_id, key)- Get a property valueget_outgoing_edges(node_id)- Get outgoing edges from a nodeget_incoming_edges(node_id)- Get incoming edges to a nodeis_read_only/is_active- Transaction state
Write Operations
create_node(labels=[], properties=None)- Create a nodedelete_node(node_id)- Delete a nodeset_property(node_id, key, value)- Set a property on a nodeset_vector(node_id, key, vector)- Set a vector embeddingbatch_insert_vectors(label, vectors)- Insert vector-bearing nodes in one callbatch_insert(label, vectors)- Deprecated compatibility alias forbatch_insert_vectorsfts_index(node_id, text)- Index text for full-text searchcreate_edge(source_id, target_id, edge_type, properties=None)- Create an edgedelete_edge(source_id, target_id, edge_type)- Delete an edgeset_edge_property(edge_id, key, value)- Set an edge property by stable edge IDget_edge_property(edge_id, key)- Get an edge property by stable edge IDremove_edge_property(edge_id, key)- Remove an edge property by stable edge IDpublish_stream(stream, payload, kind="message")- Publish a durable stream recordset_stream_offset(stream, consumer, sequence)- Commit a durable consumer offsettrim_stream(stream, through_sequence)- Delete stream records through a sequencecommit()/rollback()- Commit or rollback the transaction
Bulk Vector Insertion
Insert many nodes with vectors in a single efficient call:
import numpy as np
with Database("vectors.db", create=True, enable_vectors=True, vector_dimensions=128) as db:
with db.write() as txn:
vectors = np.random.rand(1000, 128).astype(np.float32)
node_ids = txn.batch_insert_vectors("Document", vectors)
print(f"Created {len(node_ids)} nodes")
txn.commit()
Full-Text Search
Exact Search
results = db.fts_search("machine learning", limit=10)
for r in results:
print(f"Node {r.node_id}: score={r.score:.4f}")
Fuzzy Search (Typo-Tolerant)
# Finds "machine learning" even with typos
results = db.fts_search_fuzzy("machne lerning", limit=10)
# Control fuzzy matching sensitivity
results = db.fts_search_fuzzy(
"machne",
limit=10,
max_distance=2, # Max edit distance (default: 2)
min_term_length=4, # Min term length for fuzzy matching (default: 4)
)
Embeddings
LatticeDB includes a built-in hash embedding function and an HTTP client for external embedding services. For new code, prefer the dedicated latticedb.embedding module. The package root still exposes deprecated compatibility aliases.
Hash Embeddings (Built-in)
Deterministic, no external service needed. Useful for testing or simple keyword-based similarity:
from latticedb.embedding import hash_embed
vec = hash_embed("hello world", dimensions=128)
print(vec.shape) # (128,)
HTTP Embedding Client
Connect to Ollama, OpenAI, or compatible APIs:
from latticedb.embedding import EmbeddingClient, EmbeddingApiFormat
# Ollama (default)
with EmbeddingClient("http://localhost:11434") as client:
vec = client.embed("hello world")
# OpenAI-compatible API
with EmbeddingClient(
"https://api.openai.com/v1",
model="text-embedding-3-small",
api_format=EmbeddingApiFormat.OPENAI,
api_key="sk-...",
) as client:
vec = client.embed("hello world")
Edge Traversal
with db.read() as txn:
outgoing = txn.get_outgoing_edges(node_id)
for edge in outgoing:
print(f"{edge.source_id} --[{edge.edge_type}]--> {edge.target_id}")
incoming = txn.get_incoming_edges(node_id)
for edge in incoming:
print(f"{edge.source_id} --[{edge.edge_type}]--> {edge.target_id}")
Cypher Queries
# Pattern matching
result = db.query("MATCH (n:Person) RETURN n.name")
# With parameters
result = db.query(
"MATCH (n:Person) WHERE n.name = $name RETURN n",
parameters={"name": "Alice"},
)
# Vector similarity in Cypher
result = db.query(
"MATCH (n:Document) WHERE n.embedding <=> $vec < 0.5 RETURN n.title",
parameters={"vec": query_vector},
)
# Full-text search in Cypher
result = db.query(
'MATCH (n:Document) WHERE n.content @@ "machine learning" RETURN n.title'
)
# Data mutation
db.query("CREATE (n:Person {name: 'Charlie', age: 35})")
db.query("MATCH (n:Person {name: 'Charlie'}) SET n.age = 36")
db.query("MATCH (n:Person {name: 'Charlie'}) DETACH DELETE n")
Query Cache
# Get cache statistics
stats = db.cache_stats()
print(f"Entries: {stats['entries']}, Hits: {stats['hits']}, Misses: {stats['misses']}")
# Clear the cache
db.cache_clear()
Durable Streams and Changefeeds
Streams are durable named event logs stored inside the database file. Records are published in write transactions, sequence numbers are per stream, and reads use an explicit cursor. Reads do not acknowledge records; commit offsets separately when your consumer has processed a batch.
with Database("events.db", create=True) as db:
with db.write() as txn:
txn.publish_stream("jobs", {"id": 1, "status": "queued"}, kind="job.queued")
txn.commit()
records = db.read_stream("jobs", after_sequence=0, limit=100, timeout_ms=0)
with db.write() as txn:
txn.set_stream_offset("jobs", "worker-a", records[-1].sequence)
txn.trim_stream("jobs", records[-1].sequence - 1)
txn.commit()
db.changes() reads the reserved __lattice_changes stream. It emits semantic
graph events such as node.insert, node.property_set, edge.delete, and
edge.property_remove, with payloads represented as normal Python values.
Supported Property Types
None- Null valuebool- Booleanint- 64-bit integerfloat- 64-bit floatstr- UTF-8 stringbytes- Binary data- NumPy
ndarray(float32) - Vector embeddings
Nested list and dict values are not currently exposed by the public bindings/C API.
Error Handling
from latticedb import LatticeError, LatticeNotFoundError, LatticeIOError
try:
with Database("nonexistent.db") as db:
pass
except LatticeNotFoundError:
print("Database not found")
except LatticeIOError:
print("I/O error")
except LatticeError as e:
print(f"Error: {e}")
Requirements
- Python 3.9+
- NumPy (for vector operations)
- The native LatticeDB library (
liblattice.dylib/liblattice.so)
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file latticedb-0.8.2.tar.gz.
File metadata
- Download URL: latticedb-0.8.2.tar.gz
- Upload date:
- Size: 40.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
223460bd78759a1e813f57a47b21b388dace4e236d549d71aeb71d40bc27e279
|
|
| MD5 |
f7509bd8ac44e3a39957a7248d2782db
|
|
| BLAKE2b-256 |
15bbaa651b7707d568bb9ae29ede1197526d3f28c3fe1108a4993292f5330415
|
Provenance
The following attestation bundles were made for latticedb-0.8.2.tar.gz:
Publisher:
release.yml on jeffhajewski/latticedb
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
latticedb-0.8.2.tar.gz -
Subject digest:
223460bd78759a1e813f57a47b21b388dace4e236d549d71aeb71d40bc27e279 - Sigstore transparency entry: 1452476885
- Sigstore integration time:
-
Permalink:
jeffhajewski/latticedb@9259f915d31bcf3d7ffae880d94bf2d336385dcf -
Branch / Tag:
refs/tags/v0.8.2 - Owner: https://github.com/jeffhajewski
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@9259f915d31bcf3d7ffae880d94bf2d336385dcf -
Trigger Event:
push
-
Statement type:
File details
Details for the file latticedb-0.8.2-py3-none-manylinux_2_17_x86_64.whl.
File metadata
- Download URL: latticedb-0.8.2-py3-none-manylinux_2_17_x86_64.whl
- Upload date:
- Size: 3.2 MB
- Tags: Python 3, manylinux: glibc 2.17+ x86-64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
752c7ed87b0e8602ea976c732da910f46f36596a01e5c6d8ad8d6b63f3ef6fba
|
|
| MD5 |
3935687d3de4ed444873e0835976ebd2
|
|
| BLAKE2b-256 |
3016ff667a948e7352b6ee928b3312f2ee2c1f90048462f59034be3dfc8fa8dc
|
Provenance
The following attestation bundles were made for latticedb-0.8.2-py3-none-manylinux_2_17_x86_64.whl:
Publisher:
release.yml on jeffhajewski/latticedb
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
latticedb-0.8.2-py3-none-manylinux_2_17_x86_64.whl -
Subject digest:
752c7ed87b0e8602ea976c732da910f46f36596a01e5c6d8ad8d6b63f3ef6fba - Sigstore transparency entry: 1452477058
- Sigstore integration time:
-
Permalink:
jeffhajewski/latticedb@9259f915d31bcf3d7ffae880d94bf2d336385dcf -
Branch / Tag:
refs/tags/v0.8.2 - Owner: https://github.com/jeffhajewski
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@9259f915d31bcf3d7ffae880d94bf2d336385dcf -
Trigger Event:
push
-
Statement type:
File details
Details for the file latticedb-0.8.2-py3-none-manylinux_2_17_aarch64.whl.
File metadata
- Download URL: latticedb-0.8.2-py3-none-manylinux_2_17_aarch64.whl
- Upload date:
- Size: 3.1 MB
- Tags: Python 3, manylinux: glibc 2.17+ ARM64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d6b9a14e9ea718ed3678004bf3db1c32572bd1045d18e30c59822685bfc765fb
|
|
| MD5 |
168c1ca9ea6158e028cb0ca43b55605a
|
|
| BLAKE2b-256 |
9a7953dd866e12e6ef96487ad8e0d608a27c57b620d720350f58cb82198977ae
|
Provenance
The following attestation bundles were made for latticedb-0.8.2-py3-none-manylinux_2_17_aarch64.whl:
Publisher:
release.yml on jeffhajewski/latticedb
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
latticedb-0.8.2-py3-none-manylinux_2_17_aarch64.whl -
Subject digest:
d6b9a14e9ea718ed3678004bf3db1c32572bd1045d18e30c59822685bfc765fb - Sigstore transparency entry: 1452477224
- Sigstore integration time:
-
Permalink:
jeffhajewski/latticedb@9259f915d31bcf3d7ffae880d94bf2d336385dcf -
Branch / Tag:
refs/tags/v0.8.2 - Owner: https://github.com/jeffhajewski
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@9259f915d31bcf3d7ffae880d94bf2d336385dcf -
Trigger Event:
push
-
Statement type:
File details
Details for the file latticedb-0.8.2-py3-none-macosx_11_0_arm64.whl.
File metadata
- Download URL: latticedb-0.8.2-py3-none-macosx_11_0_arm64.whl
- Upload date:
- Size: 776.2 kB
- Tags: Python 3, macOS 11.0+ ARM64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3bd24ac9069b28c6e5aac53e4348772167e5d25366b2e08c4d658f9480e2aaf3
|
|
| MD5 |
68ba577ec5c0dc9acad3e891fdaae183
|
|
| BLAKE2b-256 |
5c701677b6cc232c29960e3503ec712ac9e3d512fddcc888419d0ae24f9aa059
|
Provenance
The following attestation bundles were made for latticedb-0.8.2-py3-none-macosx_11_0_arm64.whl:
Publisher:
release.yml on jeffhajewski/latticedb
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
latticedb-0.8.2-py3-none-macosx_11_0_arm64.whl -
Subject digest:
3bd24ac9069b28c6e5aac53e4348772167e5d25366b2e08c4d658f9480e2aaf3 - Sigstore transparency entry: 1452476969
- Sigstore integration time:
-
Permalink:
jeffhajewski/latticedb@9259f915d31bcf3d7ffae880d94bf2d336385dcf -
Branch / Tag:
refs/tags/v0.8.2 - Owner: https://github.com/jeffhajewski
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@9259f915d31bcf3d7ffae880d94bf2d336385dcf -
Trigger Event:
push
-
Statement type:
File details
Details for the file latticedb-0.8.2-py3-none-macosx_10_9_x86_64.whl.
File metadata
- Download URL: latticedb-0.8.2-py3-none-macosx_10_9_x86_64.whl
- Upload date:
- Size: 841.0 kB
- Tags: Python 3, macOS 10.9+ x86-64
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
22b1d3eabb4422ef3f1c473b42f92f245296c75eb4bb29eda96f7810744ffab8
|
|
| MD5 |
faea2707f29fd4a6011a9183eb58500b
|
|
| BLAKE2b-256 |
42ffa0185551a0d10a0a0f3911f0132b041e4fe3ec3ec14ceb2c1f48e7d91815
|
Provenance
The following attestation bundles were made for latticedb-0.8.2-py3-none-macosx_10_9_x86_64.whl:
Publisher:
release.yml on jeffhajewski/latticedb
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
latticedb-0.8.2-py3-none-macosx_10_9_x86_64.whl -
Subject digest:
22b1d3eabb4422ef3f1c473b42f92f245296c75eb4bb29eda96f7810744ffab8 - Sigstore transparency entry: 1452477149
- Sigstore integration time:
-
Permalink:
jeffhajewski/latticedb@9259f915d31bcf3d7ffae880d94bf2d336385dcf -
Branch / Tag:
refs/tags/v0.8.2 - Owner: https://github.com/jeffhajewski
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@9259f915d31bcf3d7ffae880d94bf2d336385dcf -
Trigger Event:
push
-
Statement type: