LlamaIndex VectorStore adapter for skeg.
Project description
skeg-llamaindex
LlamaIndex VectorStore adapter for skeg.
Compatible with llama-index-core >= 0.10.
Install
pip install skeg-llamaindex
Pulls in skeg (Python client) and llama-index-core automatically.
Server install (skeg engine) is separate:
brew tap skegdb/tap
brew install skeg
Usage
from llama_index.core import VectorStoreIndex, StorageContext, Document
from llama_index.core.node_parser import SentenceSplitter
from skeg_llamaindex import SkegVectorStore
# 1. Start a skeg server:
# skeg --data-dir ./data --addr 127.0.0.1:7379
# 2. Point the adapter at it. `dim` must match the embedding model.
store = SkegVectorStore.from_uri(
"skeg://127.0.0.1:7379/notes",
dim=1024, # mxbai-embed-large-v1 dimension
kind="int8", # tier-1 quantisation
backend="flat", # in-RAM flat index for <50K vectors
)
# 3. Wire into LlamaIndex.
ctx = StorageContext.from_defaults(vector_store=store)
docs = [Document(text="hello world"), Document(text="goodbye world")]
index = VectorStoreIndex.from_documents(docs, storage_context=ctx)
# 4. Query as usual.
engine = index.as_query_engine()
print(engine.query("what does the first doc say?"))
Index backend choice
| Use case | backend |
Notes |
|---|---|---|
| Personal AI, < 50K nodes | flat |
Exhaustive scan; fast on M-series CPUs |
| RAG over a fixed corpus, > 50K nodes | disk_vamana (pre-build) |
Use skeg-tool build, then serve read-only |
| Streaming insert with eventual large size | disk_vamana (RW) |
Delta WAL handles streaming |
For the pre-build path, build offline once and start the server in serve mode; this adapter then queries it read-only:
# Build the index offline (one shell):
skeg-cli build --input embeddings.npy --output ./data --name notes
# Serve it read-only (another shell):
skeg --mode serve --data-dir ./data --tier pq:128:256
What this adapter handles
add(nodes): VSET each embedding + KV-store the text + metadataquery(VectorStoreQuery): VSEARCH top-k, returns node_ids + similaritydelete(ref_doc_id): VDEL + drop KV keys- Stable mapping
node_id (str) → vec_id (u64)via xxh3
What this adapter does not do
- Metadata filter pushdown: LlamaIndex post-filters returned hits.
- Batched VSET: one VSET per node on the synchronous path. For large
corpora build the index offline with
skeg-cli buildand serve it read-only. - Hybrid sparse+dense search: skeg's surface is dense-only.
- Async API: this adapter is synchronous.
Test-suite safety
The pytest suite spawns its own skeg-server via the conftest fixture
and tears it down at the end. The tests create VINDEX entries with
names like notes-<test_name> and drop them after each test. If you
ever override the fixture to point at an external server, those
VINDEX names may collide with yours. The fixture is the safe default.
License
Apache-2.0.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file skeg_llamaindex-0.1.0.tar.gz.
File metadata
- Download URL: skeg_llamaindex-0.1.0.tar.gz
- Upload date:
- Size: 17.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
941073eb67d27db320697a46d35deb0cb90dfc5cb874786cb074499ef8ac9b84
|
|
| MD5 |
70634029c4e834611f717032a8c3f137
|
|
| BLAKE2b-256 |
64ddcae07a27791046f4702a64a8ed092587ea5bf9f32948e566e5ca542a974f
|
File details
Details for the file skeg_llamaindex-0.1.0-py3-none-any.whl.
File metadata
- Download URL: skeg_llamaindex-0.1.0-py3-none-any.whl
- Upload date:
- Size: 13.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7bd6673a8ffb7457a067b57c7a82edc897dfd5102a5919666a0699a7c52e82b2
|
|
| MD5 |
383420b3d44fa93dd56502f050d3bd90
|
|
| BLAKE2b-256 |
b00e177b0946e778379297b1558f0e6e46aa2215efde1a340273904764fd6b0c
|