LlamaIndex VectorStore adapter for skeg.

These details have not been verified by PyPI

Project links

Project description

skeg-llamaindex

LlamaIndex VectorStore adapter for skeg. Compatible with llama-index-core >= 0.10.

Install

pip install skeg-llamaindex

Pulls in skeg (Python client) and llama-index-core automatically. Server install (skeg engine) is separate:

brew tap skegdb/tap
brew install skeg

Usage

from llama_index.core import VectorStoreIndex, StorageContext, Document
from llama_index.core.node_parser import SentenceSplitter
from skeg_llamaindex import SkegVectorStore

# 1. Start a skeg server:
#    skeg --data-dir ./data --addr 127.0.0.1:7379

# 2. Point the adapter at it. `dim` must match the embedding model.
store = SkegVectorStore.from_uri(
    "skeg://127.0.0.1:7379/notes",
    dim=1024,             # mxbai-embed-large-v1 dimension
    kind="int8",          # tier-1 quantisation
    backend="flat",       # in-RAM flat index for <50K vectors
)

# 3. Wire into LlamaIndex.
ctx = StorageContext.from_defaults(vector_store=store)
docs = [Document(text="hello world"), Document(text="goodbye world")]
index = VectorStoreIndex.from_documents(docs, storage_context=ctx)

# 4. Query as usual.
engine = index.as_query_engine()
print(engine.query("what does the first doc say?"))

Index backend choice

Use case	`backend`	Notes
Personal AI, < 50K nodes	`flat`	Exhaustive scan; fast on M-series CPUs
RAG over a fixed corpus, > 50K nodes	`disk_vamana` (pre-build)	Use `skeg-tool build`, then serve read-only
Streaming insert with eventual large size	`disk_vamana` (RW)	Delta WAL handles streaming

For the pre-build path, build offline once and start the server in serve mode; this adapter then queries it read-only:

# Build the index offline (one shell):
skeg-cli build --input embeddings.npy --output ./data --name notes

# Serve it read-only (another shell):
skeg --mode serve --data-dir ./data --tier pq:128:256

What this adapter handles

add(nodes): VSET each embedding + KV-store the text + metadata
query(VectorStoreQuery): VSEARCH top-k, returns node_ids + similarity
delete(ref_doc_id): VDEL + drop KV keys
Stable mapping node_id (str) → vec_id (u64) via xxh3

What this adapter does not do

Metadata filter pushdown: LlamaIndex post-filters returned hits.
Batched VSET: one VSET per node on the synchronous path. For large corpora build the index offline with skeg-cli build and serve it read-only.
Hybrid sparse+dense search: skeg's surface is dense-only.
Async API: this adapter is synchronous.

Test-suite safety

The pytest suite spawns its own skeg-server via the conftest fixture and tears it down at the end. The tests create VINDEX entries with names like notes-<test_name> and drop them after each test. If you ever override the fixture to point at an external server, those VINDEX names may collide with yours. The fixture is the safe default.

License

Apache-2.0.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.1

May 27, 2026

This version

0.1.0

May 26, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

skeg_llamaindex-0.1.0.tar.gz (17.7 kB view details)

Uploaded May 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

skeg_llamaindex-0.1.0-py3-none-any.whl (13.2 kB view details)

Uploaded May 26, 2026 Python 3

File details

Details for the file skeg_llamaindex-0.1.0.tar.gz.

File metadata

Download URL: skeg_llamaindex-0.1.0.tar.gz
Upload date: May 26, 2026
Size: 17.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for skeg_llamaindex-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`941073eb67d27db320697a46d35deb0cb90dfc5cb874786cb074499ef8ac9b84`
MD5	`70634029c4e834611f717032a8c3f137`
BLAKE2b-256	`64ddcae07a27791046f4702a64a8ed092587ea5bf9f32948e566e5ca542a974f`

See more details on using hashes here.

File details

Details for the file skeg_llamaindex-0.1.0-py3-none-any.whl.

File metadata

Download URL: skeg_llamaindex-0.1.0-py3-none-any.whl
Upload date: May 26, 2026
Size: 13.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for skeg_llamaindex-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7bd6673a8ffb7457a067b57c7a82edc897dfd5102a5919666a0699a7c52e82b2`
MD5	`383420b3d44fa93dd56502f050d3bd90`
BLAKE2b-256	`b00e177b0946e778379297b1558f0e6e46aa2215efde1a340273904764fd6b0c`

See more details on using hashes here.

skeg-llamaindex 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

skeg-llamaindex

Install

Usage

Index backend choice

What this adapter handles

What this adapter does not do

Test-suite safety

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes