A fast, lightweight, and zero-setup in-memory vector store powered by NumPy

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

tvanreenen

These details have not been verified by PyPI

Project description

NumPy Vector Store

A fast, lightweight, and zero-setup in-memory vector store powered by NumPy.

High-performance index-free O(n) cosine similarity searches
Vectorized metadata queries with NumPy operations
Easy persistence to and from compressed NumPy binary files (.npz)
Zero dependency on external services or data stores

Why?

This library is purpose-built for small to medium-scale vector search tasks and offers a simple, lightweight alternative to heavyweight solutions like Pinecone, Qdrant, Weaviate, Postgres + pgvector, or Azure AI Search—no complex setup or infrastructure required.

Sometimes you don't need a sledgehammer to crack a nut.

When/Where?

Below are benchmark results for the module's search method to help you assess its suitability for your use case.

Embedding Type	Dimensions	~5ms	~25ms	~100ms	~500ms
Sentence Transformers	384	1K vectors 1.5MB	10K vectors 15MB	100K vectors 147MB	500K vectors 732MB
OpenAI Small	1536	500 vectors 3MB	5K vectors 29MB	25K vectors 147MB	100K vectors 586MB
OpenAI Large	3072	200 vectors 2MB	2.5K vectors 29MB	5K vectors 59MB	25K vectors 293MB

Benchmarks performed on Apple M2 hardware

Installation

⚠️ Pending submission to PyPI

uv add numpy-vector-store

Quick Start

import numpy as np
from numpy_vector_store import VectorStore

# Load your vector store
store = VectorStore(dimensions=1536, file_path='vectors.npz')
store.load()

# Embed your search query
query = np.array([0.2, 0.3, 0.4, ...])

# Search using cosine similarity
results = store.search(query, top_k=3)

# Compare the results
for index, similarity, meta in results:
    print(f"{meta['title']}: {similarity:.3f}")

Usage Examples

Adding Vectors

Adding Vectors in Batch

The add_vectors method takes a 2D NumPy array where each row is a vector, and a 1D NumPy array of metadata objects.

# 2D np.array of vectors
embeddings = np.array([
    [0.1, 0.2, 0.3, ...],  # Text embedding 1
    [0.4, 0.5, 0.6, ...],  # Text embedding 2
    [0.7, 0.8, 0.9, ...]   # Text embedding 3
])

# 1D np.array of metadata objects
metadata = np.array([
    {"title": "AI Overview", "word_count": 12},
    {"title": "Python Guide", "word_count": 10},
    {"title": "Vector DBs", "word_count": 8}
])

# Vectors and metadata added using efficient vectorized NumPy operations
store.add_vectors(embeddings, metadata)

Adding Vectors Individually

Batch operations are generally preferable and more efficient. But individual vectors can be added with the same method.

store.add_vectors(
    np.array([new_embedding]), # or np.atleast_2d(new_embedding)
    np.array([{"title": "Neural Networks Paper", "word_count": 15}])
)

Save the Vector Store

Save to a gzip compressed NumPy binary file directly

store.save()

Or use context manager for automatic persistence

with VectorStore(dimensions=3, file_path="vectors.npz") as store:
    store.add_vectors(vectors_2d, metadata_array)

# Automatically saves when exiting the context

Working with Metadata

Work with flexible, unstructured metadata using standard Python operations. No schema required - perfect for getting started quickly:

# Access individual metadata
first_metadata = store.metadata[0]
print(f"First entry: {first_metadata}")

# Iterate through all metadata
for i, metadata in enumerate(store.metadata):
    print(f"Entry {i}: {metadata}")

Advanced: Structured Metadata for Performance

If you define a homogeneous NumPy schema upfront, you get significant performance improvements for metadata operations. Instead of Python loops and dictionary lookups, you get vectorized NumPy operations that are orders of magnitude faster.

Learn more: NumPy Structured Arrays Documentation

# Define schema for performance
store = VectorStore(
    dimensions=512,
    metadata_schema={'title': 'U200', 'year': 'i4', 'citations': 'i4'}
)
store.add_vectors(
    np.array([vector1, vector2]),
    np.array([
        {"title": "Paper 1", "year": 2023, "citations": 100},
        {"title": "Paper 2", "year": 2022, "citations": 50}
    ])
)

# Perform vectorized NumPy operations on metadata
recent_mask = store.metadata['year'] == 2023
recent_vectors = np.array(store.vectors)[recent_mask]

# Sort by citations
sorted_indices = np.argsort(store.metadata['citations'])[::-1]
top_vectors = np.array(store.vectors)[sorted_indices]

# Complex filtering
high_impact = store.metadata['citations'] > 100
recent = store.metadata['year'] > 2020
combined_mask = high_impact & recent
filtered_vectors = np.array(store.vectors)[combined_mask]

Contributing

Setup Development Environment

git clone https://github.com/tvanreenen/numpy-vector-store.git
cd numpy-vector-store
uv sync --frozen --group dev

Before Submitting a Pull Request

Please ensure:

Code Quality: Run uv run ruff check - should show no issues
Formatting: Run uv run ruff format - should show "files left unchanged"
Type Checking: Run uv run mypy src/ - should show no errors
Tests: Run uv run pytest - all tests should pass

License

MIT License - see LICENSE file for details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

tvanreenen

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.0

Mar 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

numpy_vector_store-0.1.0.tar.gz (52.4 kB view details)

Uploaded Mar 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

numpy_vector_store-0.1.0-py3-none-any.whl (7.4 kB view details)

Uploaded Mar 27, 2026 Python 3

File details

Details for the file numpy_vector_store-0.1.0.tar.gz.

File metadata

Download URL: numpy_vector_store-0.1.0.tar.gz
Upload date: Mar 27, 2026
Size: 52.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for numpy_vector_store-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`47a9c87edc909da868248fe8e66aa9e749856ce1ae5c9ede2d1bcb61b2e0c369`
MD5	`5cf6b0f947da982337dbaa3df996c863`
BLAKE2b-256	`c795d1b170ed7d4aa1a3f2080344cb5fdf43c0df342106cd9416bdbbb5041f7d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for numpy_vector_store-0.1.0.tar.gz:

Publisher: publish-pypi.yml on tvanreenen/numpy-vector-store

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: numpy_vector_store-0.1.0.tar.gz
- Subject digest: 47a9c87edc909da868248fe8e66aa9e749856ce1ae5c9ede2d1bcb61b2e0c369
- Sigstore transparency entry: 1187781473
- Sigstore integration time: Mar 27, 2026
Source repository:
- Permalink: tvanreenen/numpy-vector-store@a9aed836a149ee1a23ed79dfa3839e924edde971
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/tvanreenen
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@a9aed836a149ee1a23ed79dfa3839e924edde971
- Trigger Event: release

File details

Details for the file numpy_vector_store-0.1.0-py3-none-any.whl.

File metadata

Download URL: numpy_vector_store-0.1.0-py3-none-any.whl
Upload date: Mar 27, 2026
Size: 7.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for numpy_vector_store-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9f403ed67ba861a616d4d387d8900b714e2f448a43cfd3e28f6c8961df5d2f94`
MD5	`6f5f04c80a99418dfa26b3ee506874b8`
BLAKE2b-256	`03f7ef4d7b194fce36e186fd124b63d920bf743b556c6dd3722c6be072a6f7a8`

See more details on using hashes here.

Provenance

The following attestation bundles were made for numpy_vector_store-0.1.0-py3-none-any.whl:

Publisher: publish-pypi.yml on tvanreenen/numpy-vector-store

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: numpy_vector_store-0.1.0-py3-none-any.whl
- Subject digest: 9f403ed67ba861a616d4d387d8900b714e2f448a43cfd3e28f6c8961df5d2f94
- Sigstore transparency entry: 1187781477
- Sigstore integration time: Mar 27, 2026
Source repository:
- Permalink: tvanreenen/numpy-vector-store@a9aed836a149ee1a23ed79dfa3839e924edde971
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/tvanreenen
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@a9aed836a149ee1a23ed79dfa3839e924edde971
- Trigger Event: release

numpy-vector-store 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

NumPy Vector Store

Why?

When/Where?

Installation

Quick Start

Usage Examples

Adding Vectors

Adding Vectors in Batch

Adding Vectors Individually

Save the Vector Store

Working with Metadata

Advanced: Structured Metadata for Performance

Contributing

Setup Development Environment

Before Submitting a Pull Request

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance