A lightweight vector database built with Python

Project description

My Vector Database

A lightweight, production-ready vector database with a RESTful API and Python SDK. Built with FastAPI and Pydantic, it supports storing documents with vector embeddings and provides efficient similarity search using custom-implemented vector indexes.

What's New

v0.3.0 - IVF Index Implementation

⚡ IVF Index - Fast approximate search for large datasets
- 10x+ faster than FLAT on large datasets (>10k vectors)
- Configurable speed/accuracy tradeoff via nlist and nprobe parameters
- 80-95% recall with cluster-based search
- See IVF Index Documentation

v0.2.1 - Interactive CLI & IVF Index

🎯 Interactive CLI - New optional CLI for database management
- Install with: pip install "my-vector-db[cli]"
- Commands for libraries, documents, chunks, search, and persistence
- See CLI Documentation

Features

Core Features ✅

CRUD Operations: Full create, read, update, delete for libraries, documents, and chunks
Vector Indexes: FLAT index implementation with cosine, euclidean, and dot product metrics
K-Nearest Neighbor Search: Efficient similarity search with configurable result count
Metadata Support: Flexible metadata dictionaries for all entities
Thread-Safe Operations: Proper concurrency control for read/write operations

Advanced Features ✅

Metadata Filtering: Server-side declarative filters with complex logic (AND/OR, nested filters)
Custom Filter Functions: Client-side Python filter functions for advanced use cases
Data Persistence: Save/restore database snapshots with automatic and manual triggers
Python SDK Client: Fully-featured, type-safe SDK with comprehensive documentation
Batch Operations: Efficient bulk insert for chunks with atomic transactions
Comprehensive Testing: Unit tests with >80% coverage
Agent Framework Integration: integration with Agno framework for basic RAG applications
MCP Server: MCP server for vector database

Prerequisites

uv
Docker or Podman

Note: This project uses uv for all Python operations including dependency management, testing, and running the server.

Core Concepts

My Vector Database organizes data in a three-tier hierarchy:

Libraries → Documents → Chunks

Libraries: Top-level containers with their own vector index configuration (FLAT, IVF)
Documents: Logical groupings of related text chunks within a library
Chunks: Individual pieces of text with vector embeddings and metadata (the searchable units)

This structure allows you to organize and search your data at different levels of granularity. For example, a library might contain multiple PDF documents, each document split into searchable chunks with embeddings.

Quick Start

Start the Server

See the Run the Vector Database Server guide for detailed setup options.

Quick start with Docker Compose (recommended):

# Clone the repository
git clone https://github.com/ajshedivy/my-vector-db.git
cd my-vector-db

# Start with Docker Compose
docker compose up -d

Or run directly with Docker:

docker run -d --name my-vector-db -p 8000:8000 ghcr.io/ajshedivy/my-vector-db:latest

The API will be available at:

API Endpoint: http://localhost:8000
API Documentation: http://localhost:8000/docs
Health Check: http://localhost:8000/health

Install the SDK

# Install from PyPI
pip install my-vector-db

# Or install from source
git clone https://github.com/ajshedivy/my-vector-db.git
cd my-vector-db
uv sync

Using the Python SDK

Here's a complete workflow in ~15 lines:

from my_vector_db.sdk import VectorDBClient

client = VectorDBClient(base_url="http://localhost:8000")

# Create library → document → chunks
library = client.create_library(name="docs", index_type="flat")
document = client.create_document(library_id=library.id, name="intro")

# Add chunks with embeddings
chunks = client.add_chunks(
    document_id=document.id,
    chunks=[
        {
            "text": "AI enables pattern recognition",
            "embedding": [0.9, 0.8, 0.1, 0.2, 0.3],
        },
        {
            "text": "Deep learning uses neural networks",
            "embedding": [0.85, 0.75, 0.15, 0.25, 0.35],
        },
    ],
)

# Search for similar content
results = client.search(
    library_id=library.id, embedding=[0.88, 0.78, 0.12, 0.22, 0.32], k=5
)
for result in results.results:
    print(f"{result.score:.4f} - {result.text}")

For comprehensive SDK documentation, see the SDK Reference Guide which covers:

Detailed API reference for all operations
Advanced filtering (metadata, time-based, custom functions)
Persistence management (snapshots, backups, restore)
Vector index configuration and best practices
Performance optimization and error handling

For more usage examples, see the examples/ directory.

Architecture

Data Model

The vector database has three main data components organized in a hierarchical structure:

alt text

Key Relationships:

One Library contains many Documents (1:N)
One Document contains many Chunks (1:N)
Search operations are performed at the Library level across all chunks
Vector indexes are built and configured per Library

alt text

Key Design Principles:

Layered Architecture: Clean separation (API → Service → Storage → Index)
Thread-Safe: RLock-based synchronization for concurrent operations
Type-Safe: Full Pydantic validation throughout
Persistence: JSON snapshots with atomic writes
Filtering: Post-filtering strategy with declarative and custom options

Libraries

A library is a top-level container for documents with its own vector index configuration.

Attributes:

id: Unique identifier (UUID)
name: Library name
index_type: Type of vector index (flat, ivf)
index_config: Index-specific configuration (e.g., distance metric)
metadata: Optional metadata dictionary
document_ids: List of document IDs in the library
created_at, updated_at: Timestamps

Documents

A document represents a logical grouping of text chunks within a library.

Attributes:

id: Unique identifier (UUID)
library_id: Parent library ID
name: Document name
metadata: Optional metadata dictionary
chunk_ids: List of chunk IDs in the document
created_at, updated_at: Timestamps

Chunks

A chunk is a single piece of text with its vector embedding and metadata (the searchable unit).

Attributes:

id: Unique identifier (UUID)
document_id: Parent document ID
text: Text content
embedding: Vector embedding (list of floats)
metadata: Optional metadata dictionary
created_at, updated_at: Timestamps

Project Structure

The project follows Domain-Driven Design principles with clear separation of concerns:

Domain Layer (src/my_vector_db/domain/): Core business models using Pydantic
Service Layer (src/my_vector_db/services/): Business logic and orchestration
API Layer (src/my_vector_db/api/): FastAPI endpoints and request/response DTOs
Infrastructure Layer (src/my_vector_db/infrastructure/): Storage and vector index implementations

Vector Indexes

Index Type	Status	Search Type	Time Complexity	Recall	Best For
FLAT	✅ Implemented	Exact	O(n·d)	100%	Small to medium datasets (<10k vectors)
IVF	✅ Implemented	Approximate	O(k + (n/k)·d)	80-95%	Large datasets (>10k vectors), speed/accuracy balance
HNSW	⚠️ Planned	Approximate	O(log n)	95-99%	Large datasets (>10k vectors), speed priority

FLAT Index (Exact Search) ✅ Implemented

Exhaustive brute-force search comparing the query vector against every stored vector.

Characteristics:

Time Complexity: O(n·d) search, O(1) insert
Space Complexity: O(n·d)
Recall: 100% (exact results, guaranteed true nearest neighbors)
Best For: Small to medium datasets (< 10,000 vectors), when accuracy is critical

Supported Metrics: cosine, euclidean, dot_product

library = client.create_library(
    name="my_library",
    index_type="flat",
    index_config={"metric": "cosine"}
)

IVF Index (Approximate Search) ✅ Implemented

Inverted File (IVF) index using K-means clustering to partition the vector space for faster approximate search.

Characteristics:

Time Complexity: O(k + (n/k)·d) search, O(1) insert
Space Complexity: O(n·d + k·d)
Recall: 80-95% (configurable via nlist/nprobe)
Best For: Large datasets (> 10,000 vectors), balancing speed and accuracy
Configurable Parameters:
- nlist: Number of clusters (e.g., 100)
- nprobe: Number of clusters to search (e.g., 10)

Supported Metrics: cosine, euclidean, dot_product

library = client.create_library(
    name="my_library",
    index_type="ivf",
    index_config={"nlist": 100, "nprobe": 10}
)

HNSW Index (Approximate Search) ⚠️ Planned

Hierarchical Navigable Small World graph-based approximate nearest neighbor algorithm.

Planned Characteristics:

Time Complexity: O(log n) search (approximate)
Space Complexity: O(n·M·log n)
Recall: 95-99% (configurable)
Best For: Large datasets (> 10,000 vectors), when speed is more important than perfect accuracy

Note: The HNSW index is defined in the API but not yet fully implemented.

For detailed index configuration and performance considerations, see the Vector Indexes Documentation.

API Reference

Libraries

Method	Endpoint	Description
POST	`/libraries`	Create a new library
GET	`/libraries`	List all libraries
GET	`/libraries/{library_id}`	Get library by ID
PUT	`/libraries/{library_id}`	Update library
DELETE	`/libraries/{library_id}`	Delete library
POST	`/libraries/{library_id}/build-index`	Build or rebuild vector index

Documents

Method	Endpoint	Description
POST	`/libraries/{library_id}/documents`	Create document in library
POST	`/libraries/{library_id}/documents/batch`	Batch create documents in library
GET	`/libraries/{library_id}/documents`	List documents in library
GET	`/documents/{document_id}`	Get document by ID
PUT	`/documents/{document_id}`	Update document
DELETE	`/documents/{document_id}`	Delete document

Chunks

Method	Endpoint	Description
POST	`/documents/{document_id}/chunks`	Create chunk in document
POST	`/documents/{document_id}/chunks/batch`	Batch create chunks in document
GET	`/documents/{document_id}/chunks`	List chunks in document
GET	`/chunks/{chunk_id}`	Get chunk by ID
PUT	`/chunks/{chunk_id}`	Update chunk
DELETE	`/chunks/{chunk_id}`	Delete chunk

Search

Method	Endpoint	Description
POST	`/libraries/{library_id}/query`	Perform k-nearest neighbor search

Persistence

Method	Endpoint	Description
POST	`/persistence/save`	Save database snapshot to disk
POST	`/persistence/restore`	Restore from latest snapshot
GET	`/persistence/status`	Get persistence status and statistics

API Design Note: The API uses a simplified flat structure for direct resource access. Once created, documents and chunks can be accessed by their globally unique UUIDs without specifying parent resources, reducing API verbosity.

For interactive API exploration, visit http://localhost:8000/docs after starting the server.

Demo

Here is an end-to-end demo showcasing key features of the vector database. The demo is divided into several parts:

Database setup and python client SDK
Vector Search
Vector Search with Filtering
Persistence
RAG pipeline with Agno Framework
End-to-end document processing and Agent integration
MCP Server

The demo videos can be found at the links below. Set the playback quality to 1080p for best clarity. (Also set the speed to 1.25x or 1.5x to save time!).

The videos walk through the demo notebook found here: Demo Notebook

1. Database setup and python client SDK

video link

This video covers:

Installing dependencies
Starting the server with Docker
Using the Python SDK to create libraries, documents, and chunks

📚 Documentation:

2. Vector Search

video link

This video covers:

Performing k-nearest neighbor search
Understanding search results and scores

📚 Documentation:

3. Vector Search with Filtering

video link

This video covers:

Using metadata filters in search queries
Creating custom filter functions for advanced filtering
Combining multiple filter conditions

📚 Documentation:

4. Persistence

video link

This video covers:

Saving database snapshots to disk
Restoring the database from snapshots
Configuring automatic persistence settings

📚 Documentation:

5. RAG pipeline with Agno Framework

video link

This video covers:

Integrating the vector database with the Agno agent framework
Creating a knowledge base using the vector database
Building an agent that utilizes the knowledge base for retrieval-augmented generation (RAG)

📚 Documentation:

6. End-to-end document processing and Agent integration

video link

This video covers:

Processing documents and adding them to the vector database
Using the Agno agent to answer questions based on the stored documents
Demonstrating the complete workflow from document ingestion to question answering

📚 Documentation:

Agno Integration

7. MCP Server

video link

This video covers:

MCP server setup and configuration
connecting MCP server to LM Studio
Invoking tools via chat for fetching library and document info
Run vector search from the chat using search tool

📚 Documentation:

Integrations

Agno Framework

The vector database integrates seamlessly with the Agno agent framework for building RAG (Retrieval-Augmented Generation) applications.

from agno.agent import Agent
from agno.knowledge.knowledge import Knowledge
from agno.models.anthropic import Claude
from my_vector_db.db import MyVectorDB

# Create vector database
vector_db = MyVectorDB(
    api_base_url="http://localhost:8000",
    library_name="knowledge_base",
    index_type="flat"
)

# Create knowledge base
knowledge = Knowledge(
    name="My Knowledge Base",
    vector_db=vector_db,
    max_results=5
)

# Add content
knowledge.add_content(
    name="example",
    text_content="Your content here"
)

# Create agent with knowledge
agent = Agent(
    name="Assistant",
    knowledge=knowledge,
    model=Claude(id="claude-sonnet-4-5"),
    search_knowledge=True
)

agent.cli_app(stream=True)

For a complete working example, see examples/agno_example.py.

Testing

# Run all tests
uv run pytest

# Run with coverage report
uv run pytest --cov=my_vector_db --cov-report=html

# Run specific test file
uv run pytest tests/test_sdk.py -v

# Run with verbose output
uv run pytest -vv

Configuration

Environment Variables

# API server settings
VECTOR_DB_HOST=0.0.0.0
VECTOR_DB_PORT=8000

# SDK client settings
VECTOR_DB_BASE_URL=http://localhost:8000
VECTOR_DB_TIMEOUT=30

# Persistence settings (optional)
ENABLE_STORAGE_PERSISTENCE=false
STORAGE_DIR=./data
STORAGE_SAVE_EVERY=-1  # -1 disables automatic saves

For detailed persistence configuration and workflows, see the Persistence Management Documentation.

Examples

The examples/ directory contains complete usage examples

Development

Code Quality Tools

# Format code
uv run ruff format

# Lint code
uv run ruff check

# Type checking
uvx ty check src/my_vector_db

Documentation

SDK Reference Guide: Complete SDK documentation with examples

Author

Adam Shedivy 📧 ajshedivyaj@gmail.com 🔗 GitHub

Project details

Release history Release notifications | RSS feed

This version

0.3.0

Nov 8, 2025

0.2.1

Nov 6, 2025

0.2.0

Nov 6, 2025

0.1.0

Nov 6, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

my_vector_db-0.3.0.tar.gz (61.5 kB view details)

Uploaded Nov 8, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

my_vector_db-0.3.0-py3-none-any.whl (76.6 kB view details)

Uploaded Nov 8, 2025 Python 3

File details

Details for the file my_vector_db-0.3.0.tar.gz.

File metadata

Download URL: my_vector_db-0.3.0.tar.gz
Upload date: Nov 8, 2025
Size: 61.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for my_vector_db-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`0e07c6faf2c136cbc54162a558db81f107a4c49c8762737cd25759cff18c6d56`
MD5	`af1e9f31ef38bdaa92d2fc0f50bf1d72`
BLAKE2b-256	`89d6bb59ff2ff1c3704c3f0b584aca5bbd07159bd0024c24add6e45246257a32`

See more details on using hashes here.

Provenance

The following attestation bundles were made for my_vector_db-0.3.0.tar.gz:

Publisher: publish-python.yml on ajshedivy/my-vector-db

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: my_vector_db-0.3.0.tar.gz
- Subject digest: 0e07c6faf2c136cbc54162a558db81f107a4c49c8762737cd25759cff18c6d56
- Sigstore transparency entry: 684570671
- Sigstore integration time: Nov 8, 2025
Source repository:
- Permalink: ajshedivy/my-vector-db@a2d345093dc9654254aa5263a7eeb5f5c53088cc
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/ajshedivy
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-python.yml@a2d345093dc9654254aa5263a7eeb5f5c53088cc
- Trigger Event: push

File details

Details for the file my_vector_db-0.3.0-py3-none-any.whl.

File metadata

Download URL: my_vector_db-0.3.0-py3-none-any.whl
Upload date: Nov 8, 2025
Size: 76.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for my_vector_db-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9ac5041963f2a10a712677211ba3b3dd6c479c6e5d81ef99690edfd183991832`
MD5	`225e7b6b635a4d5ee6c1cb22d2e35bf1`
BLAKE2b-256	`23f6c7148c833b1b9ab0da8ef394aacc8ac33fd563cb1656dc5444e96e36886a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for my_vector_db-0.3.0-py3-none-any.whl:

Publisher: publish-python.yml on ajshedivy/my-vector-db

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: my_vector_db-0.3.0-py3-none-any.whl
- Subject digest: 9ac5041963f2a10a712677211ba3b3dd6c479c6e5d81ef99690edfd183991832
- Sigstore transparency entry: 684570679
- Sigstore integration time: Nov 8, 2025
Source repository:
- Permalink: ajshedivy/my-vector-db@a2d345093dc9654254aa5263a7eeb5f5c53088cc
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/ajshedivy
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-python.yml@a2d345093dc9654254aa5263a7eeb5f5c53088cc
- Trigger Event: push

my-vector-db 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

My Vector Database

What's New

v0.3.0 - IVF Index Implementation

v0.2.1 - Interactive CLI & IVF Index

Features

Core Features ✅

Advanced Features ✅

Prerequisites

Core Concepts

Quick Start

Start the Server

Install the SDK

Using the Python SDK

Architecture

Data Model

Libraries

Documents

Chunks

Project Structure

Vector Indexes

FLAT Index (Exact Search) ✅ Implemented

IVF Index (Approximate Search) ✅ Implemented

HNSW Index (Approximate Search) ⚠️ Planned

API Reference

Libraries

Documents

Chunks

Search

Persistence

Demo

1. Database setup and python client SDK

2. Vector Search

3. Vector Search with Filtering

4. Persistence

5. RAG pipeline with Agno Framework

6. End-to-end document processing and Agent integration

7. MCP Server

Integrations

Agno Framework

Testing

Configuration

Environment Variables

Examples

Development

Code Quality Tools

Documentation

Author

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance