ORM layer for vector databases

These details have not been verified by PyPI

Project links

repository

Project description

Vectorm

Static Badge

Vectorm is a Python ORM-like abstraction layer for vector databases, enabling declarative modeling, querying, filtering, and schema migrations — inspired by popular ORMs but purpose-built for vector stores like Qdrant.

Features

🚀 Simple Document modeling with fields and tensor annotations
🔍 Rich filtering, sorting, and cursor-based pagination
🎯 Supports dense and sparse vector search (depending on backend)
🔄 Declarative migrations system
🧩 Pluggable backend architecture (currently supports Qdrant)
🧪 Typed and fully async, suitable for large-scale pipelines

Installation

pip install vectorm

Usage

Basic Example

The core concept in Vectorm is the Document, which represents a record in your vector store. You define your document schema by subclassing Document and adding fields with type annotations much like any other Pydantic model. All documents have a document ID field by default, which is a UUID. It is accessible via the document_id property.

There are special annotations used to define tensor fields and optionally their indexing parameters. The tensor annotations support both multi-dimensional arrays as well as sparse tensors. When defining a tensor field, you can specify the dimensions and data type which will enforce array validation and sometimes reshaping. You can also specify indexing parameters like the distance metric to use for similarity search. Additionally, by defining the shape and data type, Vectorm is able to generate the appropriate JSON schema.

Most backends support 1D vectors, but some (like Qdrant) support 2D matrices as well. Other dimensions may be supported in the future, but currently will be flattened to 1D (configurable in the indexing parameters).

The following example demonstrates defining a document schema, saving documents, and performing a vector similarity query with filtering. There are two tensor fields: embeddings, which is a 1D dense vector with a length of 4, and configured to use cosine distance for indexing; and sparse_embeddings, which is a sparse vector also of length 4.

import asyncio

import asyncio
import numpy as np
from typing import Annotated, Literal

from vectorm import (
    Document, Field, FieldIndexParams, FieldIndexDataType,
    Tensor, SparseTensor, TensorIndexParams, DistanceMetric,
    VectorStore, Migrator, Revision, MigrateOp, SortOrder, Dim
)
from vectorm.backend.qdrant import QdrantVectorStoreBackend


# 1. Define your document schema
class MyDocument(Document):
    _collection_name_ = "my_documents"

    name: str
    age: int
    status: Literal["active", "inactive"]
    embeddings: Annotated[Tensor[Dim[4], np.float32], TensorIndexParams(metric=DistanceMetric.cosine)]
    sparse_embeddings: SparseTensor[Dim[4], np.float32]


async def main():
    backend = QdrantVectorStoreBackend(":memory:")

    async with VectorStore(backend) as store:
        # Save documents to the store
        await store.save_documents([
            MyDocument(
                name=f"Doc {i}",
                age=20 + i,
                status="active" if i % 2 == 0 else "inactive",
                embeddings=np.random.rand(4).astype(np.float32),
                sparse_embeddings={j: float(j) for j in range(i % 4)},
            )
            for i in range(10)
        ])

        # Query documents via vector similarity with an optional filter
        results = await store.query_documents(
            document_type=MyDocument,
            query_tensor=np.random.rand(4).astype(np.float32),
            tensor_field=Field.embeddings,
            filter=Field.age.ge(22) & Field.status.eq("active"),
            limit=5,
        )
        print("Query results:", results)

        # Retrieve documents based on IDs
        results = await store.get_documents(
            document_type=MyDocument,
            document_ids=[doc.document_id for doc in results]
        )
        print("Retrieved documents:", results)

if __name__ == "__main__":
    asyncio.run(main())

Filtering & Sorting

Vectorm supports rich filtering based on non-tensor fields, as well as sorting and cursor-based pagination. There are two primary methods for searching for documents:

find_documents: Retrieve documents based on filters, sorting, and pagination.
query_documents: Perform vector similarity search with optional filters.

Filters are defined using the Field class, which provides methods for common operations like equality, inequality, greater than, less than, etc. The collection of expressions built using these methods end up as a Filter, and is automatically converted to the appropriate backend format, allowing swapping out backends without changing your code.

documents = await store.find_documents(
    document_type=MyDocument,
    filter=Field.status.eq("active") & Field.age.ge(25),
    limit=10,
    sort=(Field.age, SortOrder.asc),
)

documents = await store.query_documents(
    document_type=MyDocument,
    query_tensor=np.random.rand(4).astype(np.float32),
    tensor_field=Field.embeddings,
    filter=Field.age.ge(22) & Field.status.eq("active"),
    limit=5,
)

paginated_docs = await store.find_documents(
    document_type=MyDocument,
    filter=Field.status.eq("active"),
    limit=5,
    cursor=previous_page.next_cursor,
    sort=(Field.age, SortOrder.asc)
)

Migrations

Vectorm includes a simple migrations system to manage schema changes over time. You define migrations as subclasses of Revision, specifying the operations to perform in the upgrade and downgrade methods. The Migrator class is used to apply these migrations to the vector store.

class InitialRevision(Revision):
    revision_id = "initial"

    async def upgrade(self, op: MigrateOp) -> None:
        await op.create_collection(
            collection_name=MyDocument.collection_name(),
            document_type=MyDocument
        )

    async def downgrade(self, op: MigrateOp) -> None:
        await op.delete_collection(MyDocument.collection_name())


class AddStatusIndexRevision(Revision):
    revision_id = "add_status_index"
    previous_revision_id = "initial"

    async def upgrade(self, op: MigrateOp) -> None:
        await op.create_index(
            collection_name=MyDocument.collection_name(),
            field_name=Field.status,
            index_params=FieldIndexParams(data_type=FieldIndexDataType.keyword)
        )

    async def downgrade(self, op: MigrateOp) -> None:
        await op.delete_index(
            collection_name=MyDocument.collection_name(),
            field_name=Field.status
        )


migrator = Migrator([InitialRevision(), AddStatusIndexRevision()])

await store.upgrade_migrations(migrator)  # Upgrade to head
await store.upgrade_migrations(migrator, revision_id="add_status_index")

License

This project is licensed under ISC License.

Support & Feedback

If you encounter any issues or have feedback, please open an issue. We'd love to hear from you!

Made with ❤️ by Timothy Pogue

Project details

These details have not been verified by PyPI

Project links

repository

Release history Release notifications | RSS feed

This version

0.1.2

Oct 14, 2025

0.1.1

Oct 11, 2025

0.1.0

Oct 9, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vectorm-0.1.2-py3-none-any.whl (25.7 kB view details)

Uploaded Oct 14, 2025 Python 3

File details

Details for the file vectorm-0.1.2-py3-none-any.whl.

File metadata

Download URL: vectorm-0.1.2-py3-none-any.whl
Upload date: Oct 14, 2025
Size: 25.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.2

File hashes

Hashes for vectorm-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`af379dade61ead6b6473b37d05ee8cc7fe53bb4983881c6c52c4454b2ddff364`
MD5	`9beaa4af169a38b9abcf5bf5cf35c298`
BLAKE2b-256	`c975ccf35ad871536e461cd69e3aa8d5586fbedbd0e7e677281d97a49e72b1ce`

See more details on using hashes here.

vectorm 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Vectorm

Features

Installation

Usage

Basic Example

Filtering & Sorting

Migrations

License

Support & Feedback

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes