Skip to main content

Django ORM support for MariaDB Vector field (MariaDB 11.8.2+)

Project description

Django MariaDB Vector

Django ORM support for the MariaDB Vector field (introduced in MariaDB 11.8.2).

Requirements

  • Python 3.12+
  • Django 5.2+, or 6.0+
  • MariaDB 11.8.2 or newer

Installation

pip install django-mariadb-vector

Usage

from django.db import models
from django_mariadb_vector import MariaDBVectorField, MariaDBVectorIndex


class MyModel(models.Model):
    embedding = MariaDBVectorField(dimensions=1536)

    class Meta:
        indexes = [
            # Vector index (MariaDB 11.8.2+)
            MariaDBVectorIndex(fields=["embedding"], name="v_idx", dimensions=1536),
        ]

Querying with Vector Functions

You can use Search (which uses VEC_DISTANCE_COSINE by default) or VecDistance to perform similarity searches.

from django_mariadb_vector import Search

from .models import MyModel

# Find 5 most similar records to a reference vector
reference_vector = [0.1, 0.2, 0.3, ...]
results = MyModel.objects.annotate(
    distance=Search("embedding", reference_vector)
).order_by("distance")[:5]

Recommended Manager Pattern

Using a custom manager can simplify vector searches in your application:

from django.db import models
from django_mariadb_vector import MariaDBVectorField, Search

class RecommendationManager(models.Manager):
    def similar_to_vector(self, vector, limit=5, exclude_id=None):
        queryset = self.get_queryset().annotate(
            distance=Search("embedding", vector)
        ).order_by("distance")

        if exclude_id:
            queryset = queryset.exclude(id=exclude_id)

        return queryset[:limit]

    def similar_to(self, id: int, limit=5):
        try:
            vector = self.get_queryset().values_list("embedding", flat=True).get(id=id)
        except self.model.DoesNotExist:
            return self.get_queryset().none()
        
        # Pass the ID to similar_to_vector to exclude it there
        return self.similar_to_vector(vector, limit=limit, exclude_id=id)

class MyModel(models.Model):
    embedding = MariaDBVectorField(dimensions=1536)
    objects = RecommendationManager()

Example of usage Manager

from .models import MyModel

reference_vector:list[float] = [0.1, 0.2, 0.3, ...]

# Find 5 most similar records to a reference vector
results = MyModel.objects.similar_to_vector(reference_vector, limit=5)

for item in results:
    print(f"{item.name} - Distance: {item.distance}")
from .models import MyModel

reference_vector:int = 1
# Find 5 most similar records to a reference object by id
results = MyModel.objects.similar_to(reference_vector, limit=5)

for item in results:
    print(f"{item.name} - Distance: {item.distance}")

Advanced Functions

The following functions are available in django_mariadb_vector.functions:

  • Search(expression, vector): Convenient wrapper for COSINE distance search.
  • VecDistance(expression, vector): Generic VEC_DISTANCE function.
  • VecDistanceCosine(expression, vector): Native MariaDB VEC_DISTANCE_COSINE.
  • VecDistanceEuclidean(expression, vector): Native MariaDB VEC_DISTANCE_EUCLIDEAN.
  • VecFromText(text): Converts JSON string to MariaDB VECTOR format.
  • VecToText(expression): Converts MariaDB VECTOR format to JSON string.

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

django_mariadb_vector-0.1.1.tar.gz (6.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

django_mariadb_vector-0.1.1-py3-none-any.whl (6.9 kB view details)

Uploaded Python 3

File details

Details for the file django_mariadb_vector-0.1.1.tar.gz.

File metadata

File hashes

Hashes for django_mariadb_vector-0.1.1.tar.gz
Algorithm Hash digest
SHA256 0c94be629b1081177dff7860da6a3d4491ee88bba505c900edf9eaaab5927b62
MD5 7cb01bd0be37524dfb51713a4c437b85
BLAKE2b-256 04137874452eed353ea257bfd9c4951260d51c9f99341c3e69803dc8b7fa0446

See more details on using hashes here.

File details

Details for the file django_mariadb_vector-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for django_mariadb_vector-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ab43261ca1d393c5432ab5cf8031b45a8a9964551db630665f79157c72e81cfe
MD5 3ee8d93587885802c31c7f7d62b9ad0c
BLAKE2b-256 869ceec5236c14b28d1dfd04db558442f326a575b4bef771cdd2cc9a39156ccd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page