Skip to main content

Django ORM support for MariaDB Vector field (MariaDB 11.8.2+)

Project description

Django MariaDB Vector

Django ORM support for the MariaDB Vector field (introduced in MariaDB 11.8.2).

Requirements

  • Python 3.12+
  • Django 5.2+, or 6.0+
  • MariaDB 11.8.2 or newer

Installation

pip install django-mariadb-vector

Usage

from django.db import models
from django_mariadb_vector import MariaDBVectorField, MariaDBVectorIndex


class MyModel(models.Model):
    embedding = MariaDBVectorField(dimensions=1536)

    class Meta:
        indexes = [
            # Vector index (MariaDB 11.8.2+)
            MariaDBVectorIndex(fields=["embedding"], name="v_idx", dimensions=1536),
        ]

Querying with Vector Functions

You can use Search (which uses VEC_DISTANCE_COSINE by default) or VecDistance to perform similarity searches.

from django_mariadb_vector import Search

from .models import MyModel

# Find 5 most similar records to a reference vector
reference_vector = [0.1, 0.2, 0.3, ...]
results = MyModel.objects.annotate(
    distance=Search("embedding", reference_vector)
).order_by("distance")[:5]

Recommended Manager Pattern

Using a custom manager can simplify vector searches in your application:

from django.db import models
from django_mariadb_vector import MariaDBVectorField, VecDistance

class RecommendationManager(models.Manager):
    def similar_to_vector(self, vector, limit=5, exclude_id=None):
        queryset = self.get_queryset().annotate(
            distance=VecDistance("embedding", vector)
        ).order_by("distance")

        if exclude_id:
            queryset = queryset.exclude(pk=exclude_id)

        return queryset[:limit]

    def similar_to(self, id: int, limit=5):
        try:
            vector = self.get_queryset().values_list("embedding", flat=True).get(id=id)
        except self.model.DoesNotExist:
            return self.get_queryset().none()
        
        # Pass the ID to similar_to_vector to exclude it there
        return self.similar_to_vector(vector, limit=limit, exclude_id=id)

class MyModel(models.Model):
    embedding = MariaDBVectorField(dimensions=1536)
    objects = RecommendationManager()

Example of usage Manager

from .models import MyModel

reference_vector:list[float] = [0.1, 0.2, 0.3, ...]

# Find 5 most similar records to a reference vector
results = MyModel.objects.similar_to_vector(reference_vector, limit=5)

for item in results:
    print(f"{item.name} - Distance: {item.distance}")
from .models import MyModel

reference_id:int = 1
# Find 5 most similar records to a reference object by id
results = MyModel.objects.similar_to(reference_id, limit=5)

for item in results:
    print(f"{item.name} - Distance: {item.distance}")

Advanced Functions

The following functions are available in django_mariadb_vector.functions:

  • Search(expression, vector): Convenient wrapper for COSINE distance search.
  • VecDistance(expression, vector): Generic VEC_DISTANCE function.
  • VecDistanceCosine(expression, vector): Native MariaDB VEC_DISTANCE_COSINE.
  • VecDistanceEuclidean(expression, vector): Native MariaDB VEC_DISTANCE_EUCLIDEAN.
  • VecFromText(text): Converts JSON string to MariaDB VECTOR format.
  • VecToText(expression): Converts MariaDB VECTOR format to JSON string.

Reference

Contributing

Pull requests are welcome. For major changes, please open an issue first to discuss what you would like to change.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

django_mariadb_vector-0.1.2.tar.gz (7.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

django_mariadb_vector-0.1.2-py3-none-any.whl (8.1 kB view details)

Uploaded Python 3

File details

Details for the file django_mariadb_vector-0.1.2.tar.gz.

File metadata

File hashes

Hashes for django_mariadb_vector-0.1.2.tar.gz
Algorithm Hash digest
SHA256 e566d746da68f9e1bd4cd179cc4cb3b9595fe800d592d87fe9f3ea63759f63b1
MD5 77635fe5e1a6bd8ce965c4a2fbda2af7
BLAKE2b-256 a8f824aedefd6c7ba1c54477086ea2462015baa583b182c3878b6573bbf10683

See more details on using hashes here.

File details

Details for the file django_mariadb_vector-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for django_mariadb_vector-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 6d657fb9b0b1cce35c283c5ac00dede38043a79e8fb05d6ac73478e3f663772b
MD5 0e9204bb39922649ac9928215bef6aa8
BLAKE2b-256 32dd127e4eb459bb494183b805843201da46399eda86c6b7545232043c4d2e24

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page