Vector search for Django models with graph relations, optional LangGraph pipeline, conversational search, smart indexing and streaming.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

chitsalex

These details have not been verified by PyPI

Project description

Django Graph Search

Production-ready semantic vector search for Django — searches across FK, M2M, and reverse relations by traversing your model graph. Pluggable backends: ChromaDB, FAISS, Qdrant.

pip install django-graph-search[chromadb]

Why Django Graph Search?

Most Django search solutions (Haystack, Elasticsearch, full-text) treat each model in isolation. Django Graph Search builds rich search context by traversing the ORM relation graph before indexing:

A Product becomes searchable by its category__name, tags__name, brand__description, etc. — automatically
Uses sentence-transformers embeddings for multilingual semantic similarity
Delta indexing — only re-index what changed
Admin UI — semantic search inside /admin/ out of the box
REST API — ready-to-use search endpoint

Installation

# ChromaDB backend (recommended for local/dev)
pip install django-graph-search[chromadb]

# FAISS backend (fast CPU similarity, no server needed)
pip install django-graph-search[faiss]

# Qdrant backend (production, scalable)
pip install django-graph-search[qdrant]

# All backends
pip install django-graph-search[all]

Quick Start (5 minutes)

1. Add to INSTALLED_APPS

INSTALLED_APPS = [
    ...,
    "django_graph_search",
]

2. Configure GRAPH_SEARCH

# settings.py
GRAPH_SEARCH = {
    "MODELS": [
        {
            "model": "shop.Product",
            # Index local fields + traverse relations with __ notation
            "fields": ["name", "description", "category__name", "tags__name"],
            "follow_relations": True,
            "relation_depth": 2,
        },
        # Or index all concrete fields:
        # {"model": "shop.Review", "fields": "__all__"}
    ],
    "VECTOR_STORE": {
        "BACKEND": "django_graph_search.backends.ChromaDBBackend",
        "OPTIONS": {
            "persist_directory": "vector_db",
            "collection_name": "django_search",
        },
    },
    "EMBEDDINGS": {
        "default": {
            "BACKEND": "django_graph_search.embeddings.SentenceTransformerBackend",
            # Multilingual model — works with Russian, English, etc.
            "MODEL_NAME": "sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2",
        },
        "fast": {
            "BACKEND": "django_graph_search.embeddings.SentenceTransformerBackend",
            "MODEL_NAME": "sentence-transformers/all-MiniLM-L6-v2",
        },
    },
    "DEFAULT_EMBEDDING": "default",
    "DEFAULT_RESULTS_LIMIT": 20,
    "DELTA_INDEXING": True,
    "CACHE": {
        "BACKEND": "file",   # Options: file | redis | db
        "OPTIONS": {"path": "graph_search_cache"},
        "TTL": 86400,
    },
}

3. Add URLs

# urls.py
from django.urls import path, include

urlpatterns = [
    ...,
    path("api/search/", include("django_graph_search.urls")),
]

4. Build the index

python manage.py build_search_index

5. Search

# REST API
GET /api/search/?q=wireless+headphones&models=shop.Product&limit=5

# Find similar items
GET /api/search/similar/shop.Product/42/?limit=5

How It Works

Django ORM Model Graph
        │
        ▼
  Relation Traversal    <- FK, M2M, reverse relations up to depth N
        │
        ▼
  Text Concatenation    <- fields + related fields merged into one document
        │
        ▼
  Sentence Transformer  <- multilingual embeddings (768-dim vectors)
        │
        ▼
  Vector Store          <- ChromaDB / FAISS / Qdrant
        │
        ▼
  Semantic Search       <- cosine similarity, top-K results

Python API

from django_graph_search import search, index, get_similar

# Semantic search across models
results = search("red smartphone", models=["shop.Product"], limit=5)

# Index a single instance (e.g. in a signal)
index(product_instance)

# Find similar objects
similar = get_similar(product_instance, limit=5)

REST API

Endpoint	Method	Description
`/api/search/?q=...&models=...&limit=...`	`GET`	Semantic full-text search
`/api/search/similar/{app}.{Model}/{id}/`	`GET`	Find similar objects
`/api/search/conversation/`	`POST`	Session-aware conversational search (optional, see below)
`/api/search/conversation/?conversation_id=...`	`DELETE`	Clear a conversation history

Management Commands

python manage.py build_search_index                  # Index all configured models
python manage.py build_search_index --model shop.Product  # Index one model
python manage.py clear_search_index                  # Remove all vectors
python manage.py search_index_status                 # Show index statistics

Admin UI

After installation, navigate to /admin/graph-search/ for a semantic search interface directly in Django Admin — useful for content managers and debugging.

Supported Backends

Backend	Best for	Server required
ChromaDB	Development, small-medium datasets	No
FAISS	High-speed CPU search, offline	No
Qdrant	Production, large datasets, filtering	Yes

Delta Indexing & Cache

Enable DELTA_INDEXING: True to skip objects that haven’t changed since last index run. Choose a cache backend:

Backend	Config	Use case
`file`	`OPTIONS.path`	Local dev
`redis`	`OPTIONS.alias`	Production
`db`	`OPTIONS.alias`	Simple setup

LangGraph-powered search pipeline (optional)

Starting with this version, django-graph-search ships with an optional orchestration layer built on top of LangGraph. It is disabled by default; the public API (Searcher.search, Searcher.find_similar, REST endpoints) is fully backwards-compatible.

When enabled, the pipeline runs as a small graph:

analyze_query → [expand_query] → vector_search → [rerank] → postprocess

Steps in [brackets] are toggled via settings, and each one degrades gracefully: if the LLM backend fails or is not configured, the pipeline keeps working using the deterministic vector search.

GRAPH_SEARCH = {
    # ... your existing config ...
    "LANGGRAPH": {
        "ENABLED": True,                # Master switch.
        "QUERY_EXPANSION": True,        # Generate semantic reformulations.
        "RERANKING": True,              # Rerank top-K candidates.
        "MAX_EXPANDED_QUERIES": 3,
        "RERANK_TOP_K": 20,
        "TIMEOUT_SECONDS": 15,
        "MAX_QUERY_LENGTH": 1024,
        "FALLBACK_ON_ERROR": True,      # Fall back to legacy search on graph errors.
        "USE_FOR_SIMILAR": False,       # Route find_similar through the graph.
        "LLM": {
            # Leave BACKEND=None to use the deterministic dummy backend.
            "BACKEND": None,
            "MODEL": None,
            "OPTIONS": {},
        },
    },
}

Bring your own LLM backend

Implement django_graph_search.llm.BaseLLMBackend and point LANGGRAPH.LLM.BACKEND at the dotted path. The contract is intentionally tiny — expand_query(query, models, max_variants) and rerank(query, candidates, top_k) — so you can wrap any provider (OpenAI, Ollama, vLLM, your in-house service) in a few lines.

Why optional?

The library refuses to add hard dependencies on langgraph or any LLM SDK. If langgraph is not installed, the pipeline transparently uses an in-tree sequential runner with the same node structure, so behaviour and tests stay identical.

Conversational search (optional)

For session-aware semantic search (follow-ups like "more", "only products", "similar") enable the conversational endpoint. It is a thin search-first shell on top of Searcher and never invents user intent: ambiguous follow-ups are surfaced as a structured clarification_needed flag instead of a hallucinated query.

GRAPH_SEARCH = {
    # ... existing config ...
    "CONVERSATIONAL": {
        "ENABLED": True,
        "MEMORY_BACKEND": "inmemory",   # or "cache" / dotted path.
        "MAX_HISTORY_ITEMS": 10,
        "ALLOW_CLARIFICATIONS": True,
    },
}

Endpoint: POST /api/search/conversation/

// Request
{
  "query": "only products",
  "conversation_id": "abc-123",
  "models": ["shop.Product"],
  "limit": 5
}

// Response
{
  "conversation_id": "abc-123",
  "query": "only products",
  "interpreted_query": "red phone",
  "clarification_needed": false,
  "results": [...],
  "total": 5
}

Use DELETE /api/search/conversation/?conversation_id=abc-123 to clear a conversation.

Built-in memory backends:

Alias	Class	Best for
`inmemory`	`InMemoryBackend`	Tests, single-worker dev
`cache` / `redis`	`DjangoCacheBackend`	Production via Django cache (Redis, memcached)

Bring your own by subclassing BaseMemoryBackend and pointing MEMORY_BACKEND at the dotted path.

Smart indexing (optional)

The classic indexer joins selected fields with whitespace. That works, but the embedding model loses the role of each value: a category name and a body paragraph become indistinguishable tokens. The optional SmartIndexer builds structured documents with labelled sections so the embedder sees something closer to:

Title: Pixel 8
Description:
Camera-first Android phone with Tensor G3.
Category: Phones

Enable it from settings — your existing index, signals, and management command keep working because the resolver and get_indexer() factory pick the new implementation transparently:

GRAPH_SEARCH = {
    # ... your existing config ...
    "SMART_INDEXING": {
        "ENABLED": True,
        # Optional per-model templates; the indexer falls back to a heuristic
        # template based on your MODELS config when one is missing.
        "TEMPLATES": {
            "shop.Product": {
                "title_field": "name",
                "sections": [
                    {"label": "Description", "field": "description", "multiline": True},
                    {"label": "Category", "field": "category__name"},
                ],
            }
        },
    },
}

The original deterministic text is always appended as a safety net so smart indexing never produces less searchable content than the legacy pipeline. Disable the flag to fall back instantly — no reindex required to switch back.

Streaming search endpoint (optional)

Long-running pipelines (query expansion, vector search, reranking) can stream lifecycle events to the client so users see progress instead of staring at a spinner. Two transports are supported:

ndjson (default): one JSON object per line, ideal for fetch + ReadableStream and CLI tools like jq.
sse: Server-Sent Events for EventSource clients.

Enable from settings:

GRAPH_SEARCH = {
    # ... your existing config ...
    "STREAMING": {
        "ENABLED": True,
        "FORMAT": "ndjson",  # or "sse"
        "INCLUDE_INTERNAL_EVENTS": True,
    },
}

The endpoint is registered at /<API_URL_PREFIX>/stream/ (default /api/search/stream/) and returns HTTP 404 when disabled, so it is safe to leave the URL config untouched.

Quick test:

curl -N "http://localhost:8000/api/search/stream/?q=phone"

Example event sequence (NDJSON):

{"type": "query_received", "query": "phone"}
{"type": "vector_search_completed", "candidate_count": 12}
{"type": "completed", "total": 5}
{"type": "results", "results": [...], "total": 5}
{"type": "end"}

Under the hood the view subscribes a queue.Queue to a per-request EventHub, runs the search in a worker thread, and yields events as soon as the nodes publish them. The hub also powers structured logging and any custom subscribers you register from your own apps.

Comparison

Feature	django-graph-search	Haystack	django-elasticsearch-dsl
Relation traversal	✅ Auto	❌ Manual	❌ Manual
Semantic / vector search	✅	❌	Partial
No external server (local)	✅ ChromaDB/FAISS	❌	❌
Multilingual out of box	✅	❌	❌
Admin UI	✅	Partial	❌
Delta indexing	✅	❌	❌

Contributing

Pull requests are welcome! Please open an issue first to discuss significant changes.

Fork the repo
git checkout -b feature/my-feature
Commit and open a PR

License

MIT — see LICENSE

Author

Alexander Valenchits — GitHub

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

chitsalex

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.3

May 19, 2026

0.3.1a1 pre-release

May 18, 2026

0.3.0a1 pre-release

May 18, 2026

This version

0.2.0

May 8, 2026

0.1.2

Jan 31, 2026

0.1.1

Jan 31, 2026

0.1.0

Jan 31, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

django_graph_search-0.2.0.tar.gz (60.0 kB view details)

Uploaded May 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

django_graph_search-0.2.0-py3-none-any.whl (56.9 kB view details)

Uploaded May 8, 2026 Python 3

File details

Details for the file django_graph_search-0.2.0.tar.gz.

File metadata

Download URL: django_graph_search-0.2.0.tar.gz
Upload date: May 8, 2026
Size: 60.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for django_graph_search-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`193d6424588eae6132d1e80ace9d46847b54758163f4de4f92c4232c5c2ffb4c`
MD5	`4e1979382393b03d87923329fac1e255`
BLAKE2b-256	`f241c31ccdf7ae841b7c959350987c7b694b4a0c3f2238c6fd8308e8d370ff92`

See more details on using hashes here.

Provenance

The following attestation bundles were made for django_graph_search-0.2.0.tar.gz:

Publisher: workflows.yaml on svalench/django_graph_search

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: django_graph_search-0.2.0.tar.gz
- Subject digest: 193d6424588eae6132d1e80ace9d46847b54758163f4de4f92c4232c5c2ffb4c
- Sigstore transparency entry: 1471334318
- Sigstore integration time: May 8, 2026
Source repository:
- Permalink: svalench/django_graph_search@9b21ac9cf1a733684fa6ab12d8aa3dde0b3c5725
- Branch / Tag: refs/tags/0.2.0
- Owner: https://github.com/svalench
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflows.yaml@9b21ac9cf1a733684fa6ab12d8aa3dde0b3c5725
- Trigger Event: release

File details

Details for the file django_graph_search-0.2.0-py3-none-any.whl.

File metadata

Download URL: django_graph_search-0.2.0-py3-none-any.whl
Upload date: May 8, 2026
Size: 56.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for django_graph_search-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b62d7030b95c7bbd88d01f2cc1051a762e46bc4ec1c662c28c8b7bde2f1a9d5c`
MD5	`500b895b0fc41ea4a18a44307d9a3e39`
BLAKE2b-256	`36db4689d455b86e8b92ae7ceac234ef6951f6cde7600794fafc3221b214298e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for django_graph_search-0.2.0-py3-none-any.whl:

Publisher: workflows.yaml on svalench/django_graph_search

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: django_graph_search-0.2.0-py3-none-any.whl
- Subject digest: b62d7030b95c7bbd88d01f2cc1051a762e46bc4ec1c662c28c8b7bde2f1a9d5c
- Sigstore transparency entry: 1471334437
- Sigstore integration time: May 8, 2026
Source repository:
- Permalink: svalench/django_graph_search@9b21ac9cf1a733684fa6ab12d8aa3dde0b3c5725
- Branch / Tag: refs/tags/0.2.0
- Owner: https://github.com/svalench
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: workflows.yaml@9b21ac9cf1a733684fa6ab12d8aa3dde0b3c5725
- Trigger Event: release

django-graph-search 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Django Graph Search

Why Django Graph Search?

Installation

Quick Start (5 minutes)

1. Add to INSTALLED_APPS

2. Configure GRAPH_SEARCH

3. Add URLs

4. Build the index

5. Search

How It Works

Python API

REST API

Management Commands

Admin UI

Supported Backends

Delta Indexing & Cache

LangGraph-powered search pipeline (optional)

Bring your own LLM backend

Why optional?

Conversational search (optional)

Smart indexing (optional)

Streaming search endpoint (optional)

Comparison

Contributing

License

Author

Links

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance