Multimodal RAG with knowledge graph and contextual intelligence

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

DlightRAG

Multimodal RAG with knowledge graph and contextual intelligence. Understands what your documents say, how concepts connect, and what the pages look like. Production-ready.

Most RAG systems treat documents as hierarchical text and search by similarity agentically — visual context is lost, entity relationships are missed, context filtering is limited. DlightRAG combines knowledge graph understanding with dynamic multimodal retrieval to close these gaps.

From text-heavy reports to chart-filled presentations — it adapts to your documents without information compromise. Inquiry answers come with inline citations grounded in actual document content. Flexibly ship it as a ready-to-run service, integrate into your backend, or expose as a tool for AI agents.

Features

Dual multimodal RAG modes — Caption mode (parse → caption → embed) for pipeline-based multimodal paradigm; Unified mode (render → multimodal embed) for modern multimodal paradigm
Knowledge graph + vector + visual retrieval — Multi-strategy retrieval across knowledge graph and vector similarity LightRAG, visual content, and dynamic metadata filters
Multimodal ingestion — PDF, Images, Office Documents from local filesystem, Azure Blob Storage etc.
Broad LLM support — Native SDKs for OpenAI, Anthropic, Gemini + any OpenAI-compatible endpoint
Cross-workspace federation — Query across embedding-compatible workspaces with well managed merging
Citation and highlighting — Inline citations with source, page, and highlighting attribution
Observability — Zero-overhead telemetry via Langfuse for tracking pipelines, queries, and generations
Four interfaces — Web UI, REST API, Python SDK, and MCP server

Architecture

DlightRAG Architecture

_{Source: docs/architecture.drawio}

Quick Start

Defaults: qwen/qwen3.5-flash-02-23 (chat via OpenRouter) + voyage-multimodal-3.5 (embedding via Voyage) in unified mode. To use other providers or models, edit config.yaml — see Configuration.

Web UI

Click the image to watch demo (YouTube)

If you already have the REST API running (via Docker or dlightrag-api), the Web UI is available at:

http://localhost:8100/web/

Without Docker:

uv add dlightrag        # or: pip install dlightrag
cp .env.example .env    # set API keys in .env
dlightrag-api

Docker (Self-Hosted)

git clone https://github.com/hanlianlu/dlightrag.git && cd dlightrag
cp .env.example .env    # set API keys in .env; edit config.yaml for models/providers
docker compose up

Includes PostgreSQL (pgvector + AGE), REST API (:8100), and MCP server (:8101).

Local models (Ollama, Xinference, etc.): use host.docker.internal instead of localhost in base_url settings.

curl http://localhost:8100/health

curl -X POST http://localhost:8100/ingest \
  -H "Content-Type: application/json" \
  -d '{"source_type": "local", "path": "/app/dlightrag_storage/sources"}'

curl -X POST http://localhost:8100/retrieve \
  -H "Content-Type: application/json" \
  -d '{"query": "What are the key findings?"}'

curl -X POST http://localhost:8100/answer \
  -H "Content-Type: application/json" \
  -d '{"query": "What are the key findings?", "stream": true}'

Python SDK

uv add dlightrag        # or: pip install dlightrag
cp .env.example .env    # set API keys in .env

import asyncio
from dotenv import load_dotenv
from dlightrag import RAGServiceManager, DlightragConfig

load_dotenv()  # load .env

async def main():
    config = DlightragConfig()
    manager = RAGServiceManager(config)

    await manager.aingest(workspace="default", source_type="local", path="./docs")

    result = await manager.aretrieve(query="What are the key findings?")
    print(result.contexts)

    result = await manager.aanswer(query="What are the key findings?")
    print(result.answer)

asyncio.run(main())

Requires PostgreSQL with pgvector + AGE, or JSON fallback for development (see Configuration).

MCP Server (for AI Agents)

uv tool install dlightrag   # or: pip install dlightrag
cp .env.example .env        # set API keys in .env
dlightrag-mcp

{
  "mcpServers": {
    "dlightrag": {
      "command": "uvx",
      "args": ["dlightrag-mcp", "--env-file", "/absolute/path/to/.env"]
    }
  }
}

Tools: retrieve, answer, ingest, list_files, delete_files, list_workspaces — all with workspace isolation.

API & Internals

Method	Endpoint	Description
`POST`	`/ingest`	Ingest from local, Azure Blob, or Snowflake
`POST`	`/retrieve`	Contexts + sources (no LLM answer)
`POST`	`/answer`	LLM answer + contexts + sources (`stream: true` for SSE)
`GET`	`/files`	List ingested documents
`DELETE`	`/files`	Delete documents
`GET`	`/api/files/{path}`	Serve/download a file (local: stream, Azure: 302 SAS redirect)
`POST`	`/reset`	Reset workspace(s) — drop storage, clear indexes
`GET`	`/workspaces`	List available workspaces
`GET`	`/health`	Health check with storage status

All write endpoints accept optional workspace; read endpoints accept workspaces list for cross-workspace federated search. Set DLIGHTRAG_API_AUTH_TOKEN in .env to enable bearer auth.

Request/response schema — docs/response-schema.md for ingestion parameters, retrieval contexts, sources, media, SSE streaming, citations, and multimodal queries.
Retrieval & answer pipeline — docs/retrieval_answer_mechanism.md for unified vs caption mode, visual resolution, reranking, Step 1+2 merge.

Configuration

Configuration uses a hybrid system — structured app settings in config.yaml, secrets and deployment in .env.

Priority: constructor args > env vars > .env > config.yaml > defaults

See config.yaml for all application settings and .env.example for secrets/deployment reference.

Env var naming: all variables use the DLIGHTRAG_ prefix. Single underscore (_) is part of the field name (e.g. DLIGHTRAG_POSTGRES_HOST → postgres_host). Double underscore (__) means nested object (e.g. DLIGHTRAG_CHAT__MODEL → chat.model). See .env.example for details.

RAG Mode

The first decision — determines your ingestion pipeline, model requirements, and retrieval behavior.

Mode	Pipeline	Best for
`caption`	Document parsing → VLM captioning → text embedding → KG	Text-heavy documents, structured elements
`unified` (default)	Page rendering → multimodal embedding → VLM entity extraction → KG	Visually rich documents (charts, diagrams, complex layouts)

Caption mode parsers (parser in config.yaml):

Parser	Description
`mineru` (default)	MinerU PDF parser — fast, good for text-heavy documents
`docling`	Docling parser — alternative structure-aware parser
`vlm`	VLM-based OCR — renders pages and uses chat model (must be VLM) to extract structured content; no external parser dependency

All caption mode parsers use Docling's HybridChunker for structure-aware chunking.

Model usage by stage:

Stage	Caption	Unified
Image captioning	chat model (VLM)	chat model (VLM)
Table / equation captioning	chat model	—
Entity extraction	chat model	chat model (VLM)
Embedding	embedding model	embedding model (multimodal)
Rerank (llm_listwise)	ingest/chat model	chat model (VLM, pointwise scoring)
Rerank (API strategy)	cohere/jina/aliyun API	cohere/jina/aliyun API
Answer generation	chat model	chat model (VLM, sees text excerpts + page images)

Important: The chat model must support vision (multimodal/VLM). It doubles as the vision model for image captioning, VLM parser, unified mode, and multimodal queries. A text-only chat model will fail on these tasks.

For unified mode, set rag_mode: unified in config.yaml and use multimodal models:

# config.yaml
rag_mode: unified

chat:
  model: qwen3-vl-32b          # must support vision

embedding:
  model: Qwen3-VL-Embedding    # must be multimodal
  dim: 4096

Limitations: Snowflake is text-only (no visual embedding). A workspace is locked to one mode after first ingestion. Page images ~3-7 MB/page at 250 DPI.

Providers

Three native SDKs — choose per model block in config.yaml:

Provider	SDK	Use for
`openai` (default)	AsyncOpenAI	OpenAI, Azure OpenAI, Qwen/DashScope, MiniMax, Ollama, Xinference, OpenRouter, any OpenAI-compatible endpoint
`anthropic`	Anthropic SDK	Anthropic Claude models
`gemini`	Google GenAI SDK	Google Gemini models

Optional providers require extra dependencies:

pip install dlightrag[anthropic]   # Anthropic SDK
pip install dlightrag[gemini]      # Google GenAI SDK
pip install dlightrag[all]         # all optional providers

# config.yaml — OpenAI-compatible (Ollama example)
chat:
  provider: openai
  model: qwen3:8b
  base_url: http://localhost:11434/v1

# config.yaml — Anthropic (native SDK)
chat:
  provider: anthropic
  model: claude-sonnet-4-20250514

# config.yaml — Google Gemini (native SDK)
chat:
  provider: gemini
  model: gemini-2.5-pro

API keys go in .env:

DLIGHTRAG_CHAT__API_KEY=sk-...
DLIGHTRAG_EMBEDDING__API_KEY=sk-...

Storage Backends

Set in config.yaml:

Setting	Default	Options
`vector_storage`	`PGVectorStorage`	PGVectorStorage, MilvusVectorDBStorage, NanoVectorDBStorage, ...
`graph_storage`	`PGGraphStorage`	PGGraphStorage, Neo4JStorage, NetworkXStorage, ...
`kv_storage`	`PGKVStorage`	PGKVStorage, JsonKVStorage, RedisKVStorage, ...
`doc_status_storage`	`PGDocStatusStorage`	PGDocStatusStorage, JsonDocStatusStorage, ...

Note: When using PostgreSQL backends, LightRAG maps its internal namespace names to different table names (e.g. text_chunks → LIGHTRAG_DOC_CHUNKS, full_docs → LIGHTRAG_DOC_FULL). DlightRAG's unified mode adds a visual_chunks table via its own KV storage.

Workspaces

Each workspace has its own knowledge graph, vector store, and document index. workspace in config.yaml (default: default) is automatically bridged to backend-specific env vars — no manual setup needed.

Backend type	Isolation mechanism
PostgreSQL (PG*)	`workspace` column / graph name in same database
Neo4j / Memgraph	Label prefix
Milvus / Qdrant	Collection prefix
MongoDB / Redis	Collection scope
JSON / Nano / NetworkX / Faiss	Subdirectory under `working_dir/<workspace>/`

Reranking

Set in config.yaml under the rerank: block:

Setting	Default	Description
`rerank.strategy`	`llm_listwise`	`llm_listwise`, `cohere`, `jina`, `aliyun`, `azure_cohere`
`rerank.model`	(strategy default)	Model name sent to the endpoint
`rerank.base_url`	(provider default)	Custom endpoint URL for any compatible service
`rerank.api_key`	—	Set in `.env` as `DLIGHTRAG_RERANK__API_KEY`

Strategy	Default model	API key
`llm_listwise`	(uses ingest/chat model)	(reuses chat/ingest key)
`cohere`	`rerank-v4.0-pro`	`DLIGHTRAG_RERANK__API_KEY`
`jina`	`jina-reranker-v3`	`DLIGHTRAG_RERANK__API_KEY`
`aliyun`	`qwen3-rerank`	`DLIGHTRAG_RERANK__API_KEY`
`azure_cohere`	`Cohere-rerank-v4.0-pro`	`DLIGHTRAG_RERANK__API_KEY`

Point any backend at a local reranker (Xinference, etc.) via rerank.base_url + rerank.model in config.yaml.

Observability (Langfuse)

DlightRAG includes native, zero-overhead tracing using Langfuse. When configured, you get detailed waterfall traces of every RAG pipeline stage, LLM generation, and embedding call. If keys are omitted, the tracing module operates as a pure no-op with zero performance penalty.

To enable observability, set the following in your .env:

DLIGHTRAG_LANGFUSE_PUBLIC_KEY=pk-...
DLIGHTRAG_LANGFUSE_SECRET_KEY=sk-...
# DLIGHTRAG_LANGFUSE_HOST=https://cloud.langfuse.com  # Optional: defaults to cloud

This will automatically track retrieve, answer, and ingest operations at the service level.

Development

git clone https://github.com/hanlianlu/dlightrag.git && cd dlightrag
cp .env.example .env && uv sync
docker compose up -d                # PostgreSQL + API + MCP
docker compose up postgres -d       # PostgreSQL only

uv run pytest tests/unit            # unit tests (no external services)
uv run pytest tests/integration     # integration tests (requires PostgreSQL)
uv run ruff check src/ tests/ scripts/ --fix && uv run ruff format src/ tests/ scripts/

License

Apache License 2.0 — see LICENSE.

Built by HanlianLyu. Contributions welcome!

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

fumoffu

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.3.6

Apr 30, 2026

1.3.5

Apr 1, 2026

This version

1.3.4

Mar 31, 2026

1.3.3

Mar 30, 2026

1.3.1

Mar 27, 2026

1.3.0

Mar 26, 2026

1.2.9

Mar 26, 2026

1.2.8

Mar 25, 2026

1.2.7

Mar 25, 2026

1.2.6

Mar 25, 2026

1.2.5

Mar 25, 2026

1.2.4.1

Mar 24, 2026

1.2.4

Mar 23, 2026

1.2.3

Mar 23, 2026

1.2.2.2

Mar 18, 2026

1.2.2.1

Mar 18, 2026

1.2.2

Mar 18, 2026

1.2.1.1

Mar 18, 2026

1.2.1

Mar 18, 2026

1.2.0.1

Mar 16, 2026

1.2.0

Mar 16, 2026

1.1.6.7

Mar 15, 2026

1.1.6.5

Mar 14, 2026

1.1.6.3

Mar 13, 2026

1.1.6.1

Mar 12, 2026

1.1.6

Mar 12, 2026

1.1.4

Mar 10, 2026

1.1.3

Mar 10, 2026

1.1.0

Mar 8, 2026

1.0.0

Mar 8, 2026

0.2.6

Mar 6, 2026

0.2.4

Mar 5, 2026

0.2.3

Mar 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dlightrag-1.3.4.tar.gz (1.7 MB view details)

Uploaded Mar 31, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

dlightrag-1.3.4-py3-none-any.whl (227.2 kB view details)

Uploaded Mar 31, 2026 Python 3

File details

Details for the file dlightrag-1.3.4.tar.gz.

File metadata

Download URL: dlightrag-1.3.4.tar.gz
Upload date: Mar 31, 2026
Size: 1.7 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dlightrag-1.3.4.tar.gz
Algorithm	Hash digest
SHA256	`d043d608248e6048392b2604be4692fff9907d1b7d34bba83983e8ac30216d4b`
MD5	`0b79b0c1dfdb69b7968d875361fe2f0a`
BLAKE2b-256	`a9ef8a19b2f8d41ece0385c06f0e83277ca55f750fc8021e7f80b7ba34368a3a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dlightrag-1.3.4.tar.gz:

Publisher: publish.yml on hanlianlu/DlightRAG

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dlightrag-1.3.4.tar.gz
- Subject digest: d043d608248e6048392b2604be4692fff9907d1b7d34bba83983e8ac30216d4b
- Sigstore transparency entry: 1203537056
- Sigstore integration time: Mar 31, 2026
Source repository:
- Permalink: hanlianlu/DlightRAG@4700c1e2fa4c8f739a10292b7e0687f161374e6f
- Branch / Tag: refs/tags/v1.3.4
- Owner: https://github.com/hanlianlu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@4700c1e2fa4c8f739a10292b7e0687f161374e6f
- Trigger Event: push

File details

Details for the file dlightrag-1.3.4-py3-none-any.whl.

File metadata

Download URL: dlightrag-1.3.4-py3-none-any.whl
Upload date: Mar 31, 2026
Size: 227.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dlightrag-1.3.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7ed88791cc573e945258ea7a79e75598a3859865c6c1058cefb40758c8f47e33`
MD5	`8b8a47fc967367335b4cd63cfd410d9d`
BLAKE2b-256	`a4f6d7e0d1286de1fe7b8c647f1a3f1d332ec38f5087da0d40107cf67dc62544`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dlightrag-1.3.4-py3-none-any.whl:

Publisher: publish.yml on hanlianlu/DlightRAG

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dlightrag-1.3.4-py3-none-any.whl
- Subject digest: 7ed88791cc573e945258ea7a79e75598a3859865c6c1058cefb40758c8f47e33
- Sigstore transparency entry: 1203537059
- Sigstore integration time: Mar 31, 2026
Source repository:
- Permalink: hanlianlu/DlightRAG@4700c1e2fa4c8f739a10292b7e0687f161374e6f
- Branch / Tag: refs/tags/v1.3.4
- Owner: https://github.com/hanlianlu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@4700c1e2fa4c8f739a10292b7e0687f161374e6f
- Trigger Event: push

dlightrag 1.3.4

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

DlightRAG

Features

Architecture

Quick Start

Web UI

Click the image to watch demo (YouTube)

Docker (Self-Hosted)

Python SDK

MCP Server (for AI Agents)

API & Internals

Configuration

RAG Mode

Providers

Storage Backends

Workspaces

Reranking

Observability (Langfuse)

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance