DlightRAG - Dual-mode multi-modal RAG service based on LightRAG

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

DlightRAG

Dual-mode multimodal RAG built on LightRAG — knowledge graph + vector retrieval as a modern and unified production-ready service.

Features

Dual multimodal RAG modes — caption mode (parse → caption → embed) for pipeline based mulitimodal paradigm; unified mode (render → multimodal embed) for more modern multimodal paradigm
Knowledge graph + vector retrieval — fusional search with LightRAG's foundation
Multimodal ingestion — PDF, Word, Excel, PowerPoint, images, etc.
Reranking — generic LLM-based listwise; Specialized rerankers support from Cohere, Jina, Aliyun, Azure Cohere; Support any additional backend via custom endpoint
Cross-workspace federation — query across workspaces with round-robin merging
Content-aware dedup — files hashed by content, preventing duplicate ingestion
Flexible sourcing — local filesystem, Azure Blob Storage, Snowflake
Three interfaces — Python SDK, REST API, MCP server

Quick Start

Python SDK

uv add dlightrag        # or: pip install dlightrag

import asyncio
from dlightrag import RAGService, DlightragConfig

async def main():
    config = DlightragConfig(openai_api_key="sk-...")
    service = await RAGService.create(config=config)

    await service.aingest(source_type="local", path="./docs")

    result = await service.aretrieve(query="What are the key findings?")
    print(result.contexts)

    result = await service.aanswer(query="What are the key findings?")
    print(result.answer)

    await service.close()

asyncio.run(main())

Requires PostgreSQL with pgvector + AGE, or JSON fallback for development (see Configuration).

Docker (Self-Hosted)

git clone https://github.com/hanlianlu/dlightrag.git && cd dlightrag
cp .env.example .env    # edit .env — at minimum set DLIGHTRAG_OPENAI_API_KEY
docker compose up

Includes PostgreSQL (pgvector + AGE), REST API (:8100), and MCP server (:8101).

Local models (Ollama, Xinference, etc.): use host.docker.internal instead of localhost in base URL settings.

curl http://localhost:8100/health

curl -X POST http://localhost:8100/ingest \
  -H "Content-Type: application/json" \
  -d '{"source_type": "local", "path": "/app/dlightrag_storage/sources"}'

curl -X POST http://localhost:8100/retrieve \
  -H "Content-Type: application/json" \
  -d '{"query": "What are the key findings?"}'

curl -X POST http://localhost:8100/answer \
  -H "Content-Type: application/json" \
  -d '{"query": "What are the key findings?", "stream": true}'

MCP Server (for AI Agents)

uv tool install dlightrag   # or: pip install dlightrag
dlightrag-mcp --env-file /path/to/.env

{
  "mcpServers": {
    "dlightrag": {
      "command": "uvx",
      "args": ["dlightrag-mcp", "--env-file", "/absolute/path/to/.env"]
    }
  }
}

Tools: retrieve, answer, ingest, list_files, delete_files, list_workspaces — all with workspace isolation.

Configuration

All settings via DLIGHTRAG_ env vars, .env file, or constructor args. See .env.example for the full reference.

Priority: constructor args > env vars > .env file > defaults

RAG Mode

The first decision — determines your ingestion pipeline, model requirements, and retrieval behavior.

Mode	Pipeline	Best for
`caption` (default)	Document parsing → VLM captioning → text embedding → KG	Text-heavy documents, structured elements
`unified`	Page rendering → multimodal embedding → VLM entity extraction → KG	Visually rich documents (charts, diagrams, complex layouts)

Model usage by stage:

Stage	Caption	Unified
Image captioning	`VISION_MODEL` ¹	`VISION_MODEL`
Table / equation captioning	`CHAT_MODEL`	—
Entity extraction	`CHAT_MODEL`	`CHAT_MODEL`
Embedding	`EMBEDDING_MODEL`	`EMBEDDING_MODEL` (multimodal)
Rerank	`RERANK_*` via LightRAG	`VISION_MODEL` ² or `RERANK_*` API
Answer generation	`CHAT_MODEL`	`VISION_MODEL` (sees page images)

¹ Falls back to CHAT_MODEL if vision model not configured. ² When RERANK_BACKEND=llm (pointwise VLM scoring).

For unified mode, set DLIGHTRAG_RAG_MODE=unified and point embedding/vision at multimodal models:

DLIGHTRAG_RAG_MODE=unified
DLIGHTRAG_EMBEDDING_MODEL=Qwen3-VL-Embedding    # must be multimodal
DLIGHTRAG_EMBEDDING_DIM=4096
DLIGHTRAG_VISION_MODEL=qwen3-vl-32b

Limitations: Snowflake is text-only (no visual embedding). A workspace is locked to one mode after first ingestion. Page images ~3-7 MB/page at 250 DPI.

Providers

Variable	Default	Description
`DLIGHTRAG_LLM_PROVIDER`	`openai`	`openai`, `azure_openai`, `anthropic`, `google_gemini`, `qwen`, `minimax`, `xinference`, `openrouter`
`DLIGHTRAG_EMBEDDING_PROVIDER`	(follows `llm_provider`)	Override embedding provider
`DLIGHTRAG_VISION_PROVIDER`	(follows `llm_provider`)	Override vision provider
`DLIGHTRAG_EMBEDDING_MODEL`	`text-embedding-3-large`	Embedding model

Each provider uses its own API key. For Ollama, use openai provider with DLIGHTRAG_OPENAI_BASE_URL pointing to Ollama.

Storage Backends

Variable	Default	Options
`DLIGHTRAG_VECTOR_STORAGE`	`PGVectorStorage`	PGVectorStorage, MilvusVectorDBStorage, NanoVectorDBStorage, ...
`DLIGHTRAG_GRAPH_STORAGE`	`PGGraphStorage`	PGGraphStorage, Neo4JStorage, NetworkXStorage, ...
`DLIGHTRAG_KV_STORAGE`	`PGKVStorage`	PGKVStorage, JsonKVStorage, RedisKVStorage, ...
`DLIGHTRAG_DOC_STATUS_STORAGE`	`PGDocStatusStorage`	PGDocStatusStorage, JsonDocStatusStorage, ...

Workspaces

Each workspace has its own knowledge graph, vector store, and document index. DLIGHTRAG_WORKSPACE (default: default) is automatically bridged to backend-specific env vars — no manual setup needed.

Backend type	Isolation mechanism
PostgreSQL (PG*)	`workspace` column / graph name in same database
Neo4j / Memgraph	Label prefix
Milvus / Qdrant	Collection prefix
MongoDB / Redis	Collection scope
JSON / Nano / NetworkX / Faiss	Subdirectory under `working_dir/<workspace>/`

Reranking

Variable	Default	Description
`DLIGHTRAG_RERANK_BACKEND`	`llm`	`llm`, `cohere`, `jina`, `aliyun`, `azure_cohere`
`DLIGHTRAG_RERANK_MODEL`	(backend default)	Model name sent to the endpoint
`DLIGHTRAG_RERANK_BASE_URL`	(provider default)	Custom endpoint URL for any compatible service
`DLIGHTRAG_RERANK_API_KEY`	—	API key (falls back to provider-specific keys)

Backend	Default model	Key
`llm`	(follows `CHAT_MODEL`)	(follows `LLM_PROVIDER`)
`cohere`	`rerank-v4.0-pro`	`DLIGHTRAG_COHERE_API_KEY`
`jina`	`jina-reranker-v3`	`DLIGHTRAG_JINA_API_KEY`
`aliyun`	`qwen3-rerank`	`DLIGHTRAG_ALIYUN_RERANK_API_KEY`
`azure_cohere`	`Cohere-rerank-v4.0-pro`	`DLIGHTRAG_AZURE_COHERE_API_KEY`

Point any backend at a local reranker (Xinference, LiteLLM, etc.) via RERANK_BASE_URL + RERANK_MODEL.

REST API

Method	Endpoint	Description
`POST`	`/ingest`	Ingest from local, Azure Blob, or Snowflake
`POST`	`/retrieve`	Contexts + sources (no LLM answer)
`POST`	`/answer`	LLM answer + contexts + sources (`stream: true` for SSE)
`GET`	`/files`	List ingested documents
`DELETE`	`/files`	Delete documents
`GET`	`/workspaces`	List available workspaces
`GET`	`/health`	Health check with storage status

All write endpoints accept optional workspace; read endpoints accept workspaces list for cross-workspace federated search. Set DLIGHTRAG_API_AUTH_TOKEN to enable bearer auth.

SSE Streaming

Set "stream": true to receive Server-Sent Events:

Event type	Payload	Description
`context`	`{type, data, raw}`	Contexts and sources (sent first)
`token`	`{type, content}`	LLM answer token
`done`	`{type}`	Stream complete
`error`	`{type, message}`	Error mid-stream

Development

git clone https://github.com/hanlianlu/dlightrag.git && cd dlightrag
cp .env.example .env && uv sync
docker compose up -d                # PostgreSQL + API + MCP
docker compose up postgres -d       # PostgreSQL only

uv run pytest tests/unit            # unit tests (no external services)
uv run pytest tests/integration     # integration tests (requires PostgreSQL)
uv run ruff check src/ tests/ scripts/ && uv run ruff format --check src/ tests/ scripts/

Skip PostgreSQL for development:

DLIGHTRAG_VECTOR_STORAGE=NanoVectorDBStorage
DLIGHTRAG_GRAPH_STORAGE=NetworkXStorage
DLIGHTRAG_KV_STORAGE=JsonKVStorage
DLIGHTRAG_DOC_STATUS_STORAGE=JsonDocStatusStorage

Architecture

DlightRAG Architecture

_{Source: docs/architecture.drawio}

License

Apache License 2.0 — see LICENSE.

Built by HanlianLyu. Contributions welcome!

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

fumoffu

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.3.6

Apr 30, 2026

1.3.5

Apr 1, 2026

1.3.4

Mar 31, 2026

1.3.3

Mar 30, 2026

1.3.1

Mar 27, 2026

1.3.0

Mar 26, 2026

1.2.9

Mar 26, 2026

1.2.8

Mar 25, 2026

1.2.7

Mar 25, 2026

1.2.6

Mar 25, 2026

1.2.5

Mar 25, 2026

1.2.4.1

Mar 24, 2026

1.2.4

Mar 23, 2026

1.2.3

Mar 23, 2026

1.2.2.2

Mar 18, 2026

1.2.2.1

Mar 18, 2026

1.2.2

Mar 18, 2026

1.2.1.1

Mar 18, 2026

1.2.1

Mar 18, 2026

1.2.0.1

Mar 16, 2026

1.2.0

Mar 16, 2026

1.1.6.7

Mar 15, 2026

1.1.6.5

Mar 14, 2026

1.1.6.3

Mar 13, 2026

1.1.6.1

Mar 12, 2026

1.1.6

Mar 12, 2026

1.1.4

Mar 10, 2026

1.1.3

Mar 10, 2026

1.1.0

Mar 8, 2026

This version

1.0.0

Mar 8, 2026

0.2.6

Mar 6, 2026

0.2.4

Mar 5, 2026

0.2.3

Mar 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dlightrag-1.0.0.tar.gz (736.6 kB view details)

Uploaded Mar 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

dlightrag-1.0.0-py3-none-any.whl (103.6 kB view details)

Uploaded Mar 8, 2026 Python 3

File details

Details for the file dlightrag-1.0.0.tar.gz.

File metadata

Download URL: dlightrag-1.0.0.tar.gz
Upload date: Mar 8, 2026
Size: 736.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dlightrag-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`93a28d1e63b5be0b5be0d2bf9230d5148af5fab88c9faa535216dc356a7a695e`
MD5	`1b25b05bc6a8601a11fe4dc62cbf297c`
BLAKE2b-256	`03e5bf13d41ffbac68d1c2d4043c3ba9c7a85ee6c81c322446d1ad06e1cce3d5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dlightrag-1.0.0.tar.gz:

Publisher: publish.yml on hanlianlu/DlightRAG

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dlightrag-1.0.0.tar.gz
- Subject digest: 93a28d1e63b5be0b5be0d2bf9230d5148af5fab88c9faa535216dc356a7a695e
- Sigstore transparency entry: 1059800648
- Sigstore integration time: Mar 8, 2026
Source repository:
- Permalink: hanlianlu/DlightRAG@72e3ba42bef5c49eb5f4f54e6f117e261a6816fd
- Branch / Tag: refs/tags/v1.0.0
- Owner: https://github.com/hanlianlu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@72e3ba42bef5c49eb5f4f54e6f117e261a6816fd
- Trigger Event: push

File details

Details for the file dlightrag-1.0.0-py3-none-any.whl.

File metadata

Download URL: dlightrag-1.0.0-py3-none-any.whl
Upload date: Mar 8, 2026
Size: 103.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dlightrag-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`21db151e3542663af5ca3ed97eab78d5d172989f696d10dc6e6db32563d32f09`
MD5	`b3fbf5a6f63fdca2022f9c09b6143ee3`
BLAKE2b-256	`86189aea76d3348466bb1edea6231ebce730a0bb813a416cfa54f2f425940392`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dlightrag-1.0.0-py3-none-any.whl:

Publisher: publish.yml on hanlianlu/DlightRAG

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dlightrag-1.0.0-py3-none-any.whl
- Subject digest: 21db151e3542663af5ca3ed97eab78d5d172989f696d10dc6e6db32563d32f09
- Sigstore transparency entry: 1059800649
- Sigstore integration time: Mar 8, 2026
Source repository:
- Permalink: hanlianlu/DlightRAG@72e3ba42bef5c49eb5f4f54e6f117e261a6816fd
- Branch / Tag: refs/tags/v1.0.0
- Owner: https://github.com/hanlianlu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@72e3ba42bef5c49eb5f4f54e6f117e261a6816fd
- Trigger Event: push

dlightrag 1.0.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

DlightRAG

Features

Quick Start

Python SDK

Docker (Self-Hosted)

MCP Server (for AI Agents)

Configuration

RAG Mode

Providers

Storage Backends

Workspaces

Reranking

REST API

SSE Streaming

Development

Architecture

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance