The most accurate Graph RAG framework. Build knowledge graphs and query them with natural language. Built on FalkorDB.

These details have not been verified by PyPI

Project description

GraphRAG SDK

The most accurate Graph RAG framework. Built on FalkorDB.

GraphRAG SDK builds knowledge graphs from documents and answers questions over them using retrieval-augmented generation. Every algorithmic concern (chunking, extraction, resolution, retrieval, reranking) is a swappable strategy behind an abstract interface. The default pipeline scores ~85% accuracy on a 100-question benchmark using GPT-4.1.

Quick Start

import asyncio
from graphrag_sdk import GraphRAG, ConnectionConfig, LiteLLM, LiteLLMEmbedder

async def main():
    async with GraphRAG(
        connection=ConnectionConfig(host="localhost", graph_name="my_graph"),
        llm=LiteLLM(model="openai/gpt-4o"),
        embedder=LiteLLMEmbedder(model="openai/text-embedding-3-small"),
    ) as rag:
        result = await rag.ingest("my_document.txt")
        print(f"Created {result.nodes_created} nodes, {result.relationships_created} edges")

        answer = await rag.completion("What is the main theme?")
        print(answer.answer)

asyncio.run(main())

Installation

pip install graphrag-sdk[litellm]       # OpenAI, Azure, Anthropic, 100+ models
pip install graphrag-sdk[openrouter]    # OpenRouter models
pip install graphrag-sdk[pdf]           # PDF ingestion
pip install graphrag-sdk[all]           # Everything

Prerequisites

Python >= 3.10
FalkorDB: docker run -p 6379:6379 falkordb/falkordb
An LLM API key (OpenAI, Azure OpenAI, OpenRouter, etc.)

Usage

Ingest & Query

import asyncio
from graphrag_sdk import GraphRAG, ConnectionConfig, LiteLLM, LiteLLMEmbedder

async def main():
    async with GraphRAG(
        connection=ConnectionConfig(host="localhost", graph_name="my_graph"),
        llm=LiteLLM(model="openai/gpt-4o"),
        embedder=LiteLLMEmbedder(model="openai/text-embedding-3-small"),
    ) as rag:
        await rag.ingest("report.pdf")                              # PDF
        await rag.ingest("source_id", text="Alice works at Acme.")  # Raw text
        await rag.finalize()                                         # Dedup + index

        # Retrieve context only
        context = await rag.retrieve("Where does Alice work?")

        # Full RAG: retrieve + generate answer
        result = await rag.completion("Where does Alice work?")
        print(result.answer)

asyncio.run(main())

Multi-Turn Conversations

completion() supports multi-turn conversations. With the built-in providers (LiteLLM, OpenRouterLLM), messages are passed natively to the LLM's chat API. Custom providers that only implement invoke() get automatic fallback via message concatenation.

from graphrag_sdk import ChatMessage

answer = await rag.completion(
    "What happened next?",
    history=[
        ChatMessage(role="user", content="Who is Alice?"),
        ChatMessage(role="assistant", content="Alice is an engineer at Acme Corp."),
    ],
)

Supported roles: "system", "user", "assistant". Invalid roles raise ValueError.

Schema Definition

from graphrag_sdk import GraphSchema, EntityType, RelationType, SchemaPattern

schema = GraphSchema(
    entities=[
        EntityType(label="Person", description="A human being"),
        EntityType(label="Organization", description="A company or institution"),
    ],
    relations=[
        RelationType(label="WORKS_AT", description="Is employed by"),
    ],
    patterns=[
        SchemaPattern(source="Person", relationship="WORKS_AT", target="Organization"),
    ],
)

rag = GraphRAG(connection=conn, llm=llm, embedder=embedder, schema=schema)  # conn, llm, embedder from above

Strategy Customization

Override any pipeline step by passing a strategy:

from graphrag_sdk.ingestion.chunking_strategies.fixed_size import FixedSizeChunking
from graphrag_sdk import GraphExtraction, LLMExtractor
from graphrag_sdk.ingestion.resolution_strategies import SemanticResolution

# Custom chunking
await rag.ingest("doc.txt", chunker=FixedSizeChunking(chunk_size=1500, chunk_overlap=200))

# LLM-based entity extraction instead of GLiNER
await rag.ingest("doc.txt", extractor=GraphExtraction(llm=llm, entity_extractor=LLMExtractor(llm)))

Strategy Reference

Every algorithmic concern is a swappable strategy behind an abstract base class:

Concern	ABC	Built-in Options	Default
Loading	`LoaderStrategy`	`TextLoader`, `PdfLoader`	Auto-detect by extension
Chunking	`ChunkingStrategy`	`FixedSizeChunking`, `SentenceTokenCapChunking`, `ContextualChunking`, `CallableChunking`	`FixedSizeChunking`
Extraction	`ExtractionStrategy`	`GraphExtraction` (GLiNER2 + LLM)	`GraphExtraction`
Resolution	`ResolutionStrategy`	`ExactMatchResolution`, `DescriptionMergeResolution`, `SemanticResolution`, `LLMVerifiedResolution`	`ExactMatch`
Retrieval	`RetrievalStrategy`	`LocalRetrieval`, `MultiPathRetrieval`	`MultiPath` (5-path)
Reranking	`RerankingStrategy`	`CosineReranker`	Cosine

LLM & Embedding Providers

Provider	LLM Class	Embedder Class	Models
LiteLLM	`LiteLLM`	`LiteLLMEmbedder`	OpenAI, Azure, Anthropic, Cohere, 100+
OpenRouter	`OpenRouterLLM`	`OpenRouterEmbedder`	All OpenRouter models
Custom	Subclass `LLMInterface`	Subclass `Embedder`	Anything

Benchmark

~85% accuracy (8.5/10) on a 100-question benchmark over 20 Project Gutenberg novels.

Metric	Value
Accuracy	~85% (8.5/10)
Questions	100 (fact retrieval, complex reasoning, summarization)
Documents	20 novels (Project Gutenberg)
Query P50	5.4s

See docs/benchmark.md for full methodology and reproduction instructions.

Examples

#	Example	Description
1	`01_quickstart.py`	Minimal ingest & query
2	`02_pdf_with_schema.py`	PDF with custom schema
3	`03_custom_strategies.py`	Benchmark-winning pipeline
4	`04_custom_provider.py`	Custom LLM/Embedder
5	`05_notebook_demo.ipynb`	Interactive notebook walkthrough

Documentation

Getting Started -- Install to first query
Architecture -- Pipeline design and graph schema
Configuration -- Connection and provider reference
Strategies -- All ABCs and built-in implementations
Providers -- LLM & embedder configuration
Benchmark -- Methodology and reproduction
API Reference -- Full API documentation

License

Apache License 2.0

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.0.1

Apr 29, 2026

1.0.0

Apr 21, 2026

This version

1.0.0rc1 pre-release

Apr 16, 2026

0.8.2

Jan 15, 2026

0.8.1

Sep 29, 2025

0.8.0

Jul 2, 2025

0.7.1

Apr 1, 2025

0.7.0

Mar 31, 2025

0.6.2

Mar 5, 2025

0.6.1

Feb 20, 2025

0.6.0

Feb 5, 2025

0.5.0

Jan 22, 2025

0.4.1

Dec 19, 2024

0.4.0

Dec 18, 2024

0.3.4

Dec 11, 2024

0.3.3

Nov 28, 2024

0.3.2

Nov 25, 2024

0.3.1

Nov 19, 2024

0.3.0

Nov 12, 2024

0.2.2

Oct 22, 2024

0.2.1

Sep 19, 2024

0.2.0

Sep 17, 2024

0.1.3b0 pre-release

Jul 2, 2024

0.1.2b0 pre-release

Jun 12, 2024

0.1.1b0 pre-release

Jun 10, 2024

0.1.0

Jun 6, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

graphrag_sdk-1.0.0rc1.tar.gz (140.5 kB view details)

Uploaded Apr 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

graphrag_sdk-1.0.0rc1-py3-none-any.whl (121.1 kB view details)

Uploaded Apr 16, 2026 Python 3

File details

Details for the file graphrag_sdk-1.0.0rc1.tar.gz.

File metadata

Download URL: graphrag_sdk-1.0.0rc1.tar.gz
Upload date: Apr 16, 2026
Size: 140.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for graphrag_sdk-1.0.0rc1.tar.gz
Algorithm	Hash digest
SHA256	`f5b5e21e5ad7bfd86c051893c0895ac5c4614addeed825e7f26fe42fb91fe279`
MD5	`a4f141a7edf766b804a7f05240589355`
BLAKE2b-256	`df3f15661e4e366dee25a402081511b740d62ac6390fd182c8f7c42a92b20ca0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for graphrag_sdk-1.0.0rc1.tar.gz:

Publisher: pypi-publish.yaml on FalkorDB/GraphRAG-SDK

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: graphrag_sdk-1.0.0rc1.tar.gz
- Subject digest: f5b5e21e5ad7bfd86c051893c0895ac5c4614addeed825e7f26fe42fb91fe279
- Sigstore transparency entry: 1317052527
- Sigstore integration time: Apr 16, 2026
Source repository:
- Permalink: FalkorDB/GraphRAG-SDK@9c9dbdbbb6f85ca59a2001fa2803c2939c4551ea
- Branch / Tag: refs/tags/v1.0.0rc1
- Owner: https://github.com/FalkorDB
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yaml@9c9dbdbbb6f85ca59a2001fa2803c2939c4551ea
- Trigger Event: release

File details

Details for the file graphrag_sdk-1.0.0rc1-py3-none-any.whl.

File metadata

Download URL: graphrag_sdk-1.0.0rc1-py3-none-any.whl
Upload date: Apr 16, 2026
Size: 121.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for graphrag_sdk-1.0.0rc1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6f1bc2899d8cbfb47199088c396fbb5029e36bfcbe12db5ff205a7a77d0fdda4`
MD5	`75012eeb4f3c8c944d40870641c763f2`
BLAKE2b-256	`90bc349141f89dbcc017f363153fa988c99dbb47e8e38bd86fbec13805ee3d5d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for graphrag_sdk-1.0.0rc1-py3-none-any.whl:

Publisher: pypi-publish.yaml on FalkorDB/GraphRAG-SDK

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: graphrag_sdk-1.0.0rc1-py3-none-any.whl
- Subject digest: 6f1bc2899d8cbfb47199088c396fbb5029e36bfcbe12db5ff205a7a77d0fdda4
- Sigstore transparency entry: 1317052542
- Sigstore integration time: Apr 16, 2026
Source repository:
- Permalink: FalkorDB/GraphRAG-SDK@9c9dbdbbb6f85ca59a2001fa2803c2939c4551ea
- Branch / Tag: refs/tags/v1.0.0rc1
- Owner: https://github.com/FalkorDB
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: pypi-publish.yaml@9c9dbdbbb6f85ca59a2001fa2803c2939c4551ea
- Trigger Event: release

graphrag-sdk 1.0.0rc1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

GraphRAG SDK

Quick Start

Installation

Prerequisites

Usage

Ingest & Query

Multi-Turn Conversations

Schema Definition

Strategy Customization

Strategy Reference

LLM & Embedding Providers

Benchmark

Examples

Documentation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance