A graph-based reasoning library with embedding search and multi-hop traversal

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

berkgokden

These details have not been verified by PyPI

Project description

ReasonGraph

A graph-based reasoning library with embedding search, multi-hop traversal, and automatic entity/causal extraction.

Installation

pip install reasongraph[all]        # everything included

Or install only what you need:

pip install reasongraph             # core: in-memory backend, NER extraction, embeddings
pip install reasongraph[sqlite]     # + SQLite backend with sqlite-vec
pip install reasongraph[gliner2]    # + GLiNER2 entity + causal extraction (recommended)
pip install reasongraph[postgres]   # + PostgreSQL + pgvector backend

Quick Start

Using a built-in dataset

from reasongraph import ReasonGraph

graph = ReasonGraph()
graph.initialize_sync()
graph.load_dataset_sync("financial")

results = graph.query_sync("What caused the 2008 financial crisis?")
for i, text in enumerate(results, 1):
    print(f"{i}. {text}")

graph.close_sync()

Output -- a connected reasoning chain, not just keyword matches:

1. Lehman Brothers filed for bankruptcy in September 2008 after massive MBS losses.
2. Loose lending standards fueled a housing price bubble across the United States.
3. Lehman's collapse triggered a global credit freeze as interbank lending stopped.
4. Mortgage-backed securities built on subprime loans collapsed when defaults surged.
5. The U.S. government enacted TARP, a $700 billion bailout to stabilize the financial system.
6. Banks issued subprime mortgages to borrowers with poor credit histories.

Parsing free-form text

For free-form text, install with pip install reasongraph[gliner2] to get automatic entity and causal relation extraction.

from reasongraph import ReasonGraph

graph = ReasonGraph()
graph.initialize_sync()

graph.add_text_sync("Lehman Brothers filed for bankruptcy in September 2008 after massive MBS losses.")
graph.add_text_sync("The U.S. government enacted TARP, a $700 billion bailout to stabilize the financial system.")
graph.add_text_sync("The Federal Reserve cut interest rates to near zero after the 2008 crisis.")

results = graph.query_sync("What happened after Lehman collapsed?")
for i, text in enumerate(results, 1):
    print(f"{i}. {text}")

graph.close_sync()

1. Lehman Brothers filed for bankruptcy in September 2008 after massive MBS losses.
2. The Federal Reserve cut interest rates to near zero after the 2008 crisis.
3. The U.S. government enacted TARP, a $700 billion bailout to stabilize the financial system.

Async API

import asyncio
from reasongraph import ReasonGraph

async def main():
    async with ReasonGraph() as graph:
        await graph.load_dataset("financial")
        results = await graph.query("What caused the 2008 crisis?")
        for text in results:
            print(text)

asyncio.run(main())

Cross-Source Discovery

ReasonGraph's real power is connecting facts across independent sources. Feed in two unrelated reports and query across them -- the graph bridges shared entities and causal relations that flat embedding search cannot.

import asyncio
from reasongraph import ReasonGraph

source_a = [  # Tech industry report
    "Meridian Technologies opened a semiconductor fabrication plant in Phoenix, Arizona in 2023.",
    "The Phoenix fabrication plant consumes 10 million gallons of water daily for semiconductor manufacturing.",
    "Meridian Technologies signed a five-year supply contract with Apex Electronics to deliver next-generation processors.",
]

source_b = [  # Water crisis report (never mentions Meridian or semiconductors)
    "Phoenix, Arizona declared a water emergency in 2024 due to declining Colorado River levels.",
    "Large-scale manufacturing facilities in Phoenix face production shutdowns under the new water restrictions.",
    "Apex Electronics warned investors that supply chain disruptions from its key suppliers could delay product launches through 2026.",
]

async def main():
    async with ReasonGraph() as graph:
        await graph.add_texts(source_a)
        await graph.add_texts(source_b)
        results = await graph.query("How might the water crisis affect chip manufacturing?")
        for text in results:
            print(text)

asyncio.run(main())

Dr. Sarah Chen, chief engineer at Meridian Technologies, developed a new chip architecture requiring enormous water usage for cooling.
Large-scale manufacturing facilities in Phoenix face production shutdowns under the new water restrictions.
The Arizona Department of Water Resources imposed mandatory 40% water cuts on industrial users in the Phoenix metropolitan area.
Phoenix, Arizona declared a water emergency in 2024 due to declining Colorado River levels.
The Phoenix fabrication plant consumes 10 million gallons of water daily for semiconductor manufacturing.
Meridian Technologies opened a semiconductor fabrication plant in Phoenix, Arizona in 2023.
Meridian Technologies signed a five-year supply contract with Apex Electronics to deliver next-generation processors.

Neither source mentions the other's topic. GLiNER2 extracts "Phoenix" and "Apex Electronics" as shared entities, and the graph traversal connects water restrictions -> Phoenix manufacturing -> Meridian's plant -> Apex supply chain.

Full demo: uv run python examples/cross_source_discovery.py

Features

Cross-source discovery -- connect facts across independent sources through shared entities and causal relations
Automatic extraction -- GLiNER2 extracts entities and causal relations in one pass (falls back to BERT NER when gliner2 is not installed)
Hybrid search -- combine embedding similarity, keyword (trigram) matching, or both
Multi-hop traversal -- follow graph edges to discover connected reasoning chains
Cross-encoder reranking -- rerank results at each hop with ms-marco-MiniLM-L-6-v2
Built-in datasets -- load curated reasoning graphs for immediate use
Async-first -- native async API with sync convenience wrappers
Pluggable backends -- in-memory (zero-config default), SQLite, or PostgreSQL with pgvector

Built-in Datasets

Dataset	Description
`syllogisms`	Classical syllogistic reasoning chains
`causal`	Cause-effect reasoning with entity annotations
`taxonomy`	Hierarchical concept taxonomy
`financial`	Financial crisis causal chains (2008 crisis, dot-com, inflation, eurozone)
`medical`	Medical causal chains (heart disease, diabetes, infectious disease, cancer)
`analysis_patterns`	Data analysis reasoning: scenario detection, technique selection, implementation patterns

graph.load_dataset_sync("financial")

Search Modes

# Pure embedding similarity (default)
results = graph.query_sync("credit freeze", search_mode="embedding")

# Pure keyword/trigram matching
results = graph.query_sync("credit freeze", search_mode="keyword")

# Hybrid: Reciprocal Rank Fusion of embedding + trigram rankings
results = graph.query_sync("credit freeze", search_mode="hybrid")

# Tune the RRF smoothing constant (default 60, lower = more weight to top ranks)
results = graph.query_sync("credit freeze", search_mode="hybrid", rrf_k=30)

Entity and Causal Extraction

When gliner2 is installed, add_text() / add_texts() automatically use GLiNER2 for both entity extraction and causal relation detection. Without gliner2, it falls back to BERT NER (entities only).

from reasongraph import ReasonGraph, NERExtractor, GLiNER2Extractor

graph = ReasonGraph()
graph.initialize_sync()

# Default: GLiNER2 (entities + causal relations) if installed, else BERT NER
entities = graph.add_text_sync("Apple released the iPhone in 2007.")
print(entities)  # ['Apple', 'iPhone']

# Explicit: force BERT NER even if GLiNER2 is installed
entities = graph.add_text_sync("Apple released the iPhone in 2007.", extractor=NERExtractor())

# Explicit: GLiNER2 with custom entity types
gliner = GLiNER2Extractor(entity_types=["company", "product", "date"])
entities = graph.add_text_sync("Apple released the iPhone in 2007.", extractor=gliner)

# Any callable works
entities = graph.add_text_sync("some text", extractor=lambda t: ["custom"])

Backends

By default, ReasonGraph() uses a pure Python in-memory backend (MemoryBackend). This works everywhere with zero dependencies beyond numpy. For persistence, pass a file path to save/load as JSON:

from reasongraph import ReasonGraph, MemoryBackend

# In-memory only (default)
graph = ReasonGraph()

# In-memory with JSON file persistence (loads on init, saves on close)
graph = ReasonGraph(backend=MemoryBackend(file_path="graph.json"))

SQLite Backend

For larger graphs or concurrent access, use the SQLite backend with sqlite-vec for vector search. Requires pip install reasongraph[sqlite].

from reasongraph import ReasonGraph
from reasongraph.backends import SqliteBackend

graph = ReasonGraph(backend=SqliteBackend(db_path="graph.db"))

PostgreSQL Backend

from reasongraph import ReasonGraph
from reasongraph.backends import PostgresBackend

graph = ReasonGraph(backend=PostgresBackend(database_url="postgresql://user:pass@localhost/db"))

Requires pip install reasongraph[postgres] and the pgvector + pg_trgm extensions enabled on your database.

Evaluation: Mixed-Domain Reasoning

We evaluate reasoning quality by loading all 6 built-in datasets into a single graph (~130 text nodes, ~104 entity nodes, ~280 edges) and testing whether the library can trace the correct causal chains, syllogistic proofs, taxonomic hierarchies, and data analysis patterns -- without being distracted by unrelated facts from other domains.

32 test cases simulate agent-style queries like "I need to understand what caused the 2008 financial crisis", "How does insulin resistance lead to kidney failure?", or "I have two numeric columns, check if related" and check whether the returned reasoning chain matches the expected ground truth.

Per-domain results (hybrid search, top_k=5, hops=4, rerank_top_k=4):

Domain	Cases	Chain Completeness	Recall@5	Precision@5	Domain Accuracy
Causal	5	100%	100%	92%	100%
Financial	6	100%	82%	60%	100%
Medical	5	100%	92%	76%	92%
Syllogisms	5	100%	100%	92%	85%
Taxonomy	3	100%	83%	53%	92%
Analysis Patterns	8	96%	75%	45%	96%
Overall	32	99%	88%	68%	95%

32/32 cases pass (>= 50% chain completeness). Split reranking gives chain continuations (text-to-text edges) priority over bridge discoveries (entity-to-text edges), keeping traversal focused.

Search mode comparison:

Mode	Chain Completeness	Recall@5	Precision@5	Domain Accuracy
Embedding	99%	88%	68%	95%
Keyword	0%	0%	0%	0%
Hybrid	99%	88%	68%	95%

Keyword-only mode scores 0% because the eval queries are natural language questions that don't substring-match the dataset's declarative statements. This is expected -- keyword search is designed for known-term lookups, not question answering.

Reproduce: uv run python tests/eval_financial_reasoning.py

API Reference

`ReasonGraph(backend=None, embed_model=None, rerank_model=None, forget_after=30)`

Method	Description
`add_nodes(nodes)`	Add `(content, type)` tuples to the graph
`add_edges(edges)`	Add `(from, to)` content edges
`add_text(text, extractor=None)`	Add text with automatic entity extraction
`add_texts(texts, extractor=None, causal_extractor=None)`	Batch add with entity + causal extraction (auto-enabled with GLiNER2)
`query(query, top_k=5, hops=4, rerank_top_k=4, search_mode="embedding", rrf_k=60)`	Search and traverse the graph
`load_dataset(name)`	Load a built-in dataset
`delete_stale()`	Remove nodes not accessed within `forget_after` days
`get_all_nodes()` / `get_all_edges()`	Inspect graph contents

All methods are async. Sync variants are available with a _sync suffix (e.g. query_sync).

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

berkgokden

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.1

Feb 23, 2026

0.3.0

Feb 23, 2026

0.2.0

Feb 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

reasongraph-0.3.1.tar.gz (42.6 kB view details)

Uploaded Feb 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

reasongraph-0.3.1-py3-none-any.whl (34.0 kB view details)

Uploaded Feb 23, 2026 Python 3

File details

Details for the file reasongraph-0.3.1.tar.gz.

File metadata

Download URL: reasongraph-0.3.1.tar.gz
Upload date: Feb 23, 2026
Size: 42.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for reasongraph-0.3.1.tar.gz
Algorithm	Hash digest
SHA256	`bceb42786666cd487569a284caea1fd8bd4ea708c99328832676740549cebd8c`
MD5	`e4d5ed14643642349d304b4e25c47f0e`
BLAKE2b-256	`2c21a72f0587ec28c5d1ac0fd5fa977aa926050d4704d56e52b227099748bb67`

See more details on using hashes here.

Provenance

The following attestation bundles were made for reasongraph-0.3.1.tar.gz:

Publisher: publish.yml on bgokden/reasongraph

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: reasongraph-0.3.1.tar.gz
- Subject digest: bceb42786666cd487569a284caea1fd8bd4ea708c99328832676740549cebd8c
- Sigstore transparency entry: 983922506
- Sigstore integration time: Feb 23, 2026
Source repository:
- Permalink: bgokden/reasongraph@17122991269a97df30d8bfcb0a5949b4e3857a59
- Branch / Tag: refs/tags/v0.3.1
- Owner: https://github.com/bgokden
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@17122991269a97df30d8bfcb0a5949b4e3857a59
- Trigger Event: release

File details

Details for the file reasongraph-0.3.1-py3-none-any.whl.

File metadata

Download URL: reasongraph-0.3.1-py3-none-any.whl
Upload date: Feb 23, 2026
Size: 34.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for reasongraph-0.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a00e22be7f5983c7fdc086ebe777280a80d7f2ac94ab8c40228f13078fa5d3e8`
MD5	`0fc826caab60113417f177eb4571ecbb`
BLAKE2b-256	`6355f5aee164754f61eb7ec1c7cc64f90d17e49792dcb45fee6874f57192c1f1`

See more details on using hashes here.

Provenance

The following attestation bundles were made for reasongraph-0.3.1-py3-none-any.whl:

Publisher: publish.yml on bgokden/reasongraph

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: reasongraph-0.3.1-py3-none-any.whl
- Subject digest: a00e22be7f5983c7fdc086ebe777280a80d7f2ac94ab8c40228f13078fa5d3e8
- Sigstore transparency entry: 983922511
- Sigstore integration time: Feb 23, 2026
Source repository:
- Permalink: bgokden/reasongraph@17122991269a97df30d8bfcb0a5949b4e3857a59
- Branch / Tag: refs/tags/v0.3.1
- Owner: https://github.com/bgokden
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@17122991269a97df30d8bfcb0a5949b4e3857a59
- Trigger Event: release

reasongraph 0.3.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

ReasonGraph

Installation

Quick Start

Using a built-in dataset

Parsing free-form text

Async API

Cross-Source Discovery

Features

Built-in Datasets

Search Modes

Entity and Causal Extraction

Backends

SQLite Backend

PostgreSQL Backend

Evaluation: Mixed-Domain Reasoning

API Reference

ReasonGraph(backend=None, embed_model=None, rerank_model=None, forget_after=30)

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`ReasonGraph(backend=None, embed_model=None, rerank_model=None, forget_after=30)`