Natural language to Neo4j — query and build knowledge graphs automatically
Project description
GibsGraph
Natural language to Neo4j — query and build knowledge graphs automatically. Built by V.Gibson at Gibbr AB
GibsGraph connects to any Neo4j knowledge graph, auto-discovers its schema, and lets you ask questions in plain English — or build new graphs from unstructured text. It generates Cypher automatically, retrieves the relevant subgraph, and returns a grounded answer with the Cypher shown so you can verify what was queried.
What it does (v0.4.1)
- Natural language queries — ask anything about your Neo4j graph
- Intent classification — LLM-powered NL understanding extracts industry, region, regulations, and goal from free-form input
- Text-to-graph ingestion —
g.ingest("any text")extracts entities and relationships into Neo4j automatically - Post-ingest validation — checks generated graphs against Neo4j conventions (PascalCase labels, UPPER_SNAKE_CASE relationships, generic label detection)
- Schema introspection —
g.schema()returns node labels, relationship types, counts, and properties per label - Expert knowledge graph — 991 records: 36 clauses, 133 functions, 477 examples, 23 modeling patterns, 322 best practices
- Bundled expert data — works out of the box without loading data into Neo4j first (quality-filtered to ~800 records)
- Training data pipelines — 9,400 pairs across 7 domains: EUR-Lex (DORA, NIS2, MiCA, AI Act, GDPR), MITRE ATT&CK, GLEIF LEI, Hetionet, construction, expert NL-to-graph patterns
- 4-stage validation — syntactic → structural → semantic → domain, with enterprise severity levels
- Auto schema discovery — connects and learns your graph structure automatically
- PCST subgraph pruning — prunes vector neighbourhoods to the most query-relevant subset (opt-in, requires
gibsgraph[gnn]) - Dual retrieval — vector search (when index exists) with text-to-Cypher fallback
- Cypher self-healing — if generated Cypher fails, the error is sent back to the LLM for correction
- Cypher transparency — see exactly what was queried
- Visualization — Mermaid diagrams and Neo4j Bloom deep links
- Secure by default — read-only transactions, parameterized Cypher, injection validation
- LLM flexibility — OpenAI, Anthropic, Mistral, xAI/Grok — auto-detected from env keys
Planned (not yet implemented)
- G-Retriever GNN reasoning (training data ready, model pending)
Quick start
pip install gibsgraph
from gibsgraph import Graph
# Connect — auto-detects LLM from your env keys
g = Graph("bolt://localhost:7687", password="your-password")
# Ask anything
result = g.ask("What movies did Tom Hanks act in?")
print(result) # the answer
print(result.cypher) # Cypher that was run
print(result.confidence) # 0.0-1.0
print(result.visualization) # Mermaid diagram string
g.close() # or use: with Graph(...) as g:
# Ingest text into your graph
g = Graph("bolt://localhost:7687", password="your-password", read_only=False)
g.ingest("Apple acquired Beats Electronics for $3 billion in 2014.", source="news")
g.close()
Expert knowledge graph
GibsGraph ships with a curated knowledge graph of 991 Neo4j records (~800 after quality filtering): Cypher clauses, functions, query examples, modeling patterns, and best practices. This expertise is bundled as JSONL — no extra setup needed.
When you ask a question, the agent retrieves relevant expert knowledge to generate better Cypher, validate results, and avoid common mistakes. This is what separates GibsGraph from a raw LLM — it reasons with real Neo4j expertise, not just training data.
Configuration
Set environment variables (or pass them directly):
export NEO4J_URI=bolt://localhost:7687
export NEO4J_PASSWORD=your_password
export OPENAI_API_KEY=sk-... # or ANTHROPIC_API_KEY, MISTRAL_API_KEY, XAI_API_KEY
Or use a .env file — copy .env.example to get started.
Installation
pip install gibsgraph
Optional extras:
pip install "gibsgraph[mistral]" # Mistral LLM support
pip install "gibsgraph[gnn]" # PCST pruning + G-Retriever GNN
pip install "gibsgraph[ui]" # Streamlit demo UI
pip install "gibsgraph[full]" # everything including dev tools
Docker
cp .env.example .env
# Edit .env with your Neo4j password and API key
docker compose up
# Open http://localhost:8501
Architecture
User Query (natural language, any quality)
|
v
LangGraph Agent (agent.py)
+-- classify <- Intent classification (industry, region, regulations, goal)
+-- retrieval/ <- Auto schema discovery + text-to-Cypher + vector search + PCST pruning
+-- tools/ <- Cypher validator, Mermaid visualizer
+-- training/ <- 4-stage validation (syntactic/structural/semantic/domain)
+-- expert.py <- ExpertStore + BundledExpertStore (JSONL fallback)
+-- data/ <- Bundled expert JSONL (clauses, functions, examples, patterns, practices)
+-- kg_builder/ <- Text to Neo4j via SimpleKGPipeline + post-ingest validation
+-- gnn/ <- G-Retriever inference (planned)
|
v
Neo4j Knowledge Graph
Examples
| Example | Description |
|---|---|
examples/regulatory_kg.py |
EU regulatory knowledge graph (gibs.dev use case) |
Testing
299 tests, 80% coverage.
pytest # All tests
pytest tests/unit/ # Unit only
pytest tests/integration/ # Integration (requires Neo4j)
pytest --cov --cov-report=html # With coverage report
Contributing
Contributions welcome! Please read CONTRIBUTING.md first.
pip install -e ".[dev]"
ruff check src/ tests/
mypy src/gibsgraph
pytest
License
MIT — see LICENSE
Acknowledgements
Built on:
- neo4j-graphrag-python
- LangGraph
- G-Retriever (He et al., 2024)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file gibsgraph-0.4.1.tar.gz.
File metadata
- Download URL: gibsgraph-0.4.1.tar.gz
- Upload date:
- Size: 392.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6535b3ade578044e5078c6840857d2225d7d554f891c311e282f3149b1929f5e
|
|
| MD5 |
da670ddff6c38072b5ff3379e13f06f5
|
|
| BLAKE2b-256 |
b8457f3a20791502ef6cd861dd4aca4b00c4c008ff9e18fd3ebc3fb263fec8cb
|
Provenance
The following attestation bundles were made for gibsgraph-0.4.1.tar.gz:
Publisher:
release.yml on gibbrdev/gibsgraph
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
gibsgraph-0.4.1.tar.gz -
Subject digest:
6535b3ade578044e5078c6840857d2225d7d554f891c311e282f3149b1929f5e - Sigstore transparency entry: 1126532327
- Sigstore integration time:
-
Permalink:
gibbrdev/gibsgraph@8a8fa996dff313ce4ebbcfcce0268b711d41bad0 -
Branch / Tag:
refs/tags/v0.4.1 - Owner: https://github.com/gibbrdev
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@8a8fa996dff313ce4ebbcfcce0268b711d41bad0 -
Trigger Event:
push
-
Statement type:
File details
Details for the file gibsgraph-0.4.1-py3-none-any.whl.
File metadata
- Download URL: gibsgraph-0.4.1-py3-none-any.whl
- Upload date:
- Size: 217.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9fd545f22a007f5a51ff0485b119241988a24c4b7989282328ebdda8d4467f9b
|
|
| MD5 |
dd01f24271f6a315cfdb68b195e97c3b
|
|
| BLAKE2b-256 |
2945ab0e8a49b8fac241e189bc799c07dc6f1e66e58de9f0bd59f2d103dc1382
|
Provenance
The following attestation bundles were made for gibsgraph-0.4.1-py3-none-any.whl:
Publisher:
release.yml on gibbrdev/gibsgraph
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
gibsgraph-0.4.1-py3-none-any.whl -
Subject digest:
9fd545f22a007f5a51ff0485b119241988a24c4b7989282328ebdda8d4467f9b - Sigstore transparency entry: 1126532383
- Sigstore integration time:
-
Permalink:
gibbrdev/gibsgraph@8a8fa996dff313ce4ebbcfcce0268b711d41bad0 -
Branch / Tag:
refs/tags/v0.4.1 - Owner: https://github.com/gibbrdev
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@8a8fa996dff313ce4ebbcfcce0268b711d41bad0 -
Trigger Event:
push
-
Statement type: