A high-performance graph database library with Python bindings written in Rust

These details have not been verified by PyPI

Project links

Project description

KGLite

An embedded knowledge graph engine for Python. Import and go — no server, no setup.

For AI agents: see Using with AI Agents and kglite.pyi for type stubs.


Embedded, in-process	No server, no network; `import` and go
In-memory	Persistence via `save()`/`load()` snapshots
Cypher subset	Querying + mutations + `text_score()` for semantic search
Single-label nodes	Each node has exactly one type
Single-threaded	Designed for single-threaded use (see Threading)

Requirements: Python 3.10+ (CPython) | macOS (ARM/Intel), Linux (x86_64/aarch64), Windows (x86_64) | pandas >= 1.5

pip install kglite

Quick Start

import kglite

graph = kglite.KnowledgeGraph()

# Create nodes and relationships
graph.cypher("CREATE (:Person {name: 'Alice', age: 28, city: 'Oslo'})")
graph.cypher("CREATE (:Person {name: 'Bob', age: 35, city: 'Bergen'})")
graph.cypher("CREATE (:Person {name: 'Charlie', age: 42, city: 'Oslo'})")
graph.cypher("""
    MATCH (a:Person {name: 'Alice'}), (b:Person {name: 'Bob'})
    CREATE (a)-[:KNOWS]->(b)
""")

# Query — returns a ResultView (lazy; data stays in Rust until accessed)
result = graph.cypher("""
    MATCH (p:Person) WHERE p.age > 30
    RETURN p.name AS name, p.city AS city
    ORDER BY p.age DESC
""")
for row in result:
    print(row['name'], row['city'])

# Quick peek at first rows
result.head()      # first 5 rows (returns a new ResultView)
result.head(3)     # first 3 rows

# Or get a pandas DataFrame
df = graph.cypher("MATCH (p:Person) RETURN p.name, p.age ORDER BY p.age", to_df=True)

# Persist to disk and reload
graph.save("my_graph.kgl")
loaded = kglite.load("my_graph.kgl")

Loading Data from DataFrames

For bulk loading (thousands of rows), use the fluent API:

import pandas as pd

users_df = pd.DataFrame({
    'user_id': [1001, 1002, 1003],
    'name': ['Alice', 'Bob', 'Charlie'],
    'age': [28, 35, 42]
})

graph.add_nodes(data=users_df, node_type='User', unique_id_field='user_id', node_title_field='name')

edges_df = pd.DataFrame({'source_id': [1001, 1002], 'target_id': [1002, 1003]})
graph.add_connections(data=edges_df, connection_type='KNOWS', source_type='User',
                      source_id_field='source_id', target_type='User', target_id_field='target_id')

graph.cypher("MATCH (u:User) WHERE u.age > 30 RETURN u.name, u.age")

Core Concepts

Nodes have three built-in fields — type (label), title (display name), id (unique within type) — plus arbitrary properties. Each node has exactly one type.

Relationships connect two nodes with a type (e.g., :KNOWS) and optional properties. The Cypher API calls them "relationships"; the fluent API calls them "connections" — same thing.

Selections (fluent API) are lightweight views — a set of node indices that flow through chained operations like type_filter().filter().traverse(). They don't copy data.

Atomicity. Each cypher() call is atomic — if any clause fails, the graph remains unchanged. For multi-statement atomicity, use graph.begin() transactions. Durability only via explicit save().

How It Works

KGLite stores nodes and relationships in a Rust graph structure (petgraph). Python only sees lightweight handles — data converts to Python objects on access, not on query.

Cypher queries parse, optimize, and execute entirely in Rust, then return a ResultView (lazy — rows convert to Python dicts only when accessed)
Fluent API chains build a selection (a set of node indices) — no data is copied until you call get_nodes(), to_df(), etc.
Persistence is via save()/load() binary snapshots — there is no WAL or auto-save

Return Types

All node-related methods use a consistent key order: type, title, id, then other properties.

Cypher

Query type	Returns
Read (`MATCH...RETURN`)	`ResultView` — lazy container, rows converted on access
Read with `to_df=True`	`pandas.DataFrame`
Mutation (`CREATE`, `SET`, `DELETE`, `MERGE`)	`ResultView` with `.stats` dict
`EXPLAIN` prefix	`str` (query plan, not executed)

Spatial return types: point() values are returned as {'latitude': float, 'longitude': float} dicts.

ResultView

ResultView is a lazy result container returned by cypher(), centrality methods, get_nodes(), and sample(). Data stays in Rust and is only converted to Python objects when you access it — making cypher() calls fast even for large result sets.

result = graph.cypher("MATCH (n:Person) RETURN n.name, n.age ORDER BY n.age")

len(result)        # row count (O(1), no conversion)
result[0]          # single row as dict (converts that row only)
result[-1]         # negative indexing works

for row in result: # iterate rows as dicts (one at a time)
    print(row)

result.head()      # first 5 rows → new ResultView
result.head(3)     # first 3 rows → new ResultView
result.tail(2)     # last 2 rows → new ResultView

result.to_list()   # all rows as list[dict] (full conversion)
result.to_df()     # pandas DataFrame (full conversion)

result.columns     # column names: ['n.name', 'n.age']
result.stats       # mutation stats (None for read queries)

Because ResultView supports iteration and indexing, it works anywhere you'd use a list of dicts — existing code that iterates over cypher() results continues to work unchanged.

Node dicts

Every method that returns node data uses the same dict shape:

{'type': 'Person', 'title': 'Alice', 'id': 1, 'age': 28, 'city': 'Oslo'}
#  ^^^^             ^^^^^             ^^^       ^^^ other properties

Retrieval methods (cheapest to most expensive)

Method	Returns	Notes
`node_count()`	`int`	No materialization
`indices()`	`list[int]`	Raw graph indices
`id_values()`	`list[Any]`	Flat list of IDs
`get_ids()`	`list[{type, title, id}]`	Identification only
`get_titles()`	`list[str]`	Flat list (see below)
`get_properties(['a','b'])`	`list[tuple]`	Flat list (see below)
`get_nodes()`	`ResultView` or grouped dict	Full node dicts
`to_df()`	`DataFrame`	Columns: `type, title, id, ...props`
`get_node_by_id(type, id)`	`dict \| None`	O(1) hash lookup

Flat vs. grouped results

get_titles(), get_properties(), and get_nodes() automatically flatten when there is only one parent group (the common case). After a traversal with multiple parent groups, they return grouped dicts instead:

# No traversal (single group) → flat list
graph.type_filter('Person').get_titles()
# ['Alice', 'Bob', 'Charlie']

# After traversal (multiple groups) → grouped dict
graph.type_filter('Person').traverse('KNOWS').get_titles()
# {'Alice': ['Bob'], 'Bob': ['Charlie']}

# Override with flatten_single_parent=False to always get grouped
graph.type_filter('Person').get_titles(flatten_single_parent=False)
# {'Root': ['Alice', 'Bob', 'Charlie']}

Centrality methods

All centrality methods (pagerank, betweenness_centrality, closeness_centrality, degree_centrality) return:

Mode	Returns
Default	`ResultView` of `{type, title, id, score}` sorted by score desc
`as_dict=True`	`{id: score}` — keyed by node ID (unique per type)
`to_df=True`	`DataFrame` with columns `type, title, id, score`

API Quick Reference

Graph lifecycle

graph = kglite.KnowledgeGraph()     # create
graph.save("file.kgl")              # persist
graph = kglite.load("file.kgl")     # reload
graph.graph_info()                   # → dict with node_count, edge_count, fragmentation_ratio, ...
graph.get_schema()                   # → str summary of types and connections
graph.node_types                     # → ['Person', 'Product', ...]

Cypher (recommended for most tasks)

graph.cypher("MATCH (n:Person) RETURN n.name")                          # → ResultView
graph.cypher("MATCH (n:Person) RETURN n.name", to_df=True)              # → DataFrame
graph.cypher("MATCH (n:Person) RETURN n.name", params={'x': 1})         # parameterized
graph.cypher("CREATE (:Person {name: 'Alice'})")                        # → ResultView (.stats has counts)

Data loading (fluent API)

graph.add_nodes(data=df, node_type='T', unique_id_field='id')           # → report dict
graph.add_connections(data=df, connection_type='REL',
    source_type='A', source_id_field='src',
    target_type='B', target_id_field='tgt')                              # → report dict

Selection chain (fluent API)

graph.type_filter('Person')                        # select by type → KnowledgeGraph
    .filter({'age': {'>': 25}})                    # filter → KnowledgeGraph
    .sort('age', ascending=False)                  # sort → KnowledgeGraph
    .traverse('KNOWS', direction='outgoing')       # traverse → KnowledgeGraph
    .get_nodes()                                   # materialize → ResultView or grouped dict

Semantic search

# Text-level API (recommended) — register model once, embed & search by column name
graph.set_embedder(model)                                                    # register model (.dimension, .embed())
graph.embed_texts('Article', 'summary')                                      # embed text column → stored as summary_emb
graph.type_filter('Article').search_text('summary', 'find AI papers', top_k=10)  # text query search

# Low-level vector API — bring your own vectors
graph.set_embeddings('Article', 'summary', {id: vec, ...})             # store embeddings
graph.type_filter('Article').vector_search('summary', qvec, top_k=10)  # similarity search
graph.list_embeddings()                                                 # list all embedding stores
graph.remove_embeddings('Article', 'summary')                           # remove an embedding store
graph.get_embeddings('Article', 'summary')                              # retrieve all vectors for type
graph.type_filter('Article').get_embeddings('summary')                  # retrieve vectors for selection
graph.get_embedding('Article', 'summary', node_id)                      # single node vector (or None)
# Cypher: text_score(n, 'summary', 'query text') — semantic search in Cypher, needs set_embedder()

Introspection

graph.schema()                                # → full graph overview (types, counts, connections, indexes)
graph.connection_types()                      # → list of edge types with counts and endpoint types
graph.properties('Person')                    # → per-property stats (type, non_null, unique, values)
graph.properties('Person', max_values=50)     # → include values list for up to 50 unique values
graph.neighbors_schema('Person')              # → outgoing/incoming connection topology
graph.sample('Person', n=5)                   # → first N nodes as ResultView
graph.indexes()                               # → all indexes with type info
graph.agent_describe()                        # → XML string for LLM prompt context

Algorithms

graph.shortest_path(source_type, source_id, target_type, target_id)  # → {path, connections, length} | None
graph.all_paths(source_type, source_id, target_type, target_id)      # → list[{path, connections, length}]
graph.pagerank(top_k=10)                                             # → ResultView of {type, title, id, score}
graph.betweenness_centrality(top_k=10)                               # → ResultView of {type, title, id, score}
graph.louvain_communities()                                          # → {communities, modularity, num_communities}
graph.connected_components()                                         # → list[list[node_dict]]

Schema Introspection

Methods for exploring graph structure — what types exist, what properties they have, and how they connect. Useful for discovering an unfamiliar graph or building dynamic UIs.

`schema()` — Full graph overview

s = graph.schema()
# {
#   'node_types': {
#     'Person': {'count': 500, 'properties': {'age': 'Int64', 'city': 'String'}},
#     'Company': {'count': 50, 'properties': {'founded': 'Int64'}},
#   },
#   'connection_types': {
#     'KNOWS': {'count': 1200, 'source_types': ['Person'], 'target_types': ['Person']},
#     'WORKS_AT': {'count': 500, 'source_types': ['Person'], 'target_types': ['Company']},
#   },
#   'indexes': ['Person.city', 'Person.(city, age)'],
#   'node_count': 550,
#   'edge_count': 1700,
# }

`connection_types()` — Edge type inventory

graph.connection_types()
# [
#   {'type': 'KNOWS', 'count': 1200, 'source_types': ['Person'], 'target_types': ['Person']},
#   {'type': 'WORKS_AT', 'count': 500, 'source_types': ['Person'], 'target_types': ['Company']},
# ]

`properties(node_type, max_values=20)` — Property details

Per-property statistics for a single node type. Only properties that exist on at least one node are included. The values list is included when the unique count is at or below max_values (default 20). Set max_values=0 to never include values, or raise it to see more (e.g., max_values=100).

graph.properties('Person')
# {
#   'type':  {'type': 'str', 'non_null': 500, 'unique': 1, 'values': ['Person']},
#   'title': {'type': 'str', 'non_null': 500, 'unique': 500},
#   'id':    {'type': 'int', 'non_null': 500, 'unique': 500},
#   'city':  {'type': 'str', 'non_null': 500, 'unique': 3, 'values': ['Bergen', 'Oslo', 'Stavanger']},
#   'age':   {'type': 'int', 'non_null': 500, 'unique': 45},
#   'email': {'type': 'str', 'non_null': 250, 'unique': 250},
# }

# See all values even for higher-cardinality properties
graph.properties('Person', max_values=100)

Raises KeyError if the node type doesn't exist.

`neighbors_schema(node_type)` — Connection topology

Outgoing and incoming connections grouped by (connection type, endpoint type):

graph.neighbors_schema('Person')
# {
#   'outgoing': [
#     {'connection_type': 'KNOWS', 'target_type': 'Person', 'count': 1200},
#     {'connection_type': 'WORKS_AT', 'target_type': 'Company', 'count': 500},
#   ],
#   'incoming': [
#     {'connection_type': 'KNOWS', 'source_type': 'Person', 'count': 1200},
#   ],
# }

Raises KeyError if the node type doesn't exist.

`sample(node_type, n=5)` — Quick data peek

Returns the first N nodes of a type as a ResultView:

result = graph.sample('Person', n=3)
result[0]          # {'type': 'Person', 'title': 'Alice', 'id': 1, 'age': 28, 'city': 'Oslo'}
result.to_list()   # all rows as list[dict]
result.to_df()     # as DataFrame

Returns fewer than N if the type has fewer nodes. Raises KeyError if the node type doesn't exist.

`indexes()` — Unified index list

graph.indexes()
# [
#   {'node_type': 'Person', 'property': 'city', 'type': 'equality'},
#   {'node_type': 'Person', 'properties': ['city', 'age'], 'type': 'composite'},
# ]

`agent_describe()` — AI agent context

Returns a self-contained XML string summarizing the graph structure and supported Cypher syntax. Designed to be included directly in an LLM prompt:

xml = graph.agent_describe()
prompt = f"You have a knowledge graph:\n{xml}\nAnswer the user's question using cypher()."

The output includes:

Dynamic (per-graph): node types with counts and property schemas, connection types, indexes
Static (always the same): supported Cypher subset, key API methods, single-label model notes

Cypher Queries

A substantial Cypher subset. See the Supported Cypher Subset table for exact coverage.

Single-label note: Each node has exactly one type. labels(n) returns a string, not a list. SET n:OtherLabel is not supported.

result = graph.cypher("""
    MATCH (p:Person)-[:KNOWS]->(f:Person)
    WHERE p.age > 30 AND f.city = 'Oslo'
    RETURN p.name AS person, f.name AS friend, p.age AS age
    ORDER BY p.age DESC
    LIMIT 10
""")

# Read queries → ResultView (iterate, index, or convert)
for row in result:
    print(f"{row['person']} knows {row['friend']}")

# Pass to_df=True for a DataFrame
df = graph.cypher("MATCH (n:Person) RETURN n.name, n.age ORDER BY n.age", to_df=True)

WHERE Clause

# Comparisons: =, <>, <, >, <=, >=
graph.cypher("MATCH (n:Product) WHERE n.price >= 500 RETURN n.title, n.price")

# Boolean operators: AND, OR, NOT
graph.cypher("MATCH (n:Person) WHERE n.age > 25 AND NOT n.city = 'Oslo' RETURN n.name")

# Null checks
graph.cypher("MATCH (n:Person) WHERE n.email IS NOT NULL RETURN n.name")

# String predicates: CONTAINS, STARTS WITH, ENDS WITH
graph.cypher("MATCH (n:Person) WHERE n.name CONTAINS 'ali' RETURN n.name")

# IN lists
graph.cypher("MATCH (n:Person) WHERE n.city IN ['Oslo', 'Bergen'] RETURN n.name")

# Regex matching with =~
graph.cypher("MATCH (n:Person) WHERE n.name =~ '(?i)^ali.*' RETURN n.name")
graph.cypher("MATCH (n:Person) WHERE n.email =~ '.*@example\\.com$' RETURN n.name")

Relationship Properties

Relationships can have properties. Access them with r.property syntax:

# Create relationships with properties
graph.cypher("""
    MATCH (p:Person {name: 'Alice'}), (m:Movie {title: 'Inception'})
    CREATE (p)-[:RATED {score: 5, comment: 'Excellent'}]->(m)
""")

# Access, filter, aggregate, sort by relationship properties
graph.cypher("MATCH (p)-[r:RATED]->(m) RETURN p.name, r.score, r.comment, type(r)")
graph.cypher("MATCH (p)-[r:RATED]->(m) WHERE r.score >= 4 RETURN p.name, m.title")
graph.cypher("MATCH (p)-[r:RATED]->(m) RETURN avg(r.score) AS avg_rating")
graph.cypher("MATCH ()-[r:RATED]->(m) RETURN m.title, r.score ORDER BY r.score DESC")

Aggregation

graph.cypher("MATCH (n:Person) RETURN n.city, count(*) AS population ORDER BY population DESC")
graph.cypher("MATCH (n:Person) RETURN avg(n.age) AS avg_age, min(n.age), max(n.age)")

# DISTINCT
graph.cypher("MATCH (n:Person) RETURN DISTINCT n.city")
graph.cypher("MATCH (n:Person) RETURN count(DISTINCT n.city) AS unique_cities")

WITH Clause

graph.cypher("""
    MATCH (p:Person)-[:KNOWS]->(f:Person)
    WITH p, count(f) AS friend_count
    WHERE friend_count > 3
    RETURN p.name, friend_count
    ORDER BY friend_count DESC
""")

OPTIONAL MATCH

Left outer join — keeps rows even when no match:

graph.cypher("""
    MATCH (p:Person)
    OPTIONAL MATCH (p)-[:KNOWS]->(f:Person)
    RETURN p.name, count(f) AS friends
""")

Built-in Functions

Function	Description
`toUpper(expr)`	Convert to uppercase
`toLower(expr)`	Convert to lowercase
`toString(expr)`	Convert to string
`toInteger(expr)`	Convert to integer
`toFloat(expr)`	Convert to float
`size(expr)`	Length of string or list
`type(r)`	Relationship type
`id(n)`	Node ID
`labels(n)`	Node type (string, not list — single-label)
`coalesce(a, b, ...)`	First non-null argument
`length(p)`	Path hop count
`nodes(p)`	Nodes in a path
`relationships(p)`	Relationships in a path
`point(lat, lon)`	Create a geographic point
`distance(p1, p2)`	Haversine great-circle distance (km)
`wkt_contains(wkt, point)`	Point-in-polygon test
`wkt_intersects(wkt1, wkt2)`	Geometry intersection test
`wkt_centroid(wkt)`	Centroid of WKT geometry
`latitude(point)`	Extract latitude from point
`longitude(point)`	Extract longitude from point
`text_score(n, prop, query)`	Semantic similarity (auto-embeds query text; requires `set_embedder()`)
`text_score(n, prop, query, metric)`	With explicit metric (`'cosine'`, `'dot_product'`, `'euclidean'`)

Spatial Functions

Built-in spatial functions for geographic queries using the Haversine formula:

Function	Returns	Description
`point(lat, lon)`	Point	Create a geographic point
`distance(p1, p2)`	Float (km)	Haversine great-circle distance between two points
`distance(lat1, lon1, lat2, lon2)`	Float (km)	Haversine distance (4-arg shorthand)
`wkt_contains(wkt, point)`	Boolean	Point-in-polygon test
`wkt_contains(wkt, lat, lon)`	Boolean	Point-in-polygon (3-arg shorthand)
`wkt_intersects(wkt1, wkt2)`	Boolean	Geometry intersection test
`wkt_centroid(wkt)`	Point	Centroid of WKT geometry
`latitude(point)`	Float	Extract latitude component
`longitude(point)`	Float	Extract longitude component

# Distance filtering — cities within 100km of Oslo
graph.cypher("""
    MATCH (n:City)
    WHERE distance(point(n.latitude, n.longitude), point(59.91, 10.75)) < 100.0
    RETURN n.name, distance(n.latitude, n.longitude, 59.91, 10.75) AS dist_km
    ORDER BY dist_km
""")

# Spatial + graph traversal
graph.cypher("""
    MATCH (a:City)-[:CONNECTED_TO]->(b:City)
    WHERE distance(point(a.lat, a.lon), point(b.lat, b.lon)) < 50.0
    RETURN a.name, b.name
""")

# Point-in-polygon with WKT
graph.cypher("""
    MATCH (c:City), (a:Area)
    WHERE wkt_contains(a.geometry, point(c.latitude, c.longitude))
    RETURN c.name, a.name
""")

# Aggregation with spatial
graph.cypher("""
    MATCH (n:City)
    RETURN avg(distance(point(n.latitude, n.longitude), point(59.91, 10.75))) AS avg_dist,
           min(distance(point(n.latitude, n.longitude), point(59.91, 10.75))) AS min_dist
""")

Arithmetic

graph.cypher("MATCH (n:Product) RETURN n.title, n.price * 1.25 AS price_with_tax")

CASE Expressions

# Generic form
graph.cypher("""
    MATCH (n:Person)
    RETURN n.name,
           CASE WHEN n.age >= 18 THEN 'adult' ELSE 'minor' END AS category
""")

# Simple form
graph.cypher("""
    MATCH (n:Person)
    RETURN n.name,
           CASE n.city WHEN 'Oslo' THEN 'capital' WHEN 'Bergen' THEN 'west coast' ELSE 'other' END AS region
""")

List Comprehensions

[x IN list WHERE predicate | expression] syntax:

# Map: double each number
graph.cypher("UNWIND [1] AS _ RETURN [x IN [1, 2, 3, 4, 5] | x * 2] AS doubled")
# [2, 4, 6, 8, 10]

# Filter only
graph.cypher("UNWIND [1] AS _ RETURN [x IN [1, 2, 3, 4, 5] WHERE x > 3] AS filtered")
# [4, 5]

# Filter + map
graph.cypher("UNWIND [1] AS _ RETURN [x IN [1, 2, 3, 4, 5] WHERE x > 3 | x * 2] AS result")
# [8, 10]

# With collect() — transform aggregated values
graph.cypher("""
    MATCH (p:Person)
    WITH collect(p.name) AS names
    RETURN [x IN names | toUpper(x)] AS upper_names
""")

Note: List comprehensions require at least one row in the pipeline. Use UNWIND [1] AS _ or a preceding MATCH/WITH to provide the row context.

Map Projections

n {.prop1, .prop2, alias: expr} syntax — select specific properties from a node:

# Select only name and age (returns a dict per row)
graph.cypher("MATCH (p:Person) RETURN p {.name, .age} AS info")
# [{'info': {'name': 'Alice', 'age': 30}}, {'info': {'name': 'Bob', 'age': 25}}]

# Mix shorthand properties with computed values
graph.cypher("""
    MATCH (p:Person)-[:WORKS_AT]->(c:Company)
    RETURN p {.name, .age, company: c.name} AS info
""")

# System properties (id, type) work too
graph.cypher("MATCH (p:Person) RETURN p {.name, .type, .id} AS info LIMIT 1")
# [{'info': {'name': 'Alice', 'type': 'Person', 'id': 1}}]

Parameters

graph.cypher(
    "MATCH (n:Person) WHERE n.age > $min_age RETURN n.name, n.age",
    params={'min_age': 25}
)

# Parameters in inline pattern properties
graph.cypher(
    "MATCH (n:Person {name: $name}) RETURN n.age",
    params={'name': 'Alice'}
)

# Parameters with DataFrame output
df = graph.cypher(
    "MATCH (n:Person) WHERE n.age > $min_age RETURN n.name, n.age ORDER BY n.age",
    params={'min_age': 20}, to_df=True
)

UNWIND

Expand a list into rows:

graph.cypher("UNWIND [1, 2, 3] AS x RETURN x, x * 2 AS doubled")

UNION

graph.cypher("""
    MATCH (n:Person) WHERE n.city = 'Oslo' RETURN n.name AS name
    UNION
    MATCH (n:Person) WHERE n.age > 30 RETURN n.name AS name
""")

Variable-Length Paths

# 1 to 3 hops
graph.cypher("MATCH (a:Person)-[:KNOWS*1..3]->(b:Person) WHERE a.name = 'Alice' RETURN b.name")

# Exact 2 hops
graph.cypher("MATCH (a:Person)-[:KNOWS*2]->(b:Person) RETURN a.name, b.name")

WHERE EXISTS

Check for subpattern existence. Both brace { } and parenthesis (( )) syntax are supported:

# Brace syntax
graph.cypher("MATCH (p:Person) WHERE EXISTS { (p)-[:KNOWS]->(:Person) } RETURN p.name")

# Parenthesis syntax (equivalent)
graph.cypher("MATCH (p:Person) WHERE EXISTS((p)-[:KNOWS]->(:Person)) RETURN p.name")

# Negation
graph.cypher("""
    MATCH (p:Person)
    WHERE NOT EXISTS { (p)-[:PURCHASED]->(:Product) }
    RETURN p.name
""")

shortestPath()

BFS shortest path between two nodes. Supports directed (->) and undirected (-) syntax:

# Directed — only follows edges in their defined direction
result = graph.cypher("""
    MATCH p = shortestPath((a:Person {name: 'Alice'})-[:KNOWS*..10]->(b:Person {name: 'Dave'}))
    RETURN length(p), nodes(p), relationships(p), a.name, b.name
""")

# Undirected — traverses edges in both directions (same as fluent API)
result = graph.cypher("""
    MATCH p = shortestPath((a:Person {name: 'Alice'})-[:KNOWS*..10]-(b:Person {name: 'Dave'}))
    RETURN length(p), nodes(p), relationships(p)
""")

# No path → empty list (not an error)

Path functions: length(p) returns hop count, nodes(p) returns node list, relationships(p) returns edge type list.

CREATE / SET / DELETE / REMOVE / MERGE

# CREATE — returns ResultView with .stats
result = graph.cypher("CREATE (n:Person {name: 'Alice', age: 30, city: 'Oslo'})")
print(result.stats['nodes_created'])  # 1

# CREATE relationship between existing nodes
graph.cypher("""
    MATCH (a:Person {name: 'Alice'}), (b:Person {name: 'Bob'})
    CREATE (a)-[:KNOWS]->(b)
""")

# SET — update properties
result = graph.cypher("MATCH (n:Person {name: 'Bob'}) SET n.age = 26, n.city = 'Stavanger'")
print(result.stats['properties_set'])  # 2

# DELETE — plain DELETE errors if node has relationships; DETACH removes all
graph.cypher("MATCH (n:Person {name: 'Alice'}) DETACH DELETE n")

# REMOVE — remove properties (id/type are immutable)
graph.cypher("MATCH (n:Person {name: 'Alice'}) REMOVE n.city")

# MERGE — match or create
graph.cypher("""
    MERGE (n:Person {name: 'Alice'})
    ON CREATE SET n.created = 'today'
    ON MATCH SET n.updated = 'today'
""")

Transactions

Group multiple mutations into an atomic unit. On success, all changes apply; on exception, nothing changes.

with graph.begin() as tx:
    tx.cypher("CREATE (:Person {name: 'Alice', age: 30})")
    tx.cypher("CREATE (:Person {name: 'Bob', age: 25})")
    tx.cypher("""
        MATCH (a:Person {name: 'Alice'}), (b:Person {name: 'Bob'})
        CREATE (a)-[:KNOWS]->(b)
    """)
    # Commits automatically when the block exits normally
    # Rolls back if an exception occurs

# Manual control:
tx = graph.begin()
tx.cypher("CREATE (:Person {name: 'Charlie'})")
tx.commit()   # or tx.rollback()

DataFrame Output

df = graph.cypher("""
    MATCH (p:Person)-[:KNOWS]->(f:Person)
    WITH p, count(f) AS friends
    RETURN p.name, p.city, friends
    ORDER BY friends DESC
""", to_df=True)

EXPLAIN

Prefix any Cypher query with EXPLAIN to see the query plan without executing it:

plan = graph.cypher("""
    EXPLAIN
    MATCH (p:Person)
    OPTIONAL MATCH (p)-[:KNOWS]->(f:Person)
    WITH p, count(f) AS friends
    RETURN p.name, friends
""")
print(plan)
# Query Plan:
#   1. NodeScan (MATCH) :Person
#   2. FusedOptionalMatchAggregate (optimized OPTIONAL MATCH + count)
#   3. Projection (RETURN) [p.name, friends]
# Optimizations: optional_match_fusion=1

Supported Cypher Subset

Category	Supported
Clauses	`MATCH`, `OPTIONAL MATCH`, `WHERE`, `RETURN`, `WITH`, `ORDER BY`, `SKIP`, `LIMIT`, `UNWIND`, `UNION`/`UNION ALL`, `CREATE`, `SET`, `DELETE`, `DETACH DELETE`, `REMOVE`, `MERGE`, `EXPLAIN`
Patterns	Node `(n:Type)`, relationship `-[:REL]->`, variable-length `*1..3`, undirected `-[:REL]-`, properties `{key: val}`, `p = shortestPath(...)`
WHERE	`=`, `<>`, `<`, `>`, `<=`, `>=`, `=~` (regex), `AND`, `OR`, `NOT`, `IS NULL`, `IS NOT NULL`, `IN [...]`, `CONTAINS`, `STARTS WITH`, `ENDS WITH`, `EXISTS { pattern }`, `EXISTS(( pattern ))`
RETURN	`n.prop`, `r.prop`, `AS` aliases, `DISTINCT`, arithmetic `+`/`-`/`*`/`/`, map projections `n {.prop1, .prop2}`
Aggregation	`count(*)`, `count(expr)`, `sum`, `avg`/`mean`, `min`, `max`, `collect`, `std`
Expressions	`CASE WHEN...THEN...ELSE...END`, `$param`, `[x IN list WHERE ... \| expr]`
Functions	`toUpper`, `toLower`, `toString`, `toInteger`, `toFloat`, `size`, `length`, `type`, `id`, `labels`, `coalesce`, `nodes(p)`, `relationships(p)`
Spatial	`point(lat, lon)`, `distance(p1, p2)`, `wkt_contains(wkt, point)`, `wkt_intersects(wkt1, wkt2)`, `wkt_centroid(wkt)`, `latitude(point)`, `longitude(point)`
Semantic	`text_score(n, prop, query [, metric])` — auto-embeds query via `set_embedder()`, cosine/dot_product/euclidean
Mutations	`CREATE (n:Label {props})`, `CREATE (a)-[:TYPE]->(b)`, `SET n.prop = expr`, `DELETE`, `DETACH DELETE`, `REMOVE n.prop`, `MERGE ... ON CREATE SET ... ON MATCH SET`
Not supported	`CALL`/stored procedures, `FOREACH`, subqueries, `SET n:Label` (label mutation), `REMOVE n:Label`, multi-label

Fluent API: Data Loading

For most use cases, use Cypher queries. The fluent API is for bulk operations from DataFrames or complex data pipelines.

Adding Nodes

products_df = pd.DataFrame({
    'product_id': [101, 102, 103],
    'title': ['Laptop', 'Phone', 'Tablet'],
    'price': [999.99, 699.99, 349.99],
    'stock': [45, 120, 30]
})

report = graph.add_nodes(
    data=products_df,
    node_type='Product',
    unique_id_field='product_id',
    node_title_field='title',
    columns=['product_id', 'title', 'price', 'stock'],       # whitelist columns (None = all)
    column_types={'launch_date': 'datetime'},                  # explicit type hints
    conflict_handling='update'  # 'update' | 'replace' | 'skip' | 'preserve'
)
print(f"Created {report['nodes_created']} nodes in {report['processing_time_ms']}ms")

Property Mapping

When adding nodes, unique_id_field and node_title_field are mapped to id and title. The original column names become aliases — they work in Cypher queries and filter(), but results always use the canonical names.

Your DataFrame Column	Stored As	Alias?
`unique_id_field` (e.g., `user_id`)	`id`	`n.user_id` resolves to `n.id`
`node_title_field` (e.g., `name`)	`title`	`n.name` resolves to `n.title`
All other columns	Same name	—

# After adding with unique_id_field='user_id', node_title_field='name':
graph.cypher("MATCH (u:User) WHERE u.user_id = 1001 RETURN u")  # OK — alias resolves to id
graph.type_filter('User').filter({'user_id': 1001})              # OK — alias works here too
graph.type_filter('User').filter({'id': 1001})                   # Also OK — canonical name

# Results always use canonical names:
# {'id': 1001, 'title': 'Alice', 'type': 'User', ...}  — NOT 'user_id' or 'name'

Creating Connections

purchases_df = pd.DataFrame({
    'user_id': [1001, 1001, 1002],
    'product_id': [101, 103, 102],
    'date': ['2023-01-15', '2023-02-10', '2023-01-20'],
    'quantity': [1, 2, 1]
})

graph.add_connections(
    data=purchases_df,
    connection_type='PURCHASED',
    source_type='User',
    source_id_field='user_id',
    target_type='Product',
    target_id_field='product_id',
    columns=['date', 'quantity']
)

source_type and target_type each refer to a single node type. To connect nodes of the same type, set both to the same value (e.g., source_type='Person', target_type='Person').

Working with Dates

graph.add_nodes(
    data=estimates_df,
    node_type='Estimate',
    unique_id_field='estimate_id',
    node_title_field='name',
    column_types={'valid_from': 'datetime', 'valid_to': 'datetime'}
)

graph.type_filter('Estimate').filter({'valid_from': {'>=': '2020-06-01'}})
graph.type_filter('Estimate').valid_at('2020-06-15')
graph.type_filter('Estimate').valid_during('2020-01-01', '2020-06-30')

Batch Property Updates

result = graph.type_filter('Prospect').filter({'status': 'Inactive'}).update({
    'is_active': False,
    'deactivation_reason': 'status_inactive'
})

updated_graph = result['graph']
print(f"Updated {result['nodes_updated']} nodes")

Operation Reports

Operations that modify the graph return detailed reports:

report = graph.add_nodes(data=df, node_type='Product', unique_id_field='product_id')
# report keys: operation, timestamp, nodes_created, nodes_updated, nodes_skipped,
#              processing_time_ms, has_errors, errors

graph.get_last_report()       # most recent operation report
graph.get_operation_index()   # sequential index of last operation
graph.get_report_history()    # all reports

Fluent API: Querying

For most queries, prefer Cypher. The fluent API is for building reusable query chains or when you need explain() and selection-based workflows.

Filtering

graph.type_filter('Product').filter({'price': 999.99})
graph.type_filter('Product').filter({'price': {'<': 500.0}, 'stock': {'>': 50}})
graph.type_filter('Product').filter({'id': {'in': [101, 103]}})
graph.type_filter('Product').filter({'category': {'is_null': True}})

# Orphan nodes (no connections)
graph.filter_orphans(include_orphans=True)

Sorting

graph.type_filter('Product').sort('price')
graph.type_filter('Product').sort('price', ascending=False)
graph.type_filter('Product').sort([('stock', False), ('price', True)])

Traversing the Graph

alice = graph.type_filter('User').filter({'title': 'Alice'})
alice_products = alice.traverse(connection_type='PURCHASED', direction='outgoing')

# Filter and sort traversal targets
expensive = alice.traverse(
    connection_type='PURCHASED',
    filter_target={'price': {'>=': 500.0}},
    sort_target='price',
    max_nodes=10
)

# Get connection information
alice.get_connections(include_node_properties=True)

Set Operations

n3 = graph.type_filter('Prospect').filter({'geoprovince': 'N3'})
m3 = graph.type_filter('Prospect').filter({'geoprovince': 'M3'})

n3.union(m3)                    # all nodes from both (OR)
n3.intersection(m3)             # nodes in both (AND)
n3.difference(m3)               # nodes in n3 but not m3
n3.symmetric_difference(m3)     # nodes in exactly one (XOR)

Retrieving Results

people = graph.type_filter('Person')

# Lightweight (no property materialization)
people.node_count()                     # → 3
people.indices()                        # → [0, 1, 2]
people.id_values()                      # → [1, 2, 3]

# Medium (partial materialization)
people.get_ids()                        # → [{'type': 'Person', 'title': 'Alice', 'id': 1}, ...]
people.get_titles()                     # → ['Alice', 'Bob', 'Charlie']
people.get_properties(['age', 'city'])  # → [(28, 'Oslo'), (35, 'Bergen'), (42, 'Oslo')]

# Full materialization
people.get_nodes()                      # → [{'type': 'Person', 'title': 'Alice', 'id': 1, 'age': 28, ...}, ...]
people.to_df()                          # → DataFrame with columns type, title, id, age, city, ...

# Single node lookup (O(1))
graph.get_node_by_id('Person', 1)       # → {'type': 'Person', 'title': 'Alice', ...} or None

Debugging Selections

result = graph.type_filter('User').filter({'id': 1001})
print(result.explain())
# TYPE_FILTER User (1000 nodes) -> FILTER (1 nodes)

Pattern Matching

For simpler pattern-based queries without full Cypher clause support:

results = graph.match_pattern(
    '(p:Play)-[:HAS_PROSPECT]->(pr:Prospect)-[:BECAME_DISCOVERY]->(d:Discovery)'
)

for match in results:
    print(f"Play: {match['p']['title']}, Discovery: {match['d']['title']}")

# With property conditions
graph.match_pattern('(u:User)-[:PURCHASED]->(p:Product {category: "Electronics"})')

# Limit results for large graphs
graph.match_pattern('(a:Person)-[:KNOWS]->(b:Person)', max_matches=100)

Graph Algorithms

Shortest Path

result = graph.shortest_path(source_type='Person', source_id=1, target_type='Person', target_id=100)
if result:
    for node in result["path"]:
        print(f"{node['type']}: {node['title']}")
    print(f"Connections: {result['connections']}")
    print(f"Path length: {result['length']}")

Lightweight variants when you don't need full path data:

graph.shortest_path_length(...)    # → int | None (hop count only)
graph.shortest_path_ids(...)       # → list[id] | None (node IDs along path)
graph.shortest_path_indices(...)   # → list[int] | None (raw graph indices, fastest)

All path methods support connection_types, via_types, and timeout_ms for filtering and safety.

Batch variant for computing many distances at once:

distances = graph.shortest_path_lengths_batch('Person', [(1, 5), (2, 8), (3, 10)])
# → [2, None, 5]  (None where no path exists, same order as input)

Much faster than calling shortest_path_length in a loop — builds the adjacency list once.

All Paths

paths = graph.all_paths(
    source_type='Play', source_id=1,
    target_type='Wellbore', target_id=100,
    max_hops=4,
    max_results=100  # Prevent OOM on dense graphs
)

Connected Components

components = graph.connected_components()
# Returns list of lists: [[node_dicts...], [node_dicts...], ...]
print(f"Found {len(components)} connected components")
print(f"Largest component: {len(components[0])} nodes")

graph.are_connected(source_type='Person', source_id=1, target_type='Person', target_id=100)

Centrality Algorithms

All centrality methods return a ResultView of {type, title, id, score} rows, sorted by score descending.

graph.betweenness_centrality(top_k=10)
graph.betweenness_centrality(normalized=True, sample_size=500)
graph.pagerank(top_k=10, damping_factor=0.85)
graph.degree_centrality(top_k=10)
graph.closeness_centrality(top_k=10)

# Alternative output formats
graph.pagerank(as_dict=True)      # → {1: 0.45, 2: 0.32, ...} (keyed by id)
graph.pagerank(to_df=True)        # → DataFrame with type, title, id, score columns

Community Detection

# Louvain modularity optimization (recommended)
result = graph.louvain_communities()
# {'communities': {0: [{type, title, id}, ...], 1: [...]},
#  'modularity': 0.45, 'num_communities': 2}

for comm_id, members in result['communities'].items():
    names = [m['title'] for m in members]
    print(f"Community {comm_id}: {names}")

# With edge weights and resolution tuning
result = graph.louvain_communities(weight_property='strength', resolution=1.5)

# Label propagation (faster, less precise)
result = graph.label_propagation(max_iterations=100)

Node Degrees

degrees = graph.type_filter('Person').get_degrees()
# Returns: {'Alice': 5, 'Bob': 3, ...}

Semantic Search

Store embedding vectors alongside nodes and query them with fast similarity search. Embeddings are stored separately from node properties — they don't appear in get_nodes(), to_df(), or regular Cypher property access.

Text-Level API (Recommended)

Register an embedding model once, then embed and search using text column names. The model runs on the Python side — KGLite only stores the resulting vectors.

from sentence_transformers import SentenceTransformer

class Embedder:
    def __init__(self, model_name="all-MiniLM-L6-v2"):
        self._model_name = model_name
        self._model = None
        self._timer = None
        self.dimension = 384  # set in load() if unknown

    def load(self):
        """Called automatically before embedding. Loads model on demand."""
        import threading
        if self._timer:
            self._timer.cancel()
            self._timer = None
        if self._model is None:
            self._model = SentenceTransformer(self._model_name)
            self.dimension = self._model.get_sentence_embedding_dimension()

    def unload(self, cooldown=60):
        """Called automatically after embedding. Releases after cooldown."""
        import threading
        def _release():
            self._model = None
            self._timer = None
        self._timer = threading.Timer(cooldown, _release)
        self._timer.start()

    def embed(self, texts: list[str]) -> list[list[float]]:
        return self._model.encode(texts).tolist()

# Register once on the graph
graph.set_embedder(Embedder())

# Embed a text column — stores vectors as "summary_emb" automatically
graph.embed_texts("Article", "summary")
# Embedding Article.summary: 100%|████████| 1000/1000 [00:05<00:00]
# → {'embedded': 1000, 'skipped': 3, 'skipped_existing': 0, 'dimension': 384}

# Search with text — resolves "summary" → "summary_emb" internally
results = graph.type_filter("Article").search_text("summary", "machine learning", top_k=10)
# [{'id': 42, 'title': '...', 'type': 'Article', 'score': 0.95, ...}, ...]

Key details:

Auto-naming: text column "summary" → embedding store key "summary_emb" (auto-derived)
Incremental: re-running embed_texts skips nodes that already have embeddings — only new nodes get embedded. Pass replace=True to force re-embed.
Progress bar: shows a tqdm progress bar by default. Disable with show_progress=False.
Load/unload lifecycle: if the model has optional load() / unload() methods, they are called automatically before and after each embedding operation. Use this to load on demand and release after a cooldown.
Not serialized: the model is not saved with save() — call set_embedder() again after deserializing.

# Add new articles, then re-embed — only new ones are processed
graph.embed_texts("Article", "summary")
# → {'embedded': 50, 'skipped': 0, 'skipped_existing': 1000, ...}

# Force full re-embed
graph.embed_texts("Article", "summary", replace=True)

# Combine with filters
results = (graph
    .type_filter("Article")
    .filter({"category": "politics"})
    .search_text("summary", "foreign policy", top_k=10))

Calling embed_texts() or search_text() without set_embedder() raises an error with a full skeleton showing the required model interface.

Storing Embeddings (Low-Level)

If you manage vectors yourself, use the low-level API:

# Explicit: pass a dict of {node_id: vector}
graph.set_embeddings('Article', 'summary', {
    1: [0.1, 0.2, 0.3, ...],
    2: [0.4, 0.5, 0.6, ...],
})

# Or auto-detect during add_nodes with column_types
df = pd.DataFrame({
    'id': [1, 2, 3],
    'title': ['A', 'B', 'C'],
    'text_emb': [[0.1, 0.2], [0.3, 0.4], [0.5, 0.6]],
})
graph.add_nodes(df, 'Doc', 'id', 'title', column_types={'text_emb': 'embedding'})

Vector Search (Low-Level)

Search operates on the current selection — combine with type_filter() and filter() for scoped queries:

# Basic search — returns list of dicts sorted by similarity
results = graph.type_filter('Article').vector_search('summary', query_vec, top_k=10)
# [{'id': 5, 'title': '...', 'type': 'Article', 'score': 0.95, ...}, ...]
# 'score' is always included: cosine similarity [-1,1], dot_product, or negative euclidean distance

# Filtered search — only search within a subset
results = (graph
    .type_filter('Article')
    .filter({'category': 'politics'})
    .vector_search('summary', query_vec, top_k=10))

# DataFrame output
df = graph.type_filter('Article').vector_search('summary', query_vec, top_k=10, to_df=True)

# Distance metrics: 'cosine' (default), 'dot_product', 'euclidean'
results = graph.type_filter('Article').vector_search(
    'summary', query_vec, top_k=10, metric='dot_product')

Semantic Search in Cypher

text_score() enables semantic search directly in Cypher queries. It automatically embeds the query text using the registered model (via set_embedder()) and computes similarity:

# Requires: set_embedder() + embed_texts()
graph.cypher("""
    MATCH (n:Article)
    RETURN n.title, text_score(n, 'summary', 'machine learning') AS score
    ORDER BY score DESC LIMIT 10
""")

# With parameters
graph.cypher("""
    MATCH (n:Article)
    WHERE text_score(n, 'summary', $query) > 0.8
    RETURN n.title
""", params={'query': 'artificial intelligence'})

# Combine with graph filters
graph.cypher("""
    MATCH (n:Article)-[:CITED_BY]->(m:Article)
    WHERE n.category = 'politics'
    RETURN m.title, text_score(m, 'summary', 'foreign policy') AS score
    ORDER BY score DESC LIMIT 5
""")

Embedding Utilities

graph.list_embeddings()
# [{'node_type': 'Article', 'text_column': 'summary', 'dimension': 384, 'count': 1000}]

graph.remove_embeddings('Article', 'summary')

# Retrieve all embeddings for a type (no selection needed)
embs = graph.get_embeddings('Article', 'summary')
# {1: [0.1, 0.2, ...], 2: [0.4, 0.5, ...], ...}

# Retrieve embeddings for current selection only
embs = graph.type_filter('Article').filter({'category': 'politics'}).get_embeddings('summary')

# Get a single node's embedding (O(1) lookup, returns None if not found)
vec = graph.get_embedding('Article', 'summary', node_id)

Embeddings persist across save()/load() cycles automatically.

Spatial Operations

Spatial queries are also available in Cypher via point(), distance(), wkt_contains(), wkt_intersects(), and wkt_centroid(). See Spatial Functions.

Bounding Box

graph.type_filter('Discovery').within_bounds(
    lat_field='latitude', lon_field='longitude',
    min_lat=58.0, max_lat=62.0, min_lon=1.0, max_lon=5.0
)

Distance Queries (Haversine)

graph.type_filter('Wellbore').near_point_km(
    center_lat=60.5, center_lon=3.2, max_distance_km=50.0,
    lat_field='latitude', lon_field='longitude'
)

WKT Geometry Intersection

graph.type_filter('Field').intersects_geometry(
    'POLYGON((1 58, 5 58, 5 62, 1 62, 1 58))',
    geometry_field='wkt_geometry'
)

Point-in-Polygon

graph.type_filter('Block').contains_point(lat=60.5, lon=3.2, geometry_field='wkt_geometry')

Analytics

Statistics

price_stats = graph.type_filter('Product').statistics('price')
unique_cats = graph.type_filter('Product').unique_values(property='category', max_length=10)

Calculations

graph.type_filter('Product').calculate(expression='price * 1.1', store_as='price_with_tax')

graph.type_filter('User').traverse('PURCHASED').calculate(
    expression='sum(price * quantity)', store_as='total_spent'
)

graph.type_filter('User').traverse('PURCHASED').count(store_as='product_count', group_by_parent=True)

Connection Aggregation

graph.type_filter('Discovery').traverse('EXTENDS_INTO').calculate(
    expression='sum(share_pct)',
    aggregate_connections=True
)

Supported: sum, avg/mean, min, max, count, std.

Schema and Indexes

Schema Definition

graph.define_schema({
    'nodes': {
        'Prospect': {
            'required': ['npdid_prospect', 'prospect_name'],
            'optional': ['prospect_status'],
            'types': {'npdid_prospect': 'integer', 'prospect_name': 'string'}
        }
    },
    'connections': {
        'HAS_ESTIMATE': {'source': 'Prospect', 'target': 'ProspectEstimate'}
    }
})

errors = graph.validate_schema()
schema = graph.get_schema()

Indexes

Two index types:

Method	Accelerates	Use for
`create_index()`	Equality (`= value`)	Exact lookups
`create_range_index()`	Range (`>`, `<`, `>=`, `<=`)	Numeric/date filtering

Both also accelerate Cypher WHERE clauses. Composite indexes support multi-property equality.

graph.create_index('Prospect', 'prospect_geoprovince')        # equality index
graph.create_range_index('Person', 'age')                      # B-Tree range index
graph.create_composite_index('Person', ['city', 'age'])        # composite equality

graph.list_indexes()
graph.drop_index('Prospect', 'prospect_geoprovince')

Indexes are maintained automatically by all mutation operations.

Import and Export

Saving and Loading

graph.save("my_graph.kgl")
loaded_graph = kglite.load("my_graph.kgl")

Portability: Save files use bincode serialization and are not guaranteed portable across OS, CPU architecture, or library versions. Always re-export via a portable format (GraphML, CSV) when sharing across machines.

Export Formats

graph.export('my_graph.graphml', format='graphml')  # Gephi, yEd
graph.export('my_graph.gexf', format='gexf')        # Gephi native
graph.export('my_graph.json', format='d3')           # D3.js
graph.export('my_graph.csv', format='csv')           # creates _nodes.csv + _edges.csv

graphml_string = graph.export_string(format='graphml')

Subgraph Extraction

subgraph = (
    graph.type_filter('Company')
    .filter({'title': 'Acme Corp'})
    .expand(hops=2)
    .to_subgraph()
)
subgraph.export('acme_network.graphml', format='graphml')

Performance

Tips

Batch operations — add nodes/connections in batches, not individually
Specify columns — only include columns you need to reduce memory
Filter by type first — type_filter() before filter() for narrower scans
Create indexes — on frequently filtered equality conditions (~3x on 100k+ nodes)
Use lightweight methods — node_count(), indices(), get_node_by_id() skip property materialization
Cypher LIMIT — use LIMIT to avoid scanning entire result sets

Threading

The Python GIL is released during heavy Rust operations, allowing other Python threads to run concurrently:

Operation	GIL Released?	Notes
`save()`	Yes	Serialization + compression + file write
`load()`	Yes	File read + decompression + deserialization
`cypher()` (reads)	Yes	Query parsing, optimization, and execution
`vector_search()`	Yes	Similarity computation (uses rayon internally)
`search_text()`	Partial	Model embedding needs GIL; vector search releases it
`add_nodes()`	No	DataFrame conversion requires GIL throughout
`cypher()` (mutations)	No	Must hold exclusive lock on graph

For concurrent access from multiple threads, mutations (add_nodes, CREATE/SET/DELETE Cypher) require external synchronization. Read-only operations (cypher reads, vector_search, save) can run while other Python threads execute.

Common Gotchas

Single-label only. Each node has exactly one type. labels(n) returns a string, not a list. SET n:OtherLabel is not supported.
id and title are canonical. add_nodes(unique_id_field='user_id') stores the column as id. The original name works as an alias in Cypher (n.user_id resolves to n.id), but results always return canonical names (id, title).
Save files aren't portable. The binary format (bincode) may differ across OS, CPU architecture, or library versions. Use export() (GraphML, CSV) for sharing across machines.
Indexes: create_index() accelerates equality only (=). For range queries (>, <, >=, <=), use create_range_index().
Flat vs. grouped results. After traversal with multiple parents, get_titles(), get_nodes(), and get_properties() return grouped dicts instead of flat lists. Use flatten_single_parent=False to always get grouped output.
No auto-persistence. The graph lives in memory. save() is manual — crashes lose unsaved work.

Using with AI Agents

KGLite is designed to work as a self-contained knowledge layer for AI agents. No external database, no server process, no network — just a Python object with a Cypher interface that an agent can query directly.

The idea

Load or build a graph from your data (DataFrames, CSVs, APIs)
Give the agent agent_describe() — a single XML string containing the full schema, Cypher reference, property values, and embedding info
The agent writes Cypher queries using graph.cypher() — no other API to learn
Semantic search works natively — text_score() in Cypher, backed by any embedding model you wrap

No vector database, no graph database, no infrastructure. The graph lives in memory and persists to a single .kgl file.

Quick setup

xml = graph.agent_describe()  # schema + Cypher reference + property values as XML
prompt = f"You have a knowledge graph:\n{xml}\nAnswer the user's question using graph.cypher()."

MCP server

Expose the graph to any MCP-compatible agent (Claude, etc.) with a thin server:

from mcp.server.fastmcp import FastMCP
import kglite

graph = kglite.load("my_graph.kgl")
mcp = FastMCP("knowledge-graph")

@mcp.tool()
def describe() -> str:
    """Get the graph schema and Cypher reference."""
    return graph.agent_describe()

@mcp.tool()
def query(cypher: str) -> str:
    """Run a Cypher query and return results."""
    result = graph.cypher(cypher, to_df=True)
    return result.to_markdown()

mcp.run(transport="stdio")

The agent calls describe() once to learn the schema, then uses query() for everything — traversals, aggregations, filtering, and semantic search via text_score().

Semantic search in agent workflows

With an embedding model registered, agents can do semantic search directly in Cypher:

# Wrap any local or remote model — only needs .dimension and .embed()
class OpenAIEmbedder:
    dimension = 1536
    def embed(self, texts: list[str]) -> list[list[float]]:
        response = client.embeddings.create(input=texts, model="text-embedding-3-small")
        return [e.embedding for e in response.data]

graph.set_embedder(OpenAIEmbedder())
graph.embed_texts("Article", "summary")  # one-time: vectorize all articles

# Now agents can use text_score() in Cypher — no extra API needed
graph.cypher("""
    MATCH (a:Article)
    WHERE text_score(a, 'summary', 'climate policy') > 0.5
    RETURN a.title, text_score(a, 'summary', 'climate policy') AS score
    ORDER BY score DESC LIMIT 10
""")

The model wrapper pattern works with any provider (OpenAI, Cohere, local sentence-transformers, Ollama) — see the Semantic Search section for a full load/unload lifecycle example.

Tips for agent prompts

Start with agent_describe() — gives the agent schema, types, property names with sample values, counts, and full Cypher syntax in one XML string
Use properties(type) for deeper column discovery — shows types, nullability, unique counts, and sample values
Use sample(type, n=3) before writing queries — lets the agent see real data shapes
Prefer Cypher over the fluent API in agent contexts — closer to natural language, easier for LLMs to generate
Use parameters (params={'x': val}) to prevent injection when passing user input to queries
ResultView is lazy — agents can call len(result) to check row count without converting all rows

What `agent_describe()` returns

Dynamic (per-graph): node types with counts, property names/types/sample values, connection types with endpoints, indexes, field aliases, embedding stores
Static (always the same): supported Cypher clauses, WHERE operators, functions (including spatial and semantic), mutation syntax, notes

Graph Maintenance

After heavy mutation workloads (DELETE, REMOVE), internal storage accumulates tombstones. Monitor with graph_info().

info = graph.graph_info()
# {'node_count': 950, 'node_capacity': 1000, 'node_tombstones': 50,
#  'edge_count': 2800, 'fragmentation_ratio': 0.05,
#  'type_count': 3, 'property_index_count': 2, 'composite_index_count': 0}

if info['fragmentation_ratio'] > 0.3:
    result = graph.vacuum()
    print(f"Reclaimed {result['tombstones_removed']} slots, remapped {result['nodes_remapped']} nodes")

vacuum() rebuilds the graph with contiguous indices and rebuilds all indexes. Resets the current selection — call between query chains.

reindex() rebuilds indexes only. Recovery tool, not routine maintenance — indexes are maintained automatically by all mutations.

Common Recipes

Upsert with MERGE

graph.cypher("""
    MERGE (p:Person {email: 'alice@example.com'})
    ON CREATE SET p.created = '2024-01-01', p.name = 'Alice'
    ON MATCH SET p.last_seen = '2024-01-15'
""")

Top-K Nodes by Centrality

top_nodes = graph.pagerank(top_k=10)
for node in top_nodes:
    print(f"{node['title']}: {node['score']:.3f}")

2-Hop Neighborhood

graph.cypher("""
    MATCH (me:Person {name: 'Alice'})-[:KNOWS*2]-(fof:Person)
    WHERE fof <> me
    RETURN DISTINCT fof.name
""")

Export Subgraph

subgraph = (
    graph.type_filter('Person')
    .filter({'name': 'Alice'})
    .expand(hops=2)
    .to_subgraph()
)
subgraph.export('alice_network.graphml', format='graphml')

Parameterized Queries

graph.cypher(
    "MATCH (p:Person) WHERE p.city = $city AND p.age > $min_age RETURN p.name",
    params={'city': 'Oslo', 'min_age': 25}
)

Delete Subgraph

graph.cypher("""
    MATCH (u:User) WHERE u.status = 'inactive'
    DETACH DELETE u
""")

Aggregation with Relationship Properties

graph.cypher("""
    MATCH (p:Person)-[r:RATED]->(m:Movie)
    RETURN p.name, avg(r.score) AS avg_rating, count(m) AS movies_rated
    ORDER BY avg_rating DESC
""")

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.9.9

May 7, 2026

0.9.8

May 7, 2026

0.9.7

May 4, 2026

0.9.6

May 3, 2026

0.9.5

May 2, 2026

0.9.4

May 2, 2026

0.9.3

May 2, 2026

0.9.2

May 2, 2026

0.9.0

May 2, 2026

0.8.41

May 1, 2026

0.8.40

May 1, 2026

0.8.39

May 1, 2026

0.8.38

May 1, 2026

0.8.37

May 1, 2026

0.8.36

May 1, 2026

0.8.35

May 1, 2026

0.8.34

Apr 30, 2026

0.8.33

Apr 30, 2026

0.8.32

Apr 30, 2026

0.8.31

Apr 30, 2026

0.8.30

Apr 30, 2026

0.8.29

Apr 30, 2026

0.8.28

Apr 30, 2026

0.8.27

Apr 29, 2026

0.8.26

Apr 28, 2026

0.8.25

Apr 27, 2026

0.8.24

Apr 27, 2026

0.8.23

Apr 27, 2026

0.8.22

Apr 27, 2026

0.8.21

Apr 27, 2026

0.8.20

Apr 27, 2026

0.8.19

Apr 26, 2026

0.8.18

Apr 26, 2026

0.8.17

Apr 26, 2026

0.8.16

Apr 25, 2026

0.8.15

Apr 25, 2026

0.8.14

Apr 24, 2026

0.8.12

Apr 23, 2026

0.8.11

Apr 22, 2026

0.8.10

Apr 20, 2026

0.8.9

Apr 20, 2026

0.8.8

Apr 20, 2026

0.8.7

Apr 19, 2026

0.8.6

Apr 19, 2026

0.8.5

Apr 19, 2026

0.8.4

Apr 19, 2026

0.8.3

Apr 19, 2026

0.8.2

Apr 19, 2026

0.8.1

Apr 19, 2026

0.8.0

Apr 18, 2026

0.7.17

Apr 17, 2026

0.7.16

Apr 17, 2026

0.7.15

Apr 17, 2026

0.7.14

Apr 16, 2026

0.7.12

Apr 16, 2026

0.7.11

Apr 16, 2026

0.7.10

Apr 15, 2026

0.7.9

Apr 15, 2026

0.7.8

Apr 15, 2026

0.7.7

Apr 15, 2026

0.7.6

Apr 12, 2026

0.7.5

Apr 10, 2026

0.7.4

Apr 8, 2026

0.7.3

Apr 8, 2026

0.7.2

Apr 6, 2026

0.7.1

Apr 6, 2026

0.7.0

Apr 4, 2026

0.6.18

Mar 30, 2026

0.6.16

Mar 30, 2026

0.6.15

Mar 30, 2026

0.6.14

Mar 30, 2026

0.6.12

Mar 30, 2026

0.6.11

Mar 29, 2026

0.6.10

Mar 29, 2026

0.6.9

Mar 22, 2026

0.6.8

Mar 19, 2026

0.6.7

Mar 18, 2026

0.6.6

Mar 18, 2026

0.6.5

Mar 18, 2026

0.6.4

Mar 18, 2026

0.6.2

Mar 9, 2026

0.5.89

Mar 6, 2026

0.5.88

Mar 4, 2026

0.5.87

Mar 3, 2026

0.5.86

Mar 3, 2026

0.5.85

Mar 3, 2026

0.5.83

Mar 3, 2026

0.5.82

Mar 3, 2026

0.5.81

Mar 2, 2026

0.5.80

Mar 2, 2026

0.5.79

Mar 2, 2026

0.5.78

Mar 2, 2026

0.5.77

Mar 1, 2026

0.5.76

Mar 1, 2026

0.5.75

Mar 1, 2026

0.5.74

Mar 1, 2026

0.5.73

Feb 27, 2026

0.5.72

Feb 27, 2026

0.5.71

Feb 27, 2026

0.5.70

Feb 26, 2026

0.5.69

Feb 26, 2026

0.5.68

Feb 26, 2026

0.5.67

Feb 26, 2026

0.5.66

Feb 26, 2026

0.5.65

Feb 26, 2026

0.5.64

Feb 25, 2026

0.5.63

Feb 25, 2026

0.5.62

Feb 25, 2026

0.5.61

Feb 24, 2026

0.5.60

Feb 24, 2026

0.5.59

Feb 24, 2026

0.5.58

Feb 24, 2026

0.5.56

Feb 23, 2026

0.5.55

Feb 23, 2026

0.5.54

Feb 23, 2026

0.5.53

Feb 22, 2026

0.5.52

Feb 22, 2026

0.5.51

Feb 21, 2026

0.5.50

Feb 21, 2026

0.5.49

Feb 20, 2026

0.5.48

Feb 20, 2026

0.5.47

Feb 20, 2026

0.5.46

Feb 20, 2026

0.5.45

Feb 20, 2026

0.5.44

Feb 20, 2026

0.5.43

Feb 19, 2026

0.5.42

Feb 19, 2026

0.5.41

Feb 19, 2026

0.5.40

Feb 19, 2026

0.5.39

Feb 19, 2026

0.5.38

Feb 19, 2026

0.5.37

Feb 19, 2026

0.5.36

Feb 18, 2026

0.5.35

Feb 18, 2026

0.5.34

Feb 18, 2026

0.5.33

Feb 18, 2026

0.5.32

Feb 18, 2026

0.5.31

Feb 17, 2026

0.5.30

Feb 17, 2026

0.5.29

Feb 17, 2026

0.5.28

Feb 17, 2026

0.5.27

Feb 17, 2026

0.5.26

Feb 16, 2026

0.5.25

Feb 15, 2026

0.5.24

Feb 15, 2026

0.5.23

Feb 15, 2026

0.5.22

Feb 15, 2026

0.5.21

Feb 15, 2026

0.5.20

Feb 14, 2026

0.5.19

Feb 14, 2026

This version

0.5.18

Feb 14, 2026

0.5.17

Feb 14, 2026

0.5.16

Feb 13, 2026

0.5.15

Feb 13, 2026

0.5.14

Feb 12, 2026

0.5.13

Feb 12, 2026

0.5.11

Feb 12, 2026

0.5.10

Feb 12, 2026

0.5.9

Feb 11, 2026

0.5.8

Feb 11, 2026

0.5.7

Feb 11, 2026

0.5.6

Feb 11, 2026

0.5.5

Feb 11, 2026

0.5.4

Feb 11, 2026

0.5.2

Feb 11, 2026

0.5.1

Feb 11, 2026

0.5.0

Feb 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kglite-0.5.18-cp313-cp313-win_amd64.whl (2.0 MB view details)

Uploaded Feb 14, 2026 CPython 3.13Windows x86-64

kglite-0.5.18-cp313-cp313-manylinux_2_35_x86_64.whl (2.4 MB view details)

Uploaded Feb 14, 2026 CPython 3.13manylinux: glibc 2.35+ x86-64

kglite-0.5.18-cp313-cp313-macosx_11_0_arm64.whl (2.1 MB view details)

Uploaded Feb 14, 2026 CPython 3.13macOS 11.0+ ARM64

kglite-0.5.18-cp313-cp313-macosx_10_12_x86_64.whl (2.2 MB view details)

Uploaded Feb 14, 2026 CPython 3.13macOS 10.12+ x86-64

kglite-0.5.18-cp312-cp312-win_amd64.whl (2.0 MB view details)

Uploaded Feb 14, 2026 CPython 3.12Windows x86-64

kglite-0.5.18-cp312-cp312-manylinux_2_35_x86_64.whl (2.4 MB view details)

Uploaded Feb 14, 2026 CPython 3.12manylinux: glibc 2.35+ x86-64

kglite-0.5.18-cp312-cp312-macosx_11_0_arm64.whl (2.1 MB view details)

Uploaded Feb 14, 2026 CPython 3.12macOS 11.0+ ARM64

kglite-0.5.18-cp312-cp312-macosx_10_12_x86_64.whl (2.2 MB view details)

Uploaded Feb 14, 2026 CPython 3.12macOS 10.12+ x86-64

kglite-0.5.18-cp311-cp311-win_amd64.whl (2.0 MB view details)

Uploaded Feb 14, 2026 CPython 3.11Windows x86-64

kglite-0.5.18-cp311-cp311-manylinux_2_35_x86_64.whl (2.4 MB view details)

Uploaded Feb 14, 2026 CPython 3.11manylinux: glibc 2.35+ x86-64

kglite-0.5.18-cp311-cp311-macosx_11_0_arm64.whl (2.1 MB view details)

Uploaded Feb 14, 2026 CPython 3.11macOS 11.0+ ARM64

kglite-0.5.18-cp311-cp311-macosx_10_12_x86_64.whl (2.2 MB view details)

Uploaded Feb 14, 2026 CPython 3.11macOS 10.12+ x86-64

kglite-0.5.18-cp310-cp310-win_amd64.whl (2.0 MB view details)

Uploaded Feb 14, 2026 CPython 3.10Windows x86-64

kglite-0.5.18-cp310-cp310-manylinux_2_35_x86_64.whl (2.4 MB view details)

Uploaded Feb 14, 2026 CPython 3.10manylinux: glibc 2.35+ x86-64

kglite-0.5.18-cp310-cp310-macosx_11_0_arm64.whl (2.1 MB view details)

Uploaded Feb 14, 2026 CPython 3.10macOS 11.0+ ARM64

kglite-0.5.18-cp310-cp310-macosx_10_12_x86_64.whl (2.2 MB view details)

Uploaded Feb 14, 2026 CPython 3.10macOS 10.12+ x86-64

File details

Details for the file kglite-0.5.18-cp313-cp313-win_amd64.whl.

File metadata

Download URL: kglite-0.5.18-cp313-cp313-win_amd64.whl
Upload date: Feb 14, 2026
Size: 2.0 MB
Tags: CPython 3.13, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp313-cp313-win_amd64.whl
Algorithm	Hash digest
SHA256	`92505bfcd11307344efa5a2d1ca05c9d2f555c0595867700ac92647bdc732ba8`
MD5	`aaad15c134d7e6b5f68b5d3977efc72c`
BLAKE2b-256	`22cbc25fa044b4e949056728ba5f69e482b8033aa47a70faf73fa9f63fbfb2cd`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp313-cp313-manylinux_2_35_x86_64.whl.

File metadata

Download URL: kglite-0.5.18-cp313-cp313-manylinux_2_35_x86_64.whl
Upload date: Feb 14, 2026
Size: 2.4 MB
Tags: CPython 3.13, manylinux: glibc 2.35+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp313-cp313-manylinux_2_35_x86_64.whl
Algorithm	Hash digest
SHA256	`031511dfac4f93fcee4815e563fdc833c4a6507f618b6ae266a51485ac65160e`
MD5	`34a925108c0e77bd40025f6b77d8f0a8`
BLAKE2b-256	`013f4688baffb3627f81c70cf10c19037bda8d3946cc8401252ff7e12c5d9d3e`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

Download URL: kglite-0.5.18-cp313-cp313-macosx_11_0_arm64.whl
Upload date: Feb 14, 2026
Size: 2.1 MB
Tags: CPython 3.13, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp313-cp313-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`e14f8c40537b32229f8aba8686d1e7cea6dd5e0b5532ba5e7dcf124899a73fb6`
MD5	`62b4c46bfd3fad66637eb18ea6a22183`
BLAKE2b-256	`eb053363cb3800712ab76d4f908b1b0b2e6199767e960bc94e44f4452cabc842`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp313-cp313-macosx_10_12_x86_64.whl.

File metadata

Download URL: kglite-0.5.18-cp313-cp313-macosx_10_12_x86_64.whl
Upload date: Feb 14, 2026
Size: 2.2 MB
Tags: CPython 3.13, macOS 10.12+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp313-cp313-macosx_10_12_x86_64.whl
Algorithm	Hash digest
SHA256	`43f46cfbd5b7137fcacde39138db9b6e828e6f092aab7dd0b58820cd3c5765bc`
MD5	`0eeea7ac3418903367b0fcf0455280c9`
BLAKE2b-256	`837df99bfc9e418512d8fb68a29ba234c3cdb2dd34cc5bbfdb548467b94e14be`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp312-cp312-win_amd64.whl.

File metadata

Download URL: kglite-0.5.18-cp312-cp312-win_amd64.whl
Upload date: Feb 14, 2026
Size: 2.0 MB
Tags: CPython 3.12, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp312-cp312-win_amd64.whl
Algorithm	Hash digest
SHA256	`265102793f97445d7930af3aadc119c62560992d6dd078cd77ec51003080ce0a`
MD5	`15b5241f1b15bbfe13afccf1fb35af3a`
BLAKE2b-256	`2b72b848490a8cdd0fd86c65155dfa533a4c3d85d16cf50fa7301740c60ffd30`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp312-cp312-manylinux_2_35_x86_64.whl.

File metadata

Download URL: kglite-0.5.18-cp312-cp312-manylinux_2_35_x86_64.whl
Upload date: Feb 14, 2026
Size: 2.4 MB
Tags: CPython 3.12, manylinux: glibc 2.35+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp312-cp312-manylinux_2_35_x86_64.whl
Algorithm	Hash digest
SHA256	`38a83e3019f77c5c273dc71489208beb3e5f9157b3a974a4c366f289a263ff65`
MD5	`06d55ed5d52384f1c6c851046bc808dd`
BLAKE2b-256	`db9e7339cb9c3799559011e441679e3a79f34fb01d77b7116b7b38e577a289a1`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

Download URL: kglite-0.5.18-cp312-cp312-macosx_11_0_arm64.whl
Upload date: Feb 14, 2026
Size: 2.1 MB
Tags: CPython 3.12, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp312-cp312-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`8ee8bc00c4b56c29ef827e839369126d3f964812be0daf80a8b06d3289b86699`
MD5	`8b013b2ed582a910ad959876aa2fac89`
BLAKE2b-256	`49857ceb5262434aaf8c29d32f3229c9abc92d996ea9d7ffd65ad6ba4ba4ee36`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp312-cp312-macosx_10_12_x86_64.whl.

File metadata

Download URL: kglite-0.5.18-cp312-cp312-macosx_10_12_x86_64.whl
Upload date: Feb 14, 2026
Size: 2.2 MB
Tags: CPython 3.12, macOS 10.12+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp312-cp312-macosx_10_12_x86_64.whl
Algorithm	Hash digest
SHA256	`6135b53216ab9b9ec1cd82c70aa9c92b2a4b961893d40a94da8ef89e8794f4bf`
MD5	`65f0b811a109c0c46408c4b00253cb6d`
BLAKE2b-256	`fbde1a11f0d50ed656d0826218bb1e9bfd04588584dd35d3ebf7eaa0e2b0f6f1`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp311-cp311-win_amd64.whl.

File metadata

Download URL: kglite-0.5.18-cp311-cp311-win_amd64.whl
Upload date: Feb 14, 2026
Size: 2.0 MB
Tags: CPython 3.11, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp311-cp311-win_amd64.whl
Algorithm	Hash digest
SHA256	`fd4dc12e9e944f7e4c11a6bbb61df5ef8db045d31dfa50f8c3f21b95dc96c174`
MD5	`42e39b374d2a67d5ebddf03778f7ebc1`
BLAKE2b-256	`9d766eb1a31305ff0d8a5e02d9dc835239f40bfb73a8bbf312a09cd947d1d0e1`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp311-cp311-manylinux_2_35_x86_64.whl.

File metadata

Download URL: kglite-0.5.18-cp311-cp311-manylinux_2_35_x86_64.whl
Upload date: Feb 14, 2026
Size: 2.4 MB
Tags: CPython 3.11, manylinux: glibc 2.35+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp311-cp311-manylinux_2_35_x86_64.whl
Algorithm	Hash digest
SHA256	`4f96ce527e4a427f35b0c01c02c0698ccbf457affd69db219e7fb2bba67b40dd`
MD5	`23a5b0b399add8e4c4ab5155c7de422b`
BLAKE2b-256	`d63a6c53f47b5780d79ca4a8145c58b07d6f920db0afd07396ac35e584042765`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

Download URL: kglite-0.5.18-cp311-cp311-macosx_11_0_arm64.whl
Upload date: Feb 14, 2026
Size: 2.1 MB
Tags: CPython 3.11, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp311-cp311-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`2d58ac9bf5e2e3a1f6ff03997671e19d2ecf58233bb794a0109447df42b0fd09`
MD5	`3872578f7f76cda5d82689a7bd9c76e8`
BLAKE2b-256	`170bc8f0d7c57ea718661a85f27771447deef4708ed73f1ce854b0d4307cdf65`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp311-cp311-macosx_10_12_x86_64.whl.

File metadata

Download URL: kglite-0.5.18-cp311-cp311-macosx_10_12_x86_64.whl
Upload date: Feb 14, 2026
Size: 2.2 MB
Tags: CPython 3.11, macOS 10.12+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp311-cp311-macosx_10_12_x86_64.whl
Algorithm	Hash digest
SHA256	`17222c1fbb7582faaea320941f49a9f278a779d201c6314a549ad4315b7e14b5`
MD5	`c3994573d5ec90a141e0dd2441b770f5`
BLAKE2b-256	`91f71a17b136fccb5088a3c9ab59095997e0becfc6ff3b04ddd82a239eda3e1d`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp310-cp310-win_amd64.whl.

File metadata

Download URL: kglite-0.5.18-cp310-cp310-win_amd64.whl
Upload date: Feb 14, 2026
Size: 2.0 MB
Tags: CPython 3.10, Windows x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp310-cp310-win_amd64.whl
Algorithm	Hash digest
SHA256	`9f8a170e00cb7a9955d55fc6e48436bf73bdaec52b17883fd49c21b40c110003`
MD5	`42630c329bea7b805a0b3d958694345c`
BLAKE2b-256	`51d994fa3be5acf3f2bf7865e87aa128c510b7c17fc43f9cbb12b92a0ddf991e`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp310-cp310-manylinux_2_35_x86_64.whl.

File metadata

Download URL: kglite-0.5.18-cp310-cp310-manylinux_2_35_x86_64.whl
Upload date: Feb 14, 2026
Size: 2.4 MB
Tags: CPython 3.10, manylinux: glibc 2.35+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp310-cp310-manylinux_2_35_x86_64.whl
Algorithm	Hash digest
SHA256	`b1168a7b55d2207b5c8dc4b888bafcace62cf2620cdfae8c5f8cc18e8f5c60ef`
MD5	`08a4d6cb156377ff4e4380d8ca40b25b`
BLAKE2b-256	`5a988cb53cd2828c204e153fee896e817d702a7de79f721e304f308817ccc761`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

Download URL: kglite-0.5.18-cp310-cp310-macosx_11_0_arm64.whl
Upload date: Feb 14, 2026
Size: 2.1 MB
Tags: CPython 3.10, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp310-cp310-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`61bc0dc5111b783b531a76504585402b2131217f7ab480600878dd137e442fe4`
MD5	`2fcaf516364b3c0c52bd8b86c49314eb`
BLAKE2b-256	`4c2b7faf11c71c8f395af19dd41ca8b72750ea5b625d502a898a177819ba0d84`

See more details on using hashes here.

File details

Details for the file kglite-0.5.18-cp310-cp310-macosx_10_12_x86_64.whl.

File metadata

Download URL: kglite-0.5.18-cp310-cp310-macosx_10_12_x86_64.whl
Upload date: Feb 14, 2026
Size: 2.2 MB
Tags: CPython 3.10, macOS 10.12+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kglite-0.5.18-cp310-cp310-macosx_10_12_x86_64.whl
Algorithm	Hash digest
SHA256	`c6ccbd245faed8a0cc4908e97f51cca4d5145401efef800815c0173e5afe4707`
MD5	`504fbb7ec15d96dd4ae769306759ddf2`
BLAKE2b-256	`aa34ada21e8a93ae3b9d785368b77d48cc160442cd1e40e3008ef7930565dd69`

See more details on using hashes here.

kglite 0.5.18

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

KGLite

Quick Start

Loading Data from DataFrames

Core Concepts

How It Works

Return Types

Cypher

ResultView

Node dicts

Retrieval methods (cheapest to most expensive)

Flat vs. grouped results

Centrality methods

API Quick Reference

Graph lifecycle

Cypher (recommended for most tasks)

Data loading (fluent API)

Selection chain (fluent API)

Semantic search

Introspection

Algorithms

Schema Introspection

schema() — Full graph overview

connection_types() — Edge type inventory

properties(node_type, max_values=20) — Property details

neighbors_schema(node_type) — Connection topology

sample(node_type, n=5) — Quick data peek

indexes() — Unified index list

agent_describe() — AI agent context

Cypher Queries

WHERE Clause

Relationship Properties

Aggregation

WITH Clause

OPTIONAL MATCH

Built-in Functions

Spatial Functions

Arithmetic

CASE Expressions

List Comprehensions

Map Projections

Parameters

UNWIND

UNION

Variable-Length Paths

WHERE EXISTS

shortestPath()

CREATE / SET / DELETE / REMOVE / MERGE

Transactions

DataFrame Output

EXPLAIN

Supported Cypher Subset

Fluent API: Data Loading

Adding Nodes

Property Mapping

Creating Connections

Working with Dates

Batch Property Updates

Operation Reports

Fluent API: Querying

Filtering

Sorting

Traversing the Graph

Set Operations

Retrieving Results

Debugging Selections

Pattern Matching

Graph Algorithms

Shortest Path

All Paths

Connected Components

Centrality Algorithms

`schema()` — Full graph overview

`connection_types()` — Edge type inventory

`properties(node_type, max_values=20)` — Property details

`neighbors_schema(node_type)` — Connection topology

`sample(node_type, n=5)` — Quick data peek

`indexes()` — Unified index list

`agent_describe()` — AI agent context

What `agent_describe()` returns