graphnetz

A large-scale database and benchmark for graph learning.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kleytondacosta

These details have not been verified by PyPI

Project description

GraphNetz

Statistically rigorous GNN benchmarking

Why GraphNetz

Most GNN benchmarks report point-estimate accuracies on a handful of citation graphs and declare a winner without confidence intervals, multiple-comparison correction, or rank aggregation across datasets. GraphNetz's default output is a structured statistical report, not a raw accuracy table:

multi-seed Student's t confidence intervals per cell,
Holm–Bonferroni paired t-tests (or Wilcoxon signed-rank) within each task,
Demšar critical-difference diagrams from Friedman ranks with a Nemenyi post-hoc.

The catalogue is organised along a category × task taxonomy: 58 dataset loaders across 10 scientific categories crossed with 4 task kinds (node classification, graph classification, graph regression, link prediction). Five canonical architectures (GCN, GAT, GIN, GraphSAGE, Graph Transformer) plug into every kind via a small set of task-kind adapters; Deep Graph Infomax is exposed as an optional pre-training utility.

Install

uv add graphnetz
# or, in an existing environment:
pip install graphnetz

For local development:

git clone https://github.com/kleyt0n/graphnet
cd graphnet
uv sync --group dev

GraphNetz requires Python ≥ 3.10, torch ≥ 2.6, and torch-geometric ≥ 2.6.

Quick start

from graphnetz import GCN, train_node_classification, plot_history
from graphnetz.datasets.social import cora

ds = cora("data/cora")
model = GCN(ds.num_features, 64, ds.num_classes)
history = train_node_classification(model, ds[0], epochs=200)
fig, ax = plot_history(history, title="GCN on Cora")

For a full benchmark run with the default statistical report:

from graphnetz import GAT, GCN, GraphSAGE, GraphTransformer, run_benchmark

report = run_benchmark(
    "social",
    {"GCN": GCN, "GAT": GAT, "GraphSAGE": GraphSAGE, "GraphTransformer": GraphTransformer},
    seeds=(0, 1, 2, 3, 4, 5, 6, 7, 8, 9),
    kind="node_cls",          # restrict to one task family
)
print(report.summary())       # per-(task, model) mean ± t-CI
print(report.pairwise())      # Holm-corrected paired t-tests (or Wilcoxon)
fig, _ = report.plot_critical_difference(alpha=0.05)

Task kinds

Kind	Symbol	Metric	Examples
Node classification	`node_cls`	test accuracy	Cora, Roman-empire
Graph classification	`graph_cls`	val accuracy	MUTAG, MNIST-superpixels
Graph regression	`graph_reg`	val MAE	ZINC, QM9
Link prediction	`link_pred`	test AUC	FB15k-237, Internet AS

Unlabelled graphs (Netzschleuder, synthetic combinatorial, Ising lattice) enter the benchmark through link prediction on a held-out edge split, so every cell carries a real test-time metric — there is no self-supervised pretext loss in the headline report.

Dataset categories

Category	#	Task kinds	Loaders
Combinatorial	6	GC, GR, LP	random TSP, VRP, max-flow, bipartite matching, coloring, max-cut
Biology	10	GC, GR, NC, LP	MUTAG, PROTEINS, ENZYMES, Peptides-func/struct, PPI, C. elegans, Budapest connectome, hospital/high-school contacts
Social	14	NC, LP	Cora, CiteSeer, PubMed, WikiCS, Roman-empire, Amazon-ratings, Minesweeper, Tolokers, Questions, MovieLens-100k, Karate, Facebook friends, DBLP coauthor, DNC emails
Knowledge	3	LP	FB15k-237, WordNet18-RR, WordNet (Netz)
Infrastructure	6	LP	power grid, EuroRoad, US roads, EU airlines, London transport, urban streets
Finance	4	NC, LP	Elliptic Bitcoin, product space, board of directors, US patents
Computing	4	LP	Internet AS, Internet topology, AS-Skitter, route views
Vision	5	GC, NC	MNIST/CIFAR-10 superpixels, ModelNet10/40, ShapeNet
Physics	3	GR, LP	QM9, ZINC, Ising lattice
Security	3	GC, LP	MalNet-Tiny, 9/11 terrorists, train terrorists
OGB	2	NC, GC	ogbn-arxiv, ogbg-molhiv (requires `pip install graphnetz[ogb]`)

from graphnetz.datasets.social import cora, roman_empire
from graphnetz.datasets.biology import peptides_func
from graphnetz.datasets.computing import internet_as
from graphnetz.datasets.ogb import ogbn_arxiv, ogbg_molhiv

ds_cora = cora("data/cora")
ds_rom  = roman_empire("data/roman_empire")        # heterophilic
ds_pep  = peptides_func("data/peptides_func")      # LRGB
ds_inet = internet_as("data/internet_as")          # Netzschleuder

For arbitrary Netzschleuder networks:

from graphnetz import Netz
ds = Netz(root="data", dataset_name="urban_streets", network_name="brasilia")

Models

Model	Kinds	Source
`GCN`	all four	Kipf & Welling, ICLR 2017
`GAT`	all four	Veličković et al., ICLR 2018
`GIN`	`graph_cls`, `graph_reg`	Xu et al., ICLR 2019
`GraphSAGE`	all four	Hamilton et al., NeurIPS 2017
`GraphTransformer`	all four	Shi et al., 2021
`DGI`	(utility)	Veličković et al., ICLR 2019

Node-level encoders enter every task kind through three small adapters: graph-level pooling head, dot-product link-prediction head, and the DGI self-supervised wrapper for optional unsupervised pre-training.

Custom models

from graphnetz import register_model

# 1. Decorator
@register_model(kinds="node_cls")
class MyGNN(torch.nn.Module):
    def __init__(self, in_channels, hidden_channels, out_channels): ...

# 2. Class attribute (no decorator)
class MyGNN(torch.nn.Module):
    task_kinds = {"node_cls", "graph_cls"}

# 3. Inline tuple at run-time
run_benchmark(
    "social",
    {"MyGNN": (MyGNN, "node_cls",
               lambda i, h, o: MyGNN(i, h, o, dropout=0.3))},
)

The statistical report

run_benchmark(...) returns a BenchmarkReport with the following methods:

Method	Output
`report.summary(ci=0.95)`	per-(task, model) mean ± t-CI half-width DataFrame
`report.pairwise(alpha=0.05)`	Holm-corrected paired t-tests or Wilcoxon signed-rank tests within each task
`report.plot_critical_difference()`	Demšar / Nemenyi CD diagram across tasks
`report.plot_pairwise(layout=...)`	matrix or list view of pairwise significance
`report.plot_forest()`	per-task forest plot of mean ± CI
`report.plot_learning_curves()`	shared-y learning curves with t-CI bands
`report.to_latex(path)`	publication-ready bold-best LaTeX table
`report.pairwise_to_latex(path)`	Holm pairwise LaTeX table (parametric or non-parametric)

Notebooks

Worked examples live under examples/:

01_benchmark.ipynb — the cross-category dashboard (multi-seed report, bootstrap CIs, custom-model integration).
02_knowledge.ipynb — relational link prediction on FB15k-237 / WN18-RR using the DistMult decoder.

Reproducing the paper

PYTHONPATH=src uv run python paper/experiment.py   # train + cache + figures
latexmk -pdf paper/main.tex                        # compile PDF

The script trains 5 architectures × 10 seeds across the 10 surviving categories, caches the histories under paper/_cache_*.pkl, and writes every figure (paper/figures/) and LaTeX table (paper/tables/) referenced by paper/main.tex. Total runtime on a recent laptop CPU is under 30 minutes.

Issues

Track issues at github.com/kleyt0n/graphnetz/issues.

Citation

If GraphNetz is useful in your work, please cite the accompanying paper:

@article{dacosta2026graphnetz,
  title   = {GraphNetz: A Statistical-Reporting Layer for Graph Neural Network Benchmarks},
  author  = {da Costa, Kleyton and Modenesi, Bernardo},
  journal = {arXiv preprint},
  year    = {2026}
}

Contributing

Pull requests welcome. Read CONTRIBUTING.md first — the short version is: every benchmark cell must carry a real held-out metric, every change must thread through the multi-seed pipeline, and every PR must be ruff clean.

uv run pytest
uv run ruff check

License

MIT — see LICENCE.txt.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kleytondacosta

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.1

May 6, 2026

This version

0.1.0

May 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

graphnetz-0.1.0.tar.gz (47.2 kB view details)

Uploaded May 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

graphnetz-0.1.0-py3-none-any.whl (57.7 kB view details)

Uploaded May 2, 2026 Python 3

File details

Details for the file graphnetz-0.1.0.tar.gz.

File metadata

Download URL: graphnetz-0.1.0.tar.gz
Upload date: May 2, 2026
Size: 47.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for graphnetz-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`5aa1aec9ef153314f063fe91f1cf0128a709de65f9b8ea3d55df64ef4183cf3a`
MD5	`bb77d725c39fdccc0c1a61db2bb52fe2`
BLAKE2b-256	`c9463c59db74df62f73db44e3cfcb51c26407f364cca01b215f5caa99d3a22f3`

See more details on using hashes here.

Provenance

The following attestation bundles were made for graphnetz-0.1.0.tar.gz:

Publisher: release.yaml on Kleyt0n/graphnetz

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: graphnetz-0.1.0.tar.gz
- Subject digest: 5aa1aec9ef153314f063fe91f1cf0128a709de65f9b8ea3d55df64ef4183cf3a
- Sigstore transparency entry: 1429604109
- Sigstore integration time: May 2, 2026
Source repository:
- Permalink: Kleyt0n/graphnetz@84d2736294d4a0d281417cb2bd5706422e8582cc
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Kleyt0n
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yaml@84d2736294d4a0d281417cb2bd5706422e8582cc
- Trigger Event: workflow_dispatch

File details

Details for the file graphnetz-0.1.0-py3-none-any.whl.

File metadata

Download URL: graphnetz-0.1.0-py3-none-any.whl
Upload date: May 2, 2026
Size: 57.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for graphnetz-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`395a66e3fdd5e219f9d7da110e2f5608c87a6c43742b62554ffcbb5e9f1fcb4e`
MD5	`022d4155ccdfe131a345b4fa69a0a707`
BLAKE2b-256	`f3907f1021ea7d1e4ee6f9cb46a071d6ee2475ff15d92e6ebb599ced48d9429f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for graphnetz-0.1.0-py3-none-any.whl:

Publisher: release.yaml on Kleyt0n/graphnetz

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: graphnetz-0.1.0-py3-none-any.whl
- Subject digest: 395a66e3fdd5e219f9d7da110e2f5608c87a6c43742b62554ffcbb5e9f1fcb4e
- Sigstore transparency entry: 1429604111
- Sigstore integration time: May 2, 2026
Source repository:
- Permalink: Kleyt0n/graphnetz@84d2736294d4a0d281417cb2bd5706422e8582cc
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Kleyt0n
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yaml@84d2736294d4a0d281417cb2bd5706422e8582cc
- Trigger Event: workflow_dispatch

graphnetz 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

Why GraphNetz

Install

Quick start

Task kinds

Dataset categories

Models

Custom models

The statistical report

Notebooks

Reproducing the paper

Issues

Citation

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance