Skip to main content

A library for generating synthetic graph families for inductive generalization experiments of graph learning models.

Project description

GraphUniverse: Enabling Systematic Evaluation of Inductive Generalization

PyPI Python License ICLR 2026

Generate families of graphs with finely controllable properties for systematic evaluation of inductive graph learning models.

Quick Start | Interactive UI | Validation | Paper Experiments

Example Graph Family

Key Features

Current graph learning benchmarks are limited to single-graph, transductive settings. GraphUniverse enables the first systematic evaluation of inductive generalization by generating entire families of graphs with:

  • Consistent Semantics: Communities maintain stable identities across graphs
  • Fine-grained Control: Tune homophily, degree distributions, community structure
  • Scalable Generation: Linear scaling, thousands of graphs per minute
  • Validated Framework: Comprehensive parameter sensitivity analysis
  • Interactive Tools: Web-based exploration and visualization

GraphUniverse Methodology Graphical Overview


Installation

Install from PyPI:

pip install graph-universe

For the interactive UI and visualization tools:

pip install graph-universe[viz]

Optional extras:

  • [viz] - Streamlit UI + seaborn visualization tools
  • [dev] - Development dependencies (testing, linting)
  • [notebook] - Jupyter notebook support
  • [all] - Everything (includes documentation tools)

Install from source (for contributors):

git clone https://github.com/LouisVanLangendonck/GraphUniverse.git
cd GraphUniverse
pip install -e ".[dev]"

Interactive UI

After installing with [viz], launch the interactive dashboard:

graph-universe-ui

This opens a browser-based UI where you can:

  • 🎛️ Configure all generation parameters with real-time preview
  • 📊 Generate graph families and visualize their properties
  • 🔍 Analyze community structure, homophily, and degree distributions
  • 💾 Download datasets directly in PyTorch Geometric format

No coding required! Perfect for exploration and quick dataset generation.

Hosted demo: Try it online at graphuniverse.streamlit.app

Launch from Python:

from graph_universe import launch_ui
launch_ui()  # Opens browser, press Ctrl+C to stop

Quick Start

Option 1: Python API with Individual Classes

from graph_universe import GraphUniverse, GraphFamilyGenerator

# Create universe with 8 communities and 10-dimensional features
universe = GraphUniverse(K=8, edge_propensity_variance=0.3, feature_dim=10)

# Generate family with full parameter control
family = GraphFamilyGenerator(
    universe=universe,
    n_nodes_range=(35, 50),
    n_communities_range=(2, 6),
    homophily_range=(0.2, 0.8),
    avg_degree_range=(2.0, 10.0),
    power_law_exponent_range=(2.0, 5.0),
    degree_separation_range=(0.1, 0.7),
    seed=42
)

# Generate 30 graphs
family.generate_family(n_graphs=30, show_progress=True)

print(f"Generated {len(family.graphs)} graphs!")

# Convert to PyTorch Geometric format for training
pyg_graphs = family.to_pyg_graphs(task="community_detection")

Option 2: Config-Driven Workflow

Create config.yaml:

universe_parameters:
  K: 10
  edge_propensity_variance: 0.5
  feature_dim: 16
  center_variance: 1.0
  cluster_variance: 0.3
  seed: 42

family_parameters:
  n_graphs: 100
  n_nodes_range: [25, 200]
  n_communities_range: [3, 7]
  homophily_range: [0.1, 0.9]
  avg_degree_range: [2.0, 8.0]
  power_law_exponent_range: [2.0, 3.0]
  degree_separation_range: [0.4, 0.8]
  seed: 42

task: "community_detection"

Then load and generate:

import yaml
from graph_universe import GraphUniverseDataset

with open("config.yaml") as f:
    config = yaml.safe_load(f)

dataset = GraphUniverseDataset(root="./data", parameters=config)
print(f"Generated dataset with {len(dataset)} graphs!")

Validation & Analysis

GraphUniverse includes built-in validation to ensure generated graphs match target properties:

# Validate standard graph properties
family_properties = family.analyze_graph_family_properties()
for property_name in ['node_counts', 'avg_degrees', 'homophily_levels']:
    values = family_properties[property_name]
    print(f"{property_name}: mean={np.mean(values):.3f}")

# Analyze within-graph community signals (fits Random Forest per graph)
family_signals = family.analyze_graph_family_signals()
for signal in ['structure_signal', 'feature_signal', 'degree_signal']:
    values = family_signals[signal]
    print(f"{signal}: mean={np.mean(values):.3f}")

# Measure between-graph consistency
family_consistency = family.analyze_graph_family_consistency()
for metric in ['structure_consistency', 'feature_consistency', 'degree_consistency']:
    value = family_consistency[metric]
    print(f"{metric}: {value:.3f}")

Documentation & Support


Citation

If you use GraphUniverse in your research, please cite:

@inproceedings{vanlangendonck2026graphuniverse,
  title={GraphUniverse: Enabling Systematic Evaluation of Inductive Generalization},
  author={Van Langendonck, Louis and Bernardez, Guillermo},
  booktitle={International Conference on Learning Representations},
  year={2026}
}

For Researchers & Contributors

The sections below contain resources for reproducing paper experiments and contributing to development.

Reproducing Paper Experiments

Clone the repository to access validation and experiment scripts:

git clone https://github.com/LouisVanLangendonck/GraphUniverse.git
cd GraphUniverse
pip install -e ".[dev]"

Run parameter sensitivity validation (reproduces paper results):

python experiments/validate_parameter_sensitivity.py --n-random-samples 100 --n-graphs 30

Run scalability experiments:

python experiments/scalability_experiment.py

Example Gallery & Additional Resources

Visit the GitHub repository for:

  • Complete example scripts in examples/ directory
  • Detailed methodology diagrams and visualizations
  • Additional documentation and tutorials

Contributing

We welcome contributions! To get started:

  1. Fork the repository
  2. Create a feature branch
  3. Install development dependencies: pip install -e ".[dev]"
  4. Run tests: pytest test/
  5. Submit a pull request

See the repository for detailed contribution guidelines.


License

MIT License - see LICENSE for details.

Copyright (c) 2025 Louis Van Langendonck and Guillermo Bernardez

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

graph_universe-0.1.1.tar.gz (52.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

graph_universe-0.1.1-py3-none-any.whl (49.5 kB view details)

Uploaded Python 3

File details

Details for the file graph_universe-0.1.1.tar.gz.

File metadata

  • Download URL: graph_universe-0.1.1.tar.gz
  • Upload date:
  • Size: 52.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.3

File hashes

Hashes for graph_universe-0.1.1.tar.gz
Algorithm Hash digest
SHA256 55e7496870c530a2923b0f7630301e6ddbda17854acfc5fa1ab5f6df67652d82
MD5 2c396e2ec2a66eafb8ffdd273ed17fd5
BLAKE2b-256 636881033fbc8029d4e2cc43413905ed69d4cf0bfb0bb9ce6c73793227decaf1

See more details on using hashes here.

File details

Details for the file graph_universe-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: graph_universe-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 49.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.3

File hashes

Hashes for graph_universe-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 f214d1f5f51e1e38ff6d9e68050733e9c0e739a9d90ff55becae69c9633e31f2
MD5 740f34b563232c78744f148a5c5cc9d2
BLAKE2b-256 bb1114fa0325f3a005c0877700c2b1621158981cf80689243f80e36b8a0c42b6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page