A Python tool to build, manipulate, and interoperate with flexible graph structures
Project description
annnet — Annotated Network Data Structure for Science
annnet (Annotated Network) is a unified, high‑expressivity graph platform that brings anndata‑style, annotated containers to networks, multilayer structures, and hypergraphs. It targets systems biology, network biology, omics integration, computational social science, and any domain needing fully flexible graph semantics with modern, stable storage and interoperability.
Why annnet?
Nothing else combines all of this at once:
- Maximum graph expressiveness: hyperedges; directed/undirected per‑edge; parallel edges; self‑loops; vertex–edge and edge–edge relations; graph/edge‑level semantics.
- Multilayer networks: support aligned with the Kivelä et al. framework (layers, aspects, inter‑layer edges).
- Powerful slicing: create subnetworks, clusters, and arbitrary “slices”; switch active slice quickly.
- Annotated tables everywhere: Narwhals-compatible tables for vertices, edges, layers, vertex-layer couples, slices, and graph metadata.
- anndata‑style feel: familiar patterns like obs/var/layers‑like concepts, caches, and indices.
- Interoperability without friction: import/export with NetworkX, igraph, GraphML, GEXF, SIF, SBML, CX2 (Cytoscape), Excel/CSV/JSON, Parquet graph directories, and DataFrames (via Narwhals).
- Algorithm interoperability: seamless, lazy calls into NetworkX/igraph/graph‑tool via the graph-owned
G.nx,G.ig, andG.gtaccessors. - Disk‑backed, lossless
.annnetstorage:- Zarr for sparse matrices
- Parquet for annotated tables
- JSON for metadata Reopen without loss of structure or metadata.
annnet is under active development. Feedback is welcome via GitHub issues: https://github.com/saezlab/annnet
Features
General graph modeling
- Simple graphs, directed graphs, multigraphs
- Hyperedges (undirected and directed head→tail)
- Signed/weighted edges and rich node/edge annotations
- Efficient indexing and fast lookups
Import/Export (file formats/data)
- GraphML, GEXF, SIF, SBML
- CX2 (Cytoscape)
- Parquet graph directory (directory of Parquet files)
- CSV/TSV, JSON/NDJSON
- Excel (via pandas/openpyxl)
- DataFrames via Narwhals (Polars, pandas, and friends)
Interoperability (runtime backends)
- NetworkX via
G.nxandto_nx/from_nx - igraph via
G.igandto_igraph/from_igraph - graph‑tool via
G.gtandto_graphtool(if installed)
Export (file formats/data)
- GraphML, GEXF, SIF
- CX2 (Cytoscape)
- Parquet graph directory (directory of Parquet files)
- JSON/NDJSON
- DataFrames via Narwhals (Polars, pandas, and friends)
Backend-specific integration
If networkx is installed, you can call algorithms directly from the AnnNet graph:
centrality = G.nx.degree_centrality(G)
Similarly, G.ig (igraph) and G.gt (graph-tool) are available if those libraries are present. On G.nx.<function>(G, ...), AnnNet resolves the NetworkX callable, converts the AnnNet graph to a NetworkX graph with the requested projection options, replaces the G argument with that backend graph, dispatches the call, and returns the backend result. Backend graphs are cached and refreshed after AnnNet mutations.
Use G.nx.backend(...), G.ig.backend(...), or G.gt.backend() when you want the concrete projected backend graph object. Use G.nx.<function>(G, ...), G.ig.<function>(G, ...), or G.gt.<namespace>.<function>(G, ...) when you want AnnNet to do the conversion and dispatch for an algorithm call. You can control directionality, hyperedge handling, slice selection, and multigraph collapsing where the backend supports those options.
Examples:
# Keep multiedges, drop hyperedges (default); operate on active slice
bc = G.nx.betweenness_centrality(G)
# Convert a specific slice and collapse multiedges using weights
nxG = G.nx.backend(
directed=True,
hyperedge_mode="skip", # or "expand"
slice="toy",
simple=True, # collapse Multi(Di)Graph -> (Di)Graph
)
# igraph interoperability (if python-igraph installed)
pagerank = G.ig.pagerank(G)
Installation
Base install:
pip install annnet
Backends and IO/storage extras (choose what you need):
# Backend libraries
pip install "annnet[networkx,igraph]"
# IO/serialization and storage (JSON/Parquet/Zarr, Excel, Narwhals)
pip install "annnet[io]"
# Or more granular extras
pip install "annnet[parquet]" # Parquet graph directory support (pyarrow)
pip install "annnet[zarr_io]" # Zarr + numcodecs for .annnet storage
pip install "annnet[excel]" # Excel loader (pandas/openpyxl)
pip install "annnet[sbml]" # SBML import (libxml2/lxml)
# Everything commonly available via pip (graph‑tool is not on PyPI)
pip install "annnet[all]"
Note: graph-tool is supported by adapters and G.gt interoperability if installed, but it is not available on PyPI; install it via your OS/package manager and annnet will detect it.
Dev/test setup with Pixi
Use Pixi to get a fully loaded dev environment (Python 3.12, all extras, graph‑tool from conda‑forge):
pixi install # solves the dev env
pixi run test-all # runs pytest in the Pixi dev env (with graph-tool)
Notes:
graph-toolis only available on some platforms (e.g., macOS arm may need Rosetta, linux-64 is supported); the Pixi manifest includes multiple platforms.- The Pixi env installs annnet editable with extras
allanddev, plus graph-tool from conda-forge.
Docs with uv
Use the module entrypoint instead of the mkdocs console script. In this repo, that path picks up the startup customizations used to avoid Jupyter warning noise during docs builds.
uv sync --group docs
uv run python -m mkdocs serve
For a one-off build:
uv run python -m mkdocs build --strict
Quick Start
import annnet as an
G = an.Graph(directed=True) # default direction; can be overridden per-edge
# Create slices and set active
G.slices.add_slice("toy")
G.slices.add_slice("train")
G.slices.add_slice("eval")
G.slices.active = "toy" # same effect as set_active_slice("toy")
# Add vertices (with attributes) in 'toy'
for v in ["A", "B", "C", "D"]:
G.add_vertices(v, label=v, kind="gene")
# 1) Binary directed
e_dir = G.add_edges("A", "B", weight=2.0, directed=True, relation="activates")
# 2) Binary undirected
e_undir = G.add_edges("B", "C", weight=1.0, directed=False, relation="binds")
# 3) Self-loop
e_loop = G.add_edges("D", "D", weight=0.5, directed=True, relation="self")
# 4) Parallel edge
e_parallel = G.add_edges("A", "B", weight=5.0, parallel="parallel", relation="alternative")
# 5) Vertex–edge (hybrid) edge
G.add_edges(edge_id="edge_e1", as_entity=True, description="signal")
e_vx = G.add_edges("edge_e1", "C", directed=True, as_entity=True, channel="edge->vertex")
# 6) Hyperedge (undirected, 3-way membership)
e_hyper_undir = G.add_edges(["A", "C", "D"], weight=1.0, directed=False, tag="complex")
# 7) Hyperedge (directed head→tail)
e_hyper_dir = G.add_edges(["A", "B"], ["C", "D"], weight=1.0, directed=True, reaction="A+B->C+D")
# 8) Run a NetworkX algorithm (if installed)
deg = G.nx.degree_centrality(G)
Interoperability
High‑fidelity conversions aim to preserve IDs, attributes, and directionality. When a conversion is lossy (e.g., hyperedges to NetworkX), functions return a manifest you can pass back to restore structure where possible.
Adapter types (what goes where):
- Runtime backend adapters (interoperability): used by
G.nx,G.ig, andG.gtfor algorithm calls and byto_nx/from_nx,to_igraph/from_igraph, andto_graphtool/from_graphtoolfor explicit in-memory conversions. - Format/data adapters (I/O): used to read/write files and tabular data. Examples: GraphML, GEXF, SIF, SBML, CX2 (Cytoscape), Parquet graph directories, JSON/NDJSON, Excel/CSV, and DataFrames via Narwhals.
In short: G.nx/G.ig/G.gt are runtime algorithm interoperability accessors; file formats use IO adapters.
NetworkX:
import annnet as an
nxG, manifest = an.adapters.to_nx(G, directed=True, hyperedge_mode="skip")
# ... run algorithms or edit nxG ...
G2 = an.adapters.from_nx(nxG, manifest) # use manifest to reduce loss
igraph / graph‑tool (if installed):
igG, ig_manifest = an.adapters.to_igraph(G, directed=True, hyperedge_mode="skip")
G2 = an.adapters.from_igraph(igG, ig_manifest)
File formats and dataframes:
import annnet as an
# GraphML / GEXF / SIF
an.io.to_graphml(G, "graph.graphml", directed=True, hyperedge_mode="reify")
G2 = an.io.from_graphml("graph.graphml")
an.io.to_sif(G, "graph.sif", lossless=True)
G3 = an.io.from_sif("graph.sif", manifest_path="graph.sif.manifest.json")
# JSON / NDJSON
an.io.to_json(G, "graph.json")
H = an.io.from_json("graph.json")
# Parquet graph directory
an.io.to_parquet(G, "graph_dir/")
K = an.io.from_parquet("graph_dir/")
# CX2 (Cytoscape)
cx2 = an.io.to_cx2(G, hyperedges="reify")
L = an.io.from_cx2(cx2, hyperedges="manifest")
Notes:
- Hyperedges may be dropped or reified depending on
hyperedge_modeand target format. - Slices: backend graphs are single‑view; multiple slices may be flattened unless specified.
- Use returned manifests when available to improve round‑trip fidelity.
Storage (.annnet)
annnet can write and read a lossless on‑disk format combining Zarr (sparse arrays) and Parquet (tables), plus JSON sidecars.
import annnet as an
an.io.write(G, "my_graph.annnet", overwrite=True) # Zarr + Parquet + JSON
R = an.io.read("my_graph.annnet") # full fidelity load
Layout highlights:
manifest.jsonwith versions, counts, and slice metadatastructure/incidence.zarrstores COO arrays (row/col/data)structure/*parquetfor indices, definitions, weights, kindstables/*parquetfor vertex/edge/slice attributeslayers/,slices/,audit/,uns/for multilayer, slicing, history, and unstructured data
Internal Design
annnet adapts internal representation for performance and compatibility:
- Sparse incidence matrix for core topology; DOK in‑memory, stored as COO
- Attributes decoupled from structure (Polars tables)
- Lazy conversion to external backends on demand
- Copy‑on‑write graph views and structured change tracking
- Optional caching for converted backend graphs within
G.nx/G.ig/G.gt
See the architecture overview in architecture.md for deeper details and examples.
Philosophy
- Simple, consistent interface for all graph types
- Interoperability‑first: integrate, don’t replace
- Performance‑aware, not performance‑obsessed
- Extensible and modular, not monolithic
Package Overview
The package is modular to separate core functionality, adapters for external libraries, and IO/storage:
annnet/
├── core/ # Graph class, managers, lazy interoperability accessors (nx/ig/gt)
├── adapters/ # Backend adapters: networkx/igraph/graph-tool conversions
├── io/ # Lossless .annnet storage and filesystem/tabular IO
├── algorithms/ # Pure-Python algorithms using core only
└── utils/ # Misc utilities (typing/validation/config)
See the architecture overview for a deeper design document.
License
annnet is licensed under the BSD‑3 License. See the LICENSE file for details.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file annnet-0.2.0.tar.gz.
File metadata
- Download URL: annnet-0.2.0.tar.gz
- Upload date:
- Size: 290.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ea905d1e0e17d8a1a3b19174148a315eb7653ea0d6bbd67d091f893008977885
|
|
| MD5 |
6d54358a563c67517c01308560770bf4
|
|
| BLAKE2b-256 |
c8f393fa40339ca6e088b26f8bd6c3e6d2372130dca51d1543c90a24809ba05d
|
File details
Details for the file annnet-0.2.0-py3-none-any.whl.
File metadata
- Download URL: annnet-0.2.0-py3-none-any.whl
- Upload date:
- Size: 224.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c2a0abd310d8ac8bf3c02aa66e292571318e0828f6a31d9b8b965aeb906795b9
|
|
| MD5 |
1a2981bdf38e0e9a831e1a98c0e1282b
|
|
| BLAKE2b-256 |
b7b78b430f9ea2c5b708dd0813fd6dbb0233d2fc35d362a4ee088ecd39e07cd6
|