A Python framework for discovering isoform-switch and splicing modules from bulk RNA-seq by combining gene-local compositional modeling with splice-graph-aware latent network inference.
Project description
IsoGraph
IsoGraph is a Python research software package for discovering isoform-switch and splicing modules from bulk RNA-seq. It combines gene-local compositional modeling with network inference so researchers can move from transcript-level counts to gene-module structure, trait associations, and reproducible benchmark artifacts.
Core Capabilities
- Generate and benchmark against the permanent
core_v1fixture suite and the large-scalescale_v1suite (6k–12k genes, 25:1–50:1 genes-to-samples ratios). - Freeze the bundled
real_caudate_aa_v1real-data fixture from local BrainSeq inputs. - Fit the deterministic baseline backend from the command line on a prepared dataset bundle.
- Run
baseline,latent,graph,vae,wgcna, orgpu_latentbackends programmatically or through the benchmark runner. - Export reproducible artifacts, benchmark reports, calibration summaries, and snapshot comparisons.
Installation
Install the core package from PyPI:
pip install isograph
The core package supports Python 3.11 through 3.14.
Optional backends
The vae and gpu_latent backends require PyTorch:
pip install torch
See the PyTorch installation guide for GPU/CUDA builds.
The wgcna backend requires R with the WGCNA package and Rscript on PATH.
Quickstart
Run a minimal benchmark on the bundled toy fixture (VAE is the default backend):
isograph benchmark -- \
fixture_filter=toy_v1 \
stage_name=readme_smoke
This writes benchmark artifacts under artifacts/benchmarks/readme_smoke/toy_v1/ and
JSON reports under artifacts/reports/.
Using Your Own Data
IsoGraph expects a prepared dataset bundle containing a manifest.json, aligned sample
metadata, feature tables, and dense count matrices. The current command-line path for
custom data is:
isograph fit \
--dataset-path path/to/my_dataset_bundle \
--output-dir artifacts/fits/my_dataset
At present, fit runs the deterministic baseline backend. For latent, graph, or VAE
backends on your own bundle, use the Python API directly. The detailed walkthroughs live
in the Wiki, and the formal data model is documented in the RTD source tree.
Documentation
- Reference docs for publication on Read the Docs live in docs.
- Step-by-step tutorials for installation, data preparation, and own-data workflows live in the GitHub Wiki.
- Project planning and staged development history remain in docs/staged-roadmap.md.
Citation
If you use IsoGraph in research, cite the software repository using the metadata in CITATION.cff. If a manuscript or preprint becomes available later, that can be added as a preferred citation target without changing the software citation path.
Acknowledgements
IsoGraph is supported by the National Institute on Minority Health and Health Disparities
award R00 MD0169640 and the Alzheimer's Association award 25AARG-1413315.
Reproducibility and Data Provenance
- The benchmark suite is fixture-driven and designed to preserve regression targets across development stages.
- The bundled real-data workflow freezes a reproducible
real_caudate_aa_v1dataset from local BrainSeq-derived inputs and caches intermediate selections underbenchmarks/cache/real_data/. - Benchmark, calibration, runtime, and snapshot artifacts are written into versioned
directories under
artifacts/andsnapshots/.
Limitations
- The benchmark CLI is optimized for the bundled fixture suite rather than arbitrary user-defined suites.
- The
fitCLI currently exposes only the baseline backend for custom datasets. - The VAE and GPU-latent backends require a separate PyTorch installation.
- The WGCNA backend requires R with the
WGCNApackage installed. - The bundled
freeze-realworkflow depends on local BrainSeq-style source files and is not a generic data-ingestion command for arbitrary cohorts.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file isograph-0.1.1.tar.gz.
File metadata
- Download URL: isograph-0.1.1.tar.gz
- Upload date:
- Size: 61.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.3.3 CPython/3.14.4 Linux/6.19.10-arch1-1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f6288781737b1f9141b4950322db14381dc84095d0463f8bb9380151022dae4e
|
|
| MD5 |
e44e2fcb7edf60762778168bfa126a5f
|
|
| BLAKE2b-256 |
9202b062394eff3f23055f56ed19cc8a2891f0bd9f3a432ec3251153c35645c0
|
File details
Details for the file isograph-0.1.1-py3-none-any.whl.
File metadata
- Download URL: isograph-0.1.1-py3-none-any.whl
- Upload date:
- Size: 78.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.3.3 CPython/3.14.4 Linux/6.19.10-arch1-1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
333d49e0fa875ad49afba497410927a376e819426162827a25ba061efb91a717
|
|
| MD5 |
76338bf626c94c75f1bb1f2709394618
|
|
| BLAKE2b-256 |
a49c6d7029b56ac34f2cb445b9813459146243d403446fb5f5e948e0d8445bf5
|