Splicing-based Pattern Extraction and Clustering using TRAnscriptomics
Project description
SPECTRA: Unsupervised Analysis of Alternative Splicing
✂️ About
SPECTRA (Splicing-based Pattern Extraction and Clustering using TRAnscriptomics) is an end-to-end pipeline for discovering patient subtypes based on alternative splicing. It is a modernized and optimized version of the original OncoSplice algorithm.
SPECTRA leverages an iterative clustering strategy to identify stable and dominant splicing patterns across patient samples. The new implementation enhances the speed, accuracy, and modularity, allowing seamless integration into both command-line workflows and interactive Python environments.
📌 Installation
SPECTRA can installed as a Python package via pip. We recommend using conda environment with python version set to 3.12.
pip3 install splicespectrax==2025.0.3
📚 Documentation
Detailed documentation on how to perform SPECTRA analysis is provided on ReadTheDocs now.
👩🏫 Tutorial
Example datasets: PSI files for each TCGA cancer can be downloaded from here.
SPECTRA can be used in two ways:
- As a command-line tool for end-to-end execution
- As a modular workflow, where individual functions are called step-by-step
See the tutorials and example scripts for each approach:
Command-Line Interface (CLI)
Run the entire pipeline with a single command using main.py. This is ideal for multiple dataset processing and automated workflows.
Modular Usage
Import and run individual components such as preprocessing, clustering, or visualization in a custom step-by-step analysis.
📝 Overview of Modules
| Module | Description |
|---|---|
main.py |
Entry point for running the complete SPECTRA pipeline. Handles argument parsing and execution flow. |
round_wrapper.py |
Wraps a single iteration of clustering (SPECTRA performs 3 iterations by default). |
preprocess.py |
Performs variance-based and intercorrelation-based filtering of splicing events prior to clustering. |
remove_redundancy.py |
Removes redundant splicing events based on intra-gene correlation. |
feature_selection.py |
Implements PCA-based feature selection, similar to the splice-ICGS method in the original OncoSplice. |
median_impute.py |
Imputes missing values in the splicing matrix using the median for each event. |
visualizations.py |
Generates visual summaries, including splicing event annotation bar plots and cluster heatmaps. |
determine_rank.py |
Automatically determines the optimal NMF rank (if not user-specified). |
run_nmf.py |
Performs NMF clustering and assigns multi-label cluster memberships. |
metadata_analysis.py |
Analyzes and annotates differential splicing events across clusters. |
linear_svm.py |
Applies linear SVM for final cluster assignment. |
correlation_depletion.py |
Identifies and depletes splicing events associated with a clustering round. |
correlation_depletion_vectorized.py |
A faster version of correlationDepletion.py using imputed values and optimized calculations. |
📖 Citation
Coming soon — citation information for referencing SPECTRA in publications.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file splicespectrax-2025.0.5.tar.gz.
File metadata
- Download URL: splicespectrax-2025.0.5.tar.gz
- Upload date:
- Size: 30.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c71b4962071bf71e5f4d9178f2eed450434e50e772e5ecfce46617081e06897b
|
|
| MD5 |
5c76017860e2400db41c0f97cf91c965
|
|
| BLAKE2b-256 |
b4c97f0b7536121786378d30ea64b1b32e184764f0e6c5c834e06ff38e6d5934
|
File details
Details for the file splicespectrax-2025.0.5-py3-none-any.whl.
File metadata
- Download URL: splicespectrax-2025.0.5-py3-none-any.whl
- Upload date:
- Size: 34.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c44403bbab25604be98f53b7080102fde7236953dacea106ddea74f373480395
|
|
| MD5 |
c0b170abb437814df015508d450c2643
|
|
| BLAKE2b-256 |
70cbc315d53c34f64d94b4081b72bccb1bed86e264d51580afd1f728be8d88ad
|