Exonic Part centered allele-specific splicing analysis

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

isoLASER

About

IsoLASER performs gene-level variant calls, phasing, and splicing linkage analysis using third-generation RNA sequencing data.

Requirements
Installation
Preprocessing
Run IsoLASER
Run IsoLASER joint
Make a nigiri plot
Demo
Output
Debug

Requirements

IsoLASER is written in Python 3.8 and requires the following packages (with the tested versions):

multiprocess
numpy==1.24.4
scipy==1.10.1
pandas==2.0.3
pysam==0.16.0.1
pytabix==0.1
pyfaidx==0.5.8
biopython==1.76
vcfpy==1.0.3
HTSeq==0.12.4
networkx==3.1
scikit-learn==1.3.2

IsoLASER has been tested in the following operating systems:

CentOS Linux 7

External software requirements:

Installation

First, it is recommended to install IsoLASER in a virtual environment.

conda create -n isolaser_env python=3.8

You can clone this GitHub repository:

git clone git@github.com:gxiaolab/isoLASER.git 
cd isoLASER
python -m build
pip install .

Or you can install directly from PyPI:

pip install isolaser

Alternatively, you can also download the Singularity container:

singularity pull library://giovas/collection/s6
singularity exec s5_latest.sif isolaser

If successful, the program is ready to use. The installation incorporates console script entry points to directly call isoLASER:

isolaser --help

Installation time varies depending on the number of dependencies that need to be installed. Assuming all library dependencies are installed already, the installation of IsoLASER should only take a few seconds.

Preprocessing

Long-read RNA sequencing is notorious for its high base-calling error rate. As such, it is important to clean and preprocess the data to discard false transcripts resulting from misalignment, bad consensus, truncation, and other technical artifacts.

Reference fasta file (e.g., hg38.fa) must be indexed (.fai and .dict).

Transcript identification

IsoLASER needs a GTF file as input to annotate individual reads with their isoform membership. Ideally, this GTF file is built using a long-read annotation software such as Talon, Clair, Bambu, Espresso, or similar.

Details of the Talon pipeline can be found on their GitHub repository.TALON.

The pipeline consists of correcting alignments around splice junctions using TranscriptClean, and labeling the reads for internal priming using talon_label_reads.
The processed bam files are then ready to be annotated by first creating a database with talon_initialize_database and then annotating the individual reads with talon.
Finally, the transcripts are filtered with talon_filter_transcripts and a GTF file is constructed with the retained transcripts with the command talon_create_GTF.

Annotate bam file

From the annotation used in the previous step, use the GTF file to generate a transcriptome reference for alignment.
This step serves to assign transcript ids to every read of the target bam file.

# generate transcriptome reference from GTF
isolaser_convert_gtf_to_fasta -g {annot.gtf} -f {reference.fa} -o {transcriptome}

# convert bam file to fastq for re-alignment
samtools fastq {input.bam} > {input.fq}

# re-align against newly generated transcriptome reference
minimap2 -t 16 -ax splice:hq -uf --MD {transcriptome.fa} {input.fq} > {transcriptome.sam}

The output is a sam file where the contigs are transcript ids. Next, filter for secondary, supplementary and trans-gene reads whilst annotating with transcript ids.

isolaser_annotate -b {input.bam} -t {transcriptome.sam} -g {annot.gtf} -o {input.annot}

The output is a bam file: input.annot.bam after some basic filtering (secondary and supplemental reads), trans-gene filtering and the ZG and ZT tags with the name of the corresponding gene and transcript id for each read.

Extract exonic parts from GTF

IsoLASER uses an exon-centric approach to analyze splicing and exonic-parts are a great granular approach to understand local splicing changes.

isolaser_extract_exon_parts -g {annot.gtf} -o {transcript.db}

The output is a new directory transcript.db that contains a pickle file per gene encapsulating all the exonic parts and transcripts associated with them.

Run IsoLASER

IsoLASER requires the annotated bam file (with ZG and ZT tags), the transcriptome database with exonic parts, and a reference annotation (e.g. hg38.fa).

isolaser -b {input.annot.bam} -o {output.prefix} -t {transcript.db} -f {reference.fa}

# output:
{output.prefix}.vcf
{output.prefix}.mi_summary.tsv

The output is very extensive and includes information that is only relevant for the joint analysis or plotting. To obtain the significant allele-specific events (cis-directed splicing events) use the filter function:

isolaser_filter -m {output.prefix.mi_summary.tsv} -o {output.prefix.mi_summary.filtered.tsv}

Run IsoLASER joint

The first step is a wrapper of GATK functions to merge the variant calls from different samples.

The input file fofn contains information of the individual samples

# fofn.tsv
SM1 SM1.mi_summary.tsv SM1.vcf
SM2 SM2.mi_summary.tsv SM2.vcf

Run the combine vcf step:

isolaser_combine_vcf -i {fofn.tsv} -o {output.prefix} -f data/hg38.fa

# output:
{output.prefix}.combined.vcf
{output.prefix}.genotyped.vcf

Perform joint analysis

isolaser_joint -i {fofn.tsv} -o {output.prefix} -t {transcript.db} -v {output.prefix}.genotyped.vcf

# output:
{output.prefix}.merged.mi_summary.tsv

Make a nigiri plot

Parse the .mi_summary.tsv file to obtain the list of events to plot

nigiri_parse --mi {output.prefix.mi_summary.tsv} -o {output.plot} -t {transcriptome.db} 

# output:
{output.plot}.cis_events.bed
{output.plot}.cis_genes.tsv

Split the bam file

nigiri_split -b {input.annot.bam} -v {var.string} -o {fofn}

We use an adapted version of ggsashimi

nigiri_plot -b {fofn} -c {region} -o {output.plot}

Demo

For a complete demo please check the test_pipeline repository.

The complete pipeline is available as a snakemake workflow in the Snakefile of the test_pipeline repository.

snakemake -s  isolaser_test.smk

Debug

If you experience any issues please submit your question to the Issues tab on this website.

Project details

These details have not been verified by PyPI

Project links

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.0.3

Apr 6, 2026

0.0.2

Oct 30, 2025

0.0.1

Oct 30, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

isolaser-0.0.3.tar.gz (1.1 MB view details)

Uploaded Apr 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

isolaser-0.0.3-py3-none-any.whl (1.1 MB view details)

Uploaded Apr 6, 2026 Python 3

File details

Details for the file isolaser-0.0.3.tar.gz.

File metadata

Download URL: isolaser-0.0.3.tar.gz
Upload date: Apr 6, 2026
Size: 1.1 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.13

File hashes

Hashes for isolaser-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`d81258d9194d952b7a72ded62afd732018f57e3c2f9800b6ccfb56c93774e62d`
MD5	`1afa5385674a65b06c77df353d876c00`
BLAKE2b-256	`6f6d7dcbc164fa6518c4abb6ca51fc332ed0d4cb4ce2e43d5c5ef5d5495ad872`

See more details on using hashes here.

File details

Details for the file isolaser-0.0.3-py3-none-any.whl.

File metadata

Download URL: isolaser-0.0.3-py3-none-any.whl
Upload date: Apr 6, 2026
Size: 1.1 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.13

File hashes

Hashes for isolaser-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9c80c51d833f29b07b2f183589a4178db090c6c126cfee229cb33d2be56e73d7`
MD5	`8967713e571391346847593c05af4581`
BLAKE2b-256	`4e450df6f7b6a7e682d3621bab607ee6895bdc6f8fe79405d54e56e39b269ee3`

See more details on using hashes here.

isolaser 0.0.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

isoLASER

About

Table of contents

Requirements

Installation

Preprocessing

Transcript identification

Annotate bam file

Extract exonic parts from GTF

Run IsoLASER

Run IsoLASER joint

Make a nigiri plot

Demo

Debug

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes