Skip to main content

splicekit: comprehensive toolkit for splicing analysis from short-read RNA-seq

Project description

splicekit: an integrative toolkit for splicing analysis from short-read RNA-seq

splicekit is a modular platform for splicing analysis from short-read RNA-seq datasets. The platform also integrates an JBrowse2 instance, pybio for genomic operations and scanRBP for RNA-protein binding studies. The whole analysis is self-contained (one single folder) and the platform is written in Python, in a modular way.

Check a short video presentation about splicekit (poster) at ECCB 2023 on Youtube:

Quick start

The easiest way to install splicekit is to simply run:

$ pip install splicekit

Note that on some systems, pip is installing the executable scripts under ~/.local/bin. However this folder is not in the PATH which will result in command not found if you try to run $ splicekit on the command line. To fix this, please execute:

export PATH="$PATH:~/.local/bin"

Another suggestion is to install inside a virtual environment (using virtualenv).

Installing splicekit directly from the GitHub repository
pip install git+https://github.com/bedapub/splicekit.git@main
If you already have aligned reads in BAM files

All you need is samples.tab (note that this is a TAB delimited file) and splicekit.config in one folder (check datasets for examples).

You can easily download and prepare the reference genome (e.g. $ pybio genome homo_sapiens).

Finally run $ splicekit process (inside the folder with samples.tab and splicekit.config).

Easiest is to check datasets examples to see how the above files look like and also to check scripts if you need to map reads from FASTQ files with pybio.

Documentation

Changelog

v0.6: released in April 2024

  • updated reports
  • JUNE analysis (junction-events to classify skipped and mutually exclusive exons)
Past changenotes (click to view)

v0.4.9: released in November 2023

  • added rMATS analysis for splicing events
  • added Docker container that can be directly imported to singularity via ghcr.io
  • fixed dependencies
  • other small fixes

v0.4: released in May 2023

  • added singularity container with all dependencies
  • added local integrated JBrowse2
  • cluster or desktop runs
  • scanRBP and bootstrap analysis of RNA-protein binding
  • further development and integration with pybio
  • extended documentation of concepts, analysis and results

v0.3: released in January 2023 (click to show details)

  • re-coded junction analysis
    • independent junctions parsing from provided bam files
    • master table of all junctions in the samples of the analyzed project, including novel junctions (refseq/ensembl non-annotated)
  • clustering by logFC of pairwise-comparisons with dendrogram: junction, exon and gene levels (clusterlogfc module)
  • added first_exon annotation for junctions touching annotated first exons of transcripts
  • extended documentation of concepts, analysis and results

v0.2: released in October 2022

  • software architecture restructure with python modules
  • filtering of lowly expressed features by edgeR
  • DonJuan analysis (junction-anchor analysis)
  • more advanced motif analysis with DREME
  • filtering regulated junctions with regulated donors

v0.1: released in July 2022

  • initial version of splicekit
  • parsing of junction and exon counts
  • computing edgeR analysis from count tables and producing a results file with direct links to JBrowse2
  • basic motif analysis

Citing and Contact

If you find splicekit useful in your work and research, please cite:

Gregor Rot, Arne Wehling, Roland Schmucki, Nikolaos Berntenis, Jitao David Zhang, Martin Ebeling
splicekit: a comprehensive toolkit for splicing analysis from short-read RNA-seq
bioRxiv, 2023.05. 25.542256

In case of questions, issues and other ideas, please use the GitHub Issues or write directly to Gregor Rot.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

splicekit-0.6.tar.gz (60.0 kB view details)

Uploaded Source

Built Distribution

splicekit-0.6-py3-none-any.whl (71.2 kB view details)

Uploaded Python 3

File details

Details for the file splicekit-0.6.tar.gz.

File metadata

  • Download URL: splicekit-0.6.tar.gz
  • Upload date:
  • Size: 60.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.8

File hashes

Hashes for splicekit-0.6.tar.gz
Algorithm Hash digest
SHA256 66aae79b707255bd148c733b6cefa0408619f88619658e824c85cd4a34d83b67
MD5 a0c0844c90c9ca9beab881f684b41999
BLAKE2b-256 7c99343adbab1466dcca1d03ad2151c5204f08117f29c004fceda02f710efad9

See more details on using hashes here.

File details

Details for the file splicekit-0.6-py3-none-any.whl.

File metadata

  • Download URL: splicekit-0.6-py3-none-any.whl
  • Upload date:
  • Size: 71.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.8

File hashes

Hashes for splicekit-0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 a7070b628841b673e024f2d0a5ba8df4429eeaeef47ec449dc29f0f47617d418
MD5 282410c8e5f609911e2991e50c867aa2
BLAKE2b-256 c70b66a0aa6c40d6ef504c4d1296ca2e5bc04828e866c683667619e3938c0993

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page