splicekit: comprehensive toolkit for splicing analysis from short-read RNA-seq
Project description
splicekit: an integrative toolkit for splicing analysis from short-read RNA-seq
splicekit is a modular platform for splicing analysis from short-read RNA-seq datasets. The platform also integrates an JBrowse2 instance, pybio for genomic operations and scanRBP for RNA-protein binding studies. The whole analysis is self-contained (one single folder) and the platform is written in Python, in a modular way.
Quick start
The easiest way to install splicekit is to simply run:
$ pip install splicekit
Note that on some systems, pip is installing the executable scripts under ~/.local/bin
. However this folder is not in the PATH which will result in command not found
if you try to run $ splicekit
on the command line. To fix this, please execute:
export PATH="$PATH:~/.local/bin"
Another suggestion is to install inside a virtual environment (using virtualenv
).
Installing splicekit directly from the GitHub repository
# move to HOME folder (for example)
cd ~
# clone the repository
git clone git@github.com:bedapub/splicekit.git
# adjust PYTHONPATH and PATH
export PYTHONPATH=$PYTHONPATH:~/splicekit
export PATH=$PATH:~/splicekit/splicekit
If you already have aligned reads in BAM files
All you need is samples.tab
(note that this is a TAB delimited file) and splicekit.config
in one folder (check datasets for examples).
You can easily download and prepare the reference genome (e.g. $ pybio genome homo_sapiens
).
Finally run $ splicekit process
(inside the folder with samples.tab
and splicekit.config
).
Easiest is to check datasets examples to see how the above files look like and also to check scripts if you need to map reads from FASTQ files with pybio
.
Documentation
- PDF reference manual
- Google docs of the above PDF (comment if you like)
Changelog
v0.4.9: released in November 2023
- added rMATS analysis for splicing events
- added Docker container that can be directly imported to singularity via ghcr.io
- fixed dependencies
- other small fixes
v0.4: released in May 2023 (click to show details)
- added singularity container with all dependencies
- added local integrated JBrowse2
- cluster or desktop runs
- scanRBP and bootstrap analysis of RNA-protein binding
- further development and integration with pybio
- extended documentation of concepts, analysis and results
v0.3: released in January 2023 (click to show details)
- re-coded junction analysis
- independent junctions parsing from provided bam files
- master table of all junctions in the samples of the analyzed project, including novel junctions (refseq/ensembl non-annotated)
- clustering by logFC of pairwise-comparisons with dendrogram: junction, exon and gene levels (clusterlogfc module)
- added first_exon annotation for junctions touching annotated first exons of transcripts
- extended documentation of concepts, analysis and results
v0.2: released in October 2022 (click to show details)
- software architecture restructure with python modules
- filtering of lowly expressed features by edgeR
- DonJuan analysis (junction-anchor analysis)
- more advanced motif analysis with DREME
- filtering regulated junctions with regulated donors
v0.1: released in July 2022 (click to show details)
- initial version of splicekit
- parsing of junction and exon counts
- computing edgeR analysis from count tables and producing a results file with direct links to JBrowse2
- basic motif analysis
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file splicekit-0.5.1.tar.gz
.
File metadata
- Download URL: splicekit-0.5.1.tar.gz
- Upload date:
- Size: 57.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 054cd4736918578358a412445dde115f5a3380add728561057ab9f65644b666a |
|
MD5 | 994f5506e9f80bdad8d4873b6897ed97 |
|
BLAKE2b-256 | afe830e83c7972d39fb04a00a61a4fffacc8583adc7a2c90def8fc76112aa15d |
File details
Details for the file splicekit-0.5.1-py3-none-any.whl
.
File metadata
- Download URL: splicekit-0.5.1-py3-none-any.whl
- Upload date:
- Size: 68.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7a6867cfcb35eea3126abf4c81201a39adeb4c39ecc5191c2c39cde584beac21 |
|
MD5 | a775c5d1bdd440558e8a602929e8bd8d |
|
BLAKE2b-256 | 75414848f5d592d7072c0685885cbd1fa0f8bccd33a8495315981c1b0f28bcaf |