SLicer detects splice leader sequences from long-read data using motif detection and soft-clip analysis.
Project description
This is the slicer pipeline from the Sequana project
- Overview:
Detects splice leader sequences from long-read (PacBio) data using motif-based read filtering, soft-clip analysis, sequence clustering, and multiple sequence alignment.
- Input:
Long-read FASTQ files (PacBio) and a reference genome FASTA file.
- Output:
Clustered and aligned soft-clipped sequences (FASTA), per-sample HTML reports, and a merged CSV statistics file.
- Status:
beta
- Citation:
Cokelaer et al, (2017), ‘Sequana’: a Set of Snakemake NGS pipelines, Journal of Open Source Software, 2(16), 352, JOSS DOI doi:10.21105/joss.00352
Installation
sequana_slicer is based on Python3, just install the package as follows:
pip install sequana_slicer --upgrade
You will need third-party tools. Please see below for details.
Usage
sequana_slicer --help sequana_slicer --input-directory DATAPATH --reference-file reference.fasta
This creates a directory with the pipeline and configuration file. You will then need to execute the pipeline:
cd slicer sh slicer.sh # for a local run
This launch a snakemake pipeline. If you are familiar with snakemake, you can retrieve the pipeline itself and its configuration files and then execute the pipeline yourself with specific parameters:
snakemake -s slicer.rules -c config.yaml --cores 4 --stats stats.txt
Or use sequanix interface.
Usage with apptainer / singularity
With Apptainer, initiate the working directory as follows:
sequana_slicer --input-directory DATAPATH --reference-file reference.fasta \
--apptainer-prefix ~/.sequana/apptainers
Images are downloaded in the working directory but you can store them in a shared directory to avoid re-downloading them (recommended for repeated use):
sequana_slicer --input-directory DATAPATH --reference-file reference.fasta \
--apptainer-prefix ~/.sequana/apptainers
and then:
cd slicer sh slicer.sh
if you decide to use snakemake manually, do not forget to add apptainer options:
snakemake -s slicer.rules -c config.yaml --cores 4 --stats stats.txt \
--apptainer-prefix ~/.sequana/apptainers \
--apptainer-args "-B /home:/home"
By default, the home is already set for you. Additional binding path can be set using environment variables e.g.:
export APPTAINER_BINDPATH=" -B /pasteur"
Requirements
This pipelines requires the following executable(s):
seqkit
minimap2
samtools
cd-hit-est
mafft
graphviz
These tools are available as Apptainer/Singularity containers (see config.yaml for container URLs).
Details
This pipeline runs slicer in parallel on the input fastq files (paired or not). A brief sequana summary report is also produced.
Rules and configuration details
Here is the latest documented configuration file to be used with the pipeline. Each rule used in the pipeline may have a section in the configuration file.
Changelog
Version |
Description |
|---|---|
0.0.1 |
First release. |
Contribute & Code of Conduct
To contribute to this project, please take a look at the Contributing Guidelines first. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sequana_slicer-0.1.0.tar.gz.
File metadata
- Download URL: sequana_slicer-0.1.0.tar.gz
- Upload date:
- Size: 62.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.0.1 CPython/3.10.14 Linux/6.14.5-100.fc40.x86_64
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f332bfb614af2252b1ae810dce2a2dd4691d2c8ed93e7ce179f6e4593f584749
|
|
| MD5 |
ba8df4f620c6242bdaf1a6caa103e474
|
|
| BLAKE2b-256 |
3157f49756f72011c81ada116f1ca6e4b69b3b7f82500eb08635d7b6400dc9d5
|
File details
Details for the file sequana_slicer-0.1.0-py3-none-any.whl.
File metadata
- Download URL: sequana_slicer-0.1.0-py3-none-any.whl
- Upload date:
- Size: 62.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.0.1 CPython/3.10.14 Linux/6.14.5-100.fc40.x86_64
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
78ce912d4b7f0bbf9e88e31d1b037c65e9c8c90d12bb24a2745de8359166c598
|
|
| MD5 |
dce9b28033ea8bc17909f1cbd23d4c27
|
|
| BLAKE2b-256 |
873fd02b50a169b54efd5a185075773c645b5e0f4a9f5888ce38748b8a4e86bc
|