Skip to main content

SLicer detects splice leader sequences from long-read data using motif detection and soft-clip analysis.

Project description

https://raw.githubusercontent.com/sequana/slicer/main/doc/slicer_logo.png https://badge.fury.io/py/sequana-slicer.svg JOSS (journal of open source software) DOI https://github.com/sequana/slicer/actions/workflows/main.yml/badge.svg Python 3.11 | 3.12

This is the slicer pipeline from the Sequana project

Overview:

Detects splice leader sequences from long-read (PacBio) data using motif-based read filtering, soft-clip analysis, sequence clustering, and multiple sequence alignment.

Input:

Long-read FASTQ files (PacBio) and a reference genome FASTA file.

Output:

Clustered and aligned soft-clipped sequences (FASTA), per-sample HTML reports, and a merged CSV statistics file.

Status:

beta

Citation:

Cokelaer et al, (2017), ‘Sequana’: a Set of Snakemake NGS pipelines, Journal of Open Source Software, 2(16), 352, JOSS DOI doi:10.21105/joss.00352

Installation

sequana_slicer is based on Python3, just install the package as follows:

pip install sequana_slicer --upgrade

You will need third-party tools. Please see below for details.

Usage

sequana_slicer --help
sequana_slicer --input-directory DATAPATH --reference-file reference.fasta

This creates a directory with the pipeline and configuration file. You will then need to execute the pipeline:

cd slicer
sh slicer.sh  # for a local run

This launch a snakemake pipeline. If you are familiar with snakemake, you can retrieve the pipeline itself and its configuration files and then execute the pipeline yourself with specific parameters:

snakemake -s slicer.rules -c config.yaml --cores 4 --stats stats.txt

Or use sequanix interface.

Usage with apptainer / singularity

With Apptainer, initiate the working directory as follows:

sequana_slicer --input-directory DATAPATH --reference-file reference.fasta \
    --apptainer-prefix ~/.sequana/apptainers

Images are downloaded in the working directory but you can store them in a shared directory to avoid re-downloading them (recommended for repeated use):

sequana_slicer --input-directory DATAPATH --reference-file reference.fasta \
    --apptainer-prefix ~/.sequana/apptainers

and then:

cd slicer
sh slicer.sh

if you decide to use snakemake manually, do not forget to add apptainer options:

snakemake -s slicer.rules -c config.yaml --cores 4 --stats stats.txt \
    --apptainer-prefix ~/.sequana/apptainers \
    --apptainer-args "-B /home:/home"

By default, the home is already set for you. Additional binding path can be set using environment variables e.g.:

export APPTAINER_BINDPATH=" -B /pasteur"

Requirements

This pipelines requires the following executable(s):

  • seqkit

  • minimap2

  • samtools

  • cd-hit-est

  • mafft

  • graphviz

These tools are available as Apptainer/Singularity containers (see config.yaml for container URLs).

https://raw.githubusercontent.com/sequana/slicer/main/sequana_pipelines/slicer/dag.png

Details

This pipeline runs slicer in parallel on the input fastq files (paired or not). A brief sequana summary report is also produced.

Rules and configuration details

Here is the latest documented configuration file to be used with the pipeline. Each rule used in the pipeline may have a section in the configuration file.

Changelog

Version

Description

0.0.1

First release.

Contribute & Code of Conduct

To contribute to this project, please take a look at the Contributing Guidelines first. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sequana_slicer-0.1.0.tar.gz (62.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

sequana_slicer-0.1.0-py3-none-any.whl (62.7 kB view details)

Uploaded Python 3

File details

Details for the file sequana_slicer-0.1.0.tar.gz.

File metadata

  • Download URL: sequana_slicer-0.1.0.tar.gz
  • Upload date:
  • Size: 62.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.0.1 CPython/3.10.14 Linux/6.14.5-100.fc40.x86_64

File hashes

Hashes for sequana_slicer-0.1.0.tar.gz
Algorithm Hash digest
SHA256 f332bfb614af2252b1ae810dce2a2dd4691d2c8ed93e7ce179f6e4593f584749
MD5 ba8df4f620c6242bdaf1a6caa103e474
BLAKE2b-256 3157f49756f72011c81ada116f1ca6e4b69b3b7f82500eb08635d7b6400dc9d5

See more details on using hashes here.

File details

Details for the file sequana_slicer-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: sequana_slicer-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 62.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.0.1 CPython/3.10.14 Linux/6.14.5-100.fc40.x86_64

File hashes

Hashes for sequana_slicer-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 78ce912d4b7f0bbf9e88e31d1b037c65e9c8c90d12bb24a2745de8359166c598
MD5 dce9b28033ea8bc17909f1cbd23d4c27
BLAKE2b-256 873fd02b50a169b54efd5a185075773c645b5e0f4a9f5888ce38748b8a4e86bc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page