Skip to main content

A Snakemake-based pipeline for amplicon processing

Project description

edentity-metabarcoding-pipeline

alt text

Table of Contents

Brief on Vsearch

Vsearch is a metabarcoding pipeline for illumina/AVITI paired-end data. More details can be found at vsearch github

Vsearch publication: https://doi.org/10.7717/peerj.2584

Technical implementation of this pipeline is inspired by APSCALE; please cite them if you use this pipeline.

Usage of this workflow

This workflow can run on:

- Conda

- Docker 

- Galaxy

Using Conda

Install Requirements

Install conda or miniconda

Ensure (mini)conda is installed on your system. Information on installing miniconda can be found here

Steps to run edentity-metabarcoding-pipeline

1 Clone this repo
git clone https://gitlab.com/naturalis/bii/bioinformatics/edentity/pipelines/edentity-metabarcoding-pipeline.git && cd edentity-metabarcoding-pipeline/

2 Install snakemake conda environment from yaml file
conda env create -n snakemake -f workflow/envs/snakemake.yaml
3 Activate snakemake conda environment
conda activate snakemake
4 Run the workflow: parameters used here are only for example; replace them with params specific to your project.
snakemake -p --profile workflow/profile/ \
    --config forward_primer=AAACTCGTGCCAGCCACC \
    reverse_primer=GGGTATCTAATCCCAGTTTG \
    raw_data_dir=/path/to/your/raw_data/ \
    work_dir=/path/to/your/work_directory \
    min_length=200 max_length=600 

Explain parameters and where more info can be found. Link to validation schema.

Using Docker

Install Requirements

1. Apptainer:

Install apptainer

2. Run the workflow:
snakemake -p --profile workflow/profile/ \
    --config forward_primer=AAACTCGTGCCAGCCACC \
    reverse_primer=GGGTATCTAATCCCAGTTTG \
    raw_data_dir=/path/to/your/raw_data/ \
    work_dir=/path/to/your/work_directory \
    min_length=200 max_length=600 --use-apptainer

Deploying to Galaxy

Prerequisites

Ensure you have access to a Galaxy instance where you have administrative privileges or the ability to install tools and workflows.

Steps to Deploy

1. Clone the Galaxy branch of this repository

Clone galaxy branch of this pipeline into your Galaxy tools directory (for Naturalis clone into: /data/galaxy/local_tools/)


git clone -b galaxy git@gitlab.com:naturalis/bii/bioinformatics/edentity/pipelines/edentity-metabarcoding-pipeline.git

2. Configure Galaxy tools xml

Edit your Galaxy tool configuration xml file to include the edentity-galaxy-pipeline.xml file located at the root directory of this repository.

For example to add this pipeline to your galaxy instance: Open galaxy/config/tool_config.xml and add the lines below.

<section id="metabarcoding-pipeline" name="Metabarcoding Pipelines">
    <tool file="edentity-metabarcoding-pipeline/edentity-galaxy-pipeline.xml"/>
</section>

NB:

  • Ensure you paste the above lines within <toolbox> </toolbox> section in the galaxy/config/tool_config.xml
  • Ensure paths are correctly referenced depending on where you cloned the pipeline
  • Some useful tips on adding custom tools on galaxy can be found here
3. Restart Galaxy

Restart your Galaxy instance to load the new tool configuration.

4. Running the pipeline on Galaxy:

Example on how to run this pipeline on Galaxy is available here

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

edentity-1.4.3.tar.gz (75.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

edentity-1.4.3-py3-none-any.whl (33.3 kB view details)

Uploaded Python 3

File details

Details for the file edentity-1.4.3.tar.gz.

File metadata

  • Download URL: edentity-1.4.3.tar.gz
  • Upload date:
  • Size: 75.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.17

File hashes

Hashes for edentity-1.4.3.tar.gz
Algorithm Hash digest
SHA256 a42f33ad64a5bbba866d80962dc8dd3a340dbfb3cc09c1a2dc04a2e74d57bfc0
MD5 bc03c06f50a630f77cdc0abe5742b0af
BLAKE2b-256 5cd835c3c9d2410910457a73951f39ea303cd73977b4c1818acadf0ad1129f2a

See more details on using hashes here.

File details

Details for the file edentity-1.4.3-py3-none-any.whl.

File metadata

  • Download URL: edentity-1.4.3-py3-none-any.whl
  • Upload date:
  • Size: 33.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.17

File hashes

Hashes for edentity-1.4.3-py3-none-any.whl
Algorithm Hash digest
SHA256 ef62022ed2e2e2452189ff8755a17c41895792f8d6ebd54f6f058836c7351e5a
MD5 0b6c91ca49fb6435c7f4679248e8e2eb
BLAKE2b-256 acf1aab96deabb50c5cb156ec4e9eed2c49928539d66e93d0723db8edbad48a6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page