Advanced Pipeline for Simple yet Comprehensive AnaLysEs of DNA metabarcoding data

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

apscale

Advanced Piepline for Simple yet Comprehensive AnaLysEs of DNA metabarcoding data

Introduction

Apscale is a metabarcoding pipeline that handles the most common tasks in in metabarcoding pipelines like paired-end merging, primer trimming, quality filtering, otu clustering and denoising. It uses a simple command line interface and is configured via a single configuration file. It automatically uses the available ressources on the machine it runs on while still providing the option to use less if desired.

Installation

Apscale can be installed on all common operating systems (Windows, Linux, MacOS). Apscale requires Python 3.7 or higher and can be easily installed via pip in any command line:

pip install apscale

To update apscale run:

pip install --upgrade apscale

Further dependencies

Apscale calls vsearch for multiple modules. It should be installed and be in PATH to be executed from anywhere on the system.

Check the vsearch Github page for further info:

https://github.com/torognes/vsearch

Support for compressed files with zlib is necessary. For Unix based systems this is shipped with vsearch, for Windows the zlib.dll can be downloaded via:

zlib for Windows

The dll has to be in the same folder as the vsearch executable. If you need help with adding a folder to PATH in windows please take a look at the first answer on this stackoverflow issue:

How to add a folder to PATH Windows

To check if everything is correctly set up please type this into your command line:

vsearch --version

It should return a message similar to this:

vsearch v2.19.0_win_x86_64, 31.9GB RAM, 24 cores
https://github.com/torognes/vsearch

Rognes T, Flouri T, Nichols B, Quince C, Mahe F (2016)
VSEARCH: a versatile open source tool for metagenomics
PeerJ 4:e2584 doi: 10.7717/peerj.2584 https://doi.org/10.7717/peerj.2584

Compiled with support for gzip-compressed files, and the library is loaded.
zlib version 1.2.5, compile flags 65
Compiled with support for bzip2-compressed files, but the library was not found.

How to use

Create a new apscale project

Apscale is oranized in projects with the following structure:

C:\USERS\DOMINIK\DESKTOP\EXAMPLE_PROJECT
â”œâ”€â”€â”€1_raw data
â”‚   â””â”€â”€â”€data
â”œâ”€â”€â”€2_demultiplexing
â”‚   â””â”€â”€â”€data
â”œâ”€â”€â”€3_PE_merging
â”‚   â””â”€â”€â”€data
â”œâ”€â”€â”€4_primer_trimming
â”‚   â””â”€â”€â”€data
â”œâ”€â”€â”€5_quality_filtering
â”‚   â””â”€â”€â”€data
â”œâ”€â”€â”€6_dereplication_pooling
â”‚   â””â”€â”€â”€data
â”‚       â”œâ”€â”€â”€dereplication
â”‚       â””â”€â”€â”€pooling
â”œâ”€â”€â”€7_otu_clustering
â”‚   â””â”€â”€â”€data
â””â”€â”€â”€8_denoising
    â””â”€â”€â”€data

A new project can be initialized with the command:

apscale --create_project NAME

If you prefer to have your data all in one place you can paste the raw data into 1_raw_data/data. Demultiplexing won't be handled by Apscale because there are to many different tagging systems out there at the moment. If you are using inline barcodes you can take a look at https://github.com/DominikBuchner/demultiplexer. If you are already starting with demultiplexed data please paste them into 2_demultiplexing/data.

Configuring the settings

Associated with every project Apscale with generate an Excel sheet in the project folder called "Settings.xlsx". It is divided into a seperate sheet for every module and a 0_general_settings tab.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

1.7.1

Apr 2, 2024

1.7.0

Apr 2, 2024

1.6.3

Feb 21, 2023

1.6.2

Jan 10, 2023

1.6.1

Jan 6, 2023

1.6.0

Jan 6, 2023

1.5.6

Dec 7, 2022

1.5.5

Apr 5, 2022

1.5.4

Feb 28, 2022

1.5.3

Feb 25, 2022

1.5.2

Feb 24, 2022

1.5.1

Feb 22, 2022

1.5.0

Feb 22, 2022

1.4.2

Feb 11, 2022

1.4.1

Feb 11, 2022

1.4.0

Feb 11, 2022

1.3.8

Feb 11, 2022

1.3.7

Feb 10, 2022

1.3.6

Feb 9, 2022

1.3.5

Feb 9, 2022

1.3.4

Feb 9, 2022

1.3.3

Feb 9, 2022

1.3.2

Feb 5, 2022

1.3.1

Feb 4, 2022

1.3.0

Feb 3, 2022

1.2.2

Feb 3, 2022

1.2.1

Feb 1, 2022

1.2.0

Jan 31, 2022

1.1.1

Jan 28, 2022

1.1.0

Jan 27, 2022

1.0.8

Jan 25, 2022

1.0.7

Jan 19, 2022

1.0.6

Jan 12, 2022

This version

1.0.5

Jan 12, 2022

1.0.4

Jan 10, 2022

1.0.3

Jan 10, 2022

1.0.2

Jan 10, 2022

1.0.1

Jan 10, 2022

1.0.0

Jan 10, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

apscale-1.0.5.tar.gz (13.6 kB view hashes)

Uploaded Jan 12, 2022 Source

Built Distribution

apscale-1.0.5-py3-none-any.whl (20.9 kB view hashes)

Uploaded Jan 12, 2022 Python 3

Hashes for apscale-1.0.5.tar.gz

Hashes for apscale-1.0.5.tar.gz
Algorithm	Hash digest
SHA256	`fa622eb3563bb3064ae296fbf31daa0d4d26938057bbb3b62758d4e0ca7346e4`
MD5	`62b19aec85dad9e621c6e8d9acbe7341`
BLAKE2b-256	`32bff945128dbac36c933f45dd991eeb1749ef6367f178324f85973078e32d22`

Hashes for apscale-1.0.5-py3-none-any.whl

Hashes for apscale-1.0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`82fe062690b24f0a635b954dabcb7c4e1674325da3d93ab85f8979a35741c9b9`
MD5	`b7d3ffd280f822cd9b33a52a7fdf242c`
BLAKE2b-256	`4a44cf0b647d989d80b589745a41b9eca8e57e3e42b9d29e152acb760eaebc9a`