YOLO Microbiome Analysis System

Project description

YaMAS (YOLO Microbiome Analysis System)

YaMAS is a package designed to easily download DNA datasets from the NCBI SRA and ENA website. It is developed by the YOLO lab team, and is designed to be simple, efficient, and easy to use for non-programmers.

Dependencies

Before proceeding with the installation and execution of YaMAS, please ensure that you have a clean environment set up on your system, with all dependencies installed. To create one, follow the steps below:

Create a new qiime2 environment using conda. Make sure you name it 'qiime2'.
Download the SRA-toolkit and Entrez packages to the environment.
Download the metaphlan package. Make sure the database works properly before proceeding.
Exporting a 16S project requires a downloaded classifier file.
Get YaMAS ready.

You are now ready to run and install YaMAS in the newly created and activated qiime2 environment.

Installation

To install YaMAS, you can use pip:

pip install YMS

Getting Started- NCBI SRA

YaMAS provides an easy-to-use interface in the terminal.
To download a project from NCBI SRA, use the one of the following templates:

Get YaMAS ready

yamas --ready <operating_system_type>

Arguments:

operating_system_type: Ubuntu/CentOS

Pay attention to the output of the command.
If the environment is ready, you will need to run one more command.
If not, follow the output guidelines.

16S/18S dataset

yamas --download PRJEB01234 --type 16S/18S

To export an OTU (Operational Taxonomic Unit), taxonomy, and phylogeny tree for a single project, use the following command:

yamas --export <project_path> <data_type> <start> <end> <classifier_file> <threads>

Arguments:

project_path: path to the project directory (created by YaMAS in the previous step).
data_type: choose one of the following types: 16S / 18S
classifier_file: path to the trained classifier file.
start & end: choose graph edges.
threads: specifies the number of threads to use for parallel processing, which can speed up the export process (default is 12).

Shotgun dataset

yamas --download PRJEB01234 --type Shotgun

Continue data downloading

Continue downloading project after downloading SRA before converting to .fastq.
Use the following command:

yamas --continue_from_fastq <dataset_id> <project_path> <data_type>

Arguments:

project_path: path to the project directory (created by YaMAS, if you started downloading data in the past).
data_type: choose one of the following types: 16S / 18S / Shotgun

Continue downloading project after downloading SRA and converting them to .fastq.
Use the following command:

yamas --continue_from <dataset_id> <project_path> <data_type>

Arguments:

project_path: path to the project directory (created by YaMAS, if you started downloading data in the past).
data_type: choose one of the following types: 16S / 18S / Shotgun

Getting Started- ENA

YaMAS provides an easy-to-use interface in the terminal.
To download a project from NCBI SRA, use the one of the following templates:

Get YaMAS ready

yamas --ready <operating_system_type>

Arguments:

operating_system_type: Ubuntu/CentOS

Pay attention to the output of the command.
If the environment is ready, you will need to run one more command.
If not, follow the output guidelines.

16S/18S dataset

yamas --qiita <preprocessed_fastq_path> <metadata_path> <data_type>

Arguments:
All can be found in https://qiita.ucsd.edu/

Where preprocessed fastq can be found?
Click the study description --> in the graph click on 'demultiplexed' --> scroll down and download 'preprocessed fastq' --> rename the file to be: "forward.fastq.gz"
Where metadata can be found? Click the study description --> download 'Prep info' --> rename the file to be: "metadata.tsv"
The new data will be created in the folder of the fastq and metadata, so it is recommended to be organized.

To export an OTU (Operational Taxonomic Unit), taxonomy, and phylogeny tree for a single project, use the following command:

yamas --export <project_path> <data_type> <start> <end> <classifier_file> <threads>

Arguments:

project_path: path to the project directory (created by YaMAS in the previous step).
data_type: choose one of the following types: 16S / 18S
classifier_file: path to the trained classifier file.
start & end: choose graph edges.
threads: specifies the number of threads to use for parallel processing, which can speed up the export process (default is 12).

Arguments and configurations

config: You can add a configuration file in order to save the data in a different folder, and change other configurations.
verbose: To get more information about a downloading process, use the verbose option (this is highly recommended).
Listing more than one project will download them one by one into different folders.

Project details

Release history Release notifications | RSS feed

1.2.11

May 6, 2024

1.2.10

May 6, 2024

1.2.9

May 6, 2024

1.2.8

May 6, 2024

1.2.7

Feb 19, 2024

1.2.6

Jan 11, 2024

1.2.5

Jan 11, 2024

This version

1.2.4

Dec 23, 2023

1.2.3

Nov 21, 2023

1.2.2

Nov 20, 2023

1.2.1

Nov 14, 2023

1.2.0

Nov 7, 2023

1.1.50

Nov 7, 2023

1.1.7

Nov 7, 2023

1.1.6

Nov 7, 2023

1.1.5

Nov 7, 2023

1.1.4

Nov 7, 2023

1.1.3

Oct 31, 2023

1.1.2

Oct 31, 2023

1.1.1

Oct 31, 2023

1.1.0

Oct 31, 2023

1.0.8

Oct 31, 2023

1.0.6

Oct 31, 2023

1.0.5

Oct 30, 2023

1.0.4

Sep 7, 2023

1.0.0

Jun 6, 2023

0.99

May 25, 2023

0.71

May 1, 2023

0.63

Apr 2, 2023

0.62

Apr 2, 2023

0.61

Apr 2, 2023

0.4

Mar 30, 2023

0.3

Mar 28, 2023

0.2

Mar 28, 2023

0.1

Mar 28, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

YMS-1.2.4.tar.gz (13.8 kB view hashes)

Uploaded Dec 23, 2023 Source

Built Distribution

YMS-1.2.4-py3-none-any.whl (16.5 kB view hashes)

Uploaded Dec 23, 2023 Python 3

Hashes for YMS-1.2.4.tar.gz

Hashes for YMS-1.2.4.tar.gz
Algorithm	Hash digest
SHA256	`c55707d64590c6e675e82bbb0724aafb16458c88d8a27f3765bc435c5f53c3c6`
MD5	`5680e5c94fa5b3ca42009bb0eb9ce422`
BLAKE2b-256	`855f6a28abe7e3a992bdc93872a29656c7fbf510a88e9a1aca8e629ca2ec6593`

Hashes for YMS-1.2.4-py3-none-any.whl

Hashes for YMS-1.2.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`32b013ef2e2fe2321706a581cbe9e9e52e5972bdf32f7fa788d85f817bd4a3ac`
MD5	`84b8642b3c80e3edf89625b6bc1cbf88`
BLAKE2b-256	`25286c526a4323751790c46a4608e0da16a1cfd03a9efb7ccabff58741d9d8d7`