Metagenomics toolkit
Project description
Metagenomics toolkit enables scientists to download all of the sample metadata for a given study or sequence to a single csv file.
Install metagenomics toolkit
pip install -U mg-toolkit
Usage
$ mg-toolkit -h
usage: mg-toolkit [-h] [-V] [-d]
{original_metadata,sequence_search,bulk_download} ...
Metagenomics toolkit
--------------------
positional arguments:
{original_metadata,sequence_search,bulk_download}
original_metadata Download original metadata.
sequence_search Search non-redundant protein database using HMMER
bulk_download Download result files in bulks for an entire study.
optional arguments:
-h, --help show this help message and exit
-V, --version print version information
-d, --debug print debugging information
Examples
Download metadata:
$ mg-toolkit original_metadata -a ERP001736
Search non-redundant protein database using HMMER and fetch metadata:
$ mg-toolkit sequence_search -seq test.fasta -db full evalue --incE 0.02
Databases:
- full - Full length sequences (default)
- all - All sequences
- partial - Partial sequences
How to bulk download result files for an entire study?
$ mg-toolkit bulk_download -h
usage: mg-toolkit bulk_download [-h] -a ACCESSION [-o OUTPUT_PATH]
[-v {1.0,2.0,3.0,4.0,4.1}]
[-g {sequence_data,functional_analysis,taxonomic_analysis,taxonomic_analysis_ssu,taxonomic_analysis_lsu,stats,non_coding_rna}]
optional arguments:
-h, --help show this help message and exit
-a ACCESSION, --accession ACCESSION
Provide the study/project accession of your interest,
e.g. ERP001736, SRP000319. The study must be publicly
available in MGnify.
-o OUTPUT_PATH, --output_path OUTPUT_PATH
Location of the output directory, where the
downloadable files are written to. DEFAULT: CWD
-v {1.0,2.0,3.0,4.0,4.1}, --version {1.0,2.0,3.0,4.0,4.1}
Specify the version of the pipeline you are interested
in. Lets say your study of interest has been analysed
with multiple version, but you are only interested in
a particular version then used this option to filter
down the results by the version you interested in.
DEFAULT: Downloads all versions
-g {sequence_data,functional_annotations,taxonomic_annotations,taxonomic_annot_ssu,taxonomic_annot_lsu,stats,non_coding_rna}, --result_group {sequence_data,functional_annotations,taxonomic_annotations,taxonomic_annot_ssu,taxonomic_annot_lsu,stats,non_coding_rna}
Provide a single result group if needed. Supported
result groups are: [sequence_data (all version),
functional_annotations (all version),
taxonomic_annotations (1.0-3.0), taxonomic_annot_ssu
(>=4.0), taxonomic_annot_lsu (>=4.0), stats,
non_coding_rna (>=4.0) DEFAULT: Downloads all result
groups if not provided. (default: None).
How to download all files for a given study accession?
$ mg-toolkit -d bulk_download -a ERP009703
How to download results of a specific version for given study accession?
$ mg-toolkit -d bulk_download -a ERP009703 -v 4.0
How to download specific result file groups (e.g. functional annotations only) for given study accession?
$ mg-toolkit -d bulk_download -a ERP009703 -g functional_annotations
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
mg-toolkit-0.4.1.tar.gz
(13.1 kB
view details)
Built Distribution
File details
Details for the file mg-toolkit-0.4.1.tar.gz
.
File metadata
- Download URL: mg-toolkit-0.4.1.tar.gz
- Upload date:
- Size: 13.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e9eb99a928a2f4067b9bbae4888d0c101f42c31c2ed73c661de21e795d34c540 |
|
MD5 | 314fdd888403941f537e5dd6889bf926 |
|
BLAKE2b-256 | cac1a55d3798917fee0fc6da8870f9be5a461d2f911576d6910300393b1eb6e3 |
File details
Details for the file mg_toolkit-0.4.1-py3-none-any.whl
.
File metadata
- Download URL: mg_toolkit-0.4.1-py3-none-any.whl
- Upload date:
- Size: 12.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7e476b5adb9863926fd0104be38dcb4e0734fc4a55c310e90367ac697a51e6cb |
|
MD5 | 95e9fe446b22d3e2b38371a64944e618 |
|
BLAKE2b-256 | 6638d368d27b1480a62953ee78fe8920cc415387cdb449b98720c71a43a23059 |