A script for updating a local directory with genomes, from NCBI assembly files
Project description
A script that downloads genomes for a selected genus/species/subspecies from NCBI assembly file.
Can also be used to update a local directory with a specified yaml file.
usage: genome_downloader.py [-h] [-f FOLDER] [-o OUTPUT] [-p N] [-i INPUT] [-domain Bacteria] [-g GENUS] [-s SPECIES_TAXID] [-t TAXID] [-d] [-u]
optional arguments:
-h, --help show this help message and exit
-o OUTPUT, folder, --output FOLDER
The folder which files will be downloaded to
-name NEWNAME, --newname NEWNAME
What the yaml file should be named
-p N, --parallel N Use N processes to download files (default = 1)
-i INPUT, --input-file-path INPUT
Local yaml file to update
-domain {archaea,bacteria,fungi,invertebrate,plant,protozoa,vertebrate_mammalian,vertebrate_other,viral},
--domain {archaea,bacteria,fungi,invertebrate,plant,protozoa,vertebrate_mammalian,vertebrate_other,viral}
The domain to download from
-g GENUS, --genus GENUS
Specify the target genus
-s SPECIES_TAXID, --species-taxid SPECIES_TAXID
Used to specify a target species_taxid
-t TAXID, --taxid TAXID
Used to specify a target taxid
-d, --download Use to only download files for all unique isolates in
a yaml file
-u, --update-yaml Use to only update local yaml file
Example: To download all genomes from brucella with 10 processes to the folder Brucella-genomes
genome_update -g Brucella -p 10 -o Brucella-genomes
Can also be used to update a local directory with a specified yaml file.
usage: genome_downloader.py [-h] [-f FOLDER] [-o OUTPUT] [-p N] [-i INPUT] [-domain Bacteria] [-g GENUS] [-s SPECIES_TAXID] [-t TAXID] [-d] [-u]
optional arguments:
-h, --help show this help message and exit
-o OUTPUT, folder, --output FOLDER
The folder which files will be downloaded to
-name NEWNAME, --newname NEWNAME
What the yaml file should be named
-p N, --parallel N Use N processes to download files (default = 1)
-i INPUT, --input-file-path INPUT
Local yaml file to update
-domain {archaea,bacteria,fungi,invertebrate,plant,protozoa,vertebrate_mammalian,vertebrate_other,viral},
--domain {archaea,bacteria,fungi,invertebrate,plant,protozoa,vertebrate_mammalian,vertebrate_other,viral}
The domain to download from
-g GENUS, --genus GENUS
Specify the target genus
-s SPECIES_TAXID, --species-taxid SPECIES_TAXID
Used to specify a target species_taxid
-t TAXID, --taxid TAXID
Used to specify a target taxid
-d, --download Use to only download files for all unique isolates in
a yaml file
-u, --update-yaml Use to only update local yaml file
Example: To download all genomes from brucella with 10 processes to the folder Brucella-genomes
genome_update -g Brucella -p 10 -o Brucella-genomes
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
genome_update-0.1.3.4.tar.gz
(5.3 kB
view hashes)
Built Distribution
Close
Hashes for genome_update-0.1.3.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0543d9b2b4af3f8fb0ddf95a2193aabcfd2c05a72f8cd05ece3cb91a9e679481 |
|
MD5 | 03e616fa41eb3220b213fa7ff75121fc |
|
BLAKE2b-256 | f85c3c0eadfa2288ab9b239b5bc5d2bb86a05873156e553614278e8ac60ba001 |