Download genome files from the NCBI FTP server.
Project description
So this is a set of scripts that focuses on the actual genome downloading.
Installation
pip install ncbi-genome-download
Alternatively, clone this repository from GitHub, then run (in a python virtual environment)
pip install .
If this fails on older versions of Python, try updating your pip tool first:
pip install --upgrade pip
and then rerun the ncbi-genome-download install.
Usage
To download all bacterial RefSeq genomes in GenBank format from NCBI, run the following:
ncbi-genome-download bacteria
If you’re on a reasonably fast connection, you might want to try running multiple downloads in parallel:
ncbi-genome-download bacteria --parallel 4
To download all fungal GenBank genomes from NCBI in GenBank format, run:
ncbi-genome-download --section genbank fungi
To download all viral RefSeq genomes in FASTA format, run:
ncbi-genome-download --format fasta viral
To download only completed bacterial RefSeq genomes in GenBank format, run:
ncbi-genome-download --assembly-level complete bacteria
To download only bacterial reference genomes from RefSeq in GenBank format, run:
ncbi-genome-download --refseq-category reference bacteria
To download bacterial RefSeq genomes of the genus Streptomyces, run:
ncbi-genome-download --genus Streptomyces bacteria
Note: This is a simple string match on the organism name provided by NCBI only.
You can also use this with a slight trick to download genomes of a certain species as well:
ncbi-genome-download --genus "Streptomyces coelicolor" bacteria
To download bacterial RefSeq genomes based on their NCBI species taxonomy ID, run:
ncbi-genome-download --species-taxid 562 bacteria
Note: The above command will download all RefSeq genomes belonging to Escherichia coli.
To download a specific bacterial RefSeq genomes based on its NCBI taxonomy ID, run:
ncbi-genome-download --taxid 511145 bacteria
Note: The above command will download the RefSeq genome belonging to Escherichia coli str. K-12 substr. MG1655.
ncbi-genome-download --human-readable bacteria
To get an overview of all options, run
ncbi-genome-download --help
As a method
import ncbi_genome_download as ngd ngd.download()
Note: To specify a taxonomic group, like bacteria, use the group keyword.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for ncbi-genome-download-0.2.5.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 800d9ff5f8369cb3ff609362094e2cc53e4ec162a327da4e0d3ec121de62f219 |
|
MD5 | 127909bdbb7146fb232a08ddaf8fd885 |
|
BLAKE2b-256 | 5cdaf85304e614b04b8b832599f1383d66724c995641203059f9792d312e2ed4 |
Hashes for ncbi_genome_download-0.2.5-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | bc42995be30cfd0e47ce85e86331b512e1cbeb2bdbec02419c0866bd597f2e98 |
|
MD5 | 6ee62b130d3384c268ec9386451a4c56 |
|
BLAKE2b-256 | 06216d18e8f5d2ec3b5dcd4c0bdc754a1fc67977864e3cfa28c0bb6fb8a8a3bc |