taxoniq-accession-db: NCBI nucleotide accession index for taxoniq
Project description
Taxoniq: Taxon Information Query - fast, offline querying of NCBI Taxonomy and related data
Taxoniq is a Python and command-line interface to the NCBI Taxonomy database and selected data sources that cross-reference it.
Taxoniq's features include:
- Pre-computed indexes updated monthly from NCBI, WoL and cross-referenced databases
- Completely offline operation: all indexes are bundled with the package; no network calls are made when querying
- A CLI capable of JSON I/O, batch processing and streaming of inputs for ease of use and pipelining in shell scripts
- A stable, well-documented, type-hinted Python API (Python 3.6 and higher is supported)
- Comprehensive testing and continuous integration
- An intuitive interface with useful defaults
- Compactness, readability, and extensibility
The Taxoniq package bundles an indexed, compressed copy of the NCBI taxonomy database files, the related accession2taxid database, the WoL database, and relevant information from other databases. Accessions which appear in NCBI BLAST nt/nr databases (and by inclusion, RefSeq) are indexed so that given a taxon ID, accession ID, or taxon name, you can quickly retrieve the taxon's rank, lineage, description, citations, representative RefSeq IDs, LCA information, evolutionary distance, and more, as described in the Cookbook section below.
Installation
pip3 install taxoniq
Synopsis
t = taxoniq.Taxon(9606)
assert t.scientific_name == "Homo sapiens"
assert t.common_name == "human"
assert t.ranked_lineage == [taxoniq.Taxon(scientific_name='Homo sapiens'),
taxoniq.Taxon(scientific_name='Homo'),
taxoniq.Taxon(scientific_name='Hominidae'),
taxoniq.Taxon(scientific_name='Primates'),
taxoniq.Taxon(scientific_name='Mammalia'),
taxoniq.Taxon(scientific_name='Chordata'),
taxoniq.Taxon(scientific_name='Metazoa'),
taxoniq.Taxon(scientific_name='Eukaryota')]
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file taxoniq-accession-db-0.0.1.tar.gz
.
File metadata
- Download URL: taxoniq-accession-db-0.0.1.tar.gz
- Upload date:
- Size: 54.0 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.22.0 setuptools/53.0.0 requests-toolbelt/0.9.1 tqdm/4.56.2 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 95508df8e95fa9ec08b29c9675fbfa6cc847b062ed23cdb6087b64e31f736239 |
|
MD5 | fab43a4514ca339fa5a8bfb3fd9d121e |
|
BLAKE2b-256 | c377ec164bfbf4e35bc5ecff17dc9ca4f0bc4fd0b7c8f8a84a7c5565351919cc |
File details
Details for the file taxoniq_accession_db-0.0.1-py3-none-any.whl
.
File metadata
- Download URL: taxoniq_accession_db-0.0.1-py3-none-any.whl
- Upload date:
- Size: 54.1 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.22.0 setuptools/53.0.0 requests-toolbelt/0.9.1 tqdm/4.56.2 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c386d5d48ac077531ebde83a82201f7c501202ce7f8c7142316676afbd40ab6c |
|
MD5 | 10fafdfb357f63669466a09ff3cf29d2 |
|
BLAKE2b-256 | c55e2b0f6c48bcd543a0259a9981eadb6568c84c0f6fa282c13b23df1bf6ce91 |