Easily gather data from across NCBI databases

These details have not been verified by PyPI

Project links

Homepage

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

NCBI Datasets

NCBI Datasets is a new resource that lets you easily gather data from across NCBI databases.

Find and download sequence, annotation, and metadata for genes and genomes using our command-line tools or web interface.

NCBI Datasets tools are under active development. Submit feedback by creating a GitHub issue or you may contact NCBI directly with your questions, comments or feature requests.

Install the Datasets command-line tools

Download and install the NCBI Datasets command-line tools, datasets and dataformat:

conda install -c conda-forge ncbi-datasets-cli

For other ways to install, see our command-line tool quickstart.

Use the Datasets command-line tools

Use datasets to download biological sequence data across all domains of life from NCBI.

Use dataformat to convert metadata included as part of the data package from JSON Lines format to other formats.

Examples:

Use datasets to download a genome data package for the human reference genome, GRCh38:

datasets download genome taxon human --reference --filename human-reference.zip

Use dataformat to extract selected fields of metadata from the downloaded data package for the human reference genome, GRCh38:

dataformat tsv genome --package human-reference.zip --fields organism-name,assminfo-name,assminfo-accession,assminfo-submitter 
Organism name	Assembly Name	Assembly Accession	Assembly Submitter
Homo sapiens	GRCh38.p13	GCF_000001405.39	Genome Reference Consortium

The schematic below outlines the available commands for the datasets command-line tool: Datasets CLI schematic

Download large numbers of genomes

Download large numbers of genomes by first downloading a dehydrated zip archive and then getting the data in three steps.

Download the dehydrated zip archive
Unzip the downloaded zip archive
Rehydrate to get the data

Try this example for the human reference genome:

Download the dehydrated zip archive
datasets download genome accession GCF_000001405.39 --dehydrated --filename human_GRCh38_dataset.zip
Unzip the downloaded zip archive
unzip human_GRCh38_dataset.zip -d my_human_dataset
Rehydrate to get the data
datasets rehydrate --directory my_human_dataset/

For more information, see how to download large genome data packages.

Datasets data packages

NCBI Datasets provides sequence, annotation, metadata and other biological data as an NCBI Datasets Data Package zip archive.

We currently offer three types of data package: a gene data package, a genome data package, and a specialized SARS-CoV-2 data package.

Datasets data reports

NCBI Datasets data packages include data report files that contain metadata about the requested records. Data report schemas describe each type of data report, including the available fields, with descriptions and examples.

Jupyter notebooks

Jupyter notebooks provide a way to explore our command-line tool, python package and API.

Python library

Programmers may be interested in trying the NCBI Datasets python library to interact with the NCBI Datasets REST API. Note that we are offering limited support for the python package at this time.

NOTE: We recommend avoiding the use of this package in an environment also containing the following packages:

google-cloud-bigquery
pandas-gbq

We have observed some dependency issues with these packages such that some environments may fail to interact properly with python BigQuery API endpoints.

Project details

These details have not been verified by PyPI

Project links

Homepage

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

16.6.1

Apr 29, 2024

16.6.0

Feb 26, 2024

16.5.0

Feb 21, 2024

16.4.9

Feb 20, 2024

16.4.8

Feb 16, 2024

16.4.7

Feb 13, 2024

16.4.5

Feb 5, 2024

16.4.4

Jan 30, 2024

16.4.3

Jan 25, 2024

16.3.0

Jan 12, 2024

16.2.0

Jan 9, 2024

16.1.2

Jan 8, 2024

16.1.1

Jan 5, 2024

16.0.0

Dec 20, 2023

15.34.0

Dec 18, 2023

15.33.0

Dec 14, 2023

15.32.0

Dec 12, 2023

15.31.3

Dec 11, 2023

15.31.2

Dec 4, 2023

15.31.1

Dec 1, 2023

15.31.0

Nov 30, 2023

15.30.0

Nov 27, 2023

15.29.0

Nov 16, 2023

15.28.0

Nov 14, 2023

15.27.1

Nov 3, 2023

15.27.0

Nov 2, 2023

15.26.0

Nov 1, 2023

15.25.0

Oct 27, 2023

15.24.0

Oct 12, 2023

15.23.0

Oct 5, 2023

15.22.0

Oct 3, 2023

15.21.1

Sep 28, 2023

15.21.0

Sep 26, 2023

15.20.2

Sep 26, 2023

15.20.0

Sep 21, 2023

15.19.1

Sep 18, 2023

15.19.0

Sep 15, 2023

15.18.0

Sep 11, 2023

15.17.0

Sep 8, 2023

15.16.3

Sep 7, 2023

15.16.2

Sep 6, 2023

15.16.1

Sep 1, 2023

15.16.0

Sep 1, 2023

15.15.0

Aug 31, 2023

15.14.0

Aug 30, 2023

15.13.1

Aug 21, 2023

15.13.0

Aug 17, 2023

15.12.0

Aug 2, 2023

15.11.0

Jul 24, 2023

15.10.0

Jul 17, 2023

15.9.1

Jul 12, 2023

15.9.0

Jul 7, 2023

15.8.0

Jul 6, 2023

15.7.0

Jun 30, 2023

15.6.1

Jun 21, 2023

15.6.0

Jun 14, 2023

15.5.0

Jun 12, 2023

15.4.0

Jun 9, 2023

15.3.1

Jun 7, 2023

15.3.0

Jun 6, 2023

15.2.0

May 31, 2023

15.1.0

May 26, 2023

15.0.0

May 24, 2023

14.29.1

May 19, 2023

14.29.0

May 17, 2023

14.28.0

May 12, 2023

14.27.0

May 8, 2023

14.26.0

May 2, 2023

14.25.2

May 1, 2023

14.25.1

Apr 25, 2023

14.24.0

Apr 21, 2023

14.23.0

Apr 20, 2023

14.22.1

Apr 13, 2023

14.22.0

Apr 10, 2023

14.21.0

Apr 7, 2023

14.20.0

Apr 3, 2023

14.19.0

Mar 28, 2023

14.18.0

Mar 21, 2023

14.17.0

Mar 16, 2023

14.16.0

Mar 9, 2023

14.15.0

Mar 7, 2023

14.14.0

Mar 2, 2023

14.13.3

Mar 1, 2023

14.13.2

Feb 23, 2023

14.13.1

Feb 21, 2023

14.13.0

Feb 10, 2023

14.12.0

Feb 6, 2023

14.11.2

Feb 3, 2023

14.11.1

Jan 31, 2023

14.11.0

Jan 30, 2023

14.10.0

Jan 30, 2023

14.9.0

Jan 25, 2023

14.8.1

Jan 24, 2023

14.7.0

Jan 10, 2023

14.6.5

Jan 3, 2023

14.6.4

Dec 27, 2022

14.6.3

Dec 22, 2022

14.6.2

Dec 19, 2022

14.6.1

Dec 13, 2022

14.6.0

Dec 6, 2022

14.5.1

Dec 5, 2022

14.5.0

Dec 2, 2022

14.4.0

Nov 21, 2022

14.3.0

Nov 16, 2022

14.2.2

Nov 1, 2022

14.2.1

Oct 27, 2022

14.2.0

Oct 26, 2022

14.1.2

Oct 25, 2022

14.1.1

Oct 24, 2022

14.1.0

Oct 21, 2022

14.0.1

Oct 13, 2022

14.0.0

Oct 12, 2022

13.43.2

Oct 7, 2022

13.43.1

Oct 6, 2022

13.43.0

Oct 5, 2022

13.42.2

Oct 4, 2022

13.42.1

Oct 3, 2022

13.42.0

Oct 1, 2022

13.41.0

Sep 27, 2022

13.40.0

Sep 16, 2022

13.39.2

Sep 12, 2022

13.39.1

Sep 9, 2022

13.39.0

Sep 8, 2022

13.38.0

Sep 6, 2022

13.37.2

Sep 2, 2022

13.37.1

Sep 1, 2022

13.36.0

Aug 25, 2022

13.35.0

Aug 11, 2022

13.34.0

Aug 10, 2022

13.33.0

Aug 9, 2022

13.32.0

Aug 8, 2022

13.31.0

Aug 2, 2022

13.30.2

Jul 22, 2022

13.30.1

Jul 21, 2022

13.30.0

Jul 19, 2022

13.29.1

Jul 13, 2022

13.28.2

Jul 12, 2022

13.28.1

Jul 7, 2022

13.28.0

Jul 5, 2022

13.27.0

Jun 28, 2022

13.26.0

Jun 24, 2022

13.25.1

Jun 21, 2022

13.25.0

Jun 16, 2022

13.24.4

Jun 16, 2022

13.24.3

Jun 14, 2022

13.24.2

Jun 11, 2022

13.24.1

Jun 10, 2022

13.24.0

Jun 8, 2022

13.23.0

Jun 3, 2022

13.22.0

May 26, 2022

13.21.0

May 23, 2022

13.20.1

May 20, 2022

13.20.0

May 18, 2022

13.19.0

May 18, 2022

13.18.2

May 11, 2022

13.18.1

May 10, 2022

13.18.0

May 10, 2022

13.17.0

May 6, 2022

13.16.0

May 5, 2022

13.15.0

May 2, 2022

13.14.0

Apr 28, 2022

13.13.0

Apr 27, 2022

13.12.0

Apr 25, 2022

13.11.0

Apr 20, 2022

13.10.1

Apr 11, 2022

13.10.0

Apr 5, 2022

13.8.0

Apr 1, 2022

13.7.0

Mar 29, 2022

13.6.0

Mar 23, 2022

13.5.1

Mar 9, 2022

13.4.0

Mar 4, 2022

13.3.0

Feb 23, 2022

13.2.0

Feb 17, 2022

13.1.0

Feb 15, 2022

13.0.0

Feb 11, 2022

12.32.0

Jan 31, 2022

12.31.0

Jan 28, 2022

12.30.0

Jan 25, 2022

12.27.1

Jan 14, 2022

12.26.1

Jan 11, 2022

12.26.0

Jan 10, 2022

This version

12.25.0

Dec 28, 2021

12.24.0

Dec 22, 2021

12.23.0

Dec 14, 2021

12.22.3

Dec 14, 2021

12.22.2

Dec 13, 2021

12.22.1

Dec 10, 2021

12.22.0

Dec 7, 2021

12.21.0

Dec 4, 2021

12.20.1

Nov 20, 2021

12.20.0

Nov 16, 2021

12.19.0

Nov 10, 2021

12.18.0

Nov 5, 2021

12.17.2

Oct 26, 2021

12.17.0

Oct 22, 2021

12.16.2

Oct 20, 2021

12.16.1

Oct 18, 2021

12.16.0

Oct 14, 2021

12.15.0

Oct 7, 2021

12.14.2

Sep 28, 2021

12.14.1

Sep 24, 2021

12.14.0

Sep 23, 2021

12.13.2

Sep 20, 2021

12.13.1

Sep 17, 2021

12.13.0

Sep 16, 2021

12.11.0

Sep 7, 2021

12.10.2

Aug 31, 2021

12.10.1

Aug 31, 2021

12.10.0

Aug 30, 2021

12.9.1

Aug 25, 2021

12.9.0

Aug 24, 2021

12.6.0

Jul 22, 2021

12.5.0

Jul 21, 2021

12.4.0

Jul 15, 2021

12.3.0

Jul 8, 2021

12.2.0

Jul 2, 2021

12.1.1

Jun 28, 2021

12.1.0

Jun 25, 2021

12.0.2

Jun 23, 2021

12.0.1

Jun 22, 2021

12.0.0

Jun 21, 2021

11.32.1

Jun 16, 2021

11.32.0

Jun 15, 2021

11.31.1

Jun 15, 2021

11.31.0

Jun 12, 2021

11.30.0

Jun 11, 2021

11.29.0

Jun 9, 2021

11.28.0

Jun 8, 2021

11.27.1

Jun 4, 2021

11.27.0

Jun 3, 2021

11.25.1

Jun 1, 2021

11.25.0

May 28, 2021

11.24.0

May 27, 2021

11.23.0

May 25, 2021

11.22.0

May 14, 2021

11.21.2

May 12, 2021

11.21.1

May 12, 2021

11.21.0

May 11, 2021

11.20.1

May 11, 2021

11.20.0

May 10, 2021

11.19.0

May 8, 2021

11.17.1

May 5, 2021

11.17.0

May 5, 2021

11.16.4

May 4, 2021

11.16.3

May 3, 2021

11.16.2

Apr 30, 2021

11.16.1

Apr 29, 2021

11.16.0

Apr 27, 2021

11.15.3

Apr 26, 2021

11.15.2

Apr 23, 2021

11.15.1

Apr 23, 2021

11.15.0

Apr 23, 2021

11.14.1

Apr 20, 2021

11.14.0

Apr 20, 2021

11.13.6

Apr 16, 2021

11.13.5

Apr 16, 2021

11.13.4

Apr 16, 2021

11.13.3

Apr 15, 2021

11.13.2

Apr 14, 2021

11.13.1

Apr 14, 2021

11.12.0

Apr 9, 2021

11.11.5

Apr 7, 2021

11.11.4

Apr 6, 2021

11.11.3

Apr 6, 2021

11.11.2

Apr 5, 2021

11.11.1

Apr 2, 2021

11.11.0

Apr 2, 2021

11.10.0

Apr 1, 2021

11.9.0

Apr 1, 2021

11.8.1

Mar 25, 2021

11.8.0

Mar 22, 2021

11.7.0

Mar 19, 2021

11.6.0

Mar 18, 2021

11.5.1

Mar 17, 2021

11.4.2

Mar 15, 2021

11.4.1

Mar 12, 2021

11.4.0

Mar 11, 2021

11.3.0

Mar 11, 2021

11.2.0

Mar 10, 2021

11.1.2

Mar 9, 2021

11.1.1

Mar 6, 2021

11.1.0

Mar 2, 2021

11.0.0

Feb 26, 2021

10.28.1

Feb 17, 2021

10.28.0

Feb 16, 2021

10.27.2

Feb 10, 2021

10.27.1

Feb 3, 2021

10.27.0

Feb 1, 2021

10.26.0

Jan 28, 2021

10.25.0

Jan 22, 2021

10.24.0

Jan 15, 2021

10.23.0

Jan 11, 2021

10.22.2

Jan 8, 2021

10.22.1

Jan 8, 2021

10.22.0

Jan 7, 2021

10.21.0

Jan 5, 2021

10.20.0

Dec 29, 2020

10.19.0

Dec 22, 2020

10.18.1

Dec 18, 2020

10.18.0

Dec 18, 2020

10.17.1

Dec 14, 2020

10.17.0

Dec 11, 2020

10.16.0

Dec 11, 2020

10.15.1

Dec 9, 2020

10.15.0

Dec 8, 2020

10.14.0

Dec 5, 2020

10.13.0

Dec 4, 2020

10.12.3

Dec 2, 2020

10.12.2

Dec 2, 2020

10.12.1

Dec 1, 2020

10.12.0

Dec 1, 2020

10.11.0

Nov 24, 2020

10.10.0

Nov 23, 2020

10.9.0

Nov 13, 2020

10.8.0

Nov 12, 2020

10.7.1

Nov 9, 2020

10.5.1

Nov 2, 2020

10.5.0

Oct 29, 2020

10.4.0

Oct 28, 2020

10.3.0

Oct 28, 2020

10.2.0

Oct 27, 2020

10.1.3

Oct 23, 2020

10.1.2

Oct 22, 2020

10.1.0

Oct 22, 2020

10.0.0

Oct 14, 2020

9.1.0

Oct 9, 2020

9.0.0

Oct 1, 2020

8.2.1

Sep 29, 2020

8.2.0

Sep 29, 2020

8.1.0

Sep 28, 2020

8.0.0

Sep 16, 2020

7.6.0

Sep 9, 2020

7.5.1

Sep 4, 2020

7.5.0

Sep 4, 2020

7.4.1

Sep 2, 2020

7.4.0

Sep 2, 2020

7.3.0

Sep 1, 2020

7.2.1

Sep 1, 2020

7.2.0

Aug 31, 2020

7.1.0

Aug 28, 2020

7.0.0

Aug 27, 2020

6.4.0

Aug 18, 2020

6.3.0

Aug 11, 2020

6.2.1

Aug 10, 2020

6.2.0

Aug 10, 2020

6.1.0

Aug 6, 2020

5.0.0

Aug 4, 2020

4.2.0

Jul 28, 2020

4.1.2

Jul 27, 2020

4.1.1

Jul 24, 2020

4.0.1

Jul 17, 2020

3.65.4

Jul 14, 2020

3.64.1

Jul 1, 2020

3.59.2

Jul 1, 2020

3.59.1

Jun 22, 2020

3.57.2

Jun 17, 2020

3.57.1

Jun 15, 2020

3.55.3

Jun 10, 2020

3.54.5

Jun 4, 2020

3.54.2

Jun 3, 2020

3.54.1

Jun 2, 2020

3.53.1

Jun 1, 2020

3.52.3

May 22, 2020

3.52.2

May 22, 2020

3.52.1

May 22, 2020

3.49.2

May 19, 2020

3.49.1

May 19, 2020

3.46.1

May 15, 2020

3.44.2

May 14, 2020

3.44.1

May 13, 2020

3.32.2

Apr 29, 2020

3.32.1

Apr 27, 2020

3.30.0

Apr 22, 2020

3.26.1

Apr 14, 2020

3.26.0

Apr 14, 2020

3.25.1

Apr 9, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ncbi-datasets-pylib-12.25.0.tar.gz (277.5 kB view hashes)

Uploaded Dec 28, 2021 Source

Hashes for ncbi-datasets-pylib-12.25.0.tar.gz

Hashes for ncbi-datasets-pylib-12.25.0.tar.gz
Algorithm	Hash digest
SHA256	`d1c3b49b22f445759f2b3517373a80ec9a0dc89e75b470dab2cb103cbc08d648`
MD5	`4e6d355d1a4071152fc508a1cea9cb79`
BLAKE2b-256	`7b19a5a8472ceb2e2fbae8fd517b3ccab20867b8a981b0cb535c63b099a20762`