Skip to main content

A Comprehensive Biological Data Downloader from authoritative public databases like NCBI, Catalogue Of Life (COL), GBIF, BOLD, and more for any taxa.

Project description

Project Logo

biodumpy: A Comprehensive Biological Data Downloader

PyPI - Version PyPI - Status PyPI - License PyPI - Downloads

Overview

biodumpy is a powerful and versatile Python package designed to simplify the process of retrieving biological information from several public databases. With biodumpy, researchers can easily download and manage data from multiple sources, ensuring access to the most up to date and comprehensive biological information available.

Note: This package is currently under development.

Key Features

biodumpy offers dedicated modules for each supported database, with each module featuring functions specifically designed for retrieving information from its respective source. The modules implemented so far are:

  • BOLD
  • COL
  • GBIF
  • iNaturalist
  • IUCN
  • NCBI
  • OBIS
  • ZooBank

This list can be expanded, thus suggestions and feedback are greatly appreciated.

Main functionalities and workflow

Before using biodumpy, users need to install the package in their Python environment with the following command:

pip install biodumpy

Usage

To simplify the use of biodumpy, we create a general structure common among the modules:

  1. Load the package. Import biodumpy into your Python environment.
  2. Load the desired modules. Import one or more specific modules needed to retrieve the data.
  3. Set up the configuration of one or more modules. Configure the biodumpy function/s with the required parameters.
  4. Start the download. Execute the function to begin retrieving the data.

Here, we provide two examples illustrating the general structure of a biodumpy script:

In detail, we described:

  • Single Module Example: This example demonstrates how to use a single biodumpy module (for example, GBIF).
  • Multiple Modules Example: This example shows how to use multiple biodumpy modules (for example, GBIF and IUCN).

Example N.1

    # Import biodumpy package
    from biodumpy import Biodumpy

    # Import GBIF module
    from biodumpy.inputs import GBIF

    # Create a list of taxa
    taxa = [
        'Alytes muletensis (Sanchíz & Adrover, 1979)', 
        'Bufotes viridis (Laurenti, 1768)',
        'Hyla meridionalis Boettger, 1874', 
        'Anax imperator Leach, 1815'
    ]

    # Set the Biodumpy function with the specific parameters
    bdp = Biodumpy([GBIF(bulk=False, accepted_only=True)])

    # Start the download
    bdp.start(taxa, output_path='YOUR_OUTPUT_PATH/downloads/{date}/{module}/{name}')

Example N.2

    # Import biodumpy package
    from biodumpy import Biodumpy

    # Import GBIF and IUCN modules
    from biodumpy.inputs import GBIF, IUCN

    api_key = 'YOUR_IUCN_API_KEY'

    # Create a list of taxa
    taxa = [
        'Alytes muletensis', 
        'Bufotes viridis', 
        'Hyla meridionalis', 
        'Anax imperator'
    ]

    # Set the Biodumpy functions with the specific parameters
    bdp = Biodumpy([GBIF(bulk=False, accepted_only=True),
                    IUCN(api_key=api_key, bulk=True, region=['global'])])

    # Start the download
    bdp.start(taxa, output_path='./downloads/{date}/{module}/{name}')

Documentation and Support

For detailed documentation and tutorials, please visit the biodumpy Read the Docs documentation.

Contribution

biodumpy is an open-source project, and contributions are welcome! If you have ideas for new features, bug fixes, or improvements, please submit an issue or pull request in our GitHub repository or contact with the support team at t.cancellario@uib.eu.

License

biodumpy is licensed under the GNU GENERAL PUBLIC LICENSE. See the LICENSE file for more details.

Acknowledgments

The project was supported by MCIN with funding from the European Union—NextGenerationEU (PRTR-C17.I1) and the Government of the Balearic Islands.


logo_cbb

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

biodumpy-0.1.3.tar.gz (19.9 kB view details)

Uploaded Source

Built Distribution

biodumpy-0.1.3-py3-none-any.whl (24.4 kB view details)

Uploaded Python 3

File details

Details for the file biodumpy-0.1.3.tar.gz.

File metadata

  • Download URL: biodumpy-0.1.3.tar.gz
  • Upload date:
  • Size: 19.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.12

File hashes

Hashes for biodumpy-0.1.3.tar.gz
Algorithm Hash digest
SHA256 11d5eb6301100f341187c1a179ef8176b77fba774501cbb256ceab39a6e3ef9b
MD5 8b26d7e07cd980397a5083cd7089c0c6
BLAKE2b-256 fb9a5dd50af30d3bd29d7590bc3b1ca4c213c93371a2a80834a6344a31559e7b

See more details on using hashes here.

File details

Details for the file biodumpy-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: biodumpy-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 24.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.12

File hashes

Hashes for biodumpy-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 2732a5acae348fbe091465607aa36690cb6a4439c414221a62900deacfcf696a
MD5 639f8fa427a6ddccec30ac898df2ec4a
BLAKE2b-256 f36aab6c21c807d2f823dfa6a7b90f7d5617f9fcb983e7efd49414eeb281a466

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page