Skip to main content

This module allows to download and analyse metadata as well as data from tabular PANGAEA (https://www.pangaea.de) datasets. Usage: import pangaeapy.pandataset as pd ds= pd.PanDataSet(787140) print(ds.title) print(ds.data.head()) Please visit the github project page to see more documentation and some examples: https://github.com/pangaea-data-publisher/pangaeapy

Project description

DOI

pangaeapy - a Python module to access and analyse PANGAEA data

Background

pangaea

PANGAEA (https://www.pangaea.de) is one of the world's largest archives of this kind offering essential data services such as data curation, long-term data archiving and data publication. PANGAEA hosts about 400,000 datasets comprising around 17.5 billion individual measurements (Aug. 2020) and observations which have been collected during more than 240 international research projects. The system is open to any project, institution or individual scientist using, archiving or publishing research data.

Since the programming languages Python and R have become increasingly important for scientific data analysis in recent years, we have developed 'pangaeapy' a new, custom Python module that considerably simplifies typical data science tasks.

Given a DOI, pangaeapy uses PANGAEA’s web services to automatically load PANGAEA metadata into a dedicated python object and tabular data into a Python Data Analysis Library (pandas) DataFrame with a mere call of a specialized function. This makes it possible to integrate PANGAEA data with data from a large number of sources and formats (Excel, NetCDF, etc.) and to carry out data analyses within a suitable computational environment such as Jupyter notebooks in a uniform manner.

Installation

  • Source code from github
    • pip install git+https://github.com/pangaea-data-publisher/pangaeapy.git
  • Wheels for Python from PyPI
    • pip install pangaeapy

Usage

import pangaeapy.pandataset as pd

ds = pd.PanDataSet(787140)
print(ds.title)
print(ds.data.head())

Examples

Please take a look at the example Jupyter Notebooks which you can find in the 'examples' folder

Documentation

https://github.com/pangaea-data-publisher/pangaeapy/blob/master/docs/pandataset.md

Running the tests

The tests ar located in the test directory. You can run them by executing pytest or tox in the root directory.

Cite as

Robert Huber, Egor Gordeev, Markus Stocker, Aarthi Balamurugan, & Uwe Schindler (2020). pangaeapy - a Python module to access and analyse PANGAEA data. Zenodo. http://doi.org/10.5281/zenodo.4013940.

DOI

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pangaeapy-1.0.17.tar.gz (48.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pangaeapy-1.0.17-py3-none-any.whl (48.2 kB view details)

Uploaded Python 3

File details

Details for the file pangaeapy-1.0.17.tar.gz.

File metadata

  • Download URL: pangaeapy-1.0.17.tar.gz
  • Upload date:
  • Size: 48.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for pangaeapy-1.0.17.tar.gz
Algorithm Hash digest
SHA256 70486662f1445ad8d46355094d580e37796259952508e7a91dd23c1c67f7061d
MD5 05c3f6f4c54be464986a63744966e917
BLAKE2b-256 ade62aa26bafb130da72700af2270a8a7fa4843b5ba51f5f241370d20dc95c7d

See more details on using hashes here.

File details

Details for the file pangaeapy-1.0.17-py3-none-any.whl.

File metadata

  • Download URL: pangaeapy-1.0.17-py3-none-any.whl
  • Upload date:
  • Size: 48.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for pangaeapy-1.0.17-py3-none-any.whl
Algorithm Hash digest
SHA256 acae7c5c0b39270bb4907b36a84c9b48d9f946e46e6d0ef537e431f8e6b68a98
MD5 fa2cfb44074af917d543b64db3ca85c6
BLAKE2b-256 c3674e301c5a8bc8ce2ac05fa1d77351fa18691c861a512f6a4f83de65aedafe

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page