This module allows to download and analyse metadata as well as data from tabular PANGAEA (https://www.pangaea.de) datasets. Usage: import pangaeapy.pandataset as pd ds= pd.PanDataSet(787140) print(ds.title) print(ds.data.head()) Please visit the github project page to see more documentation and some examples: https://github.com/pangaea-data-publisher/pangaeapy
Project description
pangaeapy - a Python module to access and analyse PANGAEA data
Background
PANGAEA (https://www.pangaea.de) is one of the world's largest archives of this kind offering essential data services such as data curation, long-term data archiving and data publication. PANGAEA hosts about 400,000 datasets comprising around 17.5 billion individual measurements (Aug. 2020) and observations which have been collected during more than 240 international research projects. The system is open to any project, institution or individual scientist using, archiving or publishing research data.
Since the programming languages Python and R have become increasingly important for scientific data analysis in recent years, we have developed 'pangaeapy' a new, custom Python module that considerably simplifies typical data science tasks.
Given a DOI, pangaeapy uses PANGAEA’s web services to automatically load PANGAEA metadata into a dedicated python object and tabular data into a Python Data Analysis Library (pandas) DataFrame with a mere call of a specialized function. This makes it possible to integrate PANGAEA data with data from a large number of sources and formats (Excel, NetCDF, etc.) and to carry out data analyses within a suitable computational environment such as Jupyter notebooks in a uniform manner.
Installation
- Source code from github
pip install git+https://github.com/pangaea-data-publisher/pangaeapy.git
- Wheels for Python from PyPI
pip install pangaeapy
Usage
import pangaeapy.pandataset as pd
ds = pd.PanDataSet(787140)
print(ds.title)
print(ds.data.head())
Examples
Please take a look at the example Jupyter Notebooks which you can find in the 'examples' folder
Documentation
https://github.com/pangaea-data-publisher/pangaeapy/blob/master/docs/pandataset.md
Running the tests
The tests ar located in the test
directory. You can run them by executing
pytest
or tox
in the root directory.
Cite as
Robert Huber, Egor Gordeev, Markus Stocker, Aarthi Balamurugan, & Uwe Schindler (2020). pangaeapy - a Python module to access and analyse PANGAEA data. Zenodo. http://doi.org/10.5281/zenodo.4013940.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for pangaeapy-1.0.20-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ee83615d937b93e23f48461726a4b75338d47eff05ebb276f64bea40c2478d89 |
|
MD5 | 14629eb4a665fbf539e87c81c82255e4 |
|
BLAKE2b-256 | f94d77795b96140a63197a4c6d95f4c46d36ada22a4db05d85c5706afec0407a |