Provides utility functions for accessing data repository for Project Pythia examples/notebooks

These details have not been verified by PyPI

Project links

Project description

CI
Docs
Package
License

pythia-datasets

Data repository for with sample data for the Pythia Foundations book.

Sample data sets

These files are used as sample data in Pythia Foundations and are downloaded by pythia_datasets package:

NARR_19930313_0000.nc
enso_data.csv
jan-17-co-asos.txt.xz
CESM2_sst_data.nc
CESM2_grid_variables.nc
daymet_v4_precip_sept_2013.nc

Adding new datasets

The scope of this data collection is to serve examples for Pythia Foundations. If you are adding new content to Foundations that requires a new dataset file, please follow these steps:

Add the dataset file to the data/ directory
From the command line, run python make_registry.py script to update the registry file residing in pythia_datasets/registry.txt
Commit and push your changes to GitHub

Using datasets in notebooks and/or scripts

Ensure the pythia_datasets package is installed in your environment

python -m pip install pythia-datasets

# or

python -m pip install git+https://github.com/ProjectPythia/pythia-datasets

Import DATASETS and inspect the registry to find out which datasets are available

In [1]: from pythia_datasets import DATASETS

In [2]: DATASETS.registry_files
Out[2]: ['jan-17-co-asos.txt.xz', 'NARR_19930313_0000.nc']

To fetch a data file of interest, use the .fetch method and provide the filename of the data file. This will
- download and cache the file if it doesn't exist already.
- retrieve and return the local path
```
In [4]: filepath = DATASETS.fetch('jan-17-co-asos.txt.xz')

In [5]: filepath
Out[5]: '/Users/abanihi/Library/Caches/pythia-datasets/jan-17-co-asos.txt.xz'
```
Once you have access to the local filepath, you can then use it to load your dataset into pandas or xarray or your package of choice:
```
In [6]: df = pd.read_csv(filepath)
```

Changing the default data cache location

The default cache location (where the data are saved on your local system) is dependent on the operating system. You can use the locate() method to identify it:

from pythia_datasets import locate

locate()

The location can be overwritten by the PYTHIA_DATASETS_DIR environment variable to the desired destination.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2026.2.20

Feb 20, 2026

This version

2025.2.20

Feb 20, 2026

2021.9.21

Sep 21, 2021

2021.9.13

Sep 13, 2021

2021.6.11

Jun 11, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pythia_datasets-2025.2.20-py3-none-any.whl (9.3 kB view details)

Uploaded Feb 20, 2026 Python 3

File details

Details for the file pythia_datasets-2025.2.20-py3-none-any.whl.

File metadata

Download URL: pythia_datasets-2025.2.20-py3-none-any.whl
Upload date: Feb 20, 2026
Size: 9.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pythia_datasets-2025.2.20-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d53437125b313297614a70b7c14dedc7c2b739b44f2a4cb9e6d74706e72d7215`
MD5	`fc7defb5fb0f7e2b5aec43e6eb2aea78`
BLAKE2b-256	`27e7ff70853ff0af28b9ccd9bc38b721458affeb1d7aab40ef69c8ef00183a34`

See more details on using hashes here.

pythia-datasets 2025.2.20

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

pythia-datasets

Sample data sets

Adding new datasets

Using datasets in notebooks and/or scripts

Changing the default data cache location

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes