Data and statistical models for biomarker shedding.

Project description

Shedding Hub

The Shedding Hub collates data and statistical models for biomarker shedding (such as viral RNA or drug metabolites) in different human specimen (such as stool or sputum samples). Developing wastewater-based epidemiology into a quantitative, reliable epidemiological monitoring tool motivates the project.

Datasets are extracted from appendices, figures, and supplementary materials of peer-reviewed studies. Each dataset is stored as a .yaml file and validated against our data schema to verify its integrity.

📊 Getting the Data

You can obtain the data by downloading it from GitHub. We also provide a convenient Python package so you can download the most recent data directly in your code or obtain a specific version of the data for reproducible analysis. Install the package by running pip install shedding-hub from the command line. The example below downloads the data from Wölfel et al. (2020) as of the commit 259ca0d.

>>> import shedding_hub as sh

>>> sh.load_dataset('woelfel2020virological', ref='259ca0d')
{'title': 'Virological assessment of hospitalized patients with COVID-2019',
 'doi': '10.1038/s41586-020-2196-x',
 ...}

🤝 Contributing

Thank you for contributing your data to the Shedding Hub and supporting wastewater-based epidemiology! If you hit a bump along the road, create a new issue and we'll sort it out together.

We use pull requests to add and update data, allowing for review and quality assurance. Learn more about the general workflow here. To contribute your data, follow these easy steps (if you're already familiar with pull requests, steps 2 and 3 are for you):

Create a fork of the Shedding Hub repository by clicking here and clone the fork to your computer. You only have to do this once.
Create a new my_cool_study/my_cool_study.yaml file in the data directory and populate it with your data. See here for a comprehensive example from Wölfel et al. (2020). A minimal example for studies with a single analyte (e.g., SARS-CoV-2 RNA concentration in stool samples) is available here, and a minimal example for studies with multiple analytes (e.g., crAssphage RNA concentration in stool samples and caffeine metabolites in urine) is available here.
Optionally, if you have a recent version of Python installed, you can validate your data to ensure it has the right structure before contributing it to the Shedding Hub.
- Run pip install -r requirements.txt from the command line to install all the Python packages you need.
- Run pytest from the command line to validate all datasets, including the one you just created.
Create a new branch by running git checkout -b my_cool_study. Branches let you isolate changes you are making to the data, e.g., if you're simultaneously working on adding multiple studies–much appreciated! You should create a new branch from the main branch for each dataset you contribute; see here for more information.
Add your changes by running git add data/my_cool_study/my_cool_study.yaml and commit them by running git commit -m "Add data from Someone et al. (20xx).". Feel free to pick another commit message if you prefer.
Push the dataset to your fork by running git push origin my_cool_study. This will send the data to GitHub, and the output of the command will include a line Create a pull reuqest for 'my_cool_study' on GitHub by visiting: https://github.com/[your-username]/shedding-hub/pull/new/my_cool_study. Click on the link and follow the next steps to create a new pull request.

Congratulations, you've just created your first pull request to contribute a new dataset! We'll now review the changes you've made to make sure everything looks good. Once any questions have been resolved, we'll merge your changes into the repository. You've just contributed your first dataset to help make wastewater-based epidemiology a more quantitative public health monitoring tool–thank you!

Project details

Release history Release notifications | RSS feed

This version

0.1.0

Sep 30, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shedding_hub-0.1.0.tar.gz (6.9 kB view details)

Uploaded Sep 30, 2024 Source

Built Distribution

shedding_hub-0.1.0-py3-none-any.whl (7.9 kB view details)

Uploaded Sep 30, 2024 Python 3

File details

Details for the file shedding_hub-0.1.0.tar.gz.

File metadata

Download URL: shedding_hub-0.1.0.tar.gz
Upload date: Sep 30, 2024
Size: 6.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for shedding_hub-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`fcdcf7d30484925c909a23899089e93f943bb74c6561fc700e350b4e2c6c85de`
MD5	`47af8abe8ecbbbbb39b5a83de72f6c77`
BLAKE2b-256	`fcae580da75c5caea0843565f2aef327185b0ce44171deeab73e5694f68502bf`

See more details on using hashes here.

File details

Details for the file shedding_hub-0.1.0-py3-none-any.whl.

File metadata

Download URL: shedding_hub-0.1.0-py3-none-any.whl
Upload date: Sep 30, 2024
Size: 7.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for shedding_hub-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`adc750b87cb55213ba72cfd0c07b0c2dfc17ec7907a2f40a71cc7cad0b63a5b9`
MD5	`7fb2ff110174898553b4ea01a746e07b`
BLAKE2b-256	`1475ea6e21cae2656cc2fae8bbec981ec5765517ec655e2115c864cad316d017`