Data and statistical models for biomarker shedding.
Project description
Shedding Hub
The Shedding Hub collates data and statistical models for biomarker shedding (such as viral RNA or drug metabolites) in different human specimen (such as stool or sputum samples). Developing wastewater-based epidemiology into a quantitative, reliable epidemiological monitoring tool motivates the project.
Datasets are extracted from appendices, figures, and supplementary materials of peer-reviewed studies. Each dataset is stored as a .yaml
file and validated against our data schema to verify its integrity.
📊 Getting the Data
You can obtain the data by downloading it from GitHub. We also provide a convenient Python package so you can download the most recent data directly in your code or obtain a specific version of the data for reproducible analysis. Install the package by running pip install shedding-hub
from the command line. The example below downloads the data from Wölfel et al. (2020) as of the commit 259ca0d
.
>>> import shedding_hub as sh
>>> sh.load_dataset('woelfel2020virological', ref='259ca0d')
{'title': 'Virological assessment of hospitalized patients with COVID-2019',
'doi': '10.1038/s41586-020-2196-x',
...}
🤝 Contributing
Thank you for contributing your data to the Shedding Hub and supporting wastewater-based epidemiology! If you hit a bump along the road, create a new issue and we'll sort it out together.
We use pull requests to add and update data, allowing for review and quality assurance. Learn more about the general workflow here. To contribute your data, follow these easy steps (if you're already familiar with pull requests, steps 2 and 3 are for you):
- Create a fork of the Shedding Hub repository by clicking here and clone the fork to your computer. You only have to do this once.
- Create a new
my_cool_study/my_cool_study.yaml
file in thedata
directory and populate it with your data. See here for a comprehensive example from Wölfel et al. (2020). A minimal example for studies with a single analyte (e.g., SARS-CoV-2 RNA concentration in stool samples) is available here, and a minimal example for studies with multiple analytes (e.g., crAssphage RNA concentration in stool samples and caffeine metabolites in urine) is available here. - Optionally, if you have a recent version of Python installed, you can validate your data to ensure it has the right structure before contributing it to the Shedding Hub.
- Run
pip install -r requirements.txt
from the command line to install all the Python packages you need. - Run
pytest
from the command line to validate all datasets, including the one you just created.
- Run
- Create a new branch by running
git checkout -b my_cool_study
. Branches let you isolate changes you are making to the data, e.g., if you're simultaneously working on adding multiple studies–much appreciated! You should create a new branch from themain
branch for each dataset you contribute; see here for more information. - Add your changes by running
git add data/my_cool_study/my_cool_study.yaml
and commit them by runninggit commit -m "Add data from Someone et al. (20xx)."
. Feel free to pick another commit message if you prefer. - Push the dataset to your fork by running
git push origin my_cool_study
. This will send the data to GitHub, and the output of the command will include a lineCreate a pull reuqest for 'my_cool_study' on GitHub by visiting: https://github.com/[your-username]/shedding-hub/pull/new/my_cool_study
. Click on the link and follow the next steps to create a new pull request.
Congratulations, you've just created your first pull request to contribute a new dataset! We'll now review the changes you've made to make sure everything looks good. Once any questions have been resolved, we'll merge your changes into the repository. You've just contributed your first dataset to help make wastewater-based epidemiology a more quantitative public health monitoring tool–thank you!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file shedding_hub-0.1.0.tar.gz
.
File metadata
- Download URL: shedding_hub-0.1.0.tar.gz
- Upload date:
- Size: 6.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | fcdcf7d30484925c909a23899089e93f943bb74c6561fc700e350b4e2c6c85de |
|
MD5 | 47af8abe8ecbbbbb39b5a83de72f6c77 |
|
BLAKE2b-256 | fcae580da75c5caea0843565f2aef327185b0ce44171deeab73e5694f68502bf |
File details
Details for the file shedding_hub-0.1.0-py3-none-any.whl
.
File metadata
- Download URL: shedding_hub-0.1.0-py3-none-any.whl
- Upload date:
- Size: 7.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | adc750b87cb55213ba72cfd0c07b0c2dfc17ec7907a2f40a71cc7cad0b63a5b9 |
|
MD5 | 7fb2ff110174898553b4ea01a746e07b |
|
BLAKE2b-256 | 1475ea6e21cae2656cc2fae8bbec981ec5765517ec655e2115c864cad316d017 |