No project description provided
Project description
The Curator helps you define pipelines for transforming dirty data into consumable databases.
Usage
from thecurator import Curator
# Paths to files describing different tables
table_descriptions = ['patient.yml', 'lab.yml']
curator = Curator(sqlalchemy_engine, table_descriptions)
# Transform a pandas DataFrame according to the descriptions
curator.transform_df('patient', patient_df)
# Transform a dictionary array according to the descriptions
curator.transform_dicts('patient', patient_dicts)
# Transform and insert a according to the descriptions
curator.insert_dicts('lab', lab_dicts)
See the tests. More coming soon…
Development
Install development requirements pip install -r dev-requirements.txt
Make changes
Run the tests pytest tests
Release
https://packaging.python.org/tutorials/distributing-packages/ gpg –detach-sign -a dist/package-1.0.1.tar.gz twine upload dist/*
twine upload –repository-url https://test.pypi.org/legacy/ dist/*
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
The Curator-0.1.2.tar.gz
(5.2 kB
view hashes)
Built Distribution
Close
Hashes for The_Curator-0.1.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6f9dd93ede6807792a2dc80fa9cd1de4e7674226138723199130fbf42cfb9b0f |
|
MD5 | 2bedfd5b6075ff787bac52be9f75fb88 |
|
BLAKE2b-256 | 246015ff6621de037a97e0a81a7ce5201bc73b83e5737f338ce99dd6f8c21bfc |