Skip to main content

No project description provided

Project description

Build Status The Curator on PyPI

The Curator helps you define pipelines for transforming dirty data into consumable databases.


from thecurator import Curator

# Paths to files describing different tables
table_descriptions = ['patient.yml', 'lab.yml']
curator = Curator(sqlalchemy_engine, table_descriptions)

# Transform a pandas DataFrame according to the descriptions
curator.transform_df('patient', patient_df)

# Transform a dictionary array according to the descriptions
curator.transform_dicts('patient', patient_dicts)

# Transform and insert a dictionary array according to the descriptions
curator.insert_dicts('lab', lab_dicts)

See the tests for more examples. More coming soon…


  • Install development requirements pip install -r dev-requirements.txt
  • Make changes
  • Run the tests pytest tests
  • See the Makefile for other useful commands

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
thecurator-0.2.1-py3-none-any.whl (7.2 kB) Copy SHA256 hash SHA256 Wheel py3
thecurator-0.2.1.tar.gz (5.7 kB) Copy SHA256 hash SHA256 Source None

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page