Python framework for transforming tabulated data with visual relationships into tidy data
Project description
Tidychef
Tidychef is a python framework to enable “data extraction for humans” via simple python beginner friendly "recipes". It aims at allowing users to easily transform tabulated data sources that use visual relationships (human readable only data) into simple machine readable "tidy data" in a repeatable way.
i.e: it allows you to reliably turn something that looks like this:
into something that looks like this:
Note: image cropped for reasons of practicality.
Currently supported input formats are xls
, xlsx
, ods
and csv
. Though users can add additional formats relatively easily and without a codebase change being necessary.
Tidychef is designed to allow even novice python users or analysts to quickly become productive but also has an advanced feature set and is designed to be readily and easily extended (adding new source of tabulated data, your own use case specific methods and filters and domain specific validation etc are all possible and documented in detail).
In depth training material, examples and technical documentation can be found here.
Installation
pip install tidychef
Acknowledgements
Tidychef is directly inspired by the python package databaker created by The Sensible Code Company in partnership with the United Kingdoms Office For National Statistics.
While I liked databaker and successfully worked with it on multiple ETL projects over the course of almost a decade, this software should be considered the culmination of that work and the lessons learned from that time.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file tidychef-0.1.3.tar.gz
.
File metadata
- Download URL: tidychef-0.1.3.tar.gz
- Upload date:
- Size: 53.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.2.2 CPython/3.7.13 Darwin/21.6.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 09eadbc61cc4c9c3f4c88a7464ff141c4bdfa8da3932965c4181dfce8de766fc |
|
MD5 | 348aeff65c7cb389bb7d304b48b6f664 |
|
BLAKE2b-256 | bea4c3d861a91a87efaecee5ca8be4001e4fdaaacea0fe10f3c7c2212e35a766 |
File details
Details for the file tidychef-0.1.3-py3-none-any.whl
.
File metadata
- Download URL: tidychef-0.1.3-py3-none-any.whl
- Upload date:
- Size: 85.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.2.2 CPython/3.7.13 Darwin/21.6.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cccdd418258301f4de5b6fd086aadc9a84feea2a9e6dfb07ba5c3004c537ef1c |
|
MD5 | 322b814e066c04084abc6f96fe6cd0d3 |
|
BLAKE2b-256 | 5e35a39cbdc05f591b8646d96b71ddd7b14bef7152927169b8282eda8fd8e4e6 |