Skip to main content

Python framework for transforming tabulated data with visual relationships into tidy data

Project description

Tidychef

Tests 100% Test Coverage Static Badge

Tidychef is a python framework to enable “data extraction for humans” via simple python beginner friendly "recipes". It aims at allowing users to easily transform tabulated data sources that use visual relationships (human readable only data) into simple machine readable "tidy data" in a repeatable way.

i.e: it allows you to reliably turn something that looks like this:

into something that looks like this:

Note: image cropped for reasons of practicality.

Currently supported input formats are xls, xlsx, ods and csv. Though users can add additional formats relatively easily and without a codebase change being necessary.

Tidychef is designed to allow even novice python users or analysts to quickly become productive but also has an advanced feature set and is designed to be readily and easily extended (adding new source of tabulated data, your own use case specific methods and filters and domain specific validation etc are all possible and documented in detail).

In depth training material, examples and technical documentation can be found here.

Installation

pip install tidychef

Acknowledgements

Tidychef is directly inspired by the python package databaker created by The Sensible Code Company in partnership with the United Kingdoms Office For National Statistics.

While I liked databaker and successfully worked with it on multiple ETL projects over the course of almost a decade, this software should be considered the culmination of that work and the lessons learned from that time.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tidychef-0.1.3.tar.gz (53.9 kB view details)

Uploaded Source

Built Distribution

tidychef-0.1.3-py3-none-any.whl (85.1 kB view details)

Uploaded Python 3

File details

Details for the file tidychef-0.1.3.tar.gz.

File metadata

  • Download URL: tidychef-0.1.3.tar.gz
  • Upload date:
  • Size: 53.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.2.2 CPython/3.7.13 Darwin/21.6.0

File hashes

Hashes for tidychef-0.1.3.tar.gz
Algorithm Hash digest
SHA256 09eadbc61cc4c9c3f4c88a7464ff141c4bdfa8da3932965c4181dfce8de766fc
MD5 348aeff65c7cb389bb7d304b48b6f664
BLAKE2b-256 bea4c3d861a91a87efaecee5ca8be4001e4fdaaacea0fe10f3c7c2212e35a766

See more details on using hashes here.

File details

Details for the file tidychef-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: tidychef-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 85.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.2.2 CPython/3.7.13 Darwin/21.6.0

File hashes

Hashes for tidychef-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 cccdd418258301f4de5b6fd086aadc9a84feea2a9e6dfb07ba5c3004c537ef1c
MD5 322b814e066c04084abc6f96fe6cd0d3
BLAKE2b-256 5e35a39cbdc05f591b8646d96b71ddd7b14bef7152927169b8282eda8fd8e4e6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page