Skip to main content

A tool for data profiling and data validation

Project description

Deirokay

build codecov docstr_coverage license: MIT code style: flake8 docstring: numpy imports: isort semantic-release: conventionalcommits

Deirokay (dejɾo'kaj) is a tool for data profiling and data validation.

Deirokay separates document parsing from validation logic, so that you can create your statements about your data without worrying whether or not your file has been properly parsed.

You can use Deirokay for:

  • Data parsing from files (CSV, parquet, excel, or any other pandas-compatible format);
  • Data validation, via Deirokay Statements;
  • Data profiling, which generates Deirokay Statements automatically based on an existing file. You may use these statements later against new documents to make sure the validation still holds for new data.

Installation

Install Deirokay using pip:

pip install Deirokay

To include optional dependences for AWS S3, install:

pip install Deirokay[s3]

Documentation

Please, read the docs.

Contributing

Check our contributing guidelines.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deirokay-1.0.0.tar.gz (38.8 kB view hashes)

Uploaded Source

Built Distribution

deirokay-1.0.0-py3-none-any.whl (57.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page