Skip to main content

Data-quality framework

Project description

Hello, welcome to Contessa!

Contessa is a Data Quality library that provides you an easy way to define, execute and store quality rules for your data.

Instead of writing a lot of sql queries that look almost exactly the same, we’re aiming for more pragmatic approach - define rules programatically. This enables much more flexibility for the user and also for us as the creators of the lib.

We implement new Rules (incrementally) that should reflect Data Quality domain. From the start these are simple rules like - NOT_NULL, GT (greater than) etc. We want to build on these simple rules and provide more complex Data Quality checkers out-of-the-box.

Goals:

  • be database agnostic (to a reasonable degree), so you will define checks against any database (e.g. mysql vs. postgres) in the same way

  • automatize data quality results e.g. from postgres table to Datadog dashboard

  • programmatic approach to data-quality definition, which leads to:

    • dynamic composition of rules in a simple script using db or any 3rd party tool - e.g. take all tables, create NOT_NULl rule for all of them for each integer column

    • users can use special rules for data if needed, if not, they can go with generic solutions

    • automatizable testable parts of definitions when needed

  • easier maintenance when number of checks scales too fast :)

Full docs here https://contessa.readthedocs.io/en/latest

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

contessa-0.2.12.tar.gz (34.0 kB view details)

Uploaded Source

Built Distribution

contessa-0.2.12-py3-none-any.whl (47.5 kB view details)

Uploaded Python 3

File details

Details for the file contessa-0.2.12.tar.gz.

File metadata

  • Download URL: contessa-0.2.12.tar.gz
  • Upload date:
  • Size: 34.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.5.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.1 CPython/3.7.1

File hashes

Hashes for contessa-0.2.12.tar.gz
Algorithm Hash digest
SHA256 22a29ba20b9afc7347eb28b6475126ea44882ab88dafd97c1e967e9a1ae70d13
MD5 135451118dc9b2792e8603424b8f28a5
BLAKE2b-256 997c609387f1771930b8041a23133c63ce522178b7219925a5f39fe634df4f5a

See more details on using hashes here.

File details

Details for the file contessa-0.2.12-py3-none-any.whl.

File metadata

  • Download URL: contessa-0.2.12-py3-none-any.whl
  • Upload date:
  • Size: 47.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.5.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.1 CPython/3.7.1

File hashes

Hashes for contessa-0.2.12-py3-none-any.whl
Algorithm Hash digest
SHA256 9030e7b080936306f302f6ec0a14f904d991a99cfb0442c0ba0902690d6ce234
MD5 ac3d41ea84dc62d4f838625a476a2c1b
BLAKE2b-256 5ef029ced3412a0ea61e6e80fb7014950be6a5c1f07b0f0dac8f40e464c04e15

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page