Skip to main content

Fully-controllable error generation for tabular data.

Project description

tab_err

tab_err is an implementation of a tabular data error model that disentangles error mechanism and error type. It generalizes the formalization of missing values, implying that missing values are only one of many possible error type implemented here. tab_err gives the user full control over the error generation process and allows to model realistic errors with complex dependency structures.

The building blocks are ErrorMechanisms, ErrorTypes, and ErrorModels. ErrorMechanism defines where the incorrect cells are and model realistic dependency structures and ErrorType describes in which way the value is incorrect. Together they build a ErrorModel that can be used to perturb existing data with realistic errors.

This repository offers (soon) three APIs, low-level, mid-level and high-level.

Examples

For details and examples please check out our Getting Started Notebook.

Where to get it

The source code is currently hosted on GitHub at: https://github.com/calgo-lab/tab_err

Binary installers for the latest released version are available at the Python Package Index (PyPI).

pip install tab-err

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tab_err-0.2.0.tar.gz (23.5 kB view details)

Uploaded Source

Built Distribution

tab_err-0.2.0-py3-none-any.whl (38.8 kB view details)

Uploaded Python 3

File details

Details for the file tab_err-0.2.0.tar.gz.

File metadata

  • Download URL: tab_err-0.2.0.tar.gz
  • Upload date:
  • Size: 23.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for tab_err-0.2.0.tar.gz
Algorithm Hash digest
SHA256 069c6b3d422382fd6be1cb4b0eddf3432a347aff07d9c9db972ed577f5e4e52a
MD5 55b2462ea939cab4dfa43399a564181a
BLAKE2b-256 4f8472570d39ae3b0d24802994fe3fb0b2e9c3d4ec4040cbc75e94cfbb38e1b9

See more details on using hashes here.

Provenance

The following attestation bundles were made for tab_err-0.2.0.tar.gz:

Publisher: publish.yaml on calgo-lab/tab_err

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file tab_err-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: tab_err-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 38.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for tab_err-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 63b45f04cc488ea8ecaf4dfb54a9172c8bb644ada486614893e0c7ab65ac4c6d
MD5 5a34d8a5d1ad8db5454299951ea09b2f
BLAKE2b-256 40a6249217a25ace948677a8d1b246aede26b9427f13939617dde5f86143217b

See more details on using hashes here.

Provenance

The following attestation bundles were made for tab_err-0.2.0-py3-none-any.whl:

Publisher: publish.yaml on calgo-lab/tab_err

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page