Skip to main content

Badgers: bad data generators

Project description

Badgers: bad data generators

badgers is a python library for generating bad data (more precisely to augment existing data with data quality deficits such as outliers, missing values, noise, etc.). It is based upon a simple API and provides a set of generators object that can generate data quality deficits from existing data.

A word of caution: badgers is still in an early development stage. Although the core structure of the package and the generate(X,y) signature are not expected to change, some API details (like attributes names) are likely to change.

The full documentation is hosted here: https://fraunhofer-iese.github.io/badgers/.

For a quick-start, you can install badgers with pip:

pip install badgers

Import badgers as any other library and start using it:

from sklearn.datasets import make_blobs
from badgers.generators.tabular_data.noise import GaussianNoiseGenerator

X, y = make_blobs()
trf = GaussianNoiseGenerator(noise_std=0.5)
Xt, yt = trf.generate(X,y)

More examples are available in the tutorials section.

The API documentation is also available in the API section.

Interested developers will find relevant information in the CONTRIBUTING.md page.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

badgers-0.0.9.tar.gz (2.6 MB view details)

Uploaded Source

Built Distribution

badgers-0.0.9-py3-none-any.whl (32.1 kB view details)

Uploaded Python 3

File details

Details for the file badgers-0.0.9.tar.gz.

File metadata

  • Download URL: badgers-0.0.9.tar.gz
  • Upload date:
  • Size: 2.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for badgers-0.0.9.tar.gz
Algorithm Hash digest
SHA256 1636f297a125b6d1ea5bcfd72b92d0b761e4a858446b07e7d7cd2140c98e887c
MD5 6b7911e96d0fdb01cbe26c4adb7bcfae
BLAKE2b-256 ba82abee75d66de1ff84eb83a3311113384968970e12198d2596d45fb3bc32e9

See more details on using hashes here.

File details

Details for the file badgers-0.0.9-py3-none-any.whl.

File metadata

  • Download URL: badgers-0.0.9-py3-none-any.whl
  • Upload date:
  • Size: 32.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for badgers-0.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 2ebefe66552a72381eac82e689e53bf8ee1f38979781cab8ab1c3aba5c47e7cd
MD5 d0b8f36db1938c6b606ef3740e9a7e0f
BLAKE2b-256 8ec7fc9dc9114cda75768a490a6222c81c30b94bb395961d22a388c60a06755f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page