Badgers: bad data generators
Project description
Badgers: bad data generators
badgers is a python library for generating bad data (more precisely to augment existing data with data quality deficits such as outliers, missing values, noise, etc.). It is based upon a simple API and provides a set of generators object that can generate data quality deficits from existing data.
A word of caution: badgers is still in an early development stage. Although the core structure of the package and the generate(X,y)
signature are not expected to change, some API details (like attributes names) are likely to change.
The full documentation is hosted here: https://fraunhofer-iese.github.io/badgers/.
For a quick-start, you can install badgers
with pip:
pip install badgers
Import badgers as any other library and start using it:
from sklearn.datasets import make_blobs
from badgers.generators.tabular_data.noise import GaussianNoiseGenerator
X, y = make_blobs()
trf = GaussianNoiseGenerator(noise_std=0.5)
Xt, yt = trf.generate(X,y)
More examples are available in the tutorials section.
The API documentation is also available in the API section.
Interested developers will find relevant information in the CONTRIBUTING.md page.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file badgers-0.0.9.tar.gz
.
File metadata
- Download URL: badgers-0.0.9.tar.gz
- Upload date:
- Size: 2.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1636f297a125b6d1ea5bcfd72b92d0b761e4a858446b07e7d7cd2140c98e887c |
|
MD5 | 6b7911e96d0fdb01cbe26c4adb7bcfae |
|
BLAKE2b-256 | ba82abee75d66de1ff84eb83a3311113384968970e12198d2596d45fb3bc32e9 |
File details
Details for the file badgers-0.0.9-py3-none-any.whl
.
File metadata
- Download URL: badgers-0.0.9-py3-none-any.whl
- Upload date:
- Size: 32.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2ebefe66552a72381eac82e689e53bf8ee1f38979781cab8ab1c3aba5c47e7cd |
|
MD5 | d0b8f36db1938c6b606ef3740e9a7e0f |
|
BLAKE2b-256 | 8ec7fc9dc9114cda75768a490a6222c81c30b94bb395961d22a388c60a06755f |