Skip to main content

a tool for quickly generating dummy data

Project description

DataBuilder

Have you ever needed some dummy data to demonstrate some basic data analysis / machine learning topics?

DataBuilder can save you time by creating customized dummy data sets within minutes.


Installation

pip install databuilder

Quick Example

import databuilder as db

# make a dummy dataset about "our employees"
config = {
    'fields': {
        'empID':        db.ID(),
        'first_name':   db.Name(first_only=True),
        'last_name':    db.Name(last_only=True),
        'department':   db.Group(["Sales", "Acct", "Mktg", "IT"]),
        'salary':       db.NormalDist(50000, 10000),
        'hire_date':    db.Date("1990-01-01", "2020-12-31")
    }
}

# create a Pandas DataFrame with 
# the fields defined in `config`
df = db.create_df(config, n=200)

print(df.head(2))
#
#   Example output:
#         empID first_name last_name department  salary  hire_date
#      0      1      Frank      Ward         IT   69210 2004-05-05
#      1      2    Barbara    George       Mktg   46744 2019-05-20

Complete Usage Guide

Detailed docs on how to use DataBuilder can be found in the docs/ folder of this repo (or click here)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

databuilder-0.0.2.tar.gz (7.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

databuilder-0.0.2-py3-none-any.whl (8.8 kB view details)

Uploaded Python 3

File details

Details for the file databuilder-0.0.2.tar.gz.

File metadata

  • Download URL: databuilder-0.0.2.tar.gz
  • Upload date:
  • Size: 7.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.0

File hashes

Hashes for databuilder-0.0.2.tar.gz
Algorithm Hash digest
SHA256 aaf815c9785aee408ebfccd97087a9695e130b5d5139f30ff3f32c1bca54dea2
MD5 54a68616c91f09ff9fe7500afb60cb2c
BLAKE2b-256 8a33f73835d5a929179feb490cb5688229993aec6aea7fc9447c307b4e22c208

See more details on using hashes here.

File details

Details for the file databuilder-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: databuilder-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 8.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.0

File hashes

Hashes for databuilder-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 c1955e8bc193b403789b583ff25b1384053cbbe1681af4a45c05bb79c0f95efc
MD5 dc1f72743d5a07e2e895024dedc3caa1
BLAKE2b-256 75d6a0dc4f359973efeacb6f0cae31139a8278d117f7bddc29d3c52e00d0f003

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page