Skip to main content

A Python package to create synthetic data with realistic statistical properties.

Project description

boot-dummy

A Python package to create synthetic data with realistic statistical properties.

Overview

boot-dummy allows you to:

  1. Generate synthetic customer data with predefined rules
  2. Load datasets from various file formats (CSV, Excel, JSON, Parquet)
  3. Generate enhanced descriptive statistics for datasets
  4. Create synthetic data that matches the statistical properties of existing datasets

Installation

pip install boot_dummy

Quick Start

Generate synthetic customer data

from boot_dummy import GenerateData

# Generate 1000 customer records
generator = GenerateData(num_records=1000)
customer_data = generator.generate()

# View the data
print(customer_data.head())

Generate data based on an existing dataset

from boot_dummy import GenerateData
import pandas as pd

# Option 1: Using a file path
result = GenerateData.generate_from_dataset(
    dataset_path='your_dataset.csv',
    num_records=500
)

# Option 2: Using an existing DataFrame
df = pd.read_csv('your_dataset.csv')
result = GenerateData.generate_from_dataset(
    dataset=df,
    num_records=500
)

# Access the components
original_data = result['original_data']
statistics = result['stats']
synthetic_data = result['synthetic_data']

Workflow

The typical workflow with boot-dummy is:

  1. Load or generate initial data

    • Use GenerateData.generate() to create synthetic customer data, or
    • Use GenerateData.load_dataset() to load data from a file
  2. Analyze the data

    • Use GenerateData.enhanced_describe() to get detailed statistics
  3. Generate new data based on statistics

    • Use GenerateData.generate_from_stats() to create new data with similar properties
  4. All-in-one approach

    • Use GenerateData.generate_from_dataset() to perform all steps in one call

Examples

See the examples directory for complete usage examples.

Resources

This package was inspired by and borrows from:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

boot_dummy-0.2.0.tar.gz (6.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

boot_dummy-0.2.0-py3-none-any.whl (6.0 kB view details)

Uploaded Python 3

File details

Details for the file boot_dummy-0.2.0.tar.gz.

File metadata

  • Download URL: boot_dummy-0.2.0.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.29

File hashes

Hashes for boot_dummy-0.2.0.tar.gz
Algorithm Hash digest
SHA256 8f57b189864fada1268a6e081fb1921ee55f13e0ef63d5846847568669ea0ac6
MD5 51e683faef17959c27cb033c2c70d1c4
BLAKE2b-256 e9d73f10b4487f8b946b8968c65ecfd09c2a20e021ba85c7ea9731e1728677e6

See more details on using hashes here.

File details

Details for the file boot_dummy-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: boot_dummy-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 6.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.29

File hashes

Hashes for boot_dummy-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 45610607f82790544ca420db37d5a20efd78c31b22b9a8f963fbac90f36f3ea6
MD5 cf4c0aef7e4d4975018d280c3b3ac75e
BLAKE2b-256 946368681e783cedccd0bfbe1551451c4b94ad5491ba6fae5ffc9fbc9e621020

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page