Skip to main content

A Python package to create synthetic data with realistic statistical properties.

Project description

boot-dummy

A Python package to create synthetic data with realistic statistical properties.

Overview

boot-dummy allows you to:

  1. Generate synthetic customer data with predefined rules
  2. Load datasets from various file formats (CSV, Excel, JSON, Parquet)
  3. Generate enhanced descriptive statistics for datasets
  4. Create synthetic data that matches the statistical properties of existing datasets

Installation

pip install boot_dummy

Quick Start

Generate synthetic customer data

from boot_dummy import GenerateData

# Generate 1000 customer records
generator = GenerateData(num_records=1000)
customer_data = generator.generate()

# View the data
print(customer_data.head())

Generate data based on an existing dataset

from boot_dummy import GenerateData
import pandas as pd

# Option 1: Using a file path
result = GenerateData.generate_from_dataset(
    dataset_path='your_dataset.csv',
    num_records=500
)

# Option 2: Using an existing DataFrame
df = pd.read_csv('your_dataset.csv')
result = GenerateData.generate_from_dataset(
    dataset=df,
    num_records=500
)

# Access the components
original_data = result['original_data']
statistics = result['stats']
synthetic_data = result['synthetic_data']

Workflow

The typical workflow with boot-dummy is:

  1. Load or generate initial data

    • Use GenerateData.generate() to create synthetic customer data, or
    • Use GenerateData.load_dataset() to load data from a file
  2. Analyze the data

    • Use GenerateData.enhanced_describe() to get detailed statistics
  3. Generate new data based on statistics

    • Use GenerateData.generate_from_stats() to create new data with similar properties
  4. All-in-one approach

    • Use GenerateData.generate_from_dataset() to perform all steps in one call

Examples

See the examples directory for complete usage examples.

Resources

This package was inspired by and borrows from:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

boot_dummy-0.2.1.tar.gz (5.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

boot_dummy-0.2.1-py3-none-any.whl (6.0 kB view details)

Uploaded Python 3

File details

Details for the file boot_dummy-0.2.1.tar.gz.

File metadata

  • Download URL: boot_dummy-0.2.1.tar.gz
  • Upload date:
  • Size: 5.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.29

File hashes

Hashes for boot_dummy-0.2.1.tar.gz
Algorithm Hash digest
SHA256 d6f1bd686366acca190675dd1b9e9f290ff58fd1f7eb68c223d1dbb061b9f68b
MD5 77e9ae23809f0b5c8b75dec02a3cee5f
BLAKE2b-256 efcd226d9e5b75a32f01c2c8e8ead1fa2ac10b1bb2713868fd793df36c45761e

See more details on using hashes here.

File details

Details for the file boot_dummy-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: boot_dummy-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 6.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.5.29

File hashes

Hashes for boot_dummy-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 3d310c5ccfe24fd3d4c3d423e9a9d200410bb3a0a28f9d751b7335b513b407bf
MD5 a984bfd61e100306b782a486093cdb14
BLAKE2b-256 b493013e90d75673472402c186dd976c59ac945c13f2baea3d8407c222324786

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page