Rand Engine v2. Package with some methods to generate random data in different formats. Great to mock data while testing or developing.

These details have been verified by PyPI

Project links

Repository

GitHub Statistics

Maintainers

marco_menezes

These details have not been verified by PyPI

Project description

Rand Engine

High-performance synthetic data generation for testing, development, and prototyping.

A Python library for generating millions of rows of realistic synthetic data through declarative specifications. Built on NumPy and Pandas for maximum performance.

🔥 What's New in v0.6.1

✅ Constraints System: Primary Keys (PK) and Foreign Keys (FK) for referential integrity between specs
✅ Composite Keys: Support for multi-column primary and foreign keys
✅ Watermarks: Temporal windows for realistic time-based relationships
✅ Enhanced Validation: Educational error messages with examples
✅ Logging System: Transparent logging with Python's built-in logger
✅ Windows Support: Full cross-platform compatibility (Linux, macOS, Windows)

📖 Complete documentation: CONSTRAINTS.md | EXAMPLES.md

📦 Installation

pip install rand-engine

🎯 Who Is This For?

Data Engineers: Test ETL/ELT pipelines without production data dependencies
QA Engineers: Generate realistic datasets for load and integration testing
Data Scientists: Mock data during model development and validation
Backend Developers: Populate development and staging environments
BI Professionals: Create demos and POCs without exposing sensitive data

🚀 Quick Start

1. Use Pre-Built Examples (Fastest Way)

Get started immediately with ready-to-use specifications:

from rand_engine import DataGenerator, RandSpecs

# Generate 10,000 customer records
rand_spec_example = RandSpecs.customers()
df_customers = DataGenerator(rand_spec_example, seed=42).size(10000).get_df()
print(df_customers.head()) # output is a pandas DataFrame

Output:

   customer_id       name  age                    email  is_active  account_balance
0    C00000001  John Smith   42    john.smith@email.com       True         15432.50
1    C00000002  Jane Brown   28   jane.brown@email.com       True          8721.33
2    C00000003   Bob Wilson   56   bob.wilson@email.com      False         42156.89
3    C00000004  Alice Davis   33  alice.davis@email.com       True         23400.12
4    C00000005   Tom Miller   49   tom.miller@email.com       True         31245.67

Test Available Pre-Built Specs:

from rand_engine import RandSpecs

builtin_rand_specs = [
  RandSpecs.customers(),    # Customer profiles (6 fields)
  RandSpecs.products(),     # Product catalog (6 fields)
  RandSpecs.orders(),       # Order records with currency/country (6 fields)
  RandSpecs.invoices(),     # Invoice records (6 fields)
  RandSpecs.shipments(),    # Shipping data with carrier/destination (6 fields)

  # 💰 Financial
  RandSpecs.transactions(), # Financial transactions (6 fields)

  # 👥 HR & People
  RandSpecs.employees(),    # Employee records with dept/level/role (6 fields)
  RandSpecs.users(),        # Application users (6 fields)

  # 🔧 IoT & Systems
  RandSpecs.devices(),      # IoT device data with status/priority (6 fields)
  RandSpecs.events()       # Event logs (6 fields)
]
for rand_spec in builtin_rand_specs:
  df = DataGenerator(rand_spec, seed=42).size(10**6).get_df()
  print(df)

**Complete Example:**

```python
from rand_engine import DataGenerator, RandSpecs


# Export to files
_ = (
  DataGenerator(RandSpecs.customers()).write \
    .size(100000)
    .format("parquet")
    .mode("overwrite")
    .option("numFiles", 5)
    .option("compression", "snappy")
    .save("./customers.parquet")
)

2. Create Custom Specifications

Build your own specs for specific use cases:

from rand_engine import DataGenerator

# Simple specification
spec = {
    "user_id": {
        "method": "unique_ids",
        "kwargs": {"strategy": "zint", "length": 8}
    },
    "age": {
        "method": "integers",
        "kwargs": {"min": 18, "max": 65}
    },
    "salary": {
        "method": "floats",
        "kwargs": {"min": 30000.0, "max": 150000.0, "round": 2}
    }
}

df = DataGenerator(spec, seed=42).size(10**7).get_df()
print(df)

📚 Core Generation Methods

Method	Description	Example Use Case
unique_ids	Unique identifiers	User IDs, order numbers
integers	Random integers	Ages, quantities, counts
floats	Random decimals	Prices, weights, measurements
floats_normal	Normal distribution	Heights, temperatures, scores
booleans	True/False with probability	Active flags, feature toggles
distincts	Random selection	Categories, statuses, types
distincts_prop	Weighted selection	Product mix, user tiers
unix_timestamps	Date/time values	Created dates, event times

Simple Example:

spec = {
    "product_id": {"method": "unique_ids", "kwargs": {"strategy": "zint"}},
    "price": {"method": "floats", "kwargs": {"min": 9.99, "max": 999.99, "round": 2}},
    "category": {"method": "distincts", "kwargs": {"distincts": ["Electronics", "Clothing", "Food"]}},
    "in_stock": {"method": "booleans", "kwargs": {"true_prob": 0.85}}
}

df_products = DataGenerator(spec).size(10**6).get_df()
print(df_products)

🎨 Real-World Use Cases

E-commerce with Referential Integrity (3 Levels)

These examples demonstrate generating related datasets with Primary Key (PK) and Foreign Key (FK) constraints to maintain referential integrity.

In background, Rand Engine uses a shared checkpoint database to track generated keys and ensure relationships are valid. At this point, it can use DuckDB or SQLite for this purpose.

from rand_engine import DataGenerator

# Use shared checkpoint database for referential integrity

# Level 1: Categories (PK)
spec_categories = lambda: {
    "category_id": dict(method="unique_ids", kwargs={"strategy": "zint", "length": 4}),
    "category_name": dict(method="distincts", kwargs={"distincts": ["Electronics", "Books", "Clothing"]}),
    "constraints": {
        "category_pk": dict(
            name="category_pk",
            tipo="PK",
            fields=["category_id VARCHAR(4)"]
        )
    }
}

# Level 2: Products (FK → categories, PK for orders)
spec_products = lambda: {
    "product_id": dict(method="unique_ids", kwargs={"strategy": "zint", "length": 8}),
    "product_name": dict(method="distincts", kwargs={"distincts": [f"Product {i}" for i in range(100)]}),
    "price": dict(method="floats", kwargs={"min": 10.0, "max": 1000.0, "round": 2}),
    "constraints": {
        "product_pk": dict(
            name="product_pk", 
            tipo="PK",
            fields=["product_id VARCHAR(8)"]
        ),
        "category_fk": dict(
            name="category_pk",
            tipo="FK",
            fields=["category_id"],
            watermark=60)
    }
}

# Level 3: Orders (FK → products)
spec_orders = lambda:{
    "order_id": dict(method="unique_ids", kwargs={"strategy": "uuid4"}),
    "quantity": dict(method="integers", kwargs={"min": 1, "max": 10}),
    "total": dict(method="floats", kwargs={"min": 10.0, "max": 5000.0, "round": 2}),
    "constraints": {
        "product_fk": dict(
            name="product_pk",
            tipo="FK",
            fields=["product_id"],
            watermark=120
        )
    }
}

df_cat = DataGenerator(spec_categories).size(10).get_df()
print(df_cat)

df_prod = DataGenerator(spec_products).size(100).get_df()
print(df_prod)

df_orders = DataGenerator(spec_orders).size(1000).get_df()
print(df_orders)

Testing ETL Pipelines

from rand_engine import DataGenerator, RandSpecs

# Generate source data
source_df = DataGenerator(RandSpecs.transactions(), seed=42).size(1_000_000).get_df()

# Export to staging
source_df.to_parquet("staging/transactions.parquet")

# Run your ETL pipeline
# ...

# Generate more data for incremental loads
incremental_df = DataGenerator(RandSpecs.transactions()).size(10_000).get_df()

Load Testing APIs

import requests
from rand_engine import DataGenerator, RandSpecs

# Generate test users
stream = DataGenerator(RandSpecs.users()).stream_dict(min_throughput=10, max_throughput=50)

for user in stream:
    response = requests.post("https://api.example.com/users", json=user)
    print(f"Created user {user['user_id']}: {response.status_code}")

Populating Development Databases

from rand_engine import DataGenerator, RandSpecs
from rand_engine.integrations._duckdb_handler import DuckDBHandler

# Generate data
customers = DataGenerator(RandSpecs.customers()).size(10_000).get_df()
orders = DataGenerator(RandSpecs.orders()).size(50_000).get_df()

# Insert into database
db = DuckDBHandler("dev_database.duckdb")
db.create_table("customers", "customer_id VARCHAR(10) PRIMARY KEY")
db.insert_df("customers", customers, pk_cols=["customer_id"])
db.create_table("orders", "order_id VARCHAR(10) PRIMARY KEY")
db.insert_df("orders", orders, pk_cols=["order_id"])
db.close()

QA Testing with Edge Cases

from rand_engine import DataGenerator

# Mix of normal and edge cases
spec = {
    "value": {"method": "floats", "kwargs": {"min": -999999.99, "max": 999999.99, "round": 2}},
    "status": {"method": "distincts", "kwargs": {"distincts": ["active", "deleted", "suspended", "pending"]}},
    "edge_case": {"method": "booleans", "kwargs": {"true_prob": 0.05}}  # 5% edge cases
}

test_data = DataGenerator(spec, seed=789).size(1000).get_df()
edge_cases = test_data[test_data['edge_case'] == True]

🔥 Advanced Features

🔗 Constraints & Referential Integrity ⭐ NEW

The most powerful feature of v0.6.1! Create realistic datasets with proper Primary Key/Foreign Key relationships.

from rand_engine import DataGenerator

# 1. Create CATEGORIES (Primary Key)
spec_categories = {
    "category_id": {"method": "unique_ids", "kwargs": {"strategy": "zint", "length": 4}},
    "category_name": {"method": "distincts", "kwargs": {"distincts": ["Electronics", "Books", "Clothing"]}},
    "constraints": {
        "category_pk": {
            "name": "category_pk",
            "tipo": "PK",
            "fields": ["category_id VARCHAR(4)"]
        }
    }
}

# Generate categories
df_categories = (
    DataGenerator(spec_categories, seed=42)
    .checkpoint(":memory:")
    .size(10)
    .get_df()
)

# 2. Create PRODUCTS (Foreign Key → categories)
spec_products = {
    "product_id": {"method": "unique_ids", "kwargs": {"strategy": "zint", "length": 8}},
    "product_name": {"method": "distincts", "kwargs": {"distincts": [f"Product {i}" for i in range(100)]}},
    "price": {"method": "floats", "kwargs": {"min": 10.0, "max": 1000.0, "round": 2}},
    "constraints": {
        "category_fk": {
            "name": "category_pk",
            "tipo": "FK",
            "fields": ["category_id"],
            "watermark": 60  # Reference records from last 60 seconds
        }
    }
}

# Generate products
df_products = (
    DataGenerator(spec_products, seed=42)
    .checkpoint(":memory:")
    .size(1000)
    .get_df()
)

# ✅ RESULT: 100% referential integrity
# All products reference valid categories
print(f"Valid integrity: {set(df_products['category_id']).issubset(set(df_categories['category_id']))}")
# Output: Valid integrity: True

Key Features:

Primary Keys (PK): Create checkpoint tables with generated records
Foreign Keys (FK): Reference values from PK checkpoint tables
Composite Keys: Multi-column PKs and FKs (e.g., client_id + client_type)
Watermarks: Temporal windows for realistic time-based relationships
DuckDB/SQLite: Checkpoint tables stored in memory or disk

📖 Complete guide with 3-level examples: CONSTRAINTS.md

Correlated Columns

Generate related data (device → OS, product → status, etc.):

# Example: orders() spec includes correlated currency & country
orders = DataGenerator(RandSpecs.orders()).size(1000).get_df()

# Result: 
# order_id  amount  currency  country
#       001  100.50      USD       US
#       002   85.30      EUR       DE
#       003  120.75      GBP       UK

Weighted Distributions

# Example: products() uses weighted categories
products = DataGenerator(RandSpecs.products()).size(10000).get_df()

# Result distribution:
# Electronics: ~40%
# Clothing: ~30%  
# Food: ~20%
# Books: ~10%

Streaming Generation

from rand_engine import DataGenerator, RandSpecs

# Generate continuous data stream
stream = DataGenerator(RandSpecs.events()).stream_dict(
    min_throughput=5,   # Minimum records/second
    max_throughput=15   # Maximum records/second
)

for event in stream:
    # Each record includes automatic timestamp_created
    print(f"[{event['timestamp_created']}] Event: {event['event_type']}")
    # Send to Kafka, Kinesis, etc.

Multiple Export Formats

from rand_engine import DataGenerator, RandSpecs

spec = RandSpecs.transactions()

# CSV with compression
DataGenerator(spec).write.size(100000).format("csv").option("compression", "gzip").save("data.csv.gz")

# Parquet with Snappy
DataGenerator(spec).write.size(1000000).format("parquet").option("compression", "snappy").save("data.parquet")

# JSON
DataGenerator(spec).write.size(50000).format("json").save("data.json")

Reproducible Data

from rand_engine import DataGenerator, RandSpecs

# Same seed = identical data
df1 = DataGenerator(RandSpecs.customers(), seed=42).size(1000).get_df()
df2 = DataGenerator(RandSpecs.customers(), seed=42).size(1000).get_df()

assert df1.equals(df2)  # True - perfect reproducibility

🗂️ Export & Integration

File Formats

from rand_engine import DataGenerator, RandSpecs

generator = DataGenerator(RandSpecs.orders())

# CSV
generator.write.size(10000).format("csv").save("orders.csv")

# Parquet (recommended for large datasets)
generator.write.size(1000000).format("parquet").save("orders.parquet")

# JSON
generator.write.size(5000).format("json").save("orders.json")

# Multiple files (partitioned)
generator.write.size(1000000).option("numFiles", 10).format("parquet").save("orders/")

Writing Modes: Batch vs Streaming

rand_engine supports two distinct writing modes:

Batch Mode (.write): Generate all data at once

# Single file
DataGenerator(spec).write \
    .size(10000) \
    .format("parquet") \
    .option("compression", "snappy") \
    .save("output/data.parquet")

# Multiple files (parallel processing)
DataGenerator(spec).write \
    .size(1000000) \
    .option("numFiles", 5) \
    .format("parquet") \
    .save("output/data.parquet")
# Creates: part_uuid1.parquet, part_uuid2.parquet, ...

Streaming Mode (.writeStream): Continuous generation over time

# Stream for 1 hour, new file every minute
DataGenerator(spec).writeStream \
    .size(500) \
    .format("json") \
    .option("compression", "gzip") \
    .option("timeout", 3600) \
    .trigger(frequency=60) \
    .start("output/events")
# Creates 60 files over 1 hour

Compression Support:

CSV/JSON: gzip, bz2, zip, xz
Parquet: snappy (default), gzip, zstd, lz4, brotli

📖 Complete guide with examples: WRITING_FILES.md

Database Integration

DuckDB:

from rand_engine import DataGenerator, RandSpecs
from rand_engine.integrations._duckdb_handler import DuckDBHandler

# Generate data
df = DataGenerator(RandSpecs.employees()).size(10000).get_df()

# Insert into DuckDB
db = DuckDBHandler("analytics.duckdb")
db.create_table("employees", "employee_id VARCHAR(10) PRIMARY KEY")
db.insert_df("employees", df, pk_cols=["employee_id"])

# Query
result = db.select_all("employees")
print(result.head())

db.close()

SQLite:

from rand_engine.integrations._sqlite_handler import SQLiteHandler

db = SQLiteHandler("test.db")
db.create_table("users", "user_id VARCHAR(10) PRIMARY KEY")
db.insert_df("users", df, pk_cols=["user_id"])
db.close()

📖 Exploring Available Specs

Want to see what's inside each pre-built spec?

from rand_engine import RandSpecs
import json

# View any spec structure
spec = RandSpecs.customers()
print(json.dumps(spec, indent=2))

# Output shows all fields and generation methods:
# {
#   "customer_id": {
#     "method": "unique_ids",
#     "kwargs": {"strategy": "zint", "prefix": "C"}
#   },
#   "name": {
#     "method": "distincts",
#     "kwargs": {"distincts": ["John Smith", "Jane Brown", ...]}
#   },
#   ...
# }

Try different specs:

# See all available specs
print(RandSpecs.products())
print(RandSpecs.transactions())
print(RandSpecs.devices())
print(RandSpecs.events())

Each spec demonstrates different generation techniques - use them as templates for your own custom specs!

🛠️ Creating Custom Specs

Basic Template

from rand_engine import DataGenerator

my_spec = {
    "id": {
        "method": "unique_ids",
        "kwargs": {"strategy": "zint"}
    },
    "name": {
        "method": "distincts",
        "kwargs": {"distincts": ["Alice", "Bob", "Charlie"]}
    },
    "value": {
        "method": "floats",
        "kwargs": {"min": 0.0, "max": 100.0, "round": 2}
    }
}

df = DataGenerator(my_spec).size(1000).get_df()

Spec Validation

Enable validation to catch errors early:

invalid_spec = {
    "age": {
        "method": "integers"  # Missing required "min" and "max"
    }
}

try:
    generator = DataGenerator(invalid_spec, validate=True)
except Exception as e:
    print(e)
    # ❌ Column 'age': Missing required parameter 'min'
    #    Correct example:
    #    {
    #        "age": {
    #            "method": "integers",
    #            "kwargs": {"min": 18, "max": 65}
    #        }
    #    }

Validates:

Required parameters for each method
Constraints structure (PK/FK, fields, watermark)
Data types and ranges
Provides educational error messages with examples

🏗️ Architecture

Design Philosophy

Declarative: Specify what you want, not how to generate it
Performance: Built on NumPy for vectorized operations (millions of rows/second)
Simplicity: Pre-built examples for immediate use
Extensibility: Easy to create custom specifications

Public API

from rand_engine import DataGenerator, RandSpecs

# That's it! Simple and clean.

All internal modules (prefixed with _) are implementation details.

🧪 Quality & Testing

236 tests passing (20 new constraint tests in v0.6.1)
Comprehensive coverage of all generation methods
Validated on millions of generated records
Battle-tested in production ETL pipelines
Constraint validation with 100% integrity checks

# Run tests
pytest

# Run constraint tests only
pytest tests/test_8_consistency.py -v

# With coverage report
pytest --cov=rand_engine --cov-report=html

💡 Tips & Best Practices

For Data Engineers

Use seed parameter for reproducible test data
Export to Parquet with compression for large datasets
Use streaming mode for continuous data generation
Leverage constraints for multi-table data generation with referential integrity
Use .checkpoint(":memory:") for in-memory databases or .checkpoint("path/to/db.duckdb") for persistence

For QA Engineers

Start with pre-built specs (RandSpecs)
Use validation mode (validate=True) during development
Generate edge cases with low probability booleans
Create multiple test datasets with different seeds
Test PK/FK relationships with constraints for realistic scenarios

Performance Tips

Generate data in batches for optimal memory usage
Use Parquet format for large datasets (10x smaller than CSV)
Enable compression for file exports
Reuse DataGenerator instances when generating multiple datasets
Use watermarks to control FK relationship size (avoid loading entire checkpoint tables)

Constraints Best Practices

Use composite keys for complex relationships (e.g., client_id + client_type)
Set appropriate watermarks (60-3600 seconds) based on data freshness requirements
Use in-memory databases (:memory:) for testing, disk-based for production
Generate PK specs before FK specs to ensure checkpoint tables exist
Validate integrity with set operations: set(fk_values).issubset(set(pk_values))

📖 50+ production-ready examples: EXAMPLES.md

📄 Requirements

Python: >= 3.10
numpy: >= 2.1.1
pandas: >= 2.2.2
faker: >= 28.4.1 (optional, for realistic names/addresses)
duckdb: >= 1.1.0 (optional, for constraints with DuckDB)
sqlite3: Built-in Python (for constraints with SQLite)

📚 Documentation

EXAMPLES.md: 50+ production-ready examples (1,600+ lines)
CONSTRAINTS.md: Complete guide to PK/FK system (900+ lines)
API_REFERENCE.md: Full method reference
LOGGING.md: Logging configuration guide

📞 Support

Issues: GitHub Issues
Discussions: GitHub Discussions
Email: marcourelioreislima@gmail.com

📄 License

MIT License - see LICENSE file for details.

🌟 Star History

If you find this project useful, consider giving it a ⭐ on GitHub!

Built with ❤️ for Data Engineers, QA Engineers, and the entire data community

Project details

These details have been verified by PyPI

Project links

Repository

GitHub Statistics

Maintainers

marco_menezes

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.6.4rc1 pre-release

Nov 5, 2025

0.6.3

Nov 1, 2025

0.6.3rc3 pre-release

Nov 1, 2025

0.6.3rc2 pre-release

Nov 1, 2025

0.6.3rc1 pre-release

Oct 30, 2025

0.6.2

Oct 26, 2025

0.6.2rc1 pre-release

Oct 26, 2025

This version

0.6.1

Oct 24, 2025

0.6.1rc4 pre-release

Oct 24, 2025

0.6.1rc3 pre-release

Oct 24, 2025

0.6.1rc2 pre-release

Oct 23, 2025

0.6.1rc1 pre-release

Oct 22, 2025

0.6.0

Oct 19, 2025

0.6.0rc2 pre-release

Oct 19, 2025

0.6.0rc1 pre-release

Oct 19, 2025

0.5.5

Oct 17, 2025

0.5.5rc2 pre-release

Oct 17, 2025

0.5.5rc1 pre-release

Oct 17, 2025

0.5.4rc1 pre-release

Oct 13, 2025

0.5.3

Oct 12, 2025

0.5.2rc1 pre-release

Oct 12, 2025

0.5.1rc1 pre-release

Oct 11, 2025

0.4.7

Oct 11, 2025

0.4.5

Sep 23, 2025

0.4.4

Sep 23, 2025

0.4.3

Sep 23, 2025

0.4.2

Sep 23, 2025

0.4.1

Sep 23, 2025

0.4.0

Sep 23, 2025

0.3.14

Sep 18, 2025

0.3.13

Sep 18, 2025

0.3.12

Sep 18, 2025

0.3.11

Sep 18, 2025

0.3.9

Sep 18, 2025

0.3.8

Sep 18, 2025

0.3.7

Sep 9, 2025

0.3.5

Feb 2, 2025

0.3.3

Dec 1, 2024

0.2.0

Oct 27, 2024

0.1.1

Sep 24, 2024

0.0.3

Jun 23, 2022

0.0.2

Apr 11, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rand_engine-0.6.1.tar.gz (40.1 kB view details)

Uploaded Oct 24, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rand_engine-0.6.1-py3-none-any.whl (44.5 kB view details)

Uploaded Oct 24, 2025 Python 3

File details

Details for the file rand_engine-0.6.1.tar.gz.

File metadata

Download URL: rand_engine-0.6.1.tar.gz
Upload date: Oct 24, 2025
Size: 40.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for rand_engine-0.6.1.tar.gz
Algorithm	Hash digest
SHA256	`e25633f786b480308b19acfd1e384df991c8fa454fa42555eebafe23addd413c`
MD5	`d68c2d5a097f80deca2be92b7ac0863e`
BLAKE2b-256	`f46eb21694b725ceb7571bc99d16fbafe4aa0fe50f4dd8e396592ed0035f27b2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rand_engine-0.6.1.tar.gz:

Publisher: auto_tag_publish_master.yml on marcoaureliomenezes/rand_engine

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rand_engine-0.6.1.tar.gz
- Subject digest: e25633f786b480308b19acfd1e384df991c8fa454fa42555eebafe23addd413c
- Sigstore transparency entry: 637474382
- Sigstore integration time: Oct 24, 2025
Source repository:
- Permalink: marcoaureliomenezes/rand_engine@653f9e11777e372f7212e0540d3828888262ff72
- Branch / Tag: refs/heads/master
- Owner: https://github.com/marcoaureliomenezes
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: auto_tag_publish_master.yml@653f9e11777e372f7212e0540d3828888262ff72
- Trigger Event: pull_request

File details

Details for the file rand_engine-0.6.1-py3-none-any.whl.

File metadata

Download URL: rand_engine-0.6.1-py3-none-any.whl
Upload date: Oct 24, 2025
Size: 44.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for rand_engine-0.6.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b79a371d15ff4c78030e834b37e28a47803847aa275b80a3bbef24a211c930d0`
MD5	`b3c13f376b89147269974500714ee93a`
BLAKE2b-256	`d765ccfdbc5e6a87204607df1c1fa3970c0f29aa1ea166572cf02b204b8388c6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rand_engine-0.6.1-py3-none-any.whl:

Publisher: auto_tag_publish_master.yml on marcoaureliomenezes/rand_engine

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rand_engine-0.6.1-py3-none-any.whl
- Subject digest: b79a371d15ff4c78030e834b37e28a47803847aa275b80a3bbef24a211c930d0
- Sigstore transparency entry: 637474385
- Sigstore integration time: Oct 24, 2025
Source repository:
- Permalink: marcoaureliomenezes/rand_engine@653f9e11777e372f7212e0540d3828888262ff72
- Branch / Tag: refs/heads/master
- Owner: https://github.com/marcoaureliomenezes
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: auto_tag_publish_master.yml@653f9e11777e372f7212e0540d3828888262ff72
- Trigger Event: pull_request

rand-engine 0.6.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

Rand Engine

🔥 What's New in v0.6.1

📦 Installation

🎯 Who Is This For?

🚀 Quick Start

1. Use Pre-Built Examples (Fastest Way)

2. Create Custom Specifications

📚 Core Generation Methods

🎨 Real-World Use Cases

E-commerce with Referential Integrity (3 Levels)

Testing ETL Pipelines

Load Testing APIs

Populating Development Databases

QA Testing with Edge Cases

🔥 Advanced Features

🔗 Constraints & Referential Integrity ⭐ NEW

Correlated Columns

Weighted Distributions

Streaming Generation

Multiple Export Formats

Reproducible Data

🗂️ Export & Integration

File Formats

Writing Modes: Batch vs Streaming

Database Integration

📖 Exploring Available Specs

🛠️ Creating Custom Specs

Basic Template

Spec Validation

🏗️ Architecture

Design Philosophy

Public API

🧪 Quality & Testing

💡 Tips & Best Practices

For Data Engineers

For QA Engineers

Performance Tips

Constraints Best Practices

📄 Requirements

📚 Documentation

📞 Support

📄 License

🌟 Star History

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance