A data transformation library for flattening complex nested structures into tabular formats while preserving hierarchical relationships

These details have not been verified by PyPI

Project links

Project description

Transmog

Transform nested data into flat tables with a simple, intuitive API.

Overview

Transmog transforms nested JSON data into flat, tabular formats while preserving relationships between parent and child records.

Key Features:

Simple one-function API with smart defaults
Multiple output formats (JSON, CSV, Parquet)
Automatic relationship preservation
Memory-efficient streaming for large datasets

Quick Start

pip install transmog

import transmog as tm

# Transform nested data into flat tables
data = {"product_id": "PROD-123", "name": "Gaming Laptop", "specs": {"cpu": "i7", "ram": "16GB"}}
result = tm.flatten(data, name="products")

# Access flattened data in memory (list of dicts)
print(result.main)
# [{'product_id': 'PROD-123', 'name': 'Gaming Laptop', 'specs_cpu': 'i7', 'specs_ram': '16GB'}]

# Save to files in different formats
result.save("products.csv")        # Single CSV file
result.save("products.parquet")    # Single Parquet file
result.save("products.json")       # Single JSON file (only main table)

Example: Nested JSON to Multiple Tables

Transform complex nested data with arrays intelligently using smart mode (default):

data = {
    "user": {"name": "Alice", "email": "alice@example.com"},
    "tags": ["premium", "verified"],  # Simple array - kept as native array
    "orders": [  # Complex array - exploded to child table
        {"id": 101, "amount": 99.99, "items": ["laptop", "mouse"]},
        {"id": 102, "amount": 45.50, "items": ["keyboard"]}
    ]
}

result = tm.flatten(data, name="customer")

# Main table - flattened user data with native arrays
print(result.main)
# [
#   {
#     'user_name': 'Alice',
#     'user_email': 'alice@example.com',
#     'tags': ['premium', 'verified'],  # Native array!
#     '_id': 'a1b2c3d4-e5f6-4789-abc1-23456789def0'
#   }
# ]

# Complex arrays become separate tables with parent references
print(result.tables["customer_orders"])
# [
#   {'id': '101', 'amount': '99.99', 'items': ['laptop', 'mouse'], '_parent_id': 'a1b2c3d4...', '_id': 'b2c3d4...'},
#   {'id': '102', 'amount': '45.50', 'items': ['keyboard'], '_parent_id': 'a1b2c3d4...', '_id': 'c3d4...'}
# ]

# Access all tables in memory
print(f"Created {len(result.all_tables)} tables:")
print(list(result.all_tables.keys()))
# ['customer', 'customer_orders', 'customer_orders_items']

# Save to different formats for analysis
result.save("analytics/", "csv")       # CSV files for database import
result.save("warehouse/", "parquet")   # Parquet files for data warehouse
result.save("api/", "json")           # JSON files for web applications

Key Options:

Custom field separators: separator="."
Use existing IDs: id_field="customer_id"
Error handling: errors="skip"
File processing: tm.flatten_file("data.json")

Advanced Options

For more control over the flattening process:

result = tm.flatten(
    data,
    name="products",
    # Naming options
    separator=".",              # Use dots: user.name instead of user_name
    nested_threshold=3,         # Simplify deeply nested field names
    # ID management
    id_field="sku",            # Use existing field as primary ID
    parent_id_field="_parent",  # Customize parent reference field name
    add_timestamp=True,         # Add processing timestamp to records
    # Array handling (default is "smart")
    arrays="separate",         # Extract all arrays to child tables (vs "smart", "inline", "skip")
    # Data processing
    preserve_types=True,       # Keep original data types (not strings)
    skip_null=False,           # Include null values in output
    skip_empty=False,          # Include empty strings/lists
    # Performance tuning
    batch_size=5000,           # Process more records per batch
    low_memory=True,           # Optimize for memory usage over speed
)

Documentation

Complete documentation is available at scottdraper8.github.io/transmog, including:

Contributing

For contribution guidelines, development setup, and coding standards, see the Contributing Guide in the documentation.

License

MIT License

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.0.4

Mar 5, 2026

2.0.3

Feb 14, 2026

2.0.2

Feb 2, 2026

2.0.1

Nov 19, 2025

2.0.0

Nov 12, 2025

1.2.0 yanked

Apr 25, 2025

Reason this release was yanked:

Misprint of version

This version

1.1.1

Nov 6, 2025

1.1.0

Jul 1, 2025

1.0.6

Jun 3, 2025

1.0.5

Jun 2, 2025

1.0.4

May 27, 2025

1.0.3

May 23, 2025

1.0.2

May 22, 2025

1.0.1

May 19, 2025

1.0.0

May 16, 2025

0.1.2.5

Apr 25, 2025

0.1.2

Apr 25, 2025

0.1.1

Apr 25, 2025

0.1.0

Apr 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

transmog-1.1.1.tar.gz (105.9 kB view details)

Uploaded Nov 6, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

transmog-1.1.1-py3-none-any.whl (141.4 kB view details)

Uploaded Nov 6, 2025 Python 3

File details

Details for the file transmog-1.1.1.tar.gz.

File metadata

Download URL: transmog-1.1.1.tar.gz
Upload date: Nov 6, 2025
Size: 105.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.24

File hashes

Hashes for transmog-1.1.1.tar.gz
Algorithm	Hash digest
SHA256	`6db3ff81b2d7b26cad91b61077b6190f2c834625380461e0692fe1aef84e5da2`
MD5	`063f1f9c0826a38597dfd25515635e5a`
BLAKE2b-256	`f7beccde4afa9bd6a6345db5de8c113fb32ee58432bb251961adf1c2fb79268f`

See more details on using hashes here.

File details

Details for the file transmog-1.1.1-py3-none-any.whl.

File metadata

Download URL: transmog-1.1.1-py3-none-any.whl
Upload date: Nov 6, 2025
Size: 141.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.24

File hashes

Hashes for transmog-1.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a63cd6e8febd0d96874627631ae872ba09847df149458139c4cfc6a45a4e1373`
MD5	`b28ce261f4152a9d6386ed25fb15e946`
BLAKE2b-256	`39c37c6abd7a99614386510404b548f740ed10379ee909b282cc8b45aa87c98c`

See more details on using hashes here.

transmog 1.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Transmog

Overview

Quick Start

Example: Nested JSON to Multiple Tables

Advanced Options

Documentation

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes