A declarative, type-safe Python DSL for mapping complex nested JSON to relational database schemas

Project description

`etielle`: Declarative JSON-to-Relational Mapping in Python

etielle is a simple, powerful Python library for reshaping nested JSON data, typically from an API, into relational tables that fit your database schema. Think of etielle as a “JSON extractor” that you program with clear instructions: “Go here in the JSON, pull this data, and put it in that table.” The library’s name is a play on ETL (“Extract, Transform, Load”), which is the technical term for this set of operations.

Repository: Promptly-Technologies-LLC/etielle
PyPI: etielle
Python: ≥ 3.13

Why Use `etielle`? (For Beginners)

JSON data from APIs (Application Program Interfaces—web services that typically return JSON) is often deeply nested and requires complicated parsing. etielle helps by:

Declaring what you want: Write Python code to describe your tables and how to fill them.
Traversing nested structures: Walk through arrays-within-dictionaries-within-arrays to any arbitrary depth.
Performing arbitrary transformations: Use the provided functions to perform common operations (like getting the key or index of the current item or its parent), or define your own.
Building relationships: Link records across your different output tables and emit ORM relationships or foreign keys.
Being beginner-friendly: Everything is type-safe (Python checks your types), composable (build complex things from simple pieces), and easy to debug.

Learning Path

Start here: Follow the Quick Start example below to see basic mapping
Understand the pieces: Read Core Concepts to learn about Context, Transforms, and TraversalSpec
Go deeper: Explore the detailed examples for nesting and merging
Advanced features: Check out the docs/ folder for instance emission, relationships, and more

Installation

We recommend using uv for faster installs, but pip works too.

With uv (Recommended for Speed)

For your project:

uv add etielle

For one-off use:

uv pip install etielle

With pip

pip install etielle

Optional: SQLAlchemy adapter

If you plan to bind relationships and flush via SQLAlchemy in one go, install the optional extra:

uv add "etielle[sqlalchemy]"

Optional: SQLModel adapter

If you plan to bind relationships and flush via SQLModel in one go, install the optional extra:

uv add "etielle[sqlmodel]"

Quick Start: Your First Mapping

Let’s start with a simple example. Suppose you have this JSON:

import json

data = {
  "users": [
    {"id": "u1", "name": "Alice", "posts": [{"id": "p1", "title": "Hello"}, {"id": "p2", "title": "World"}]},
    {"id": "u2", "name": "Bob", "posts": []}
  ]
}

We want two tables: “users” (id, name) and “posts” (id, user_id, title).

Here’s the code:

from etielle.core import MappingSpec, TraversalSpec, TableEmit, Field  # Core building blocks
from etielle.transforms import get, get_from_parent  # Functions to pull data from JSON
from etielle.executor import run_mapping  # The engine that runs everything

# A TraversalSpec tells etielle how to walk through your JSON. Think of it as
# giving directions: "Start at the 'users' key, then loop through each item in that array."

# Traverse users array
users_traversal = TraversalSpec(
    path=["users"],  # Path to the array
    mode="auto",  # Iterate automatically based on container
    emits=[
        # The join_keys identify each unique row—like a primary key in a database.
        # Rows with matching keys will be merged together.
        TableEmit(
            table="users",
            join_keys=[get("id")],  # Unique key for the row
            fields=[
                Field("id", get("id")),
                Field("name", get("name"))
            ]
        )
    ]
)

# This second traversal is nested: first we navigate to each user,
# then for each user we go into their posts array using inner_path.
posts_traversal = TraversalSpec(
    path=["users"],
    mode="auto",
    inner_path=["posts"],  # Nested path inside each user
    inner_mode="auto",
    emits=[
        TableEmit(
            table="posts",
            join_keys=[get("id")],
            fields=[
                Field("id", get("id")),
                Field("user_id", get_from_parent("id")),  # Link to parent user
                Field("title", get("title"))
            ]
        )
    ]
)

spec = MappingSpec(traversals=[users_traversal, posts_traversal])
result = run_mapping(data, spec)

# result is a dict: {"users": MappingResult, "posts": MappingResult}
# Each MappingResult has .instances (a dict keyed by join_keys)
# Let's convert to simple lists for display:
out = {table: list(mr.instances.values()) for table, mr in result.items()}
print(json.dumps(out, indent=2))

{
  "users": [
    {
      "id": "u1",
      "name": "Alice"
    },
    {
      "id": "u2",
      "name": "Bob"
    }
  ],
  "posts": [
    {
      "id": "p1",
      "user_id": "u1",
      "title": "Hello"
    },
    {
      "id": "p2",
      "user_id": "u1",
      "title": "World"
    }
  ]
}

Congrats! You’ve mapped your first JSON.

Core Concepts: Breaking It Down

Let’s explain the building blocks like you’re learning for the first time.

1. Context: Your “Location” in the JSON

Imagine traversing a JSON tree—Context is your GPS:

root: The entire JSON.
node: The current spot (e.g., a user object).
path: Directions to get here (e.g., (“users”, 0)).
parent: The previous spot (for looking “up”).
key/index: If in a dict/list, the current key or index.
slots: A notepad for temporary notes.

Contexts are created automatically as you traverse and are immutable (unchangeable) for safety.

2. Transforms: Smart Data Extractors

Transforms are like mini-functions that pull values from Context. They’re “lazy”—they don’t run until needed, and they adapt to the current Context.

Examples:

get("name"): Get “name” from current node → "Alice" when node is {"name": "Alice"}
get_from_parent("id"): Get “id” from parent context → "u1" when processing a post under user u1
index(): Current list position → 0 for first item, 1 for second, etc.
concat(literal("user_"), get("id")): Combine strings → "user_u1"

Full list in the Cheatsheet below.

3. TraversalSpec: How to Walk the JSON

This says: “Start here, then go deeper if needed, and do this for each item.”

path: Starting path (list of strings, e.g., [“users”]).
mode: Iteration mode for the outer container: “auto” (default), “items”, or “single”.
inner_path: Optional deeper path (e.g., [“posts”] for nesting).
inner_mode: Iteration mode for the inner container: “auto” (default), “items”, or “single”.
emits: What tables to create from each item.

You can have multiple Traversals in one MappingSpec—they run independently.

Here’s a visual representation of how traversals work:

JSON structure:
root
└── users []                    ← path=["users"]
    ├── [0] {"id": "u1", ...}
    │   └── posts []            ← inner_path=["posts"]
    │       ├── [0] {"id": "p1", "title": "Hello"}
    │       └── [1] {"id": "p2", "title": "World"}
    └── [1] {"id": "u2", ...}

4. TableEmit and Fields: Building Your Tables

table: Name of the table.
fields: List of Field(name, transform) – columns and how to compute them.
join_keys: List of transforms for unique row IDs (like primary keys). Same keys across traversals merge rows.

5. Executor: Running It All

run_mapping(json_data, spec) executes everything and returns a dict of tables.

Detailed Examples

Example 1: Composite Keys for Merging Data

Merge user info from two parts of JSON:

spec = MappingSpec(traversals=[
    TraversalSpec(  # Basic user data
        path=["users"],
        mode="auto",
        emits=[TableEmit(
            table="users",
            join_keys=[get("id")],
            fields=[Field("id", get("id")), Field("name", get("name"))]
        )]
    ),
    TraversalSpec(  # Add email from another section
        path=["profiles"],
        mode="auto",
        emits=[TableEmit(
            table="users",  # Same table!
            join_keys=[get("user_id")],  # Matches previous keys
            fields=[Field("email", get("email"))]
        )]
    )
])

Rows with matching keys merge: e.g., add “email” to existing user row.

Example 2: Deep Nesting (Arbitrary Depth)

No limit to depth—use longer inner_path. The depth parameter controls how many levels up to look:

get_from_parent("id") or depth=1 → immediate parent
get_from_parent("id", depth=2) → grandparent
get_from_parent("id", depth=3) → great-grandparent

spec = MappingSpec(traversals=[
    TraversalSpec(
        path=["servers"],
        mode="auto",
        inner_path=["channels", "messages", "reactions"],  # 3 levels deep!
        inner_mode="auto",
        emits=[TableEmit(
            table="reactions",
            join_keys=[get_from_parent("id", depth=3), get_from_parent("id", depth=2), get_from_parent("id"), get("id")],
            fields=[
                Field("server_id", get_from_parent("id", depth=3)),
                Field("channel_id", get_from_parent("id", depth=2)),
                Field("message_id", get_from_parent("id")),
                Field("reaction", get("emoji"))
            ]
        )]
    )
])

Transform Cheatsheet

get(path): From current node (dot notation or list, e.g., “user.name” or [“user”, 0]).
get_from_parent(path, depth=1): From ancestor.
get_from_root(path): From top-level JSON.
key(): Current dict key.
index(): Current list index.
literal(value): Constant value.
concat(*parts): Join strings.
format_id(*parts, sep="_"): Join non-empty parts with separator.
coalesce(*transforms): First non-None value.
len_of(inner): Length of a list/dict/string.

Pro Tip: Transforms are lazy—they run in the “context” of where they’re used, making them super flexible.

Transforms compose naturally:

user_key = concat(literal("user_"), get("id"))           # "user_123"
full_name = concat(get("first"), literal(" "), get("last"))  # "Alice Smith"

Common Mistakes

Empty results?
- Check your path matches the JSON structure exactly
- Verify the data type at that path matches expectations
Missing parent data?
- Check the depth parameter in get_from_parent()
- Ensure the parent context exists in your traversal
Duplicate or missing rows?
- Verify join_keys are unique for each row
- Check that join_keys don’t contain None values (these rows are skipped)

Advanced Topics

Lazy Evaluation: Transforms don’t compute until executed, adapting to the current spot in JSON.
Custom Transforms: Define your own functions that take Context and return values.
Row Merging Rules: Last write wins for duplicate fields; missing keys skip rows.
Field selectors: Type-safe field references. See Field selectors.
Instance emission: Build Pydantic/TypedDict/ORM instances directly instead of dicts. See Instance emission.
Merge policies: Sum/append/min/max instead of overwrite when multiple traversals update the same field. See Merge policies.
Error reporting: Per-key diagnostics in results. See Error reporting.
Relationships without extra round trips: Bind in-memory, flush once. See Relationships and SQLAlchemy adapter.
Performance: Efficient for large JSON; traversals are independent.

Roadmap Ideas

Database integrations (e.g., SQLAlchemy).
More examples and benchmarks.
Visual mapping tools.

Glossary

Context: Your current position while traversing the JSON tree
Transform: A function that extracts values from a Context
Traversal: Instructions for walking through part of the JSON
Emit: Creating a table row from the current context
Join keys: Values that uniquely identify a row (like primary keys)
Depth: How many parent levels to traverse upward

License

MIT

Need help? Open an issue on GitHub!

Project details

Release history Release notifications | RSS feed

3.6.1

Dec 14, 2025

3.6.0

Dec 14, 2025

3.5.2

Dec 12, 2025

3.5.1

Dec 4, 2025

3.5.0

Dec 3, 2025

3.4.0

Dec 3, 2025

3.3.0

Dec 3, 2025

3.2.0

Dec 2, 2025

3.1.0

Dec 2, 2025

3.0.0

Dec 2, 2025

2.6.0

Dec 2, 2025

2.5.0

Dec 1, 2025

2.4.0

Dec 1, 2025

2.3.2

Nov 29, 2025

2.3.1

Nov 29, 2025

2.3.0

Nov 29, 2025

2.2.1

Nov 25, 2025

2.2.0

Oct 23, 2025

This version

2.1.0

Oct 20, 2025

2.0.0

Oct 19, 2025

1.4.0

Oct 15, 2025

1.3.0

Oct 14, 2025

1.2.0

Oct 14, 2025

1.1.0

Oct 14, 2025

1.0.6

Oct 14, 2025

0.2.0

Oct 14, 2025

0.0.0

Oct 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

etielle-2.1.0.tar.gz (93.2 kB view details)

Uploaded Oct 20, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

etielle-2.1.0-py3-none-any.whl (19.8 kB view details)

Uploaded Oct 20, 2025 Python 3

File details

Details for the file etielle-2.1.0.tar.gz.

File metadata

Download URL: etielle-2.1.0.tar.gz
Upload date: Oct 20, 2025
Size: 93.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for etielle-2.1.0.tar.gz
Algorithm	Hash digest
SHA256	`b4c325f51cea8ec972810bc646bc7fa43c3e7340db5ade586e8fadf3108d864b`
MD5	`347007022999352185fd81df73a500a7`
BLAKE2b-256	`123fabbd70d8d7d3a6c5b07bc1af893f11b2688f7746bcc7eb15e7bb5a44206b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for etielle-2.1.0.tar.gz:

Publisher: release.yml on Promptly-Technologies-LLC/etielle

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: etielle-2.1.0.tar.gz
- Subject digest: b4c325f51cea8ec972810bc646bc7fa43c3e7340db5ade586e8fadf3108d864b
- Sigstore transparency entry: 623149235
- Sigstore integration time: Oct 20, 2025
Source repository:
- Permalink: Promptly-Technologies-LLC/etielle@a9bbee9e204b834602271cdf453d5e60088574d7
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Promptly-Technologies-LLC
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@a9bbee9e204b834602271cdf453d5e60088574d7
- Trigger Event: workflow_run

File details

Details for the file etielle-2.1.0-py3-none-any.whl.

File metadata

Download URL: etielle-2.1.0-py3-none-any.whl
Upload date: Oct 20, 2025
Size: 19.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for etielle-2.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8b2cb2aa5291a7a9f32af9b62753d6524b29aa3f694f0f679895b1737f6529b1`
MD5	`a160c53dc63294b010d6af4a92bf86a0`
BLAKE2b-256	`302303c1fc948dc3131abb4b50c5bc0045a05ad1f1178cdb608cba08dc73b9c2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for etielle-2.1.0-py3-none-any.whl:

Publisher: release.yml on Promptly-Technologies-LLC/etielle

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: etielle-2.1.0-py3-none-any.whl
- Subject digest: 8b2cb2aa5291a7a9f32af9b62753d6524b29aa3f694f0f679895b1737f6529b1
- Sigstore transparency entry: 623149246
- Sigstore integration time: Oct 20, 2025
Source repository:
- Permalink: Promptly-Technologies-LLC/etielle@a9bbee9e204b834602271cdf453d5e60088574d7
- Branch / Tag: refs/heads/main
- Owner: https://github.com/Promptly-Technologies-LLC
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@a9bbee9e204b834602271cdf453d5e60088574d7
- Trigger Event: workflow_run

etielle 2.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

etielle: Declarative JSON-to-Relational Mapping in Python

Why Use etielle? (For Beginners)

Learning Path

Installation

With uv (Recommended for Speed)

With pip

Optional: SQLAlchemy adapter

Optional: SQLModel adapter

Quick Start: Your First Mapping

Core Concepts: Breaking It Down

1. Context: Your “Location” in the JSON

2. Transforms: Smart Data Extractors

3. TraversalSpec: How to Walk the JSON

4. TableEmit and Fields: Building Your Tables

5. Executor: Running It All

Detailed Examples

Example 1: Composite Keys for Merging Data

Example 2: Deep Nesting (Arbitrary Depth)

Transform Cheatsheet

Common Mistakes

Advanced Topics

Roadmap Ideas

Glossary

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`etielle`: Declarative JSON-to-Relational Mapping in Python

Why Use `etielle`? (For Beginners)