A Tabular Helper API library that maps and joins dict-like data into row dicts using dotted-path projection.
Project description
tha-map-runner
A small Python library that joins a list of row dicts with a lookup source on a single key, projecting values into flat row columns via a mapping config.
Supports left, inner, and anti joins — all with dotted-path projection on the source side.
Install
pip install tha-map-runner
Quick start
from tha_map_runner import ThaMap
rows = [
{"Org BK": "school-001", "Start Date": "08/15"},
{"Org BK": "school-002", "Start Date": "08/16"},
]
api_response = [
{"sourcedId": "school-001", "name": "Lincoln Elementary", "parent": {"sourcedId": "dist-A"}},
{"sourcedId": "school-002", "name": "Roosevelt Middle", "parent": {"sourcedId": "dist-A"}},
]
mapper = ThaMap()
enriched = mapper.enrich_rows(
rows=rows,
source=api_response,
mapping={
"Org Name": "name",
"Parent BK": "parent.sourcedId",
},
row_key="Org BK",
source_key="sourcedId",
)
How it works
- Builds an index of
sourceonsource_key— O(n+m), no nested loops - For each row, looks up a match by
row[row_key] - Walks dotted paths (
"parent.sourcedId") into the matched source entry - Projects resolved values into new columns on a copy of the row
- Returns a new list — input is never mutated
Rows whose row status is in skip_statuses are passed through unchanged.
API
ThaMap
ThaMap()
mapper.enrich_rows()
mapper.enrich_rows(
rows, # list of row dicts
source, # list of dicts to join against
mapping, # {"output_column": "dotted.path"}
row_key, # column name in rows to match on
source_key, # field in source to match on
*,
how="left", # "left" | "inner" | "anti"
on_no_match="skip", # "skip" | "error" | "blank" (left only)
allow_empty_source=False, # if True, empty source is not an error
skip_statuses=["error", "warning"],# rows with these statuses are passed through
) -> list[dict]
Results are also stored in mapper.rows.
how
| Value | Behaviour |
|---|---|
"left" |
All rows kept; unmatched rows handled by on_no_match |
"inner" |
Only matched rows kept; mapping applied |
"anti" |
Only unmatched rows kept; no mapping applied |
Rows whose row status is in skip_statuses are always passed through unchanged, regardless of how.
on_no_match (left join only)
| Value | Behaviour |
|---|---|
"skip" |
Row is returned unchanged — no new columns added |
"error" |
row status="error", message=..., mapping columns set to "" |
"blank" |
Mapping columns set to "", row status untouched |
skip_statuses
By default, rows already marked row status="error" or row status="warning" are passed through without processing. Override with any list:
mapper.enrich_rows(..., skip_statuses=["error"]) # only skip errors
mapper.enrich_rows(..., skip_statuses=["error", "pending"]) # custom statuses
mapper.enrich_rows(..., skip_statuses=[]) # process every row regardless
Composing with tha-csv-runner
from tha_csv_runner import ThaCSV
from tha_map_runner import ThaMap
import requests
runner = ThaCSV()
runner.read("Step 1 of 2", "input.csv", ["Org BK"])
api_response = requests.get(api_url).json()
mapper = ThaMap()
enriched = mapper.enrich_rows(
rows=runner.rows,
source=api_response,
mapping={"Org Name": "name", "District": "parent.sourcedId"},
row_key="Org BK",
source_key="sourcedId",
)
runner.write("Step 2 of 2", "output.csv", rows=enriched)
Alternatives
This library is intentionally limited in scope — it handles one specific pattern: left-joining row dicts against a lookup list and projecting values via dotted paths. For more general needs:
- pandas —
DataFrame.merge()covers join operations with far more flexibility (inner, outer, multi-key, aggregations) - glom — powerful dotted-path access and transformation for arbitrarily nested Python data structures
- jmespath — JSON path-style queries for extracting values from nested dicts
Choose this library when you're already working with tha-* row dicts and want to join them against a lookup list in one call — no DataFrame conversion, join and projection in a single step.
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tha_map_runner-0.2.0.tar.gz.
File metadata
- Download URL: tha_map_runner-0.2.0.tar.gz
- Upload date:
- Size: 34.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8404923fed5587740f1589d80249eb826c202cfa04bc0282951204fb5227905e
|
|
| MD5 |
aa3f891cc79f5d4d4da403a92c6aecc8
|
|
| BLAKE2b-256 |
f557c63614766a9f137323fd06ffee29573282fb1c4c5625b399ec7214f5faa9
|
Provenance
The following attestation bundles were made for tha_map_runner-0.2.0.tar.gz:
Publisher:
publish.yml on tha-guy-nate/tha-map-runner
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
tha_map_runner-0.2.0.tar.gz -
Subject digest:
8404923fed5587740f1589d80249eb826c202cfa04bc0282951204fb5227905e - Sigstore transparency entry: 1563949518
- Sigstore integration time:
-
Permalink:
tha-guy-nate/tha-map-runner@dd94054a8297f562397ef7309a61b1385a451eac -
Branch / Tag:
refs/tags/v0.2.0 - Owner: https://github.com/tha-guy-nate
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@dd94054a8297f562397ef7309a61b1385a451eac -
Trigger Event:
push
-
Statement type:
File details
Details for the file tha_map_runner-0.2.0-py3-none-any.whl.
File metadata
- Download URL: tha_map_runner-0.2.0-py3-none-any.whl
- Upload date:
- Size: 6.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fa6f35d0fd191b1d9e475a48ee2a5e112485afe181c90d30c390b17aa3ea0a08
|
|
| MD5 |
94464c1fed17f04f44ce35ebe11b4474
|
|
| BLAKE2b-256 |
4c6748e59c32465364b6effc615f1bc750bfb8a7a9a8fc1022f255932558ca86
|
Provenance
The following attestation bundles were made for tha_map_runner-0.2.0-py3-none-any.whl:
Publisher:
publish.yml on tha-guy-nate/tha-map-runner
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
tha_map_runner-0.2.0-py3-none-any.whl -
Subject digest:
fa6f35d0fd191b1d9e475a48ee2a5e112485afe181c90d30c390b17aa3ea0a08 - Sigstore transparency entry: 1563949544
- Sigstore integration time:
-
Permalink:
tha-guy-nate/tha-map-runner@dd94054a8297f562397ef7309a61b1385a451eac -
Branch / Tag:
refs/tags/v0.2.0 - Owner: https://github.com/tha-guy-nate
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@dd94054a8297f562397ef7309a61b1385a451eac -
Trigger Event:
push
-
Statement type: