A library that facilitates mapping and aggregating objects from one model to another

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Framework
- Pydantic :: 2
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
- Python :: 3.12

Project description

PyTransmuter

PyTransmuter is a Python library designed for efficient data mapping and aggregation from source models to target models, leveraging the power of Pydantic for data validation and transformation. It simplifies the process of transforming data structures, making it an essential tool for applications requiring data normalization, transformation, and aggregation.

Features

Generic Mapping: Utilizes generic typing to map data between different model structures.
Flexible Aggregation: Supports complex data aggregation strategies, including grouping, sorting, and custom aggregation functions.
Self-Inspection: Incorporates self-inspection capabilities for resolving callables related to class instances.
Pydantic Integration: Leverages Pydantic models for input and output validation, ensuring data integrity.
Customizable Transformations: Allows defining custom field transformations using callable functions or lambdas.

Installation

To install the library, run the following command in your terminal:

pip install py-transmuter

Usage of the Mapper

Basic Setup

Start by defining your source and target Pydantic models that represent the structure of your input and output data:

from pydantic import BaseModel

class SourceModel(BaseModel):
    id: int
    temperature_celsius: float
    humidity_percentage: int

class TargetModel(BaseModel):
    id: int
    temperature_fahrenheit: float
    humidity_proportion: float

Implementing a Mapper

Define a mapper by inheriting from BaseModelMapper, specifying how each field in the source model maps to the target model:

from py_transmuter.pydantic.mapper import BaseModelMapper

class MyMapper(BaseModelMapper[TargetModel, SourceModel]):
    mapping = {
        "id": "id",
        "temperature_fahrenheit": ("temperature_celsius", lambda c: c * 9 / 5 + 32),
        "humidity_proportion": ("humidity_percentage", lambda h: h / 100.0),
    }

Mapping Data

Use your mapper to transform data from the source model to the target model:

source_data = SourceModel(id=1, temperature_celsius=25, humidity_percentage=50)

mapper = MyMapper()
target_data = mapper.map(source_data)

print(target_data)
# Output: TargetModel(id=1, temperature_fahrenheit=77.0, humidity_proportion=0.5)

Mapping Lists of Data

BaseModelMapper also supports mapping lists of data from source models to target models:

source_list = [
    SourceModel(id=1, temperature_celsius=25, humidity_percentage=50),
    SourceModel(id=2, temperature_celsius=20, humidity_percentage=60),
]

mapped_list = mapper.map_list(source_list)

for item in mapped_list:
    print(item)
# Output: List of TargetModel instances with mapped data

Usage of the Aggregator

Basic Setup

First, define your source and target Pydantic models:

from pydantic import BaseModel

class SourceModel(BaseModel):
    id: int
    first_name: str
    last_name: str

class TargetModel(BaseModel):
    full_names: list[str]

Aggregating Data

To aggregate data from a list of SourceModel instances into a list of TargetModel instances, define an aggregator class by inheriting from BaseModelAggregator:

from py_transmuter.pydantic.aggregator import BaseModelAggregator

class MyAggregator(BaseModelAggregator[TargetModel, SourceModel]):
    mappings = {"full_names": lambda data: f"{data.first_name} {data.last_name}"}

Using the Aggregator

Once your aggregator is defined, you can use it to aggregate data as follows:

# Sample data
source_data = [
    SourceModel(id=1, first_name="Jane", last_name="Doe"),
    SourceModel(id=2, first_name="John", last_name="Doe"),
]

# Aggregating data
aggregator = MyAggregator()
target_data = aggregator.aggregate(source_data)

print(target_data)
# Output: [TargetModel(full_names=['Jane Doe', 'John Doe'])]

Advanced Usage of the Aggregator

For more complex scenarios, py-transmuter allows for advanced data transformation capabilities, including grouping, sorting, and using custom functions for mappings and aggregations. Here’s an elaborate example that showcases these features.

Scenario

Imagine you have a dataset of measurements taken by different sensors in a scientific experiment. Each measurement includes the sensor's ID, the timestamp of the measurement, and the measured value. Your goal is to aggregate these measurements by sensor ID and day, calculate the daily average value for each sensor, and sort the results by date.

Source and Target Models

First, define your source model for individual measurements and a target model for the aggregated data:

from pydantic import BaseModel
from datetime import datetime, date

class Measurement(BaseModel):
    sensor_id: int
    timestamp: datetime
    value: float

class DailyAverage(BaseModel):
    sensor_id: int
    date: date
    average_value: float

Aggregator Class

Next, define the aggregator class that specifies how to group measurements, how to calculate the daily averages, and how to sort the results:

from py_transmuter.pydantic.aggregator import BaseModelAggregator
from statistics import mean

class MeasurementAggregator(BaseModelAggregator[DailyAverage, Measurement]):
    # Group by sensor ID and the date part of the timestamp
    group_by = (
        "sensor_id",
        lambda x: x.timestamp.date(),
    )
    
    # Define the aggregation to calculate the average value
    aggregations = {
        "sensor_id": ("sensor_id", lambda ids: ids[0]),
        "date": ("timestamp", lambda stamps: stamps[0].date()),
        "average_value": (
            "value",
            lambda values: mean(values),
        ),
    }

Executing the aggregation

With the aggregator defined, you can transform a list of Measurement instances into a list of DailyAverage instances, grouped by sensor ID and date, with the daily average value calculated for each group:

# Sample data: a list of measurements
measurements = [
    Measurement(sensor_id=1, timestamp=datetime(2024, 1, 1, 12, 30), value=10),
    Measurement(sensor_id=1, timestamp=datetime(2024, 1, 1, 13, 45), value=20),
    Measurement(sensor_id=2, timestamp=datetime(2024, 1, 1, 14, 15), value=30),
    # More measurements...
]

# Instantiate the aggregator and aggregate the data
aggregator = MeasurementAggregator()
daily_averages = aggregator.aggregate(measurements)

# Output the aggregated data
print(daily_averages)
# [DailyAverage(sensor_id=1, date=date(2024, 1, 1), average_value=15), DailyAverage(sensor_id=2, date=date(2024, 1, 1), value=30), ...]

Using mappings in the Aggregator

An Aggregator can have both aggregation and mappings; the former were explained above, but the latter serve a way simpler purpose: simply extract the value of field for every element in the group and store it in a list; for example:

class Child(BaseModel):
    parent: str
    first_name: str
    last_name: str

class Parent(BaseModel):
    name: str
    children: list[str]

class ChildParentAggregator(BaseModelAggregator[Parent, Child]):
    group_by = ('parent',)

    mappings = {"children": lambda child: f"{child.first_name} {child.last_name}"}
    aggregations = {"name": ("parent": lambda names: names[0])}

With this setup, we would then get:

children = [
    Child(parent="Tom Smith", first_name="Paul", last_name="Smith"),
    Child(parent="Anna Lopez", first_name="Tupac", last_name="Towers"),
    Child(parent="Tom Smith", first_name="Laura", last_name="Smith")
]

parents = ChildParentAggregator().aggregate(children)
print(parents)
# [Parent(name="Tom Smith", children=["Paul Smith", "Laura Smith"]), Parent(name="Anna Lopez", children=["Tupac Towers"])]

Using self inspection

There are many scenarios in which we need values that are known only in runtime and not when declaring the mappings and aggregations for our Mapper or Aggregator class; for this, both these classes are capable of identifying methods that "belong to them" and it is possible to add them to the static definitions of mappings and aggregations, and can later access class or instance attributes in runtime.

An example of this would be:

class A(BaseModel):
    id: int

class B(BaseModel):
    id: str

class ABMapper(BaseModelMapper[B, A]):
    def map_id(self, data: A) -> str:
        return str(data.id * self.context["factor"])

    mapping = {"id": map_id}

assert ABMapper(context={"factor": 2}).map(A(id=1)) == B(id="2")

The context attribute is included by default in both the Mapper and the Aggregator, but you could define the class as you wish (with any attributes you'd want) and access them in the moment they are needed:

class Mapper(BaseModelMapper[B, A]):
    attribute: Any

    def __init__(self, attribute: Any, *args, **kwargs):
        super().__init__(*args, **kwargs)
        self.attribute = attribute

Project details

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Framework
- Pydantic :: 2
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
- Python :: 3.12

Release history Release notifications | RSS feed

0.2.0

Apr 16, 2024

This version

0.1.2

Mar 29, 2024

0.1.1

Mar 29, 2024

0.1.0

Mar 29, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

py_transmuter-0.1.2.tar.gz (11.6 kB view details)

Uploaded Mar 29, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

py_transmuter-0.1.2-py3-none-any.whl (11.3 kB view details)

Uploaded Mar 29, 2024 Python 3

File details

Details for the file py_transmuter-0.1.2.tar.gz.

File metadata

Download URL: py_transmuter-0.1.2.tar.gz
Upload date: Mar 29, 2024
Size: 11.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.0.0 CPython/3.12.0

File hashes

Hashes for py_transmuter-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`ef3a42ccbea8829f44e6895705ce6034f70dd3fad29e639d0074b5a9e3322119`
MD5	`404dba171be605758bc285446c15cd06`
BLAKE2b-256	`2440de017b3a27babcffd68bede5ad7f8b724ce5eeda196f543d4da02564cbef`

See more details on using hashes here.

File details

Details for the file py_transmuter-0.1.2-py3-none-any.whl.

File metadata

Download URL: py_transmuter-0.1.2-py3-none-any.whl
Upload date: Mar 29, 2024
Size: 11.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.0.0 CPython/3.12.0

File hashes

Hashes for py_transmuter-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3c894c5c65754244441a7dbcd1f84c64a2f3f61f78fcf0ae9e50a6458b1e3d55`
MD5	`1681e39c2f97daaa784383057a08e3c1`
BLAKE2b-256	`3f7c5db81af22a632f5c9f7274efd54d415e1aaaa33a4bd3507b82a2d52a33a2`

See more details on using hashes here.

py-transmuter 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

PyTransmuter

Features

Installation

Usage of the Mapper

Basic Setup

Implementing a Mapper

Mapping Data

Mapping Lists of Data

Usage of the Aggregator

Basic Setup

Aggregating Data

Using the Aggregator

Advanced Usage of the Aggregator

Scenario

Source and Target Models

Aggregator Class

Executing the aggregation

Using mappings in the Aggregator

Using self inspection

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes