A library that facilitates mapping and aggregating objects from one model to another

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Framework
- Pydantic :: 2
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
- Python :: 3.12

Project description

PyTransmuter

PyTransmuter is a Python library designed for efficient data mapping and aggregation from source models to target models. It simplifies the process of transforming data structures, even between classes of different types (this is, you can transform from a BaseModel to a @dataclass if you need to) or even transform data in Python dictionaries.

Features

Generic Mapping: Utilizes generic typing to map data between different model structures; for example, Python's @dataclass, Pydantic's BaseModel, Pydantic's @dataclass or even plain dictionaries.
Flexible Aggregation: Supports complex data aggregation strategies, including grouping, sorting, and custom aggregation functions.
Self-Inspection: Incorporates self-inspection capabilities for resolving callables related to class instances.
Customizable Transformations: Allows defining custom field transformations using callable functions or lambdas.

Installation

To install the library, run the following command in your terminal:

pip install py-transmuter

Usage of the ModelMapper

Basic Setup

Start by defining your source and target models that represent the structure of your input and output data:

from pydantic import BaseModel
from dataclasses import dataclass

@dataclass
class SourceModel:
    id: int
    temperature_celsius: float
    humidity_percentage: int

class TargetModel(BaseModel):
    id: int
    temperature_fahrenheit: float
    humidity_proportion: float

Implementing a ModelMapper

Define a mapper by inheriting from ModelMapper, specifying how each field in the source model maps to the target model:

from py_transmuter.models.mapper import ModelMapper

class MyMapper(ModelMapper[TargetModel, SourceModel]):
    mapping = {
        "id": "id",
        "temperature_fahrenheit": ("temperature_celsius", lambda c: c * 9 / 5 + 32),
        "humidity_proportion": ("humidity_percentage", lambda h: h / 100.0),
    }

Mapping Data

Use your mapper to transform data from the source model to the target model:

source_data = SourceModel(id=1, temperature_celsius=25, humidity_percentage=50)

mapper = MyMapper()
target_data = mapper.map(source_data)

print(target_data)
# Output: TargetModel(id=1, temperature_fahrenheit=77.0, humidity_proportion=0.5)

Mapping Lists of Data

ModelMapper also supports mapping lists of data from source models to target models:

source_list = [
    SourceModel(id=1, temperature_celsius=25, humidity_percentage=50),
    SourceModel(id=2, temperature_celsius=20, humidity_percentage=60),
]

mapped_list = mapper.map_list(source_list)

for item in mapped_list:
    print(item)
# Output: List of TargetModel instances with mapped data

Usage of the ModelAggregator

Basic Setup

First, define your source and target models:

from pydantic import BaseModel
from pydantic.dataclasses import dataclass as pydantic_dataclass

class SourceModel(BaseModel):
    id: int
    first_name: str
    last_name: str

@pydantic_dataclass
class TargetModel:
    full_names: list[str]

Aggregating Data

To aggregate data from a list of SourceModel instances into a list of TargetModel instances, define an aggregator class by inheriting from ModelAggregator:

from py_transmuter.models.aggregator import ModelAggregator

class MyAggregator(ModelAggregator[TargetModel, SourceModel]):
    mappings = {"full_names": lambda data: f"{data.first_name} {data.last_name}"}

Using the ModelAggregator

Once your aggregator is defined, you can use it to aggregate data as follows:

# Sample data
source_data = [
    SourceModel(id=1, first_name="Jane", last_name="Doe"),
    SourceModel(id=2, first_name="John", last_name="Doe"),
]

# Aggregating data
aggregator = MyAggregator()
target_data = aggregator.aggregate(source_data)

print(target_data)
# Output: [TargetModel(full_names=['Jane Doe', 'John Doe'])]

Advanced Usage of the ModelAggregator

For more complex scenarios, py-transmuter allows for advanced data transformation capabilities, including grouping, sorting, and using custom functions for mappings and aggregations. Here’s an elaborate example that showcases these features.

Scenario

Imagine you have a dataset of measurements taken by different sensors in a scientific experiment. Each measurement includes the sensor's ID, the timestamp of the measurement, and the measured value. Your goal is to aggregate these measurements by sensor ID and day, calculate the daily average value for each sensor, and sort the results by date.

Source and Target Models

First, define your source model for individual measurements and a target model for the aggregated data:

from pydantic import BaseModel
from datetime import datetime, date

class Measurement(BaseModel):
    sensor_id: int
    timestamp: datetime
    value: float

class DailyAverage(BaseModel):
    sensor_id: int
    date: date
    average_value: float

ModelAggregator Class

Next, define the aggregator class that specifies how to group measurements, how to calculate the daily averages, and how to sort the results:

from py_transmuter.models.aggregator import ModelAggregator
from statistics import mean

class MeasurementAggregator(ModelAggregator[DailyAverage, Measurement]):
    # Group by sensor ID and the date part of the timestamp
    group_by = (
        "sensor_id",
        lambda x: x.timestamp.date(),
    )
    
    # Define the aggregation to calculate the average value
    aggregations = {
        "sensor_id": ("sensor_id", lambda ids: ids[0]),
        "date": ("timestamp", lambda stamps: stamps[0].date()),
        "average_value": (
            "value",
            lambda values: mean(values),
        ),
    }

Executing the aggregation

With the aggregator defined, you can transform a list of Measurement instances into a list of DailyAverage instances, grouped by sensor ID and date, with the daily average value calculated for each group:

# Sample data: a list of measurements
measurements = [
    Measurement(sensor_id=1, timestamp=datetime(2024, 1, 1, 12, 30), value=10),
    Measurement(sensor_id=1, timestamp=datetime(2024, 1, 1, 13, 45), value=20),
    Measurement(sensor_id=2, timestamp=datetime(2024, 1, 1, 14, 15), value=30),
    # More measurements...
]

# Instantiate the aggregator and aggregate the data
aggregator = MeasurementAggregator()
daily_averages = aggregator.aggregate(measurements)

# Output the aggregated data
print(daily_averages)
# [DailyAverage(sensor_id=1, date=date(2024, 1, 1), average_value=15), DailyAverage(sensor_id=2, date=date(2024, 1, 1), value=30), ...]

Using mappings in the ModelAggregator

An Aggregator can have both aggregation and mappings; the former were explained above, but the latter serve a way simpler purpose: simply extract the value of field for every element in the group and store it in a list; for example:

class Child(BaseModel):
    parent: str
    first_name: str
    last_name: str

class Parent(BaseModel):
    name: str
    children: list[str]

class ChildParentAggregator(ModelAggregator[Parent, Child]):
    group_by = ('parent',)

    mappings = {"children": lambda child: f"{child.first_name} {child.last_name}"}
    aggregations = {"name": ("parent": lambda names: names[0])}

With this setup, we would then get:

children = [
    Child(parent="Tom Smith", first_name="Paul", last_name="Smith"),
    Child(parent="Anna Lopez", first_name="Tupac", last_name="Towers"),
    Child(parent="Tom Smith", first_name="Laura", last_name="Smith")
]

parents = ChildParentAggregator().aggregate(children)
print(parents)
# [Parent(name="Tom Smith", children=["Paul Smith", "Laura Smith"]), Parent(name="Anna Lopez", children=["Tupac Towers"])]

Using self inspection

There are many scenarios in which we need values that are known only in runtime and not when declaring the mappings and aggregations for our ModelMapper or ModelAggregator class; for this, both these classes are capable of identifying methods that "belong to them" and it is possible to add them to the static definitions of mappings and aggregations, and can later access class or instance attributes in runtime.

An example of this would be:

@dataclass
class A:
    id: int

@dataclass
class B:
    id: str

class ABMapper(ModelMapper[B, A]):
    def map_id(self, data: A) -> str:
        return str(data.id * self.context["factor"])

    mapping = {"id": map_id}

assert ABMapper(context={"factor": 2}).map(A(id=1)) == B(id="2")

The context attribute is included by default in both the ModelMapper and the ModelAggregator, but you could define the class as you wish (with any attributes you'd want) and access them in the moment they are needed:

class Mapper(ModelMapper[B, A]):
    attribute: Any

    def __init__(self, attribute: Any, *args, **kwargs):
        super().__init__(*args, **kwargs)
        self.attribute = attribute

Dictionary Mapper and Aggregator

Very similarly to what we can do with the ModelMapper and ModelAggregator, we can create a DictionaryMapper and DictionaryAggregator. These have the same functionalities we have seen above but allow for mapping and aggregation of dictionaries of type dict[Any, Any] (not restricted to string keys or JSON formats).

An example of this would be:

from py_transmuter.dictionaries.mapper import DictionaryMapper

class Mapper(DictionaryMapper):
    mapping = {
        "id": 1,
        ("march", 14): date(2021, 3, 14),
    }

mapper = Mapper()

source = {1: "an_id", date(2021, 3, 14): "Tomorrow is Pi day!"}
mapper.map(source)
# {"id": "an_id", ("march, 14"): "Tomorrow is Pi day!"}

This seems quite odd, but who knows, maybe you're the one who finds it useful!

Project details

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Framework
- Pydantic :: 2
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language
- Python :: 3.12

Release history Release notifications | RSS feed

This version

0.2.0

Apr 16, 2024

0.1.2

Mar 29, 2024

0.1.1

Mar 29, 2024

0.1.0

Mar 29, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

py_transmuter-0.2.0.tar.gz (14.2 kB view details)

Uploaded Apr 16, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

py_transmuter-0.2.0-py3-none-any.whl (15.4 kB view details)

Uploaded Apr 16, 2024 Python 3

File details

Details for the file py_transmuter-0.2.0.tar.gz.

File metadata

Download URL: py_transmuter-0.2.0.tar.gz
Upload date: Apr 16, 2024
Size: 14.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.0.0 CPython/3.12.0

File hashes

Hashes for py_transmuter-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`438a7992d7652986f1164ea817c001e239ae4a915fd0e7da89c6890c4d3d2fea`
MD5	`27c0fbcdbc1802ac713223760065ab34`
BLAKE2b-256	`e8e45e736a0d617524688ea4d0e2f084a6ccdf9ce04cd7d21207c38f6d6cdd8b`

See more details on using hashes here.

File details

Details for the file py_transmuter-0.2.0-py3-none-any.whl.

File metadata

Download URL: py_transmuter-0.2.0-py3-none-any.whl
Upload date: Apr 16, 2024
Size: 15.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.0.0 CPython/3.12.0

File hashes

Hashes for py_transmuter-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3df8523348e731bdf7bb4ebb213e6edc81f3763bdf5ee6d1ebcf4a7881af1622`
MD5	`86f49e9330ed7b7bf51acc4e92a9eb35`
BLAKE2b-256	`bca4119a165e44c5f663d534b497150486066239ccd2dc3cd8af5229f15ed98f`

See more details on using hashes here.

py-transmuter 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

PyTransmuter

Features

Installation

Usage of the ModelMapper

Basic Setup

Implementing a ModelMapper

Mapping Data

Mapping Lists of Data

Usage of the ModelAggregator

Basic Setup

Aggregating Data

Using the ModelAggregator

Advanced Usage of the ModelAggregator

Scenario

Source and Target Models

ModelAggregator Class

Executing the aggregation

Using mappings in the ModelAggregator

Using self inspection

Dictionary Mapper and Aggregator

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes