pydantic2-resolve

create nested data structure easily

These details have not been verified by PyPI

Project links

Project description

Python Versions

NOTICE: this package supports pydantic v2 only, if you want to use with pydantic v1, please use pydantic-resolve instead.

Introduction

Building related data has always been a troublesome thing, whether through ORM or manually constructing it, especially when you need to build data that combines data from multiple kind of sources.

for example, if I want to provide a blog list with recent 10 comment, 5 edit histories and author info, however:

blogs and comments are stored in the DB
edit histories are stored as files
author details are provided by some 3rd party user service module.

this is merely hypothetical

---
title: data from different sources
---

erDiagram
    Blog ||--o{ Comment : has
    Blog ||--o{ EditHistory : has
    Blog ||--|| Author : belongs_to

    Blog {
      binary uuid
      binary author_id
      string title
      string content
      source db_table_blog
    }
    Comment {
      binary uuid
      binary blog_id
      string content
      binary author_id
      source db_table_comment
    }
    EditHistory {
      binary uuid
      binary blog_id
      string content
      source provider_oss
    }
    Author {
      binary uuid
      string name
      string email
      source provider_azure_user
    }

pydantic-resolve provides a unified approach to stitching together various data sources, all you need is to define DataLoader for each data source.

class Blog(BaseModel):
    id: int
    name: str
    author_id: str

    # 1 : 1
    author: Optional[Author] = None
    def resolve_author(self, user_loader: LoaderDepend(UserDataLoader)):
        return user_loader.load(self.author_id)  # service: api handler

    # 1 : n
    comments: List[Comment] = []
    def resolve_comments(self, comment_loader: LoaderDepend(CommentDataLoader)):
        return comment_loader.load(self.id)  # service: db handler

    # 1 : n
    edit_histories: List[EditHistory] = []
    def resolve_edit_histories(self, history_loader: LoaderDepend(EditHistoryDataLoader)):
        return history_loader.load(self.id)  # service: file handler

In addition, it can help you do some extra calculations after resolving the data.

class Blog(BaseModel):
    ...

    comments_count: int = 0
    def post_comments_count(self):
        return len(self.comments)

After schema is done, you only need to query for the base data (blogs), after which pydantic-resolve will load all the related data for you.

blogs = await query_blogs()
blogs = [Blog(**blog) for blog in blogs]
blogs = await Resolver().resolve(blogs)
return blogs

Install

pip install pydantic-resolve

Demo

Assume we have 3 tables: departments, teams and members, which have 1:N relationship from left to right.

erDiagram
    Department ||--o{ Team : has
    Team ||--o{ Member : has

    Department {
      int id
      string name
    }
    Team {
      int id
      int department_id
      string name
    }
    Member {
      int id
      int team_id
      string name
    }

departments = [
    dict(id=1, name='INFRA'),
    dict(id=2, name='DevOps'),
    dict(id=3, name='Sales'),
]

teams = [
    dict(id=1, department_id=1, name="K8S"),
    dict(id=2, department_id=1, name="MONITORING"),
    # ...
    dict(id=10, department_id=2, name="Operation"),
]

members = [
    dict(id=1, team_id=1, name="Sophia"),
    # ...
    dict(id=19, team_id=10, name="Emily"),
    dict(id=20, team_id=10, name="Ella")
]

and we want to generate nested json base on these 3 tables. the output should be looks like:

{
  "departments": [
    {
      "id": 1,
      "name": "INFRA",
      "teams": [
        {
          "id": 1,
          "name": "K8S",
          "members": [
            {
              "id": 1,
              "name": "Sophia"
            }
          ]
        }
      ]
    }
  ]
}

We will shows how to make it with pydantic-resolve which has 4 steps:

define dataloader
define pydantic schema, use dataloaders (no N+1 query)
resolve

import json
import asyncio
from typing import List
from pydantic import BaseModel
from pydantic2_resolve import Resolver, LoaderDepend, build_list

# 0. prepare table records
departments = [
    dict(id=1, name='INFRA'),
    dict(id=2, name='DevOps'),
    dict(id=3, name='Sales'),
]

teams = [
    dict(id=1, department_id=1, name="K8S"),
    dict(id=2, department_id=1, name="MONITORING"),
    dict(id=3, department_id=1, name="Jenkins"),
    dict(id=5, department_id=2, name="Frontend"),
    dict(id=6, department_id=2, name="Bff"),
    dict(id=7, department_id=2, name="Backend"),
    dict(id=8, department_id=3, name="CAT"),
    dict(id=9, department_id=3, name="Account"),
    dict(id=10, department_id=3, name="Operation"),
]

members = [
    dict(id=1, team_id=1, name="Sophia"),
    dict(id=2, team_id=1, name="Jackson"),
    dict(id=3, team_id=2, name="Olivia"),
    dict(id=4, team_id=2, name="Liam"),
    dict(id=5, team_id=3, name="Emma"),
    dict(id=6, team_id=4, name="Noah"),
    dict(id=7, team_id=5, name="Ava"),
    dict(id=8, team_id=6, name="Lucas"),
    dict(id=9, team_id=6, name="Isabella"),
    dict(id=10, team_id=6, name="Mason"),
    dict(id=11, team_id=7, name="Mia"),
    dict(id=12, team_id=8, name="Ethan"),
    dict(id=13, team_id=8, name="Amelia"),
    dict(id=14, team_id=9, name="Oliver"),
    dict(id=15, team_id=9, name="Charlotte"),
    dict(id=16, team_id=10, name="Jacob"),
    dict(id=17, team_id=10, name="Abigail"),
    dict(id=18, team_id=10, name="Daniel"),
    dict(id=19, team_id=10, name="Emily"),
    dict(id=20, team_id=10, name="Ella")
]

# 1. define dataloader
async def teams_batch_load_fn(department_ids):
    """ return teams grouped by department_id """
    # visit [aiodataloader](https://github.com/syrusakbary/aiodataloader) to know how to define `DataLoader`

    dct = defaultdict(list)
    _teams = team_service.batch_query_by_department_ids(department_ids)  # assume data is exposed by service
    for team in _teams:
        dct[team['department_id']].append(team)

    return [dct.get(did, []) for did in department_ids]

async def members_batch_load_fn(team_ids):
    """ return members grouped by team_id """
    _members = member_service.batch_query_by_team_ids(team_ids)

    return build_list(_members, team_ids, lambda t: t['team_id'])  # helper func

# 2. define pydantic schemas
class Member(BaseModel):
    id: int
    name: str

class Team(BaseModel):
    id: int
    name: str

    members: List[Member] = []
    def resolve_members(self, loader=LoaderDepend(members_batch_load_fn)):
        return loader.load(self.id)

    member_count: int = 0
    def post_member_count(self):
        return len(self.members)

class Department(BaseModel):
    id: int
    name: str
    teams: List[Team] = []
    def resolve_teams(self, loader=LoaderDepend(teams_batch_load_fn)):
        return loader.load(self.id)

    member_count: int = 0
    def post_member_count(self):
        return sum([team.member_count for team in self.teams])

class Result(BaseModel):
    departments: List[Department] = []
    def resolve_departments(self):
        return departments

# 3. resolve
async def main():
    result = Result()
    data = await Resolver().resolve(result)
    print(json.dumps(data.model_dump(), indent=4))

asyncio.run(main())

then we got the output (display the first item for demostration)

{
  "departments": [
    {
      "id": 1,
      "name": "INFRA",
      "member_count": 5,
      "teams": [
        {
          "id": 1,
          "name": "K8S",
          "member_count": 2,
          "members": [
            {
              "id": 1,
              "name": "Sophia"
            },
            {
              "id": 2,
              "name": "Jackson"
            }
          ]
        },
        {
          "id": 2,
          "name": "MONITORING",
          "member_count": 2,
          "members": [
            {
              "id": 3,
              "name": "Olivia"
            },
            {
              "id": 4,
              "name": "Liam"
            }
          ]
        },
        {
          "id": 3,
          "name": "Jenkins",
          "member_count": 1,
          "members": [
            {
              "id": 5,
              "name": "Emma"
            }
          ]
        }
      ]
    }
  ]
}

More cases:

for more cases like:

how to filter members
how to make post calculation after resolved?
and so on..

please read the following demos.

cd examples

python -m readme_demo.0_basic
python -m readme_demo.1_filter
python -m readme_demo.2_post_methods
python -m readme_demo.3_context
python -m readme_demo.4_loader_instance
python -m readme_demo.5_subset
python -m readme_demo.6_mapper
python -m readme_demo.7_single

API

Resolver(loader_filters, loader_instances, ensure_type, annotation_class, context)

loader_filters: dict

provide extra query filters along with loader key.

reference: 6_sqlalchemy_loaderdepend_global_filter.py L55, L59
loader_instances: dict

provide pre-created loader instance, with can prime data into loader cache.

reference: test_20_loader_instance.py, L62, L63
ensure_type: bool

if True, resolve method is restricted to be annotated.

reference: test_13_check_wrong_type.py
annotation_class: class

if you have from __future__ import annotation, and pydantic raises error, use this config to update forward refs

reference: test_25_parse_to_obj_for_pydantic_with_annotation.py, L39

context: dict

context can carry setting into each single resolver methods.

class Earth(BaseModel):
    humans: List[Human] = []
    def resolve_humans(self, context):
        return [dict(name=f'man-{i}') for i in range(context['count'])]

earth = await Resolver(context={'count': 10}).resolve(earth)

LoaderDepend(loader_fn)

loader_fn: subclass of DataLoader or batch_load_fn. detail

declare dataloader dependency, pydantic-resolve will take the care of lifecycle of dataloader.

build_list(rows, keys, fn), build_object(rows, keys, fn)

rows: list, query result
keys: list, batch_load_fn:keys
fn: lambda, define the way to get primary key

helper function to generate return value required by batch_load_fn. read the code for details.

reference: test_utils.py, L32

mapper(param)

param: class of pydantic or dataclass, or a lambda

pydantic-resolve will trigger the fn in mapper after inner future is resolved. it exposes an interface to change return schema even from the same dataloader. if param is a class, it will try to automatically transform it.

reference: test_16_mapper.py

ensure_subset(base_class)

base_class: class

it will raise exception if fields of decorated class has field not existed in base_class.

reference: test_2_ensure_subset.py

Run FastAPI example

poetry shell
cd examples
uvicorn fastapi_demo.main:app
# http://localhost:8000/docs#/default/get_tasks_tasks_get

Unittest

poetry run python -m unittest  # or
poetry run pytest  # or
poetry run tox

Coverage

poetry run coverage run -m pytest
poetry run coverage report -m

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

2.1.1

Apr 20, 2024

2.1.0

Apr 8, 2024

2.0.0

Feb 29, 2024

1.1.3

Feb 8, 2024

1.1.2 yanked

Feb 8, 2024

Reason this release was yanked:

missing model_config in __all__

1.1.1

Dec 20, 2023

1.1.0 yanked

Dec 16, 2023

Reason this release was yanked:

potential bug on []

1.0.1

Nov 11, 2023

This version

1.0.0

Nov 10, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydantic2_resolve-1.0.0.tar.gz (12.7 kB view details)

Uploaded Nov 10, 2023 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pydantic2_resolve-1.0.0-py3-none-any.whl (13.9 kB view details)

Uploaded Nov 10, 2023 Python 3

File details

Details for the file pydantic2_resolve-1.0.0.tar.gz.

File metadata

Download URL: pydantic2_resolve-1.0.0.tar.gz
Upload date: Nov 10, 2023
Size: 12.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.4.0 CPython/3.10.4 Windows/10

File hashes

Hashes for pydantic2_resolve-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`ca498f52857b28923864401912148214c034bbae37562547ce16f5d60bfb10f7`
MD5	`d8086cff1434bfe3bd1f170c9cae9e45`
BLAKE2b-256	`bef4dc02bb2e94bee530f24e20335b18fc8f0e06ecdabc1232780159cf3b26b2`

See more details on using hashes here.

File details

Details for the file pydantic2_resolve-1.0.0-py3-none-any.whl.

File metadata

Download URL: pydantic2_resolve-1.0.0-py3-none-any.whl
Upload date: Nov 10, 2023
Size: 13.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.4.0 CPython/3.10.4 Windows/10

File hashes

Hashes for pydantic2_resolve-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`605fcbfd6b0506a9b4c265ff8fa9a2b47dab402501c717b50b833922246a546e`
MD5	`4358950a6c591272e34d19fec9f27a8b`
BLAKE2b-256	`02d24b72c5b78e19208a78fcf8c90f75e05d2d61245176b5426604cbeab9437b`

See more details on using hashes here.

pydantic2-resolve 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Introduction

Install

Demo

More cases:

API

Resolver(loader_filters, loader_instances, ensure_type, annotation_class, context)

LoaderDepend(loader_fn)

build_list(rows, keys, fn), build_object(rows, keys, fn)

mapper(param)

ensure_subset(base_class)

Run FastAPI example

Unittest

Coverage

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes