Pydantic discriminators for polymorphic models
Project description
pydantic-discriminator
Welcome to pydantic-discriminator! This is a small utility library that adds support for discriminator-based polymorphism to pydantic.
[!CAUTION] This library is cursed 💀 and was condemned by the old ones. I am trying to make it as safe as possible, but integrating this functionality into pydantic as an external library can be very hacky expecially after release 2. I warned you, proceed at your own risk.
[!NOTE] Currently tested with 100% test coverage on every possible combination of:
- Python 3.9, 3.10, 3.11 and 3.12
- Pydantic 1.10, 2.0, 2.1, 2.3, 2.4, 2.5
Please fill this repository with issues if you find any bugs or have any suggestions.
📦Installation
You can install pydantic-discriminator with pip:
pip install pydantic-discriminator
The only requirement is pydantic, which is automatically installed with this library. No additional dependencies will be installed in your environment.
💡What does it do?
😡The problem
[!IMPORTANT] The following example can be pretty long to read, but to some extent it is necessary to understand the problem that this library solves (or at least tries to solve).
Let's say you have a class hierarchy that looks like this:
classDiagram
class Shape {
+ x: float
+ y: float
}
class Circle {
+ radius : float
}
class Hexagon {
+ radius : float
}
class Rectangle {
+ width : float
+ height : float
}
Shape <|-- Circle
Shape <|-- Hexagon
Shape <|-- Rectangle
class Container {
+ shapes : list[Shape]
}
Container --> Shape
Let's implement it with pydantic:
class Shape(BaseModel):
x: float
y: float
class Circle(Shape):
radius: float
class Hexagon(Shape):
radius: float
class Rectangle(Shape):
width: float
height: float
class Container(BaseModel):
shapes: list[Shape]
[!CAUTION] The code above is completely broken. Nothing will work. Keep reading to find out why.
Now, let's write a program that uses this class hierarchy:
my_data = {
"shapes": [
{"x": 0, "y": 0, "radius": 1}, # This is a Circle
{"x": 1, "y": 2, "radius": 1}, # This is a Hexagon (because I said so)
{"x": 5, "y": 3, "width": 1, "height": 1}, # This is a Rectangle
]
}
cont = Container.model_validate(my_data)
print(cont)
>>> shapes=[Shape(x=0.0, y=0.0), Shape(x=0.0, y=0.0)]
Disappointing, isn't it? We lost all the information about the shapes 😩. This is actually expected behaviour, because pydantic doesn't know that a Shape can be either a Circle, an Hexagon or a Rectangle. We just tell him that it is a list of Shape, and that's it, we get a list of Shape.
[!WARNING] A very bad smell is coming from the fact that
CircleandHexagonhave the same fields. Pydantic will never be able to tell them apart. This won't normally be a problem for any type system, like python's, but it is a problem for pydantic, because their serialization is ambiguous.
😕The "Union" solution
How should we handle this situation? As far as I know, we must sacrifice the Object-Oriented approach and use Union types.
Let's rewrite our class hierarchy, applying the following changes:
- All classes have a
typefield that is used as a discriminator, and must be set to a hardcoded value, in the form of a string literal. They must all be different. - In the
Containerclass, replace theShapehint with aUnionhint that contains all the possible shapes.
class Shape(BaseModel):
type: Literal["shape"] = "shape"
x: float
y: float
class Circle(Shape):
type: Literal["circle"] = "circle"
radius: float
class Hexagon(Shape):
type: Literal["hexagon"] = "hexagon"
radius: float
class Rectangle(Shape):
type: Literal["rectangle"] = "rectangle"
width: float
height: float
class Container(BaseModel):
shapes: list[Circle | Hexagon | Rectangle]
Let's also update the client program:
my_data = {
"shapes": [
{"type": "circle", "x": 0, "y": 0, "radius": 1},
{"type": "hexagon", "x": 1, "y": 2, "radius": 1},
{"type": "rectangle", "x": 5, "y": 3, "width": 1, "height": 1},
]
}
cont = Container.model_validate(my_data)
print(cont)
>>> shapes=[Circle(type='circle', x=0.0, y=0.0, radius=1.0), Hexagon(type='hexagon', x=1.0, y=2.0, radius=1.0), Rectangle(type='rectangle', x=5.0, y=3.0, width=1.0, height=1.0)]
It works! Yay! 🎉
But... something is not right.
[!WARNING] What if a new class
Triangleis added to the hierarchy? We must remember to add it to theUniontype inContainer.
[!CAUTION] What if we want to add the
Triangleclass to the hierarchy, but theContainerclass is defined in a different library? We can't, unless we do some radioactive monkey patching. ☢️
[!WARNING] Moreover, the
Uniontype is not very readable, and it will completely mess up every type hint in theContainerclass. The IDE will complain, the type checker will complain, and you will too. 😡
The pydantic-discriminator solution
This library provides a solution to this problem by using a modified BaseModel class that can handle this situation. No more Union types, no more monkey patching, no more type checker errors.
[!NOTE] All the pydantic features should be preserved. The new base class just adds some additional functionality.
Let's go back to the original class hierarchy, but applying the following changes:
- The
Shapeclass is now aDiscriminatedBaseModelclass. - All classes have a class keyword argument
discriminatorthat is used as a discriminator, and must be set to a hardcoded value, in the form of a string literal. They must all be different.
from pydantic_discriminator import DiscriminatedBaseModel
class Shape(DiscriminatedBaseModel):
x: float
y: float
class Circle(Shape, discriminator="circle"):
radius: float
class Hexagon(Shape, discriminator="hexagon"):
radius: float
class Rectangle(Shape, discriminator="rectangle"):
width: float
height: float
class Container(BaseModel):
shapes: list[Shape]
>>> shapes=[Circle(type_='circle', x=0.0, y=0.0, radius=1.0), Hexagon(type_='hexagon', x=1.0, y=2.0, radius=1.0), Rectangle(type_='rectangle', x=5.0, y=3.0, width=1.0, height=1.0)]
It works too! Yay! 🎉
[!NOTE] The code is now much more clean and readable. It is basically the same as the original code, with the addition of the
discriminatorkeyword argument.
[!NOTE] Adding a new class to the hierarchy is now as easy as adding a new class to the hierarchy. No need to modify the
Containerclass. The new class can also be located in different modules or libraries, as long as it is imported somewhere in the program and thediscriminatorkeyword argument is set correctly.
[!NOTE] The IDE and the type checker will be happy too. 😊
Under the hood, what happens is that the DiscriminatedBaseModel class will automatically add a type_ (aliased to type to avoid potential conflicts with python keywords) field to the model, and whenever a model of the hierarchy is created, it will look for the correct class to instantiate among the registered subclasses, which is the one whose discriminator keyword argument matches the value of the type_ field.
Classes are registered automatically when they are defined in a tree structure, so there is no need to do anything else.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pydantic_discriminator-0.1.0.tar.gz.
File metadata
- Download URL: pydantic_discriminator-0.1.0.tar.gz
- Upload date:
- Size: 11.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.12.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d2adac6bc37fab0e2977259669d61c9a61fce6685effaf7f7e986462f09e556f
|
|
| MD5 |
5150aef36eba3135f832927966b1087f
|
|
| BLAKE2b-256 |
203b76f3b9a386d3c21adff0a2dc7c24fb09315a997991fdebc4741df4124710
|
File details
Details for the file pydantic_discriminator-0.1.0-py3-none-any.whl.
File metadata
- Download URL: pydantic_discriminator-0.1.0-py3-none-any.whl
- Upload date:
- Size: 9.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.12.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6cc065fea789ed7895121d43b8394b0010e74aeac71738e3f9a8f0078e17ef5c
|
|
| MD5 |
9028354c902b4ded8f1dfcbe2cb0cdd0
|
|
| BLAKE2b-256 |
1ab5d4b730b9df1344d4b4dd7cc068c700d9073ac051f7a053a89c5de53e18d5
|