Convention based, effortless serialization and deserialization
Reason this release was yanked:
breaks 3.10
Project description
Convention based, effortless serialization and deserialization
uniserde can convert Python classes to/from JSON and BSON without any input
from your side. Simply define the classes, and the library does the rest.
Define your types as classes with type annotations, and call one of uniserde's
serialization/deserialization functions:
from uniserde import Serde
from datetime import datetime, timezone
from dataclasses import dataclass
from bson import ObjectId
@dataclass
class Person(Serde):
id: ObjectId
name: str
birth_date: datetime
betty = Person(
id=ObjectId(),
name="Betty",
birth_date=datetime(year=1988, month=12, day=1, tzinfo=timezone.utc),
)
print(betty.as_json())
This will print a dictionary similar to this one
{
'id': '62bc6c77792fc617c52499d0',
'name': 'Betty',
'birthDate': '1988-12-01T00:00:00+00:00'
}
You can easily convert this to a string using Python's built-in json module if
that's what you need.
API
The API is extremely simple. Functions/Classes you might be interested in are:
-
as_json,as_bsonGiven a class with type annotations, these create a JSON/BSON like dictionary. You can feed those into functions like
json.dump, or use them as is. -
from_json,from_bsonGiven a JSON/BSON like dictionary and Python type, these will instantiate the corresponding Python class. Raise
SerdeErrorif the values are invalid. -
Serdeis a helper class you can optionally apply to your models. It adds the convenience functionsas_json,as_bson,from_json, andfrom_bsondirectly to the models. -
Sometimes a class simply acts as a type-safe base, but you really just want to serialize the children of that class. In that case you can decorate the class with
@as_child. This will store an additionaltypefield in the result, so the correct child class can be instantiated when deserializing. -
as_mongodb_schemaautomatically creates JSON schemas compatible with MongoDB from models -
Custom serialization / deserialization can be achieved by inheriting from the
Serdeclass and overriding theas_json,as_bson,from_json,from_bsonand/oras_mongodb_schemamethods. -
The library also exposes a couple handy type definitions:
Jsonable,Bsonable-- Any type which can occur in a JSON / BSON file respectively, i.e. (bool, int, float, ...)JsonDoc,BsonDoc-- A dictionary mapping strings toJsonables /Bsonable
Lazy Deserialization
Normally, serialization happens all at once: You tell uniserde to create a
class instance from a JSON, uniserde processes all of the fields and returns
the finished class.
This works great, but can be wasteful if you are working with large documents
and only need to access few fields. To help with this, you can pass lazy=True
when deserializing any object. uniserde will then hold off deserializing
fields until they are accessed for the first time, saving precious processing
time.
A word of caution: Data is validated as it is deserialized. Since lazy deserialization defers work until the data is accessed, this means any data you don't access also won't be validated. Thus, lazy serialization can be a very powerful tool for speeding up interactions with large objects, but you should only use when you are absolutely certain the data is correct. (For example because you have just fetched the object from your own, trusted, database.)
Types & Conventions
The library tries to stick to the naming conventions used by the target formats:
- names in JSON are written in lowerCamelCase, as is convention in JavaScript
- BSON uses the same conventions as JSON
- Python class names must be in UpperCamelCase
- Python fields must be in all_lower_case
- Python enum values must be in ALL_UPPER_CASE
JSON
| Python | JSON | Notes |
|---|---|---|
bool |
bool |
|
int |
float |
|
float |
float |
|
str |
str |
|
tuple |
list |
|
list |
list |
|
set |
list |
|
Optional |
value or None |
|
Any |
as-is | The value is kept unchanged, without any checks. |
Literal[str] |
str |
|
enum.Enum |
str |
Enum values are mapped to their name (NOT value!) |
enum.Flag |
list[str] |
Each flag is encoded the same way a regular enum.Enum value would. |
| custom class | dict |
Each attribute is stored as key, in lowerCamelCase. If marked with as_child, an additional type field is added. |
bytes |
str |
base64 encoded |
datetime.datetime |
str |
as ISO 8601 - with timezone. Naïve datetimes are intentionally not supported. Do yourself a favor and don't use them. |
datetime.timedelta |
float |
duration, in seconds |
dict[str, ...] |
dict |
|
bson.ObjectId |
str |
|
pathlib.Path |
str |
Paths are made absolute before serialization. |
uuid.UUID |
str |
BSON
BSON mostly uses the same conventions as JSON, with just a few changes:
| Python | BSON | Notes |
|---|---|---|
| custom class | dict |
Same as JSON, but any fields named id are renamed to _id to match MongoDB. |
bytes |
bytes |
|
datetime.datetime |
datetime.datetime |
Serialization requires a timezone be set. Deserialization imputes UTC, to match MongoDB. |
bson.ObjectId |
bson.ObjectId |
(Note that UUIDs are also serialized to str in BSON, just like in JSON. This
is to avoid the mess that is MongoDB's mapping UUIDs to blobs.)
MongoDB Schema Generation
If you are working with MongoDB you will come to appreciate the automatic schema
generation. Calling uniserde.as_mongodb_schema on any supported class will
return a MongoDB compatible JSON schema without hassle.
For example, here's the result of uniserde.as_mongodb_schema(Person) with the
Person class above:
{
'type': 'object',
'properties': {
'_id': {
'bsonType': 'objectId'
},
'name': {
'type': 'string'
},
'birthDate': {
'bsonType': 'date'
}
},
'additionalProperties': False,
'required': [
'_id',
'name',
'birthDate'
]
}
TODO
- Support for
Unionis currently very limited. Really onlyOptionalis supported (which Python internally maps toUnion) Literalcurrently only supports strings- Extend
as_child, to allow marking some classes as abstract. i.e. their parents/children can be serialized, but not those classes themselves - Being able to specify additional limitations to fields would be nice:
- must match regex
- minimum / maximum
- custom validation functions
- more Unit tests (custom de-serializers!?)
- Add more examples to the README
- show custom serializers/deserializers
- recommended usage
- regression tracking
- calling
uniserde.serializeon non-classes causes problems, because the serializationas_typeis guessed incorrectly. e.g.[1, 2, 3]will be incorrectly serialized aslistrather thanlist[int].
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file uniserde-0.3.15.tar.gz.
File metadata
- Download URL: uniserde-0.3.15.tar.gz
- Upload date:
- Size: 23.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.12.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d1cf8e72359f9f9150a33bf105e497cebc98bd428b21e1e06356a5e72f7b5fb6
|
|
| MD5 |
83719232424259f5d0cf2785c9236bb9
|
|
| BLAKE2b-256 |
f7c397c4833e153d3058f59b716208886bc173a47efd1203d33bcdfa746dccd0
|
File details
Details for the file uniserde-0.3.15-py3-none-any.whl.
File metadata
- Download URL: uniserde-0.3.15-py3-none-any.whl
- Upload date:
- Size: 25.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.12.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ce53044fcf47cd41ca8b9af85710aeec9e963c3046782d14f8b634bfad1602f8
|
|
| MD5 |
45ef6c98e6cd67136c4c9d1161c0d4a4
|
|
| BLAKE2b-256 |
16504745f7cd14b427f105ff4848fa260b778456b8bb88484d750f6a192cb31c
|