Skip to main content

Custom BaseModel class and type-hints for using Pydantic within TeamTomo.

Project description

teamtomo-basemodel

Custom Pydantic classes and type-hints for building more complex data classes within TeamTomo.

Package Usage

The BaseModelTeamTomo should be used when making sub-classes which should inherit the basic parsing/serialization functionality within Pydantic. Currently, the following methods are implemented:

  • to_json/to_yaml - Export the model fields to a JSON/YAML file (filepath passed as argument).
  • from_json/from_yaml - Instantiate a new model from contents in a JSON/YAML file (filepath passed as argument).
  • get_schema - Alias for model_json_schema method of the Pydantic basemodel class.

Basic example

These derived classes define their own fields and validation methods. For example, the following is an example derived class.

from pydantic import Field

from teamtomo_basemodel import BaseModelTeamTomo


class CustomModel(BaseModelTeamTomo):
    """An example custom model with three fields."""

    some_label: str
    some_value: float = Field(ge=0.0)  # will raise Validation error if less than 0.0
    list_int: list[int] = Field(description="A list of integers")


my_model = CustomModel(some_label="Example", some_value=42.0, list_int=[1, 2, 3, 4, 5])

# Export the model to a YAML file
my_model.to_yaml("custom_model.yaml")

# Create new model from the YAML file
new_model = CustomModel.from_yaml("custom_model.yaml")

# Check if the two models are equal
print(my_model == new_model)  # prints 'True'

Inspecting the contents of the custom_model.yaml file, we see the following.

list_int:
- 1
- 2
- 3
- 4
- 5
some_label: Example
some_value: 42.0

Obtaining model schema

The JSON model schema for a Pydantic model is used to define and validate the structure, its data types, and other constraints of the represented data. This ensures consistency across data types, and we expose the get_schema method for this purpose.

from pprint import pprint

# Using the same 'CustomModel' class defined above
my_schema = my_model.get_schema()

print(type(my_schema))
# <class 'dict'>

pprint(my_schema)
# {'additionalProperties': False,
#  'description': 'An example custom model with three fields.',
#  'properties': {'list_int': {'description': 'A list of integers',
#                              'items': {'type': 'integer'},
#                              'title': 'List Int',
#                              'type': 'array'},
#                 'some_label': {'title': 'Some Label', 'type': 'string'},
#                 'some_value': {'minimum': 0.0,
#                                'title': 'Some Value',
#                                'type': 'number'}},
#  'required': ['some_label', 'some_value', 'list_int'],
#  'title': 'CustomModel',
#  'type': 'object'}

Using included type-hints

Some data structures used by custom data classes may be serialized elsewhere (e.g., DataFrames using csv files) and should not be serialized by the class. For this reason, we also include helpful type-hints which tell Pydantic to ignore these fields while still exposing them to the user. The currently included type-hints are:

  • ExcludedTensor
  • ExcludedNumpyArray
  • ExcludedDataFrame

As an example, we can define another model which uses the field df_path to load in a DataFrame; the df_path field is serialized, but the DataFrame itself is not.

import pandas as pd
from teamtomo_basemodel import BaseModelTeamTomo, ExcludedDataFrame


class CustomModelWithDF(BaseModelTeamTomo):
    """An example custom model with a DataFrame field."""

    df_path: str
    df: ExcludedDataFrame
    
    def __init__(self, **data):
        super().__init__(**data)
        self.df = pd.read_csv(self.df_path) # Load the DataFrame from the specified path


# First, write an example DataFrame to a CSV file
df_example = pd.DataFrame({"column1": [1, 2, 3], "column2": ["a", "b", "c"]})
df_example.to_csv("example_df.csv", index=False)

# Now, create an instance of the custom model with the DataFrame
custom_model_with_df = CustomModelWithDF(df_path="example_df.csv")
print(custom_model_with_df.df)
#    column1 column2
# 0        1       a
# 1        2       b
# 2        3       c

# Print the contents of the model
print(custom_model_with_df.model_dump())
# {'df_path': 'example_df.csv'}

Installation

TeamTomo basemodel is available on the Python package index and can be installed via

pip install teamtomo-basemodel

From source

If you'd rather install the package from source, say for development purposes, use the following commands for cloning then installing a local editable version of the package

git clone https://github.com/teamtomo/teamtomo-basemodel.git
cd teamtomo-basemodel
pip install -e .

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

teamtomo_basemodel-0.1.1.tar.gz (10.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

teamtomo_basemodel-0.1.1-py3-none-any.whl (6.2 kB view details)

Uploaded Python 3

File details

Details for the file teamtomo_basemodel-0.1.1.tar.gz.

File metadata

  • Download URL: teamtomo_basemodel-0.1.1.tar.gz
  • Upload date:
  • Size: 10.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for teamtomo_basemodel-0.1.1.tar.gz
Algorithm Hash digest
SHA256 098fd8b627a3c1864447517d8e8fa97751fcf7a4bfb8dafe0baa81aa2c51359c
MD5 13ca988e7ffef7d5face93f400037c58
BLAKE2b-256 de00e404f48a165d99b8283c506562e943c80739893d680aa847178f760e955a

See more details on using hashes here.

Provenance

The following attestation bundles were made for teamtomo_basemodel-0.1.1.tar.gz:

Publisher: ci.yml on teamtomo/teamtomo-basemodel

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file teamtomo_basemodel-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for teamtomo_basemodel-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 898ec9e69198a34b0c5eebca76baae6c5cc34b168bcdb54b064d18d53ab858f4
MD5 2b05d92b41551df012f271bc6db2302c
BLAKE2b-256 2b202e82d450f0340ef4f122c1450f8841840618add8f980383a60df575bda04

See more details on using hashes here.

Provenance

The following attestation bundles were made for teamtomo_basemodel-0.1.1-py3-none-any.whl:

Publisher: ci.yml on teamtomo/teamtomo-basemodel

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page