Call an Open Inference server using pydantic models defining your model inputs and outputs.

These details have been verified by PyPI

Project links

Repository

GitHub Statistics

Maintainers

bunny-therapist

These details have not been verified by PyPI

Project description

pydantic-open-inference

What is it?

Provides a client-side python interface for calling open inference servers. Define pydantic models for the model inputs and outputs, and it gets automatically converted to and from the open inference protocol, removing the need for boilerplate coding handling raw tensors and their shapes.

Features

✅ Simple, type-safe API toward an open inference server using user-defined pydantic models
✅ Automatic shape inference from pydantic models
✅ Automatic datatype mapping (int→INT64, float→FP32, str→BYTES)
✅ Datatype override for fine-grained control of model inputs
✅ Automatically request only a subset of model outputs by omitting fields in your pydantic model
✅ Take advantage of the full power of pydantic to validate and type-coerce your model calls
✅ Multi-input/output models - each input/output maps to a pydantic field
✅ Custom timeout configuration - configurable per model
✅ Support calling inference for versioned and unversioned models
✅ Error handling with custom exceptions - handle, e.g., specific error codes from the inference server differently
✅ Fully typed - use, e.g., mypy and its pydantic plugin to type check your code
✅ Thread safe

Installation

Installation using pip:

pip install pydantic-open-inference

Adding it to your project dependencies using uv:

uv add pydantic-open-inference

How to use it

Assume we have an inference server serving an Open Inference V2 REST API at http://localhost:8080 and that the server contains a model named example_model, which has the following model inputs

name	shape	datatype	description	example data
text	[1]	BYTES	A single string of text.	`["This is a text"]`

and the following model outputs:

name	shape	datatype	description	example data
entities	[-1, 4]	BYTES	A list of entities extracted from the text. Each entity is made up of (in order) a label (either "order-id" or "invoice-id"), a confidence score, a start index for the entity in the text, and an end index.	`[["order-id", "0.9", "23", "30"],["invoice-id","0.8","34","46"]]`

We then define a pydantic model for the inputs, inheriting from InputsBaseModel. Each model input name needs to correspond to a field in the pydantic model, and the type of the field needs to make sense given the shape and datatype of the input. In this case, there is just a single input named "text" that is supposed to hold a single text, so we simply define

from pydantic_open_inference import InputsBaseModel

class ExampleModelInputs(InputsBaseModel):
    text: str

In the same way, we define a pydantic model for the outputs, inheriting from OutputsBaseModel. There is one model outputs named "entities", so that will be the only field we define. There can be any number of entities (corresponding to the -1 in the shape), so we define it to be a list. A list of what? Here we have some choices. We could simply define entities: list[list[str]] and get each entity as a list of strings, e.g., ["order-id", "0.98", "12", "24"], but this is not very useful to us, so we instead define, e.g., a named tuple with ordered, named, and typed fields:

from typing import NamedTuple
from enum import StrEnum
from pydantic_open_inference import OutputsBaseModel

class Label(StrEnum):
    ORDER_ID = "order-id"
    INVOICE_ID = "invoice-id"

class Entity(NamedTuple):
    label: Label
    score: float
    start: int
    end: int

class ExampleModelOutputs(OutputsBaseModel):
    entities: list[Entity]

Note that we could also have used typing.Literal instead of enum.StrEnum, and even added further validation, e.g., that the score is between 0.0 and 1.0, and that the start index is not negative etc. Pydantic offers many possibilities, but for now, let us stick with this.

We now create a RemoteModel instance. The RemoteModel class represents a model in the inference server, in this case our "example_model". Instantiation requires the URL of the inference server, the name of the model, and the pydantic models we defined for the input and output:

from pydantic_open_inference import RemoteModel

INFERENCE_SERVER_URL = "http://localhost:8080"

example_model = RemoteModel(
    server_url=INFERENCE_SERVER_URL,
    model_name="example_model",
    inputs_model=ExampleModelInputs,
    outputs_model=ExampleModelOutputs,
)

In order to call the inference server, we simply call the RemoteModel.infer method, giving it an instance of ExampleModelInputs, and get back an instance of ExampleModelOutputs:

example_input = ExampleModelInputs(text="Your order id is 123456.")
example_output = example_model.infer(example_input)
# ExampleModelOutputs(
#    entities=[
#        Entity(label=Label.ORDER_ID, score=0.95, start=17, end=23),
#    ]
# )

We have thus not only made a perfectly type-safe and validated call to the inference server, we have also gotten the response back in a useful format! We never had to bother with handling the shape of the data, parsing it in row-major order etc, as all of that is handled under the hood!

The only time you need to think about "shape" is when defining the fields of the pydantic models. Use types like, e.g., list, tuple, and typing.NamedTuple, to type the pydantic-model fields so that they map sensibly to their corresponding model input/output. Each input/output name must match to a pydantic field and the shape must correspond to the structure of the field, e.g., list[tuple[int, int]] for shape [-1, 2], or tuple[tuple[float, float], tuple[float, float]] for shape [2, 2], etc.

The "datatype" of the model inputs is automatically set based on the innermost type of the corresponding field in the InputsBaseModel subclass, according to:

python type	datatype
bool	BOOL
int	INT64
float	FP32
str	BYTES

However, you can always override the datatype of the model inputs to whatever you want using pydantic_open_inference.DatatypeOverride; see its docstring for more information. The datatypes of the model outputs are disregarded; all that matters is that the data we got back from the model can be mapped to the defined OutputsBaseModel subclass.

When an inference call to a model is made, it automatically requests only the specific outputs for which you have defined fields in your OutputsBaseModel subclass. So if you are not interested in an output, simply do not define a pydantic field for it.

Examples

See the examples directory for more examples.

Project details

These details have been verified by PyPI

Project links

Repository

GitHub Statistics

Maintainers

bunny-therapist

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.3.2

Mar 17, 2026

1.3.1

Mar 17, 2026

1.3.0

Mar 17, 2026

1.2.0

Mar 12, 2026

This version

1.1.0

Mar 11, 2026

1.0.0

Nov 19, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydantic_open_inference-1.1.0.tar.gz (10.9 kB view details)

Uploaded Mar 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pydantic_open_inference-1.1.0-py3-none-any.whl (12.0 kB view details)

Uploaded Mar 11, 2026 Python 3

File details

Details for the file pydantic_open_inference-1.1.0.tar.gz.

File metadata

Download URL: pydantic_open_inference-1.1.0.tar.gz
Upload date: Mar 11, 2026
Size: 10.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pydantic_open_inference-1.1.0.tar.gz
Algorithm	Hash digest
SHA256	`bdfcb9b44b85fb3b9e8c1d46bdf8898c8ae27f64eb4ad78c2c23f2b58473d45b`
MD5	`95f71b17f2238795011fcd105a6cb0f7`
BLAKE2b-256	`a3cd6f134ec564ff7b3308fbf286f254f907fe4913d5133b233772527f9b8bf4`

See more details on using hashes here.

File details

Details for the file pydantic_open_inference-1.1.0-py3-none-any.whl.

File metadata

Download URL: pydantic_open_inference-1.1.0-py3-none-any.whl
Upload date: Mar 11, 2026
Size: 12.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pydantic_open_inference-1.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e6c78b1bc7afc93a430101edbe85dcec7af0c36b46c02e1290818a51a3a2d64c`
MD5	`f0237c12e17e8635428929dacf5a806b`
BLAKE2b-256	`bbd52916278cb716fbdb87f9df7c5df55e2b7bc7ce77039ec801af2d99d24951`

See more details on using hashes here.

pydantic-open-inference 1.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

pydantic-open-inference

What is it?

Features

Installation

How to use it

Examples

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes