clearml-serving

clearml-serving - Model-Serving Orchestration and Repository Solution

These details have not been verified by PyPI

Project links

Homepage

Project description

ClearML Serving - ML-Ops made easy

`clearml-serving`
Model-Serving Orchestration and Repository Solution

clearml-serving is a command line utility for the flexible orchestration of your model deployment.
clearml-serving can make use of a variety of serving engines (Nvidia Triton, OpenVino Model Serving, KFServing) setting them up for serving wherever you designate a ClearML Agent or on your ClearML Kubernetes cluster

Features:

Spin serving engines on your Kubernetes cluster or ClearML Agent machine from CLI
Full usage & performance metrics integrated with ClearML UI
Multi-model support in a single serving engine container
Automatically deploy new model versions
Support Canary model releases
Integrates to ClearML Model Repository
Deploy & upgrade endpoints directly from ClearML UI
Programmatic interface for endpoint/versions/metric control

Installing ClearML Serving

Setup your ClearML Server or use the Free tier Hosting
Connect your ClearML Worker(s) to your ClearML Server (see ClearML Agent / Kubernetes integration)
Install clearml-serving (Note: clearml-serving is merely a control utility, it does not require any resources for actual serving)

pip install clearml-serving

Using ClearML Serving

Clearml-Serving will automatically serve published models from your ClearML model repository, so the first step is getting a model into your ClearML model repository.
Background: When using clearml in your training code, any model stored by your python code is automatically registered (and, optionally, uploaded) to the model repository. This auto-magic logging is key for continuous model deployment.
To learn more on training models and the ClearML model repository, see the ClearML documentation

Training a toy model with Keras (about 2 minutes on a laptop)

The main goal of clearml-serving is to seamlessly integrate with the development process and the model repository. This is achieved by combining ClearML's auto-magic logging which creates and uploads models directly from the python training code, with accessing these models as they are automatically added into the model repository using the ClearML Server's REST API and its pythonic interface.
Let's demonstrate this seamless integration by training a toy Keras model to classify images based on the MNIST dataset. Once we have a trained model in the model repository we will serve it using clearml-serving.

We'll also see how we can retrain another version of the model, and have the model serving engine automatically upgrade to the new model version.

Keras mnist toy train example (single epoch mock training):

install tensorflow (and of course cleamrl)
```
pip install "tensorflow>2" clearml
```
Execute the training code
```
cd examples/keras
python keras_mnist.py
```
Notice: The only required integration code with clearml are the following two lines:
```
from clearml import Task
task = Task.init(project_name="examples", task_name="Keras MNIST serve example", output_uri=True)
```
This call will make sure all outputs are automatically logged to the ClearML Server, this includes: console, Tensorboard, cmdline arguments, git repo etc.
It also means any model stored by the code will be automatically uploaded and logged in the ClearML model repository.
Review the models in the ClearML web UI:
Go to the "Projects" section of your ClearML server (free hosted or self-deployed).
in the "examples" project, go to the Models tab (model repository).
We should have a model named "Keras MNIST serve example - serving_model".
Once a model-serving service is available, Right-clicking on the model and selecting "Publish" will trigger upgrading the model on the serving engine container.

Next we will spin the Serving Service and the serving-engine

Serving your models

In order to serve your models, clearml-serving will spawn a serving service which stores multiple endpoints and their configuration, collects metric reports, and updates models when new versions are published in the model repository.
In addition, a serving engine is launched, which is the container actually running the inference engine.
(Currently supported engines are Nvidia-Triton, coming soon are Intel OpenVIno serving-engine and KFServing)

Now that we have a published model in the ClearML model repository, we can spin a serving service and a serving engine.

Starting a Serving Service:

Create a new serving instance.
This is the control plane Task, we will see all its configuration logs and metrics in the "serving" project. We can have multiple serving services running in the same system.
In this example we will make use of Nvidia-Triton engines.

clearml-serving triton --project "serving" --name "serving example"

Add models to the serving engine with specific endpoints.
Reminder: to view your model repository, login to your ClearML account, go to "examples" project and review the "Models" Tab

clearml-serving triton --endpoint "keras_mnist"  --model-project "examples" --model-name "Keras MNIST serve example - serving_model"

Launch the serving service.
The service will be launched on your "services" queue, which by default runs services on the ClearML server machine.
(Read more on services queue here)
We set our serving-engine to launch on the "default" queue,

clearml-serving launch --queue default

Optional: If you do not have a machine connected to your ClearML cluster, either read more on our Kubernetes integration, or spin a bare-metal worker and connect it with your ClearML Server.
clearml-serving is leveraging the orchestration capabilities of ClearML to launch the serving engine on the cluster.
Read more on the ClearML Agent orchestration module here
If you have not yet setup a ClearML worker connected to your clearml account, you can do this now using:
```
pip install clearml-agent
clearml-agent daemon --docker --queue default --detached
```

We are done! To test the new served model, you can curl to the new endpoint:

curl <serving-engine-ip>:8000/v2/models/keras_mnist/versions/1

Notice: If we re-run our keras training example and publish a new model in the repository, the engine will automatically update to the new model.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.3.5

Mar 27, 2025

1.3.4

Dec 18, 2024

1.3.3

Dec 17, 2024

1.3.2

Dec 16, 2024

1.3.1

Dec 12, 2024

1.3.0

Apr 12, 2023

1.2.0

Oct 8, 2022

1.1.0

Sep 2, 2022

1.0.0

Jun 6, 2022

0.9.0

Mar 21, 2022

0.3.3

Jun 5, 2021

This version

0.3.2

May 13, 2021

0.3.1

Apr 12, 2021

0.0.0a0 pre-release

Mar 20, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

clearml_serving-0.3.2-py3-none-any.whl (16.6 kB view details)

Uploaded May 13, 2021 Python 3

File details

Details for the file clearml_serving-0.3.2-py3-none-any.whl.

File metadata

Download URL: clearml_serving-0.3.2-py3-none-any.whl
Upload date: May 13, 2021
Size: 16.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.20.1 setuptools/41.0.1 requests-toolbelt/0.8.0 tqdm/4.23.4 CPython/3.5.2

File hashes

Hashes for clearml_serving-0.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`35bbadb65f658facd3fc8126422cbe5ae2b20176c365483fbb916017ce86c363`
MD5	`bf861b38080414a61febe233ba3d3835`
BLAKE2b-256	`ec8ee8ce80941c2e50e1e13a35043888b920d0807a95965f15c43523e07b7f93`

See more details on using hashes here.

clearml-serving 0.3.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

`clearml-serving`
Model-Serving Orchestration and Repository Solution

Installing ClearML Serving

Using ClearML Serving

Training a toy model with Keras (about 2 minutes on a laptop)

Keras mnist toy train example (single epoch mock training):

Serving your models

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

clearml-serving 0.3.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

clearml-serving Model-Serving Orchestration and Repository Solution

Installing ClearML Serving

Using ClearML Serving

Training a toy model with Keras (about 2 minutes on a laptop)

Keras mnist toy train example (single epoch mock training):

Serving your models

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

`clearml-serving`
Model-Serving Orchestration and Repository Solution