bentoml

An open-source ML model serving and model management framework

These details have not been verified by PyPI

Project links

Project description

BentoML is an open-source framework for high-performance ML model serving.

What does BentoML do?

Create API endpoint serving trained models with just a few lines of code
Support all major machine learning training frameworks
High-Performance online API serving with adaptive micro-batching support
Model Registry for teams, providing Web UI dashboard and CLI/API access
Flexible deployment orchestration with DevOps best practices baked-in, supporting Docker, Kubernetes, Kubeflow, Knative, AWS Lambda, SageMaker, Azure ML, GCP and more

👉 To follow development updates and discussion, join the BentoML mailing list and the Bentoml Slack community.

Why BentoML

Getting Machine Learning models into production is hard. Data Scientists are not experts in building production services and DevOps best practices. The trained models produced by a Data Science team are hard to test and hard to deploy. This often leads us to a time consuming and error-prone workflow, where a pickled model or weights file is handed over to a software engineering team.

BentoML is an end-to-end solution for model serving, making it possible for Data Science teams to build production-ready model serving endpoints, with common DevOps best practices and performance optimizations baked in.

Check out Frequently Asked Questions page on how does BentoML compares to Tensorflow-serving, Clipper, AWS SageMaker, MLFlow, etc.

Getting Started

BentoML requires python 3.6 or above, install with pip:

pip install bentoml

A minimal prediction service in BentoML looks something like this:

# https://github.com/bentoml/BentoML/blob/master/guides/quick-start/iris_classifier.py
from bentoml import env, artifacts, api, BentoService
from bentoml.adapters import DataframeInput
from bentoml.artifact import SklearnModelArtifact

@env(auto_pip_dependencies=True)
@artifacts([SklearnModelArtifact('model')])
class IrisClassifier(BentoService):

    @api(input=DataframeInput())
    def predict(self, df):
        # Optional pre-processing, post-processing code goes here
        return self.artifacts.model.predict(df)

This code defines a prediction service that bundles a scikit-learn model and provides an API that expects input data in the form of pandas.Dataframe. The user-defined API function predict defines how the input dataframe data will be processed and used for inference with the bundled scikit-learn model. BentoML also supports other API input types such as ImageInput, JsonInput and more.

The following code trains a scikit-learn model and packages the trained model with the IrisClassifier class defined above. It then saves the IrisClassifier instance to disk in the BentoML SavedBundle format:

# https://github.com/bentoml/BentoML/blob/master/guides/quick-start/main.py
from sklearn import svm
from sklearn import datasets

from iris_classifier import IrisClassifier

if __name__ == "__main__":
    # Load training data
    iris = datasets.load_iris()
    X, y = iris.data, iris.target

    # Model Training
    clf = svm.SVC(gamma='scale')
    clf.fit(X, y)

    # Create a iris classifier service instance
    iris_classifier_service = IrisClassifier()

    # Pack the newly trained model artifact
    iris_classifier_service.pack('model', clf)

    # Save the prediction service to disk for model serving
    saved_path = iris_classifier_service.save()

By default, BentoML stores SavedBundle files under the ~/bentoml directory. Users can also customize BentoML to use a different directory or cloud storage like AWS S3 and MinIO, via BentoML's model management component YataiService, which provides advanced model management features including a dashboard web UI:

BentoML YataiService Bento Repository Page

BentoML YataiService Bento Details Page

Learn more about using YataiService for model management and try out the Web UI here.

The BentoML SavedBundle directory contains all the code, data and configs required to deploy the model. To start a REST API model server with the IrisClassifier SavedBundle, use the bentoml serve command:

bentoml serve IrisClassifier:latest

The IrisClassifier model is now served at localhost:5000. Use curl command to send a prediction request:

curl -i \
  --header "Content-Type: application/json" \
  --request POST \
  --data '[[5.1, 3.5, 1.4, 0.2]]' \
  http://localhost:5000/predict

The BentoML API server also provides a web UI for accessing predictions and debugging the server. Visit http://localhost:5000 in the browser and use the Web UI to send prediction request:

BentoML provides a convenient way to containerize the model API server with Docker:

Find where the SavedBundle directory is created in the file system:
- The saved path is return by the iris_classifier_service.save() call
- The saved path is printed in the stdout when saving: INFO - BentoService bundle 'IrisClassifier:20200121114004_360ECB' saved to: ...
- Use the bentoml get IrisClassifier:latest command to view all the metadata including saved path
Run docker build under the SavedBundle directory which contains a generated Dockerfile, which will build a docker image containing the IrisClassifier API server

# If jq command not found, install jq (the command-line JSON processor) here: https://stedolan.github.io/jq/download/
saved_path=$(bentoml get IrisClassifier:latest -q | jq -r ".uri.uri")

# Build the docker image
docker build -t iris-classifier $saved_path

# Start a container with the image build above
docker run -p 5000:5000 iris-classifier

This docker images makes it possible to deploy BentoML saved bundle to container orchestration platforms such as Kubeflow, Knative, Kubernetes, which provides advanced model deployment features such as auto-scaling, A/B testing, scale-to-zero, canary rollout and multi-armed bandit.

BentoML can also deploy SavedBundle directly to cloud services such as AWS Lambda or AWS SageMaker, with the bentoml CLI command. For a list of all deployment options with BentoML, check out the BentoML deployment guides.

Documentation

BentoML full documentation: https://docs.bentoml.org/

Quick Start Guide: https://docs.bentoml.org/en/latest/quickstart.html
Core Concepts: https://docs.bentoml.org/en/latest/concepts.html
Deployment Guides: https://docs.bentoml.org/en/latest/deployment/index.html
API References: https://docs.bentoml.org/en/latest/api/index.html
Frequently Asked Questions: https://docs.bentoml.org/en/latest/faq.html

Examples

Visit bentoml/gallery repository for more examples and tutorials.

FastAI

Pet Image Classification - Google Colab | nbviewer | source
Salary Range Prediction - Google Colab | nbviewer | source

Scikit-Learn

Sentiment Analysis - Google Colab | nbviewer | source

PyTorch

Fashion MNIST - Google Colab | nbviewer | source
CIFAR-10 Image Classification - Google Colab | nbviewer | source

Tensorflow Keras

Fashion MNIST - Google Colab | nbviewer | source
Text Classification - Google Colab | nbviewer | source
Toxic Comment Classifier - Google Colab | nbviewer | source

Tensorflow 2.0

tf.Function model - Google Colab | nbviewer | source
Fashion MNIST - Google Colab | nbviewer | source
Movie Review Sentiment with BERT - Google Colab | nbviewer | source

XGBoost

Titanic Survival Prediction - Google Colab | nbviewer | source
League of Legend win Prediction - Google Colab | nbviewer | source

LightGBM

Titanic Survival Prediction - Google Colab | nbviewer | source

H2O

Loan Default Prediction - Google Colab | nbviewer | source
Prostate Cancer Prediction - Google Colab | nbviewer | source

FastText

Text Classification - Google Colab | nbviewer | source

ONNX

Scikit-Learn Iris Classifier - Google Colab | nbviewer | source
ResNet50 Image recognition (ONNX model zoo) - Google Colab | nbviewer | source

Deployment guides:

End-to-end deployment management with BentoML
- BentoML AWS Lambda Deployment Guide
- BentoML AWS SageMaker Deployment Guide
Deployment guides for open-source platforms:
Deployment guides for Cloud service providers:

Contributing

Have questions or feedback? Post a new github issue or discuss in our Slack channel:

Want to help build BentoML? Check out our contributing guide and the development guide.

Releases

BentoML is under active development and is evolving rapidly. Currently it is a Beta release, we may change APIs in future releases.

Read more about the latest features and changes in BentoML from the releases page.

Usage Tracking

BentoML by default collects anonymous usage data using Amplitude. It only collects BentoML library's own actions and parameters, no user or model data will be collected. Here is the code that does it.

This helps BentoML team to understand how the community is using this tool and what to build next. You can easily opt-out of usage tracking by running the following command:

# From terminal:
bentoml config set usage_tracking=false

# From python:
import bentoml
bentoml.config().set('core', 'usage_tracking', 'False')

License

Apache License 2.0

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.4.35

Feb 3, 2026

1.4.34

Jan 26, 2026

1.4.33

Jan 12, 2026

1.4.32

Jan 9, 2026

1.4.31 yanked

Dec 23, 2025

1.4.30

Nov 27, 2025

1.4.29

Nov 17, 2025

1.4.28

Oct 29, 2025

1.4.27

Oct 20, 2025

1.4.26

Oct 10, 2025

1.4.25

Sep 24, 2025

1.4.24

Sep 17, 2025

1.4.23

Sep 5, 2025

1.4.22

Aug 27, 2025

1.4.21

Aug 14, 2025

1.4.20

Aug 13, 2025

1.4.19

Jul 29, 2025

1.4.18

Jul 24, 2025

1.4.17

Jun 30, 2025

1.4.16

Jun 16, 2025

1.4.15

May 23, 2025

1.4.14

May 20, 2025

1.4.13

May 9, 2025

1.4.12

Apr 29, 2025

1.4.11

Apr 22, 2025

1.4.10

Apr 17, 2025

1.4.9 yanked

Apr 17, 2025

Reason this release was yanked:

bad release

1.4.8

Apr 8, 2025

1.4.7

Mar 28, 2025

1.4.6

Mar 25, 2025

1.4.5

Mar 14, 2025

1.4.4

Mar 14, 2025

1.4.3

Mar 6, 2025

1.4.2

Feb 28, 2025

1.4.1

Feb 25, 2025

1.4.0

Feb 20, 2025

1.4.0a2 pre-release

Feb 14, 2025

1.4.0a1 pre-release

Feb 14, 2025

1.3.22

Feb 13, 2025

1.3.21

Feb 4, 2025

1.3.20

Jan 17, 2025

1.3.19

Jan 6, 2025

1.3.18

Dec 27, 2024

1.3.17

Dec 24, 2024

1.3.16

Dec 12, 2024

1.3.15

Dec 2, 2024

1.3.14

Nov 21, 2024

1.3.13

Nov 18, 2024

1.3.12

Nov 14, 2024

1.3.11

Nov 8, 2024

1.3.10

Oct 29, 2024

1.3.9

Oct 16, 2024

1.3.8

Oct 11, 2024

1.3.7

Sep 25, 2024

1.3.6

Sep 25, 2024

1.3.5

Sep 10, 2024

1.3.4.post1

Sep 6, 2024

1.3.3

Aug 23, 2024

1.3.2

Aug 14, 2024

1.3.1

Aug 2, 2024

1.3.0

Jul 19, 2024

1.3.0a3 pre-release

Jul 18, 2024

1.3.0a2 pre-release

Jul 16, 2024

1.3.0a1 pre-release

Jul 12, 2024

1.2.20

Jul 12, 2024

1.2.19

Jun 25, 2024

1.2.18

Jun 14, 2024

1.2.17

Jun 3, 2024

1.2.16

May 16, 2024

1.2.15

May 9, 2024

1.2.14

May 8, 2024

1.2.13

May 6, 2024

1.2.12

Apr 19, 2024

1.2.11

Apr 12, 2024

1.2.10

Apr 8, 2024

1.2.9

Mar 22, 2024

1.2.8

Mar 20, 2024

1.2.7

Mar 14, 2024

1.2.6

Mar 9, 2024

1.2.5

Mar 5, 2024

1.2.4

Feb 20, 2024

1.2.3

Feb 20, 2024

1.2.2

Feb 5, 2024

1.2.1

Feb 3, 2024

1.2.1a1 pre-release

Feb 3, 2024

1.2.0

Feb 2, 2024

1.2.0rc1 pre-release

Jan 31, 2024

1.2.0a7 pre-release

Jan 30, 2024

1.2.0a6 pre-release

Jan 26, 2024

1.2.0a5 pre-release

Jan 20, 2024

1.2.0a4 pre-release

Jan 19, 2024

1.2.0a3 pre-release

Jan 19, 2024

1.2.0a2 pre-release

Jan 18, 2024

1.2.0a1 pre-release

Jan 17, 2024

1.2.0a0 pre-release

Jan 9, 2024

1.1.11

Dec 28, 2023

1.1.10

Nov 20, 2023

1.1.9

Nov 7, 2023

1.1.8

Nov 3, 2023

1.1.7

Oct 12, 2023

1.1.6

Sep 8, 2023

1.1.5

Sep 1, 2023

1.1.4

Aug 29, 2023

1.1.3

Aug 24, 2023

1.1.2

Aug 22, 2023

1.1.1

Aug 1, 2023

1.1.0

Jul 24, 2023

1.0.25

Jul 20, 2023

1.0.24

Jul 19, 2023

1.0.23

Jun 29, 2023

1.0.22

Jun 12, 2023

1.0.21

Jun 6, 2023

1.0.20

May 9, 2023

1.0.19

Apr 26, 2023

1.0.18

Apr 14, 2023

1.0.17

Apr 6, 2023

1.0.16

Mar 14, 2023

1.0.15

Feb 15, 2023

1.0.14

Feb 8, 2023

1.0.13

Jan 19, 2023

1.0.12

Dec 8, 2022

1.0.11

Dec 7, 2022

1.0.10

Nov 9, 2022

1.0.9

Nov 9, 2022

1.0.8

Nov 1, 2022

1.0.7

Oct 3, 2022

1.0.6 yanked

Sep 27, 2022

Reason this release was yanked:

A critical module import issue has been introduced in version 1.0.6. Please see release note https://github.com/bentoml/BentoML/releases/tag/v1.0.7

1.0.5

Aug 30, 2022

1.0.4

Aug 26, 2022

1.0.3

Aug 8, 2022

1.0.2

Jul 29, 2022

1.0.0

Jul 13, 2022

1.0.0rc3 pre-release

Jul 1, 2022

1.0.0rc2 pre-release

Jun 22, 2022

1.0.0rc1 pre-release

Jun 8, 2022

1.0.0rc0 pre-release

May 30, 2022

1.0.0a7 pre-release

Apr 6, 2022

1.0.0a6 pre-release

Mar 7, 2022

1.0.0a5 pre-release

Mar 1, 2022

1.0.0a4 pre-release

Feb 15, 2022

1.0.0a3 pre-release

Jan 28, 2022

1.0.0a2 pre-release

Jan 20, 2022

1.0.0a1 pre-release

Dec 14, 2021

1.0.0.dev1 pre-release yanked

Dec 13, 2021

1.0.0.dev0 pre-release yanked

Dec 13, 2021

0.13.2

Jul 20, 2022

0.13.1

Jul 13, 2021

0.13.0

Jun 16, 2021

0.12.1

Apr 5, 2021

0.12.0

Mar 22, 2021

0.11.0

Jan 14, 2021

0.11.dev0 pre-release

Jan 14, 2021

0.10.1

Dec 10, 2020

0.10.0

Dec 7, 2020

0.9.2

Oct 17, 2020

0.9.1

Oct 1, 2020

0.9.0

Sep 25, 2020

0.9.0rc0 pre-release

Sep 21, 2020

0.8.6

Aug 25, 2020

0.8.5

Aug 11, 2020

0.8.4

Aug 7, 2020

0.8.3

Jul 6, 2020

This version

0.8.2 yanked

Jun 26, 2020

0.8.1

Jun 15, 2020

0.8.0 yanked

Jun 15, 2020

0.7.8

May 27, 2020

0.7.7

May 18, 2020

0.7.6

May 15, 2020

0.7.5

May 7, 2020

0.7.4

Apr 30, 2020

0.7.3

Apr 14, 2020

0.7.2

Apr 3, 2020

0.7.1

Apr 3, 2020

0.7.0

Apr 3, 2020

0.6.3

Mar 5, 2020

0.6.2

Feb 11, 2020

0.6.1

Jan 24, 2020

0.6.0

Jan 23, 2020

0.5.8

Jan 8, 2020

0.5.7

Jan 6, 2020

0.5.6

Dec 22, 2019

0.5.5

Dec 20, 2019

0.5.4

Dec 19, 2019

0.5.3

Nov 28, 2019

0.5.2

Nov 26, 2019

0.5.1

Nov 26, 2019

0.5.0

Nov 21, 2019

0.4.9

Nov 11, 2019

0.4.8

Oct 24, 2019

0.4.7

Oct 17, 2019

0.4.5

Oct 17, 2019

0.4.4

Oct 14, 2019

0.4.3

Oct 9, 2019

0.4.2

Sep 25, 2019

0.4.1

Sep 17, 2019

0.4.0

Sep 17, 2019

0.3.4

Aug 7, 2019

0.3.3

Aug 7, 2019

0.3.1

Jul 25, 2019

0.3.0

Jul 17, 2019

0.2.2

Jul 10, 2019

0.2.1

Jun 24, 2019

0.2.0

May 21, 2019

0.1.2

May 1, 2019

0.1.1

Apr 25, 2019

0.0.9

Apr 18, 2019

0.0.8.post1

Apr 12, 2019

0.0.8

Apr 10, 2019

0.0.7

Apr 4, 2019

0.0.7.dev0 pre-release

Apr 10, 2019

0.0.6a0 pre-release

Apr 2, 2019

0.0.5

Apr 2, 2019

0.0.3

Jan 16, 2019

0.0.2

Jan 16, 2019

0.0.1

Jan 15, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

BentoML-0.8.2.tar.gz (2.7 MB view details)

Uploaded Jun 26, 2020 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

BentoML-0.8.2-py3-none-any.whl (3.0 MB view details)

Uploaded Jun 26, 2020 Python 3

File details

Details for the file BentoML-0.8.2.tar.gz.

File metadata

Download URL: BentoML-0.8.2.tar.gz
Upload date: Jun 26, 2020
Size: 2.7 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2.post20191203 requests-toolbelt/0.9.1 tqdm/4.41.0 CPython/3.7.5

File hashes

Hashes for BentoML-0.8.2.tar.gz
Algorithm	Hash digest
SHA256	`30bb76484ca3b625e4e411e75e4362ae28653dba0739afcb592b1012c34fdfa0`
MD5	`11207859d8e948ea27b3892b7e6be3da`
BLAKE2b-256	`450d762cf45599d5e2aab5ed24f57b30225789230c48fc2f18822ac6ad7682b3`

See more details on using hashes here.

File details

Details for the file BentoML-0.8.2-py3-none-any.whl.

File metadata

Download URL: BentoML-0.8.2-py3-none-any.whl
Upload date: Jun 26, 2020
Size: 3.0 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2.post20191203 requests-toolbelt/0.9.1 tqdm/4.41.0 CPython/3.7.5

File hashes

Hashes for BentoML-0.8.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`adeaaba3648b590ef2fb79da391dd7f2d488f83b5b64b27c0027e65b71de3ad1`
MD5	`59441b61bb5661dd0a57a655467c48c9`
BLAKE2b-256	`501b080d5f918d0922f6f2622d440870846d9c73f19484293649acdfd403da0b`

See more details on using hashes here.

bentoml 0.8.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Why BentoML

Getting Started

Documentation

Examples

FastAI

Scikit-Learn

PyTorch

Tensorflow Keras

Tensorflow 2.0

XGBoost

LightGBM

H2O

FastText

ONNX

Deployment guides:

Contributing

Releases

Usage Tracking

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes