BentoML: Package and Deploy Your Machine Learning Models

These details have not been verified by PyPI

Project links

Project description

BentoML

From a model in ipython notebook to production ready API service in 5 minutes.

project status build status pypi status python versions

BentoML is a python library for packaging and deploying machine learning models. It does two things without changing your model training workflow:

Standardize how to package your ML model for production, including its preprocessing/feature-fetching code, dependencies and configurations.
Easily distribute your ML model as PyPI package, API Server(in a Docker Image) , command line tool or Spark/Flink UDF.

Installation
Getting Started
Examples
More About BentoML
Releases and Contributing
License

Installation

python versions pypi status

pip install bentoml

Verify installation:

bentoml --version

Getting Started

BentoML does not change your training workflow. Let's train a simple scikit-learn model as example:

from sklearn import svm
from sklearn import datasets

clf = svm.SVC(gamma='scale')
iris = datasets.load_iris()
X, y = iris.data, iris.target
clf.fit(X, y)

To package this model with BentoML, you will need to create a new BentoService by subclassing it, and provides artifacts and env definition for it:

%%writefile iris_classifier.py
from bentoml import BentoService, api, env, artifacts
from bentoml.artifact import PickleArtifact
from bentoml.handlers import DataframeHandler

@artifacts([PickleArtifact('model')])
@env(conda_dependencies=["scikit-learn"])
class IrisClassifier(BentoService):

    @api(DataframeHandler)
    def predict(self, df):
        return self.artifacts.model.predict(df)

Now, to save your trained model for prodcution use, simply import your BentoService class and pack it with required artifacts:

from iris_classifier import IrisClassifier

svc = IrisClassifier.pack(model=clf)

svc.save('./saved_bento', version='v0.0.1') # Saving archive to ./saved_bento/IrisClassifier/v0.0.1/

That's it. Now you have created your first BentoArchive. It's a directory containing all the source code, data and configurations files required to run this model in production. There are a few ways you could use this archive:

Serving a BentoArchive via REST API

For exposing your model as a HTTP API endpoint, you can simply use the bentoml serve command:

bentoml serve --archive-path="./saved_bento/IrisClassifier/v0.0.1/"

Note: you must ensure the pip and conda dependencies are available in your python environment when using bentoml serve command. More commonly we recommand using BentoML API server with Docker(see below).

Build API server Docker Image from BentoArchive

You can build a Docker Image for running API server hosting your BentoML archive by using the archive folder as docker build context:

cd ./saved_bento/IrisClassifier/v0.0.1/

docker build -t myorg/iris-classifier .

Next, you can docker push the image to your choice of registry for deployment, or run it locally for development and testing:

docker run -p 5000:5000 myorg/iris-classifier

Loading BentoArchive in Python

import bentoml

bento_svc = bentoml.load('./saved_bento/IrisClassifier/v0.0.1/')
bento_svc.predict(X[0])

BentoML also supports loading an archive from s3 location directly:

bento_svc = bentoml.load('s3://my-bento-svc/iris_classifier/')

Install BentoArchive as PyPI package

First install your exported bentoml service with pip:

pip install ./saved_bento/IrisClassifier/v0.0.1/

Now you can import it and used it as a python module:

from IrisClassifier import IrisClassifier

installed_svc = IrisClassifier()
installed_svc.predict(X[0])

Note that you could also publish your exported BentoService as a PyPI package as a public python package on pypi.org or upload to your organization's private PyPI index:

cd ./saved_bento/IrisClassifier/v0.0.1/

python setup.py sdist upload

Loading BentoArchive from CLI

When pip install a BentoML archive, it also provides you with a CLI tool for accsing your BentoService's apis from command line:

pip install ./saved_bento/IrisClassifier/v0.0.1/

IrisClassifier info

IrisClassifier predict --input='./test.csv'

Alternatively, you can also use the bentoml cli to load and run a BentoArchive directly:

bentoml info ./saved_bento/IrisClassifier/v0.0.1/

bentoml predict ./saved_bento/IrisClassifier/v0.0.1/ --input='./test.csv'

CLI access made it very easy to put your saved BentoArchive into an Airflow DAG, integrate your packaged ML model into testing environment or use it in combination with other shell tools.

Examples

All examples can be found in the BentoML/examples directory.

Quick Start with sklearn
Sentiment Analysis with Scikit-Learn
Text Classification with Tensorflow Keras
Fashion MNIST classification with Pytorch
Fashion MNIST classification with Tensorflow Keras
More examples coming soon!

More About BentoML

We build BentoML because we think there should be a much simpler way for machine learning teams to ship models for production. They should not wait for engineering teams to re-implement their models for production environment or build complex feature pipelines for experimental models.

Our vision is to empower Machine Learning scientists to build and ship their own models end-to-end as production services, just like software engineers do. BentoML is enssentially this missing 'build tool' for Machine Learing projects.

With that in mind, here is the top design goals for BentoML:

Multiple framework support - BentoML should supports a wide range of ML frameworks out-of-the-box including Tensorflow, PyTorch, Scikit-Learn, xgboost and can be easily extended to work with new or custom frameworks.
Best Practice built-in - BentoML users can easily customize telemetrics and logging for their model, and make it easy to integrate with production systems.
Streamlines deployment workflows - BentoML supports deploying models into REST API endpoints with Docker, Kubernetes, AWS EC2, ECS, Google Cloud Platform, AWS SageMaker, and Azure ML.
Custom model runtime - Easily integrate your python code with high-performance model runtime backend(e.g. tf-serving, tensorrt-inference-server) in real-time model serving.

Releases and Contributing

BentoML is under active development. Current version is a beta release, we may change APIs in future releases.

Want to help build BentoML? Check out our contributing documentation.

License

BentoML is GPL-3.0 licensed, as found in the COPYING file.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.4.17

Jun 30, 2025

1.4.16

Jun 16, 2025

1.4.15

May 23, 2025

1.4.14

May 20, 2025

1.4.13

May 9, 2025

1.4.12

Apr 29, 2025

1.4.11

Apr 22, 2025

1.4.10

Apr 17, 2025

1.4.9 yanked

Apr 17, 2025

Reason this release was yanked:

bad release

1.4.8

Apr 8, 2025

1.4.7

Mar 28, 2025

1.4.6

Mar 25, 2025

1.4.5

Mar 14, 2025

1.4.4

Mar 14, 2025

1.4.3

Mar 6, 2025

1.4.2

Feb 28, 2025

1.4.1

Feb 25, 2025

1.4.0

Feb 20, 2025

1.4.0a2 pre-release

Feb 14, 2025

1.4.0a1 pre-release

Feb 14, 2025

1.3.22

Feb 13, 2025

1.3.21

Feb 4, 2025

1.3.20

Jan 17, 2025

1.3.19

Jan 6, 2025

1.3.18

Dec 27, 2024

1.3.17

Dec 24, 2024

1.3.16

Dec 12, 2024

1.3.15

Dec 2, 2024

1.3.14

Nov 21, 2024

1.3.13

Nov 18, 2024

1.3.12

Nov 14, 2024

1.3.11

Nov 8, 2024

1.3.10

Oct 29, 2024

1.3.9

Oct 16, 2024

1.3.8

Oct 11, 2024

1.3.7

Sep 25, 2024

1.3.6

Sep 25, 2024

1.3.5

Sep 10, 2024

1.3.4.post1

Sep 6, 2024

1.3.3

Aug 23, 2024

1.3.2

Aug 14, 2024

1.3.1

Aug 2, 2024

1.3.0

Jul 19, 2024

1.3.0a3 pre-release

Jul 18, 2024

1.3.0a2 pre-release

Jul 16, 2024

1.3.0a1 pre-release

Jul 12, 2024

1.2.20

Jul 12, 2024

1.2.19

Jun 25, 2024

1.2.18

Jun 14, 2024

1.2.17

Jun 3, 2024

1.2.16

May 16, 2024

1.2.15

May 9, 2024

1.2.14

May 8, 2024

1.2.13

May 6, 2024

1.2.12

Apr 19, 2024

1.2.11

Apr 12, 2024

1.2.10

Apr 8, 2024

1.2.9

Mar 22, 2024

1.2.8

Mar 20, 2024

1.2.7

Mar 14, 2024

1.2.6

Mar 9, 2024

1.2.5

Mar 5, 2024

1.2.4

Feb 20, 2024

1.2.3

Feb 20, 2024

1.2.2

Feb 5, 2024

1.2.1

Feb 3, 2024

1.2.1a1 pre-release

Feb 3, 2024

1.2.0

Feb 2, 2024

1.2.0rc1 pre-release

Jan 31, 2024

1.2.0a7 pre-release

Jan 30, 2024

1.2.0a6 pre-release

Jan 26, 2024

1.2.0a5 pre-release

Jan 20, 2024

1.2.0a4 pre-release

Jan 19, 2024

1.2.0a3 pre-release

Jan 19, 2024

1.2.0a2 pre-release

Jan 18, 2024

1.2.0a1 pre-release

Jan 17, 2024

1.2.0a0 pre-release

Jan 9, 2024

1.1.11

Dec 28, 2023

1.1.10

Nov 20, 2023

1.1.9

Nov 7, 2023

1.1.8

Nov 3, 2023

1.1.7

Oct 12, 2023

1.1.6

Sep 8, 2023

1.1.5

Sep 1, 2023

1.1.4

Aug 29, 2023

1.1.3

Aug 24, 2023

1.1.2

Aug 22, 2023

1.1.1

Aug 1, 2023

1.1.0

Jul 24, 2023

1.0.25

Jul 20, 2023

1.0.24

Jul 19, 2023

1.0.23

Jun 29, 2023

1.0.22

Jun 12, 2023

1.0.21

Jun 6, 2023

1.0.20

May 9, 2023

1.0.19

Apr 26, 2023

1.0.18

Apr 14, 2023

1.0.17

Apr 6, 2023

1.0.16

Mar 14, 2023

1.0.15

Feb 15, 2023

1.0.14

Feb 8, 2023

1.0.13

Jan 19, 2023

1.0.12

Dec 8, 2022

1.0.11

Dec 7, 2022

1.0.10

Nov 9, 2022

1.0.9

Nov 9, 2022

1.0.8

Nov 1, 2022

1.0.7

Oct 3, 2022

1.0.6 yanked

Sep 27, 2022

Reason this release was yanked:

A critical module import issue has been introduced in version 1.0.6. Please see release note https://github.com/bentoml/BentoML/releases/tag/v1.0.7

1.0.5

Aug 30, 2022

1.0.4

Aug 26, 2022

1.0.3

Aug 8, 2022

1.0.2

Jul 29, 2022

1.0.0

Jul 13, 2022

1.0.0rc3 pre-release

Jul 1, 2022

1.0.0rc2 pre-release

Jun 22, 2022

1.0.0rc1 pre-release

Jun 8, 2022

1.0.0rc0 pre-release

May 30, 2022

1.0.0a7 pre-release

Apr 6, 2022

1.0.0a6 pre-release

Mar 7, 2022

1.0.0a5 pre-release

Mar 1, 2022

1.0.0a4 pre-release

Feb 15, 2022

1.0.0a3 pre-release

Jan 28, 2022

1.0.0a2 pre-release

Jan 20, 2022

1.0.0a1 pre-release

Dec 14, 2021

1.0.0.dev1 pre-release yanked

Dec 13, 2021

1.0.0.dev0 pre-release yanked

Dec 13, 2021

0.13.2

Jul 20, 2022

0.13.1

Jul 13, 2021

0.13.0

Jun 16, 2021

0.12.1

Apr 5, 2021

0.12.0

Mar 22, 2021

0.11.0

Jan 14, 2021

0.11.dev0 pre-release

Jan 14, 2021

0.10.1

Dec 10, 2020

0.10.0

Dec 7, 2020

0.9.2

Oct 17, 2020

0.9.1

Oct 1, 2020

0.9.0

Sep 25, 2020

0.9.0rc0 pre-release

Sep 21, 2020

0.8.6

Aug 25, 2020

0.8.5

Aug 11, 2020

0.8.4

Aug 7, 2020

0.8.3

Jul 6, 2020

0.8.2 yanked

Jun 26, 2020

0.8.1

Jun 15, 2020

0.8.0 yanked

Jun 15, 2020

0.7.8

May 27, 2020

0.7.7

May 18, 2020

0.7.6

May 15, 2020

0.7.5

May 7, 2020

0.7.4

Apr 30, 2020

0.7.3

Apr 14, 2020

0.7.2

Apr 3, 2020

0.7.1

Apr 3, 2020

0.7.0

Apr 3, 2020

0.6.3

Mar 5, 2020

0.6.2

Feb 11, 2020

0.6.1

Jan 24, 2020

0.6.0

Jan 23, 2020

0.5.8

Jan 8, 2020

0.5.7

Jan 6, 2020

0.5.6

Dec 22, 2019

0.5.5

Dec 20, 2019

0.5.4

Dec 19, 2019

0.5.3

Nov 28, 2019

0.5.2

Nov 26, 2019

0.5.1

Nov 26, 2019

0.5.0

Nov 21, 2019

0.4.9

Nov 11, 2019

0.4.8

Oct 24, 2019

0.4.7

Oct 17, 2019

0.4.5

Oct 17, 2019

0.4.4

Oct 14, 2019

0.4.3

Oct 9, 2019

0.4.2

Sep 25, 2019

0.4.1

Sep 17, 2019

0.4.0

Sep 17, 2019

0.3.4

Aug 7, 2019

0.3.3

Aug 7, 2019

0.3.1

Jul 25, 2019

0.3.0

Jul 17, 2019

0.2.2

Jul 10, 2019

0.2.1

Jun 24, 2019

0.2.0

May 21, 2019

0.1.2

May 1, 2019

0.1.1

Apr 25, 2019

0.0.9

Apr 18, 2019

This version

0.0.8.post1

Apr 12, 2019

0.0.8

Apr 10, 2019

0.0.7

Apr 4, 2019

0.0.7.dev0 pre-release

Apr 10, 2019

0.0.6a0 pre-release

Apr 2, 2019

0.0.5

Apr 2, 2019

0.0.3

Jan 16, 2019

0.0.2

Jan 16, 2019

0.0.1

Jan 15, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

BentoML-0.0.8.post1-py3-none-any.whl (88.5 kB view details)

Uploaded Apr 12, 2019 Python 3

File details

Details for the file BentoML-0.0.8.post1-py3-none-any.whl.

File metadata

Download URL: BentoML-0.0.8.post1-py3-none-any.whl
Upload date: Apr 12, 2019
Size: 88.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/40.6.3 requests-toolbelt/0.8.0 tqdm/4.29.1 CPython/3.7.2

File hashes

Hashes for BentoML-0.0.8.post1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`12ce6b006a82c2bd20a27f9ec28af5f56d43936e655fd6e6a7db20aeda8c53a3`
MD5	`e9aca2bd651973a6b316686a26faebf0`
BLAKE2b-256	`2f903910c95c7aa9c2b18188c3e5f564f02cebd0dff9733719017a8ed4465af1`

See more details on using hashes here.

bentoml 0.0.8.post1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

BentoML

Installation

Getting Started

Serving a BentoArchive via REST API

Build API server Docker Image from BentoArchive

Loading BentoArchive in Python

Install BentoArchive as PyPI package

Loading BentoArchive from CLI

Examples

More About BentoML

Releases and Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes