BentoML: The Unified Model Serving Framework

These details have not been verified by PyPI

Project links

Project description

The Unified Model Serving Framework

BentoML makes it easy to create Machine Learning services that are ready to deploy and scale.

👉 Join our Slack community today!

✨ Looking deploy your ML service quickly? Checkout BentoML Cloud for the easiest and fastest way to deploy your bento.

Getting Started

Documentation - Overview of the BentoML docs and related resources
Tutorial: Intro to BentoML - Learn by doing! In under 10 minutes, you'll serve a model via REST API and generate a docker image for deployment.
Main Concepts - A step-by-step tour for learning main concepts in BentoML
Examples - Gallery of sample projects using BentoML
ML Framework Guides - Best practices and example usages by the ML framework of your choice
Advanced Guides - Learn about BentoML's internals, architecture and advanced features
Need help? Join BentoML Community Slack 💬

Highlights

🍭 Unified Model Serving API

Framework-agnostic model packaging for Tensorflow, PyTorch, XGBoost, Scikit-Learn, ONNX, and many more!
Write custom Python code alongside model inference for pre/post-processing and business logic
Apply the same code for online(REST API or gRPC), offline batch, and streaming inference
Simple abstractions for building multi-model inference pipelines or graphs

🚂 Standardized process for a frictionless transition to production

Build Bento as the standard deployable artifact for ML services
Automatically generate docker images with the desired dependencies
Easy CUDA setup for inference with GPU
Rich integration with the MLOps ecosystem, including Kubeflow, Airflow, MLFlow, Triton

🏹 Scalable with powerful performance optimizations

Adaptive batching dynamically groups inference requests on server-side optimal performance
Runner abstraction scales model inference separately from your custom code
Maximize your GPU and multi-core CPU utilization with automatic provisioning

🎯 Deploy anywhere in a DevOps-friendly way

Streamline production deployment workflow via:
- ☁️ BentoML Cloud: the fastest way to deploy your bento, simple and at scale
- 🦄️ Yatai: Model Deployment at scale on Kubernetes
- 🚀 bentoctl: Fast model deployment on AWS SageMaker, Lambda, ECE, GCP, Azure, Heroku, and more!
Run offline batch inference jobs with Spark or Dask
Built-in support for Prometheus metrics and OpenTelemetry
Flexible APIs for advanced CI/CD workflows

How it works

Save your trained model with BentoML:

import bentoml

saved_model = bentoml.pytorch.save_model(
    "demo_mnist", # model name in the local model store
    model, # model instance being saved
)

print(f"Model saved: {saved_model}")
# Model saved: Model(tag="demo_mnist:3qee3zd7lc4avuqj", path="~/bentoml/models/demo_mnist/3qee3zd7lc4avuqj/")

Define a prediction service in a service.py file:

import numpy as np
import bentoml
from bentoml.io import NumpyNdarray, Image
from PIL.Image import Image as PILImage

mnist_runner = bentoml.pytorch.get("demo_mnist:latest").to_runner()

svc = bentoml.Service("pytorch_mnist", runners=[mnist_runner])

@svc.api(input=Image(), output=NumpyNdarray(dtype="int64"))
def predict(input_img: PILImage):
    img_arr = np.array(input_img)/255.0
    input_arr = np.expand_dims(img_arr, 0).astype("float32")
    output_tensor = mnist_runner.predict.run(input_arr)
    return output_tensor.numpy()

Create a bentofile.yaml build file for your ML service:

service: "service:svc"
include:
  - "*.py"
python:
  packages:
    - numpy
    - torch
    - Pillow

Now, run the prediction service:

bentoml serve

Sent a prediction request:

curl -F 'image=@samples/1.png' http://127.0.0.1:3000/predict_image

Build a Bento and generate a docker image:

$ bentoml build
Successfully built Bento(tag="pytorch_mnist:4mymorgurocxjuqj") at "~/bentoml/bentos/pytorch_mnist/4mymorgurocxjuqj/"

$ bentoml containerize pytorch_mnist:4mymorgurocxjuqj
Successfully built docker image "pytorch_mnist:4mymorgurocxjuqj"

$ docker run -p 3000:3000 pytorch_mnist:4mymorgurocxjuqj
Starting production BentoServer from "pytorch_mnist:4mymorgurocxjuqj" running on http://0.0.0.0:3000

For a more detailed user guide, check out the BentoML Tutorial.

Community

For general questions and support, join the community slack.
To receive release notification, star & watch the BentoML project on GitHub.
To report a bug or suggest a feature request, use GitHub Issues.
To stay informed with community updates, follow the BentoML Blog and @bentomlai on Twitter.

Contributing

There are many ways to contribute to the project:

If you have any feedback on the project, share it under the #bentoml-contributors channel in the community slack.
Report issues you're facing and "Thumbs up" on issues and feature requests that are relevant to you.
Investigate bugs and reviewing other developer's pull requests.
Contributing code or documentation to the project by submitting a GitHub pull request. Check out the Development Guide.
Learn more in the contributing guide.

Contributors

Thanks to all of our amazing contributors!

Usage Reporting

BentoML collects usage data that helps our team to improve the product. Only BentoML's internal API calls are being reported. We strip out as much potentially sensitive information as possible, and we will never collect user code, model data, model names, or stack traces. Here's the code for usage tracking. You can opt-out of usage tracking by the --do-not-track CLI option:

bentoml [command] --do-not-track

Or by setting environment variable BENTOML_DO_NOT_TRACK=True:

export BENTOML_DO_NOT_TRACK=True

License

Apache License 2.0

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.4.36

Mar 6, 2026

1.4.35

Feb 3, 2026

1.4.34

Jan 26, 2026

1.4.33

Jan 12, 2026

1.4.32

Jan 9, 2026

1.4.31 yanked

Dec 23, 2025

1.4.30

Nov 27, 2025

1.4.29

Nov 17, 2025

1.4.28

Oct 29, 2025

1.4.27

Oct 20, 2025

1.4.26

Oct 10, 2025

1.4.25

Sep 24, 2025

1.4.24

Sep 17, 2025

1.4.23

Sep 5, 2025

1.4.22

Aug 27, 2025

1.4.21

Aug 14, 2025

1.4.20

Aug 13, 2025

1.4.19

Jul 29, 2025

1.4.18

Jul 24, 2025

1.4.17

Jun 30, 2025

1.4.16

Jun 16, 2025

1.4.15

May 23, 2025

1.4.14

May 20, 2025

1.4.13

May 9, 2025

1.4.12

Apr 29, 2025

1.4.11

Apr 22, 2025

1.4.10

Apr 17, 2025

1.4.9 yanked

Apr 17, 2025

Reason this release was yanked:

bad release

1.4.8

Apr 8, 2025

1.4.7

Mar 28, 2025

1.4.6

Mar 25, 2025

1.4.5

Mar 14, 2025

1.4.4

Mar 14, 2025

1.4.3

Mar 6, 2025

1.4.2

Feb 28, 2025

1.4.1

Feb 25, 2025

1.4.0

Feb 20, 2025

1.4.0a2 pre-release

Feb 14, 2025

1.4.0a1 pre-release

Feb 14, 2025

1.3.22

Feb 13, 2025

1.3.21

Feb 4, 2025

1.3.20

Jan 17, 2025

1.3.19

Jan 6, 2025

1.3.18

Dec 27, 2024

1.3.17

Dec 24, 2024

1.3.16

Dec 12, 2024

1.3.15

Dec 2, 2024

1.3.14

Nov 21, 2024

1.3.13

Nov 18, 2024

1.3.12

Nov 14, 2024

1.3.11

Nov 8, 2024

1.3.10

Oct 29, 2024

1.3.9

Oct 16, 2024

1.3.8

Oct 11, 2024

1.3.7

Sep 25, 2024

1.3.6

Sep 25, 2024

1.3.5

Sep 10, 2024

1.3.4.post1

Sep 6, 2024

1.3.3

Aug 23, 2024

1.3.2

Aug 14, 2024

1.3.1

Aug 2, 2024

1.3.0

Jul 19, 2024

1.3.0a3 pre-release

Jul 18, 2024

1.3.0a2 pre-release

Jul 16, 2024

1.3.0a1 pre-release

Jul 12, 2024

1.2.20

Jul 12, 2024

1.2.19

Jun 25, 2024

1.2.18

Jun 14, 2024

1.2.17

Jun 3, 2024

1.2.16

May 16, 2024

1.2.15

May 9, 2024

1.2.14

May 8, 2024

1.2.13

May 6, 2024

1.2.12

Apr 19, 2024

1.2.11

Apr 12, 2024

1.2.10

Apr 8, 2024

1.2.9

Mar 22, 2024

1.2.8

Mar 20, 2024

1.2.7

Mar 14, 2024

1.2.6

Mar 9, 2024

1.2.5

Mar 5, 2024

1.2.4

Feb 20, 2024

1.2.3

Feb 20, 2024

1.2.2

Feb 5, 2024

1.2.1

Feb 3, 2024

1.2.1a1 pre-release

Feb 3, 2024

1.2.0

Feb 2, 2024

1.2.0rc1 pre-release

Jan 31, 2024

1.2.0a7 pre-release

Jan 30, 2024

1.2.0a6 pre-release

Jan 26, 2024

1.2.0a5 pre-release

Jan 20, 2024

1.2.0a4 pre-release

Jan 19, 2024

1.2.0a3 pre-release

Jan 19, 2024

1.2.0a2 pre-release

Jan 18, 2024

1.2.0a1 pre-release

Jan 17, 2024

1.2.0a0 pre-release

Jan 9, 2024

1.1.11

Dec 28, 2023

1.1.10

Nov 20, 2023

1.1.9

Nov 7, 2023

1.1.8

Nov 3, 2023

1.1.7

Oct 12, 2023

1.1.6

Sep 8, 2023

1.1.5

Sep 1, 2023

1.1.4

Aug 29, 2023

1.1.3

Aug 24, 2023

1.1.2

Aug 22, 2023

1.1.1

Aug 1, 2023

1.1.0

Jul 24, 2023

1.0.25

Jul 20, 2023

1.0.24

Jul 19, 2023

1.0.23

Jun 29, 2023

1.0.22

Jun 12, 2023

1.0.21

Jun 6, 2023

1.0.20

May 9, 2023

1.0.19

Apr 26, 2023

1.0.18

Apr 14, 2023

1.0.17

Apr 6, 2023

1.0.16

Mar 14, 2023

1.0.15

Feb 15, 2023

1.0.14

Feb 8, 2023

1.0.13

Jan 19, 2023

This version

1.0.12

Dec 8, 2022

1.0.11

Dec 7, 2022

1.0.10

Nov 9, 2022

1.0.9

Nov 9, 2022

1.0.8

Nov 1, 2022

1.0.7

Oct 3, 2022

1.0.6 yanked

Sep 27, 2022

Reason this release was yanked:

A critical module import issue has been introduced in version 1.0.6. Please see release note https://github.com/bentoml/BentoML/releases/tag/v1.0.7

1.0.5

Aug 30, 2022

1.0.4

Aug 26, 2022

1.0.3

Aug 8, 2022

1.0.2

Jul 29, 2022

1.0.0

Jul 13, 2022

1.0.0rc3 pre-release

Jul 1, 2022

1.0.0rc2 pre-release

Jun 22, 2022

1.0.0rc1 pre-release

Jun 8, 2022

1.0.0rc0 pre-release

May 30, 2022

1.0.0a7 pre-release

Apr 6, 2022

1.0.0a6 pre-release

Mar 7, 2022

1.0.0a5 pre-release

Mar 1, 2022

1.0.0a4 pre-release

Feb 15, 2022

1.0.0a3 pre-release

Jan 28, 2022

1.0.0a2 pre-release

Jan 20, 2022

1.0.0a1 pre-release

Dec 14, 2021

1.0.0.dev1 pre-release yanked

Dec 13, 2021

1.0.0.dev0 pre-release yanked

Dec 13, 2021

0.13.2

Jul 20, 2022

0.13.1

Jul 13, 2021

0.13.0

Jun 16, 2021

0.12.1

Apr 5, 2021

0.12.0

Mar 22, 2021

0.11.0

Jan 14, 2021

0.11.dev0 pre-release

Jan 14, 2021

0.10.1

Dec 10, 2020

0.10.0

Dec 7, 2020

0.9.2

Oct 17, 2020

0.9.1

Oct 1, 2020

0.9.0

Sep 25, 2020

0.9.0rc0 pre-release

Sep 21, 2020

0.8.6

Aug 25, 2020

0.8.5

Aug 11, 2020

0.8.4

Aug 7, 2020

0.8.3

Jul 6, 2020

0.8.2 yanked

Jun 26, 2020

0.8.1

Jun 15, 2020

0.8.0 yanked

Jun 15, 2020

0.7.8

May 27, 2020

0.7.7

May 18, 2020

0.7.6

May 15, 2020

0.7.5

May 7, 2020

0.7.4

Apr 30, 2020

0.7.3

Apr 14, 2020

0.7.2

Apr 3, 2020

0.7.1

Apr 3, 2020

0.7.0

Apr 3, 2020

0.6.3

Mar 5, 2020

0.6.2

Feb 11, 2020

0.6.1

Jan 24, 2020

0.6.0

Jan 23, 2020

0.5.8

Jan 8, 2020

0.5.7

Jan 6, 2020

0.5.6

Dec 22, 2019

0.5.5

Dec 20, 2019

0.5.4

Dec 19, 2019

0.5.3

Nov 28, 2019

0.5.2

Nov 26, 2019

0.5.1

Nov 26, 2019

0.5.0

Nov 21, 2019

0.4.9

Nov 11, 2019

0.4.8

Oct 24, 2019

0.4.7

Oct 17, 2019

0.4.5

Oct 17, 2019

0.4.4

Oct 14, 2019

0.4.3

Oct 9, 2019

0.4.2

Sep 25, 2019

0.4.1

Sep 17, 2019

0.4.0

Sep 17, 2019

0.3.4

Aug 7, 2019

0.3.3

Aug 7, 2019

0.3.1

Jul 25, 2019

0.3.0

Jul 17, 2019

0.2.2

Jul 10, 2019

0.2.1

Jun 24, 2019

0.2.0

May 21, 2019

0.1.2

May 1, 2019

0.1.1

Apr 25, 2019

0.0.9

Apr 18, 2019

0.0.8.post1

Apr 12, 2019

0.0.8

Apr 10, 2019

0.0.7

Apr 4, 2019

0.0.7.dev0 pre-release

Apr 10, 2019

0.0.6a0 pre-release

Apr 2, 2019

0.0.5

Apr 2, 2019

0.0.3

Jan 16, 2019

0.0.2

Jan 16, 2019

0.0.1

Jan 15, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bentoml-1.0.12.tar.gz (17.1 MB view details)

Uploaded Dec 8, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

bentoml-1.0.12-py3-none-any.whl (909.7 kB view details)

Uploaded Dec 8, 2022 Python 3

File details

Details for the file bentoml-1.0.12.tar.gz.

File metadata

Download URL: bentoml-1.0.12.tar.gz
Upload date: Dec 8, 2022
Size: 17.1 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.8

File hashes

Hashes for bentoml-1.0.12.tar.gz
Algorithm	Hash digest
SHA256	`cca3e7d882ce83ae0319f7cffd938acfa8a55b41f6206464fbb0f95fdaad6842`
MD5	`3af6c8c1e8f126474f59b19fa54d27ee`
BLAKE2b-256	`0cc274544bb24d1f3927f3d53a38bfd6821ae952dcb4b6bc98f83430d9d6879e`

See more details on using hashes here.

File details

Details for the file bentoml-1.0.12-py3-none-any.whl.

File metadata

Download URL: bentoml-1.0.12-py3-none-any.whl
Upload date: Dec 8, 2022
Size: 909.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.8

File hashes

Hashes for bentoml-1.0.12-py3-none-any.whl
Algorithm	Hash digest
SHA256	`5acb60d70b5a45539a7498a7aed8076494a6eba96c51c439fe852d7f60f08faf`
MD5	`8d27ba07140045cf2f86986929266af2`
BLAKE2b-256	`c30e912e126bfac5c0d7b9ae8cd978ab06afb4144b38112293457e93abb6f5cf`

See more details on using hashes here.

bentoml 1.0.12

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

The Unified Model Serving Framework

Getting Started

Highlights

How it works

Community

Contributing

Contributors

Usage Reporting

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes