bentoml: A framework for machine learning model serving
Project description
🍱 BentoML: The Unified Model Serving Framework
BentoML is an open platform that simplifies ML model deployment and enables you to serve your models at production scale in minutes
👉 Pop into our Slack community! We're happy to help with any issue you face or even just to meet you and hear what you're working on :)
The BentoML version 1.0 is around the corner. For stable release version 0.13, see the 0.13-LTS branch. Version 1.0 is under active development, you can be of great help by testing out the preview release, reporting issues, contribute to the documentation and create sample gallery projects.
Why BentoML
- The easiest way to turn your ML models into production-ready API endpoints.
- High performance model serving, all in Python.
- Standardize model packaging and ML service definition to streamline deployment.
- Support all major machine-learning training frameworks.
- Deploy and operate ML serving workload at scale on Kubernetes via Yatai.
Getting Started
- Quickstart guide will show you a simple example of using BentoML in action. In under 10 minutes, you'll be able to serve your ML model over an HTTP API endpoint, and build a docker image that is ready to be deployed in production.
- Main concepts will give a comprehensive tour of BentoML's components and introduce you to its philosophy. After reading, you will see what drives BentoML's design, and know what
bento
andrunner
stands for. - ML Frameworks lays out best practices and example usages by the ML framework used for training models.
- Advanced Guides showcases advanced features in BentoML, including GPU support, inference graph, monitoring, and customizing docker environment etc.
- Check out other projects from the BentoML team:
- 🦄️ Yatai: Model Deployment at scale on Kubernetes
- 🚀 bentoctl: Fast model deployment on any cloud platform
Community
- To report a bug or suggest a feature request, use GitHub Issues.
- For other discussions, use Github Discussions.
- To receive release announcements, please join us on Slack.
Contributing
There are many ways to contribute to the project:
- If you have any feedback on the project, share it with the community in Github Discussions of this project.
- Report issues you're facing and "Thumbs up" on issues and feature requests that are relevant to you.
- Investigate bugs and reviewing other developer's pull requests.
- Contributing code or documentation to the project by submitting a Github pull request. See the development guide.
- See more in the contributing guide.
Usage Reporting
BentoML collects anonymous usage data that helps our team to improve the product.
Only BentoML's internal API calls and CLI commands are being reported. We strip out as much potentially
sensitive information as possible, and we will never collect user code, model data, model names, or stack traces.
Here's the code for usage tracking.
You can opt-out of usage tracking by the --do-not-track
CLI option:
bentoml [command] --do-not-track
Or by setting environment variable BENTOML_DO_NOT_TRACK=True
:
export BENTOML_DO_NOT_TRACK=True
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file bentoml-1.0.0a7.tar.gz
.
File metadata
- Download URL: bentoml-1.0.0a7.tar.gz
- Upload date:
- Size: 515.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.8.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6e04e747b8068636b0e82768e195ad2e82a0a167291f48ddf68f36cb3cd353cb |
|
MD5 | 0162b3b91272213f7c489677c903f65e |
|
BLAKE2b-256 | 60ecf21c1c527868ccd70881aa8717b6aaf06bb410991a7c75589e201c4dafa1 |
File details
Details for the file bentoml-1.0.0a7-py3-none-any.whl
.
File metadata
- Download URL: bentoml-1.0.0a7-py3-none-any.whl
- Upload date:
- Size: 602.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.0 CPython/3.8.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 59f783b5c31803a28f9e4c2248c8691cc400e1472f86516a9fb4a27676c655c6 |
|
MD5 | 0e06ee6b60039b5d3365020a9bc32bca |
|
BLAKE2b-256 | c39d5edb5e6e83fb4e1225b053916c52a45ea070398b8884f606c00d5882514c |