Skip to main content

Deploy mlflow models as JSON APIs with minimal new code.

Project description

fastapi mlflow

Deploy mlflow models as JSON APIs using FastAPI with minimal new code.

Installation

pip install fastapi-mlflow

For running the app in production, you will also need an ASGI server, such as Uvicorn or Hypercorn.

Install on Apple Silicon (ARM / M1)

If you experience problems installing on a newer generation Apple silicon based device, this solution from StackOverflow before retrying install has been found to help.

brew install openblas gfortran
export OPENBLAS="$(brew --prefix openblas)"

License

Copyright © 2022-26 Autotrader Group plc.

Apache-2.0

Examples

Simple

Create

Create a file main.py containing:

from fastapi_mlflow.applications import build_app
from mlflow.pyfunc import load_model

model = load_model("/Users/me/path/to/local/model")
app = build_app(model)

Run

Run the server with:

uvicorn main:app

Check

Open your browser at http://127.0.0.1:8000/docs

You should see the automatically generated docs for your model, and be able to test it out using the Try it out button in the UI.

Serve multiple models

It should be possible to host multiple models (assuming that they have compatible dependencies...) by leveraging FastAPIs Sub Applications:

from fastapi import FastAPI
from fastapi_mlflow.applications import build_app
from mlflow.pyfunc import load_model

app = FastAPI()

model1 = load_model("/Users/me/path/to/local/model1")
model1_app = build_app(model1)
app.mount("/model1", model1_app)

model2 = load_model("/Users/me/path/to/local/model2")
model2_app = build_app(model2)
app.mount("/model2", model2_app)

Run and Check as above.

Custom routing

If you want more control over where and how the prediction end-point is mounted in your API, you can build the predictor function directly and use it as you need:

from inspect import signature

from fastapi import FastAPI
from fastapi_mlflow.predictors import build_predictor
from mlflow.pyfunc import load_model

model = load_model("/Users/me/path/to/local/model")
predictor = build_predictor(model)
app = FastAPI()
app.add_api_route(
    "/classify",
    predictor,
    response_model=signature(predictor).return_annotation,
    methods=["POST"],
)

Run and Check as above.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fastapi_mlflow-0.8.0.tar.gz (8.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

fastapi_mlflow-0.8.0-py3-none-any.whl (10.4 kB view details)

Uploaded Python 3

File details

Details for the file fastapi_mlflow-0.8.0.tar.gz.

File metadata

  • Download URL: fastapi_mlflow-0.8.0.tar.gz
  • Upload date:
  • Size: 8.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.5

File hashes

Hashes for fastapi_mlflow-0.8.0.tar.gz
Algorithm Hash digest
SHA256 3f7ca2123a8e06fc5cfb9ee2deee66302b5fb45e8cfdb34ae87f178b5b8e3774
MD5 ac11aac77f29876e1fe9562115655ae7
BLAKE2b-256 e5eb75043089baecb47f3ceea0bf2e19fe1ab0a5446eec4e944cf9ba94f9fd86

See more details on using hashes here.

File details

Details for the file fastapi_mlflow-0.8.0-py3-none-any.whl.

File metadata

  • Download URL: fastapi_mlflow-0.8.0-py3-none-any.whl
  • Upload date:
  • Size: 10.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.5

File hashes

Hashes for fastapi_mlflow-0.8.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c997aafbf23db4641ab40dc28845c5dc1f2765bcd7ed3e6641942aaf51d83ebc
MD5 91765538d7e05a5dd7e41a7800c38367
BLAKE2b-256 56c25e53030df2e341bb06a68e1a3b7e58c90a6c6cad1bd1e20d8b2cc9d0c843

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page