The new inference engine for Computer Vision models

These details have not been verified by PyPI

Project description

🚀 What is inference-models?

inference-models is the library to make predictions from computer vision models provided by Roboflow — designed to be fast, reliable, and user-friendly. It offers:

Multi-Backend Support: Run models with PyTorch, ONNX, TensorRT, or Hugging Face backends
Automatic Model Loading: Smart model resolution and backend selection
Minimal Dependencies: Composable extras system for installing only what you need
Behavior-Based Interfaces: Models with similar behavior share consistent APIs; custom models can define their own
Full Roboflow Platform Support: Run any model trained on Roboflow

Visit our documentation for more information.

🛣️ Roadmap

With release 0.19.0, we have reached the first stable release of inference-models and fully integrated the package to inference - our main inference package, making it selectable backend for running predictions from models.

We are still making changes to add new features and models. API should be fairly stable already, but the problems may still occur. If you encounter any issues, please report them.

💻 Installation

CPU installation:

uv pip install inference-models
# or with pip
pip install inference-models

inference-models can be installed with CUDA and TensorRT support - see Installation Guide for more options.

🏃‍➡️ Usage

Pretrained Models

Load and run a pretrained model:

import cv2
import supervision as sv
from inference_models import AutoModel

# Load pretrained model from Roboflow
model = AutoModel.from_pretrained("rfdetr-base")

# Run inference (works with numpy arrays or torch.Tensor)
image = cv2.imread("<path-to-your-image>")
predictions = model(image)

# Use with supervision
annotator = sv.BoxAnnotator()
annotated = annotator.annotate(image, predictions[0].to_supervision())

Your Roboflow Models

Load and run models trained on the Roboflow platform:

import cv2
import supervision as sv
from inference_models import AutoModel

# Load your custom model from Roboflow
model = AutoModel.from_pretrained(
    "<your-project>/<version>",
    api_key="<your-api-key>"  # model access secured with API key
)

# Run inference (works with numpy arrays or torch.Tensor)
image = cv2.imread("<path-to-your-image>")
predictions = model(image)

# Use with supervision
annotator = sv.BoxAnnotator()
annotated = annotator.annotate(image, predictions[0].to_supervision())

🧠 Supported Model Architectures

RFDetr
SAM models family
Vision-Language Models (Florence, PaliGemma, Qwen, SmolVLM, Moondream)
OCR (DocTR, EasyOCR, TrOCR)
YOLO
and many more

For detailed model documentation, see Supported Models.

🔧 Run your local models

Load your own model implementations from a local directory - models with architectures not in the main inference-models package. This is especially valuable for production deployment of custom models.

from inference_models import AutoModel

model = AutoModel.from_pretrained(
    "/path/to/my_custom_model",
    allow_local_code_packages=True
)

See Load Models from Local Packages for complete details on creating custom model packages.

📄 License

The inference-models package is licensed under Apache 2.0. Individual models may have different licenses - see the Supported Models for details.

Ready to get started? Head to the Quick Overview →

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.28.1

May 14, 2026

0.28.0

May 13, 2026

0.28.0rc2 pre-release

May 12, 2026

0.28.0rc1 pre-release

May 12, 2026

0.27.3rc3 pre-release

May 11, 2026

0.27.3rc2 pre-release

May 8, 2026

0.27.3rc1 pre-release

May 5, 2026

0.27.2

Apr 24, 2026

0.27.1

Apr 24, 2026

0.27.0

Apr 23, 2026

0.27.0rc1 pre-release

Apr 23, 2026

0.26.1

Apr 23, 2026

0.26.0

Apr 22, 2026

0.26.0rc1 pre-release

Apr 21, 2026

0.25.2

Apr 22, 2026

0.25.1

Apr 21, 2026

0.25.1rc2 pre-release

Apr 20, 2026

0.25.1rc1 pre-release

Apr 18, 2026

0.25.0 yanked

Apr 17, 2026

Reason this release was yanked:

bug in RF Instant model confidence filtering

0.25.0rc2 pre-release

Apr 15, 2026

0.25.0rc1 pre-release

Apr 13, 2026

0.24.4

Apr 17, 2026

0.24.3

Apr 10, 2026

0.24.2

Mar 31, 2026

0.24.2rc1 pre-release

Mar 31, 2026

0.24.1

Mar 30, 2026

0.24.0

Mar 27, 2026

0.23.0

Mar 26, 2026

This version

0.22.1

Mar 24, 2026

0.22.0

Mar 20, 2026

0.22.0rc1 pre-release

Mar 20, 2026

0.21.1

Mar 19, 2026

0.21.0

Mar 18, 2026

0.21.0rc1 pre-release

Mar 18, 2026

0.20.2

Mar 17, 2026

0.20.2rc1 pre-release

Mar 17, 2026

0.20.1

Mar 12, 2026

0.20.0

Mar 11, 2026

0.19.6rc2 pre-release

Mar 10, 2026

0.19.6rc1 pre-release

Mar 6, 2026

0.19.5

Mar 6, 2026

0.19.5rc1 pre-release

Mar 6, 2026

0.19.4

Mar 6, 2026

0.19.4rc7 pre-release

Mar 6, 2026

0.19.4rc6 pre-release

Mar 5, 2026

0.19.4rc5 pre-release

Mar 5, 2026

0.19.4rc4 pre-release

Mar 5, 2026

0.19.4rc3 pre-release

Mar 5, 2026

0.19.4rc1 pre-release

Mar 4, 2026

0.19.3

Mar 4, 2026

0.19.2

Mar 3, 2026

0.19.2rc1 pre-release

Feb 27, 2026

0.19.1

Feb 23, 2026

0.19.1rc1 pre-release

Feb 23, 2026

0.19.0

Feb 20, 2026

0.18.6rc17 pre-release

Feb 18, 2026

0.18.6rc16 pre-release

Feb 17, 2026

0.18.6rc15 pre-release

Feb 16, 2026

0.18.6rc14 pre-release

Feb 13, 2026

0.18.6rc13 pre-release

Feb 13, 2026

0.18.6rc12 pre-release

Feb 13, 2026

0.18.6rc11 pre-release

Feb 13, 2026

0.18.6rc10 pre-release

Feb 12, 2026

0.18.6rc9 pre-release

Feb 12, 2026

0.18.6rc8 pre-release

Feb 5, 2026

0.18.6rc7 pre-release

Feb 5, 2026

0.18.6rc6 pre-release

Feb 5, 2026

0.18.6rc5 pre-release

Feb 5, 2026

0.18.6rc4 pre-release

Feb 5, 2026

0.18.6rc3 pre-release

Feb 4, 2026

0.18.6rc2 pre-release

Feb 2, 2026

0.18.6rc1 pre-release

Jan 30, 2026

0.18.5

Jan 20, 2026

0.18.4

Jan 12, 2026

0.18.3

Dec 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inference_models-0.22.1.tar.gz (1.7 MB view details)

Uploaded Mar 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

inference_models-0.22.1-py3-none-any.whl (1.8 MB view details)

Uploaded Mar 24, 2026 Python 3

File details

Details for the file inference_models-0.22.1.tar.gz.

File metadata

Download URL: inference_models-0.22.1.tar.gz
Upload date: Mar 24, 2026
Size: 1.7 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for inference_models-0.22.1.tar.gz
Algorithm	Hash digest
SHA256	`5b59ec492ebd914f9fb44f369404ccecc3458e62a365e9088a0d6191ff630130`
MD5	`a56aae1817b19e906c95ad49675cb488`
BLAKE2b-256	`8324d5bfecd77cf41384403e42130b21382c99a249b69c5ff9afd467e3abae37`

See more details on using hashes here.

Provenance

The following attestation bundles were made for inference_models-0.22.1.tar.gz:

Publisher: publish.pypi.inference_exp.yml on roboflow/inference

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: inference_models-0.22.1.tar.gz
- Subject digest: 5b59ec492ebd914f9fb44f369404ccecc3458e62a365e9088a0d6191ff630130
- Sigstore transparency entry: 1174007486
- Sigstore integration time: Mar 24, 2026
Source repository:
- Permalink: roboflow/inference@494de1b12ff0305b750ba3d28c1a512522ba80e1
- Branch / Tag: refs/heads/feature/respect-max-rfdetr-input-size-in-inference-models
- Owner: https://github.com/roboflow
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: self-hosted
- Publication workflow: publish.pypi.inference_exp.yml@494de1b12ff0305b750ba3d28c1a512522ba80e1
- Trigger Event: workflow_dispatch

File details

Details for the file inference_models-0.22.1-py3-none-any.whl.

File metadata

Download URL: inference_models-0.22.1-py3-none-any.whl
Upload date: Mar 24, 2026
Size: 1.8 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for inference_models-0.22.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c0fb56e1a30b7be2a53a6764fdf12f9d056f23ced14786ee7c2fb706444e81aa`
MD5	`61cb6fe31624920afa98099a086efa50`
BLAKE2b-256	`d1a913c35837ecc2b13e0cc1988db0efbd1df80eeaa6456adc4c3d8f2f657381`

See more details on using hashes here.

Provenance

The following attestation bundles were made for inference_models-0.22.1-py3-none-any.whl:

Publisher: publish.pypi.inference_exp.yml on roboflow/inference

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: inference_models-0.22.1-py3-none-any.whl
- Subject digest: c0fb56e1a30b7be2a53a6764fdf12f9d056f23ced14786ee7c2fb706444e81aa
- Sigstore transparency entry: 1174007537
- Sigstore integration time: Mar 24, 2026
Source repository:
- Permalink: roboflow/inference@494de1b12ff0305b750ba3d28c1a512522ba80e1
- Branch / Tag: refs/heads/feature/respect-max-rfdetr-input-size-in-inference-models
- Owner: https://github.com/roboflow
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: self-hosted
- Publication workflow: publish.pypi.inference_exp.yml@494de1b12ff0305b750ba3d28c1a512522ba80e1
- Trigger Event: workflow_dispatch

inference-models 0.22.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

🚀 What is inference-models?

🛣️ Roadmap

💻 Installation

🏃‍➡️ Usage

Pretrained Models

Your Roboflow Models

🧠 Supported Model Architectures

🔧 Run your local models

📄 License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance