The new inference engine for Computer Vision models

These details have not been verified by PyPI

Project description

🚀 What is inference-models?

inference-models is the library to make predictions from computer vision models provided by Roboflow — designed to be fast, reliable, and user-friendly. It offers:

Multi-Backend Support: Run models with PyTorch, ONNX, TensorRT, or Hugging Face backends
Automatic Model Loading: Smart model resolution and backend selection
Minimal Dependencies: Composable extras system for installing only what you need
Behavior-Based Interfaces: Models with similar behavior share consistent APIs; custom models can define their own
Full Roboflow Platform Support: Run any model trained on Roboflow

[!NOTE] Roadmap for inference-models

We are still making changes to the API and adding new features. API should be fairly stable already, but it is advised to pin to specific version if you are using it in production and review our roadmap.

🛣️ Roadmap

We're actively working toward stabilizing inference-models and integrating it into the main inference package. The plan is to:

Stabilize the API - Finalize the core interfaces and ensure backward compatibility
Integrate with inference - Make inference-models available as a selectable backend in the inference package
Production deployment - Enable users to choose between the classic inference backend and the new inference-models backend
Gradual migration - Provide a smooth transition path for existing users

We're sharing this preview to gather valuable community feedback that will help us shape the final release. Your input is crucial in making this the best inference experience possible!

💻 Installation

CPU installation:

uv pip install inference-models
# or with pip
pip install inference-models

inference-models can be installed with CUDA and TensorRT support - see Installation Guide for more options.

🏃‍➡️ Usage

Pretrained Models

Load and run a pretrained model:

import cv2
import supervision as sv
from inference_models import AutoModel

# Load pretrained model from Roboflow
model = AutoModel.from_pretrained("rfdetr-base")

# Run inference (works with numpy arrays or torch.Tensor)
image = cv2.imread("<path-to-your-image>")
predictions = model(image)

# Use with supervision
annotator = sv.BoxAnnotator()
annotated = annotator.annotate(image, predictions[0].to_supervision())

Your Roboflow Models

Load and run models trained on the Roboflow platform:

import cv2
import supervision as sv
from inference_models import AutoModel

# Load your custom model from Roboflow
model = AutoModel.from_pretrained(
    "<your-project>/<version>",
    api_key="<your-api-key>"  # model access secured with API key
)

# Run inference (works with numpy arrays or torch.Tensor)
image = cv2.imread("<path-to-your-image>")
predictions = model(image)

# Use with supervision
annotator = sv.BoxAnnotator()
annotated = annotator.annotate(image, predictions[0].to_supervision())

🧠 Supported Model Architectures

RFDetr
SAM models family
Vision-Language Models (Florence, PaliGemma, Qwen, SmolVLM, Moondream)
OCR (DocTR, EasyOCR, TrOCR)
YOLO
and many more

For detailed model documentation, see Supported Models.

🔧 Run your local models

Load your own model implementations from a local directory - models with architectures not in the main inference-models package. This is especially valuable for production deployment of custom models. Find more information in Load Models from Local Packages.

from inference_models import AutoModel

model = AutoModel.from_pretrained(
    "/path/to/my_custom_model",
    allow_local_code_packages=True
)

See Load Models from Local Packages for complete details on creating custom model packages.

📄 License

The inference-models package is licensed under Apache 2.0. Individual models may have different licenses - see the Supported Models for details.

Ready to get started? Head to the Quick Overview →

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.28.1

May 14, 2026

0.28.0

May 13, 2026

0.28.0rc2 pre-release

May 12, 2026

0.28.0rc1 pre-release

May 12, 2026

0.27.3rc3 pre-release

May 11, 2026

0.27.3rc2 pre-release

May 8, 2026

0.27.3rc1 pre-release

May 5, 2026

0.27.2

Apr 24, 2026

0.27.1

Apr 24, 2026

0.27.0

Apr 23, 2026

0.27.0rc1 pre-release

Apr 23, 2026

0.26.1

Apr 23, 2026

0.26.0

Apr 22, 2026

0.26.0rc1 pre-release

Apr 21, 2026

0.25.2

Apr 22, 2026

0.25.1

Apr 21, 2026

0.25.1rc2 pre-release

Apr 20, 2026

0.25.1rc1 pre-release

Apr 18, 2026

0.25.0 yanked

Apr 17, 2026

Reason this release was yanked:

bug in RF Instant model confidence filtering

0.25.0rc2 pre-release

Apr 15, 2026

0.25.0rc1 pre-release

Apr 13, 2026

0.24.4

Apr 17, 2026

0.24.3

Apr 10, 2026

0.24.2

Mar 31, 2026

0.24.2rc1 pre-release

Mar 31, 2026

0.24.1

Mar 30, 2026

0.24.0

Mar 27, 2026

0.23.0

Mar 26, 2026

0.22.1

Mar 24, 2026

0.22.0

Mar 20, 2026

0.22.0rc1 pre-release

Mar 20, 2026

0.21.1

Mar 19, 2026

0.21.0

Mar 18, 2026

0.21.0rc1 pre-release

Mar 18, 2026

0.20.2

Mar 17, 2026

0.20.2rc1 pre-release

Mar 17, 2026

0.20.1

Mar 12, 2026

0.20.0

Mar 11, 2026

0.19.6rc2 pre-release

Mar 10, 2026

0.19.6rc1 pre-release

Mar 6, 2026

0.19.5

Mar 6, 2026

0.19.5rc1 pre-release

Mar 6, 2026

0.19.4

Mar 6, 2026

0.19.4rc7 pre-release

Mar 6, 2026

0.19.4rc6 pre-release

Mar 5, 2026

0.19.4rc5 pre-release

Mar 5, 2026

0.19.4rc4 pre-release

Mar 5, 2026

0.19.4rc3 pre-release

Mar 5, 2026

0.19.4rc1 pre-release

Mar 4, 2026

0.19.3

Mar 4, 2026

0.19.2

Mar 3, 2026

0.19.2rc1 pre-release

Feb 27, 2026

0.19.1

Feb 23, 2026

0.19.1rc1 pre-release

Feb 23, 2026

0.19.0

Feb 20, 2026

0.18.6rc17 pre-release

Feb 18, 2026

0.18.6rc16 pre-release

Feb 17, 2026

0.18.6rc15 pre-release

Feb 16, 2026

0.18.6rc14 pre-release

Feb 13, 2026

0.18.6rc13 pre-release

Feb 13, 2026

0.18.6rc12 pre-release

Feb 13, 2026

0.18.6rc11 pre-release

Feb 13, 2026

0.18.6rc10 pre-release

Feb 12, 2026

0.18.6rc9 pre-release

Feb 12, 2026

0.18.6rc8 pre-release

Feb 5, 2026

0.18.6rc7 pre-release

Feb 5, 2026

0.18.6rc6 pre-release

Feb 5, 2026

0.18.6rc5 pre-release

Feb 5, 2026

0.18.6rc4 pre-release

Feb 5, 2026

0.18.6rc3 pre-release

Feb 4, 2026

0.18.6rc2 pre-release

Feb 2, 2026

0.18.6rc1 pre-release

Jan 30, 2026

This version

0.18.5

Jan 20, 2026

0.18.4

Jan 12, 2026

0.18.3

Dec 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

inference_models-0.18.5.tar.gz (262.3 kB view details)

Uploaded Jan 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

inference_models-0.18.5-py3-none-any.whl (395.0 kB view details)

Uploaded Jan 20, 2026 Python 3

File details

Details for the file inference_models-0.18.5.tar.gz.

File metadata

Download URL: inference_models-0.18.5.tar.gz
Upload date: Jan 20, 2026
Size: 262.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.12

File hashes

Hashes for inference_models-0.18.5.tar.gz
Algorithm	Hash digest
SHA256	`e48c47a6b4727fd42607bed9be31ededbd2d1e3fa3ac05f6b0b20338ddfcbd90`
MD5	`f8d8215083fea0616e2e599ca1e7cd4a`
BLAKE2b-256	`1377514d34a8a67391ded59be1c56ed85df679bc8d444ac1124fc6f9a5b219fb`

See more details on using hashes here.

File details

Details for the file inference_models-0.18.5-py3-none-any.whl.

File metadata

Download URL: inference_models-0.18.5-py3-none-any.whl
Upload date: Jan 20, 2026
Size: 395.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.7.12

File hashes

Hashes for inference_models-0.18.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`511d721567fdb04679cf65864f615634c0e5e4ee1b1ff12464fa70a3b7415576`
MD5	`199a750222af0a51ec75d5ae3c19c5dd`
BLAKE2b-256	`0cbe0d971913a4287627619484812b85ddd58c653edcc52a4c1e26b83b1d066d`

See more details on using hashes here.

inference-models 0.18.5

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

🚀 What is inference-models?

🛣️ Roadmap

💻 Installation

🏃‍➡️ Usage

Pretrained Models

Your Roboflow Models

🧠 Supported Model Architectures

🔧 Run your local models

📄 License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes