inference·PyPI

With no prior knowledge of machine learning or device-specific deployment, you can deploy a computer vision model to a range of devices and environments using Roboflow Inference.

These details have not been verified by PyPI

Project description

notebooks | supervision | autodistill | maestro

Make Any Camera an AI Camera

Inference turns any computer or edge device into a command center for your computer vision projects.

🛠️ Self-host your own fine-tuned models
🧠 Access the latest and greatest foundation models (like Florence-2, CLIP, and SAM2)
🤝 Use Workflows to track, count, time, measure, and visualize
👁️ Combine ML with traditional CV methods (like OCR, Barcode Reading, QR, and template matching)
📈 Monitor, record, and analyze predictions
🎥 Manage cameras and video streams
📬 Send notifications when events happen
🛜 Connect with external systems and APIs
🔗 Extend with your own code and models
🚀 Deploy production systems at scale

See Example Workflows for common use-cases like detecting small objects with SAHI, multi-model consensus, active learning, reading license plates, blurring faces, background removal, and more.

Time In Zone Workflow Example

🔥 quickstart

Install Docker (and NVIDIA Container Toolkit for GPU acceleration if you have a CUDA-enabled GPU). Then run

pip install inference-cli && inference server start --dev

This will pull the proper image for your machine and start it in development mode.

In development mode, a Jupyter notebook server with a quickstart guide runs on http://localhost:9001/notebook/start. Dive in there for a whirlwind tour of your new Inference Server's functionality!

Now you're ready to connect your camera streams and start building & deploying Workflows in the UI or interacting with your new server via its API.

🛠️ build with Workflows

A key component of Inference is Workflows, composable blocks of common functionality that give models a common interface to make chaining and experimentation easy.

License Plate OCR Workflow Visualization

With Workflows, you can:

Detect, classify, and segment objects in images using state-of-the-art models.
Use Large Multimodal Models (LMMs) to make determinations at any stage in a workflow.
Seamlessly swap out models for a given task.
Chain models together.
Track, count, time, measure, and visualize objects.
Add business logic and extend functionality to work with your external systems.

Workflows allow you to extend simple model predictions to build computer vision micro-services that fit into a larger application or fully self-contained visual agents that run on a video stream.

Learn more, read the Workflows docs, or start building.

	Tutorial: Build an AI-Powered Self-Serve Checkout Created: 2 Feb 2025 Make a computer vision app that identifies different pieces of hardware, calculates the total cost, and records the results to a database.
	Tutorial: Intro to Workflows Created: 6 Jan 2025 Learn how to build and deploy Workflows for common use-cases like detecting vehicles, filtering detections, visualizing results, and calculating dwell time on a live video stream.
	Tutorial: Build a Smart Parking System Created: 27 Nov 2024 Build a smart parking lot management system using Roboflow Workflows! This tutorial covers license plate detection with YOLOv8, object tracking with ByteTrack, and real-time notifications with a Telegram bot.

📟 connecting via api

Once you've installed Inference, your machine is a fully-featured CV center. You can use its API to run models and workflows on images and video streams. By default, the server is running locally on localhost:9001.

To interface with your server via Python, use our SDK:

pip install inference-sdk

Then run an example model comparison Workflow like this:

from inference_sdk import InferenceHTTPClient

client = InferenceHTTPClient(
    api_url="http://localhost:9001", # use local inference server
    # api_key="<YOUR API KEY>" # optional to access your private data and models
)

result = client.run_workflow(
    workspace_name="roboflow-docs",
    workflow_id="model-comparison",
    images={
        "image": "https://media.roboflow.com/workflows/examples/bleachers.jpg"
    },
    parameters={
        "model1": "yolov8n-640",
        "model2": "yolov11n-640"
    }
)

print(result)

In other languages, use the server's REST API; you can access the API docs for your server at /docs (OpenAPI format) or /redoc (Redoc Format).

Check out the inference_sdk docs to see what else you can do with your new server.

🎥 connect to video streams

The inference server is a video processing beast. You can set it up to run Workflows on RTSP streams, webcam devices, and more. It will handle hardware acceleration, multiprocessing, video decoding and GPU batching to get the most out of your hardware.

This example workflow will watch a stream for frames that CLIP thinks match an inputted text prompt.

from inference_sdk import InferenceHTTPClient
import atexit
import time

max_fps = 4

client = InferenceHTTPClient(
    api_url="http://localhost:9001", # use local inference server
    # api_key="<YOUR API KEY>" # optional to access your private data and models
)

# Start a stream on an rtsp stream
result = client.start_inference_pipeline_with_workflow(
    video_reference=["rtsp://user:password@192.168.0.100:554/"],
    workspace_name="roboflow-docs",
    workflow_id="clip-frames",
    max_fps=max_fps,
    workflows_parameters={
        "prompt": "blurry", # change to look for something else
        "threshold": 0.16
    }
)

pipeline_id = result["context"]["pipeline_id"]

# Terminate the pipeline when the script exits
atexit.register(lambda: client.terminate_inference_pipeline(pipeline_id))

while True:
  result = client.consume_inference_pipeline_result(pipeline_id=pipeline_id)

  if not result["outputs"] or not result["outputs"][0]:
    # still initializing
    continue

  output = result["outputs"][0]
  is_match = output.get("is_match")
  similarity = round(output.get("similarity")*100, 1)
  print(f"Matches prompt? {is_match} (similarity: {similarity}%)")

  time.sleep(1/max_fps)

Pipeline outputs can be consumed via API for downstream processing or the Workflow can be configured to call external services with Notification blocks (like Email or Twilio) or the Webhook block. For more info on video pipeline management, see the Video Processing overview.

If you have a Roboflow account & have linked an API key, you can also remotely monitor and manage your running streams via the Roboflow UI.

🔑 connect to the cloud

Without an API Key, you can access a wide range of pre-trained and foundational models and run public Workflows.

Pass an optional Roboflow API Key to the inference_sdk or API to access additional features enhanced by Roboflow's Cloud platform. When running with an API Key, usage is metered according to Roboflow's pricing tiers.

	Open Access	With API Key (Metered)
Pre-Trained Models	✅	✅
Foundation Models	✅	✅
Video Stream Management	✅	✅
Dynamic Python Blocks	✅	✅
Public Workflows	✅	✅
Private Workflows		✅
Fine-Tuned Models		✅
Universe Models		✅
Active Learning		✅
Serverless Hosted API		✅
Dedicated Deployments		✅
Commercial Model Licensing		Paid
Device Management		Enterprise
Model Monitoring		Enterprise

🌩️ hosted compute

If you don't want to manage your own infrastructure for self-hosting, Roboflow offers a hosted Inference Server via one-click Dedicated Deployments (CPU and GPU machines) billed hourly, or simple models and Workflows via our serverless Hosted API billed per API-call.

We offer a generous free-tier to get started.

🖥️ run on-prem or self-hosted

Inference is designed to run on a wide range of hardware from beefy cloud servers to tiny edge devices. This lets you easily develop against your local machine or our cloud infrastructure and then seamlessly switch to another device for production deployment.

inference server start attempts to automatically choose the optimal container to optimize performance on your machine (including with GPU acceleration via NVIDIA CUDA when available). Special installation notes and performance tips by device are listed below:

⭐️ New: Enterprise Hardware

For manufacturing and logistics use-cases Roboflow now offers the NVIDIA Jetson-based Flowbox, a ruggedized CV center pre-configured with Inference and optimized for running in secure networks. It has integrated support for machine vision cameras like Basler and Lucid over GigE, supports interfacing with PLCs and HMIs via OPC or MQTT, enables enterprise device management through a DMZ, and comes with the support of our team of computer vision experts to ensure your project is a success.

📚 documentation

Visit our documentation to explore comprehensive guides, detailed API references, and a wide array of tutorials designed to help you harness the full potential of the Inference package.

© license

The core of Inference is licensed under Apache 2.0.

Models are subject to licensing which respects the underlying architecture. These licenses are listed in inference/models. Paid Roboflow accounts include a commercial license for some models (see roboflow.com/licensing for details).

Cloud connected functionality (like our model and Workflows registries, dataset management, model monitoring, device management, and managed infrastructure) requires a Roboflow account and API key & is metered based on usage.

Enterprise functionality is source-available in inference/enterprise under an enterprise license and usage in production requires an active Enterprise contract in good standing.

See the "Self Hosting and Edge Deployment" section of the Roboflow Licensing documentation for more information on how Roboflow Inference is licensed.

🏆 contribution

We would love your input to improve Roboflow Inference! Please see our contributing guide to get started. Thank you to all of our contributors! 🙏

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.51.1

Jun 27, 2025

0.51.0

Jun 20, 2025

0.50.5

Jun 16, 2025

0.50.4

Jun 6, 2025

0.50.4rc2 pre-release

Jun 2, 2025

0.50.4rc1 pre-release

Jun 2, 2025

0.50.3

May 30, 2025

0.50.2

May 29, 2025

0.50.1

May 23, 2025

0.50.0

May 23, 2025

0.49.5

May 16, 2025

0.49.3

May 16, 2025

0.49.2

May 14, 2025

0.49.1

May 9, 2025

0.48.3

May 5, 2025

0.48.1

May 2, 2025

0.48.0

Apr 25, 2025

0.47.0

Apr 18, 2025

0.46.5

Apr 14, 2025

0.46.4

Apr 9, 2025

0.46.3

Apr 9, 2025

0.46.1

Apr 7, 2025

0.46.0

Apr 4, 2025

0.46.0rc2 pre-release

Mar 31, 2025

0.46.0rc1 pre-release

Mar 31, 2025

0.45.3

Apr 2, 2025

0.45.2

Apr 2, 2025

0.45.1

Apr 2, 2025

0.45.0

Mar 28, 2025

0.44.1

Mar 26, 2025

0.44.0

Mar 25, 2025

0.43.0

Mar 21, 2025

0.43.0rc1 pre-release

Mar 21, 2025

0.42.1

Mar 17, 2025

0.41.0

Mar 7, 2025

0.40.0

Feb 26, 2025

0.40.0rc2 pre-release

Feb 24, 2025

0.40.0rc1 pre-release

Feb 21, 2025

0.39.0

Feb 21, 2025

0.38.0

Feb 18, 2025

0.38.0rc1 pre-release

Feb 10, 2025

0.37.1

Feb 11, 2025

0.37.0rc1 pre-release

Feb 7, 2025

0.36.1

Jan 31, 2025

0.36.0

Jan 31, 2025

0.36.0rc1 pre-release

Jan 30, 2025

0.35.0

Jan 24, 2025

0.35.0rc1 pre-release

Jan 20, 2025

0.34.0

Jan 17, 2025

0.34.0rc1 pre-release

Jan 14, 2025

0.33.0

Jan 10, 2025

0.32.0

Dec 20, 2024

0.31.2rc1 pre-release

Dec 19, 2024

0.31.1

Dec 13, 2024

0.31.0

Dec 13, 2024

0.30.0

Dec 11, 2024

0.29.2

Dec 5, 2024

0.29.1

Dec 3, 2024

0.29.0

Nov 29, 2024

0.28.2

Nov 27, 2024

0.28.1

Nov 25, 2024

0.28.0 yanked

Nov 22, 2024

Reason this release was yanked:

Bug in release causing workflows loading to break

0.27.0

Nov 15, 2024

0.26.1

Nov 13, 2024

0.26.0

Nov 8, 2024

0.25.0

Nov 1, 2024

0.24.0

Oct 18, 2024

0.23.0

Oct 11, 2024

0.22.2

Oct 4, 2024

0.22.1

Oct 4, 2024

0.22.0

Oct 4, 2024

0.21.1

Sep 30, 2024

0.21.0

Sep 27, 2024

0.20.1

Sep 24, 2024

0.20.0

Sep 23, 2024

0.19.0

Sep 18, 2024

0.18.1

Sep 6, 2024

0.18.0

Sep 6, 2024

0.17.1

Sep 3, 2024

0.17.0

Aug 30, 2024

0.16.3

Aug 22, 2024

0.16.2

Aug 16, 2024

0.16.0

Aug 9, 2024

0.15.2

Aug 3, 2024

0.15.1

Jul 20, 2024

0.15.0

Jul 18, 2024

0.14.1

Jul 15, 2024

0.14.0

Jul 12, 2024

0.13.0

Jun 26, 2024

0.12.1

Jun 17, 2024

0.12.0

Jun 7, 2024

0.11.2

May 25, 2024

0.11.1

May 23, 2024

0.11.0

May 20, 2024

0.10.0

May 14, 2024

0.9.23

Apr 30, 2024

0.9.22

Apr 18, 2024

0.9.20

Mar 27, 2024

0.9.18

Mar 25, 2024

0.9.17 yanked

Mar 15, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.16 yanked

Mar 11, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.15 yanked

Feb 28, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.15rc1 pre-release yanked

Feb 27, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.14 yanked

Feb 23, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.13 yanked

Feb 16, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.12 yanked

Feb 16, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.12rc3 pre-release yanked

Feb 16, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.12rc1 pre-release yanked

Feb 16, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.11 yanked

Feb 16, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.11rc2 pre-release yanked

Feb 15, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.11rc1 pre-release yanked

Feb 15, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.10 yanked

Feb 13, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.10rc3 pre-release yanked

Feb 12, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.9 yanked

Feb 7, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.9rc23 pre-release yanked

Feb 7, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.8 yanked

Dec 29, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.7 yanked

Dec 20, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.7rc2 pre-release yanked

Dec 18, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.6 yanked

Dec 13, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.5 yanked

Dec 5, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.4 yanked

Oct 27, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.3 yanked

Oct 24, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.2 yanked

Oct 13, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.1 yanked

Oct 9, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.0 yanked

Oct 6, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.8.9 yanked

Oct 3, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.8.8 yanked

Sep 27, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.8.5 yanked

Sep 19, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.8.4 yanked

Sep 15, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.8.2 yanked

Sep 7, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.8.1 yanked

Sep 5, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.8.0 yanked

Aug 31, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.7.6 yanked

Aug 18, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.7.2 yanked

Aug 15, 2023

Reason this release was yanked:

Wrong Python version denoted.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

inference-0.51.1-py3-none-any.whl (2.4 MB view details)

Uploaded Jun 27, 2025 Python 3

File details

Details for the file inference-0.51.1-py3-none-any.whl.

File metadata

Download URL: inference-0.51.1-py3-none-any.whl
Upload date: Jun 27, 2025
Size: 2.4 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for inference-0.51.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`41e15edcb864145d421ae8a7a844e196fbe408534b6c305b683be4d968d3bf53`
MD5	`4658a1c4b2cebcd4ece0f87e63ec7775`
BLAKE2b-256	`585657bd41c2f7ab61e0025240fd2579f93904a769d6ed3dc81d9f11d8837a13`

See more details on using hashes here.

Provenance

The following attestation bundles were made for inference-0.51.1-py3-none-any.whl:

Publisher: publish.pypi.yml on roboflow/inference

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: inference-0.51.1-py3-none-any.whl
- Subject digest: 41e15edcb864145d421ae8a7a844e196fbe408534b6c305b683be4d968d3bf53
- Sigstore transparency entry: 253633586
- Sigstore integration time: Jun 27, 2025
Source repository:
- Permalink: roboflow/inference@1eb0dfe0d6562bae2898b6476dcd84e89b689dcb
- Branch / Tag: refs/tags/v0.51.1
- Owner: https://github.com/roboflow
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: self-hosted
- Publication workflow: publish.pypi.yml@1eb0dfe0d6562bae2898b6476dcd84e89b689dcb
- Trigger Event: release

inference 0.51.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Make Any Camera an AI Camera

🔥 quickstart

🛠️ build with Workflows

📟 connecting via api

🎥 connect to video streams

🔑 connect to the cloud

🌩️ hosted compute

🖥️ run on-prem or self-hosted

⭐️ New: Enterprise Hardware

📚 documentation

© license

🏆 contribution

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes

Provenance