inference-gpu

With no prior knowledge of machine learning or device-specific deployment, you can deploy a computer vision model to a range of devices and environments using Roboflow Inference.

These details have been verified by PyPI

Maintainers

capjamesg ppeczek-roboflow robolake sachin-roboflow yeldarb

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Reason this release was yanked:

Wrong Python version denoted.

Project description

Roboflow Inference banner

👋 hello

Roboflow Inference is an opinionated tool for running inference on state-of-the-art computer vision models. With no prior knowledge of machine learning or device-specific deployment, you can deploy a computer vision model to a range of devices and environments. Inference supports object detection, classification, and instance segmentation models, and running foundation models (CLIP and SAM).

🎥 Inference in action

Check out Inference running on a video of a football game:

https://github.com/roboflow/inference/assets/37276661/121ab5f4-5970-4e78-8052-4b40f2eec173

👩‍🏫 Examples

The /examples directory contains example code for working with and extending inference, including HTTP and UDP client code and an insights dashboard, along with community examples (PRs welcome)!

💻 Why Inference?

Inference provides a scalable method through which you can manage inferences for your vision projects.

Inference is backed by:

A server, so you don’t have to reimplement things like image processing and prediction visualization on every project.
Standardized APIs for computer vision tasks, so switching out the model weights and architecture can be done independently of your application code.
Model architecture implementations, which implement the tensor parsing glue between images and predictions for supervised models that you've fine-tuned to perform custom tasks.
A model registry, so your code can be independent from your model weights & you don't have to re-build and re-deploy every time you want to iterate on your model weights.
Data management integrations, so you can collect more images of edge cases to improve your dataset & model the more it sees in the wild.

And more!

📌 Install pip vs Docker:

pip: Installs inference into your Python environment. Lightweight, good for Python-centric projects.
Docker: Packages inference with its environment. Ensures consistency across setups; ideal for scalable deployments.

💻 install

With ONNX CPU Runtime:

For CPU powered inference:

pip install inference

pip install inference-cpu

With ONNX GPU Runtime:

If you have an NVIDIA GPU, you can accelerate your inference with:

pip install inference-gpu

Without ONNX Runtime:

Roboflow Inference uses Onnxruntime as its core inference engine. Onnxruntime provides an array of different execution providers that can optimize inference on differnt target devices. If you decide to install onnxruntime on your own, install inference with:

pip install inference-core

Alternatively, you can take advantage of some advanced execution providers using one of our published docker images.

Extras:

Some functionality requires extra dependancies. These can be installed by specifying the desired extras during installation of Roboflow Inference.

extra	description
`clip`	Ability to use the core `CLIP` model (by OpenAI)
`gaze`	Ability to use the core `Gaze` model
`http`	Ability to run the http interface
`sam`	Ability to run the core `Segment Anything` model (by Meta AI)

Note: Both CLIP and Segment Anything require pytorch to run. These are included in their respective dependancies however pytorch installs can be highly environment dependant. See the official pytorch install page for instructions specific to your enviornment.

Example install with http dependancies:

pip install inference[http]

🐋 docker

You can learn more about Roboflow Inference Docker Image build, pull and run in our documentation.

Run on x86 CPU:

docker run --net=host roboflow/roboflow-inference-server-cpu:latest

Run on NVIDIA GPU:

docker run --network=host --gpus=all roboflow/roboflow-inference-server-gpu:latest

👉 more docker run options

Run on arm64 CPU:

docker run -p 9001:9001 roboflow/roboflow-inference-server-arm-cpu:latest

Run on NVIDIA GPU with TensorRT Runtime:

docker run --network=host --gpus=all roboflow/roboflow-inference-server-trt:latest

Run on NVIDIA Jetson with JetPack 4.x:

docker run --privileged --net=host --runtime=nvidia roboflow/roboflow-inference-server-trt-jetson:latest

Run on NVIDIA Jetson with JetPack 5.x:

docker run --privileged --net=host --runtime=nvidia roboflow/roboflow-inference-server-trt-jetson-5.1.1:latest

🔥 quickstart

Docker Quickstart:

import requests

dataset_id = "soccer-players-5fuqs"
version_id = "1"
image_url = "https://source.roboflow.com/pwYAXv9BTpqLyFfgQoPZ/u48G0UpWfk8giSw7wrU8/original.jpg"
#Replace ROBOFLOW_API_KEY with your Roboflow API Key
api_key = "ROBOFLOW_API_KEY"
confidence = 0.5

url = f"http://localhost:9001/{dataset_id}/{version_id}"

params = {
    "api_key": api_key,
    "confidence": confidence,
    "image": image_url,
}

res = requests.post(url, params=params)
print(res.json())

pip Quickstart:

After installing via pip, you can run a simple inference using:

from inference.models.utils import get_roboflow_model

model = get_roboflow_model(
    model_id="soccer-players-5fuqs/1", 
    #Replace ROBOFLOW_API_KEY with your Roboflow API Key
    api_key="ROBOFLOW_API_KEY"
)

results = model.infer(image="https://source.roboflow.com/pwYAXv9BTpqLyFfgQoPZ/u48G0UpWfk8giSw7wrU8/original.jpg", confidence=0.5, iou_threshold=0.5)

print(results)

CLIP Quickstart:

You can run inference with OpenAI's CLIP model using:

from inference.models import Clip

model = Clip(
    #Replace ROBOFLOW_API_KEY with your Roboflow API Key
    api_key = "ROBOFLOW_API_KEY"
)

image_url = "https://source.roboflow.com/7fLqS2r1SV8mm0YzyI0c/yy6hjtPUFFkq4yAvhkvs/original.jpg"

embeddings = model.embed_image(image_url)

print(embeddings)

SAM Quickstart:

You can run inference with Meta's Segment Anything model using:

from inference.models import SegmentAnything

model = SegmentAnything(
    #Replace ROBOFLOW_API_KEY with your Roboflow API Key
    api_key = "ROBOFLOW_API_KEY"
)

image_url = "https://source.roboflow.com/7fLqS2r1SV8mm0YzyI0c/yy6hjtPUFFkq4yAvhkvs/original.jpg"

embeddings = model.embed_image(image_url)

print(embeddings)

🏗️ inference process

To standardize the inference process throughout all our models, Roboflow Inference has a structure for processing inference requests. The specifics can be found on each model's respective page, but overall it works like this for most models:

📝 license

The Roboflow Inference code is distributed under an Apache 2.0 license. The models supported by Roboflow Inference have their own licenses. View the licenses for supported models below.

model	license
`inference/models/clip`	MIT
`inference/models/gaze`	MIT, Apache 2.0
`inference/models/sam`	Apache 2.0
`inference/models/vit`	Apache 2.0
`inference/models/yolact`	MIT
`inference/models/yolov5`	AGPL-3.0
`inference/models/yolov7`	GPL-3.0
`inference/models/yolov8`	AGPL-3.0

🚀 enterprise

With a Roboflow Inference Enterprise License, you can access additional Inference features, including:

Server cluster deployment
Device management
Active learning
YOLOv5 and YOLOv8 model sub-license

To learn more, contact the Roboflow team.

📚 documentation

Visit our documentation for usage examples and reference for Roboflow Inference.

🏆 contribution

We would love your input to improve Roboflow Inference! Please see our contributing guide to get started. Thank you to all of our contributors! 🙏

💻 explore more Roboflow open source projects

Project	Description
supervision	General-purpose utilities for use in computer vision projects, from predictions filtering and display to object tracking to model evaluation.
Autodistill	Automatically label images for use in training computer vision models.
Inference (this project)	An easy-to-use, production-ready inference server for computer vision supporting deployment of many popular model architectures and fine-tuned models.
Notebooks	Tutorials for computer vision tasks, from training state-of-the-art models to tracking objects to counting objects in a zone.
Collect	Automated, intelligent data collection powered by CLIP.

Project details

These details have been verified by PyPI

Maintainers

capjamesg ppeczek-roboflow robolake sachin-roboflow yeldarb

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

1.0.0

Feb 20, 2026

1.0.0rc1 pre-release

Feb 6, 2026

0.64.8

Feb 13, 2026

0.64.7

Feb 6, 2026

0.64.6

Jan 30, 2026

0.64.5

Jan 23, 2026

0.64.4

Jan 22, 2026

0.64.3

Jan 16, 2026

0.64.2

Jan 15, 2026

0.64.1

Jan 15, 2026

0.64.0

Jan 15, 2026

0.63.5

Jan 9, 2026

0.63.4

Jan 6, 2026

0.63.3

Jan 3, 2026

0.63.2

Dec 23, 2025

0.63.1

Dec 19, 2025

0.62.5

Dec 12, 2025

0.62.4

Dec 10, 2025

0.62.2

Nov 26, 2025

0.62.2rc1 pre-release

Nov 26, 2025

0.62.1rc1 pre-release

Nov 21, 2025

0.62.0

Nov 19, 2025

0.61.0

Nov 17, 2025

0.60.1rc2 pre-release

Nov 14, 2025

0.60.1rc1 pre-release

Nov 14, 2025

0.60.0

Nov 7, 2025

0.59.1

Oct 31, 2025

0.59.0

Oct 24, 2025

0.58.3

Oct 21, 2025

0.58.2

Oct 13, 2025

0.58.1

Oct 3, 2025

0.58.0

Oct 2, 2025

0.57.4

Oct 1, 2025

0.57.3 yanked

Sep 30, 2025

Reason this release was yanked:

Bug with rfdetr max input resolution

0.57.2

Sep 29, 2025

0.56.0

Sep 19, 2025

0.55.2

Sep 17, 2025

0.55.1

Sep 17, 2025

0.55.0

Sep 13, 2025

0.54.2

Sep 5, 2025

0.54.1

Aug 29, 2025

0.54.0

Aug 29, 2025

0.53.0

Aug 26, 2025

0.52.1

Aug 15, 2025

0.52.0

Aug 11, 2025

0.51.10

Aug 1, 2025

0.51.9

Jul 28, 2025

0.51.7

Jul 23, 2025

0.51.6

Jul 22, 2025

0.51.5

Jul 18, 2025

0.51.4

Jul 16, 2025

0.51.3

Jul 11, 2025

0.51.2

Jul 4, 2025

0.51.1

Jun 27, 2025

0.51.0

Jun 20, 2025

0.50.5

Jun 16, 2025

0.50.4

Jun 6, 2025

0.50.4rc2 pre-release

Jun 2, 2025

0.50.4rc1 pre-release

Jun 2, 2025

0.50.3

May 30, 2025

0.50.2

May 29, 2025

0.50.1

May 23, 2025

0.50.0

May 23, 2025

0.49.5

May 16, 2025

0.49.3

May 16, 2025

0.49.2

May 14, 2025

0.49.1

May 9, 2025

0.48.3

May 5, 2025

0.48.1

May 2, 2025

0.48.0

Apr 25, 2025

0.47.0

Apr 18, 2025

0.46.5

Apr 14, 2025

0.46.4

Apr 9, 2025

0.46.3

Apr 9, 2025

0.46.1

Apr 7, 2025

0.46.0

Apr 4, 2025

0.46.0rc2 pre-release

Mar 31, 2025

0.46.0rc1 pre-release

Mar 31, 2025

0.45.3

Apr 2, 2025

0.45.2

Apr 2, 2025

0.45.1

Apr 2, 2025

0.45.0

Mar 28, 2025

0.44.1

Mar 26, 2025

0.44.0

Mar 25, 2025

0.43.0

Mar 21, 2025

0.43.0rc1 pre-release

Mar 21, 2025

0.42.1

Mar 17, 2025

0.41.0

Mar 7, 2025

0.40.0

Feb 26, 2025

0.40.0rc2 pre-release

Feb 24, 2025

0.40.0rc1 pre-release

Feb 21, 2025

0.39.0

Feb 21, 2025

0.38.0

Feb 18, 2025

0.38.0rc1 pre-release

Feb 10, 2025

0.37.1

Feb 11, 2025

0.37.0rc1 pre-release

Feb 7, 2025

0.36.1

Jan 31, 2025

0.36.0

Jan 31, 2025

0.36.0rc1 pre-release

Jan 30, 2025

0.35.0

Jan 24, 2025

0.35.0rc1 pre-release

Jan 20, 2025

0.34.0

Jan 17, 2025

0.34.0rc1 pre-release

Jan 14, 2025

0.33.0

Jan 10, 2025

0.32.0

Dec 20, 2024

0.31.2rc1 pre-release

Dec 19, 2024

0.31.1

Dec 13, 2024

0.31.0

Dec 13, 2024

0.30.0

Dec 11, 2024

0.29.2

Dec 5, 2024

0.29.1

Dec 3, 2024

0.29.0

Nov 29, 2024

0.28.2

Nov 27, 2024

0.28.1

Nov 25, 2024

0.28.0 yanked

Nov 22, 2024

Reason this release was yanked:

Bug in release causing workflows loading to break

0.27.0

Nov 15, 2024

0.26.1

Nov 13, 2024

0.26.0

Nov 8, 2024

0.25.0

Nov 1, 2024

0.24.0

Oct 18, 2024

0.23.0

Oct 11, 2024

0.22.2

Oct 4, 2024

0.22.1

Oct 4, 2024

0.22.0

Oct 4, 2024

0.21.1

Sep 30, 2024

0.21.0

Sep 27, 2024

0.20.1

Sep 24, 2024

0.20.0

Sep 23, 2024

0.19.0

Sep 18, 2024

0.18.1

Sep 6, 2024

0.18.0

Sep 6, 2024

0.17.1

Sep 3, 2024

0.17.0

Aug 30, 2024

0.16.3

Aug 22, 2024

0.16.2

Aug 16, 2024

0.16.0

Aug 9, 2024

0.15.2

Aug 3, 2024

0.15.1

Jul 20, 2024

0.15.0

Jul 18, 2024

0.14.1

Jul 15, 2024

0.14.0

Jul 12, 2024

0.13.0

Jun 26, 2024

0.12.1

Jun 17, 2024

0.12.0

Jun 7, 2024

0.11.2

May 25, 2024

0.11.1

May 23, 2024

0.11.0

May 20, 2024

0.10.0

May 14, 2024

0.9.23

Apr 30, 2024

0.9.22

Apr 18, 2024

0.9.20

Mar 27, 2024

0.9.18

Mar 25, 2024

0.9.17 yanked

Mar 15, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.16 yanked

Mar 11, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.15 yanked

Feb 28, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.15rc1 pre-release yanked

Feb 27, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.14 yanked

Feb 23, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.13 yanked

Feb 16, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.12 yanked

Feb 16, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.12rc3 pre-release yanked

Feb 16, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.12rc1 pre-release yanked

Feb 16, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.11 yanked

Feb 16, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.11rc2 pre-release yanked

Feb 15, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.11rc1 pre-release yanked

Feb 15, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.10 yanked

Feb 13, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.10rc3 pre-release yanked

Feb 12, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.9 yanked

Feb 7, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.9rc23 pre-release yanked

Feb 7, 2024

Reason this release was yanked:

Wrong Python version denoted.

0.9.8 yanked

Dec 29, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.7 yanked

Dec 20, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.7rc2 pre-release yanked

Dec 18, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.6 yanked

Dec 13, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.5 yanked

Dec 5, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.4 yanked

Oct 27, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.3 yanked

Oct 24, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.2 yanked

Oct 13, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.1 yanked

Oct 9, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.9.0 yanked

Oct 6, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.8.9 yanked

Oct 3, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.8.8 yanked

Sep 27, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.8.5 yanked

Sep 19, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.8.4 yanked

Sep 15, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.8.2 yanked

Sep 7, 2023

Reason this release was yanked:

Wrong Python version denoted.

This version

0.8.1 yanked

Sep 5, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.8.0 yanked

Aug 31, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.7.6 yanked

Aug 18, 2023

Reason this release was yanked:

Wrong Python version denoted.

0.7.2 yanked

Aug 15, 2023

Reason this release was yanked:

Wrong Python version denoted.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

inference_gpu-0.8.1-py3-none-any.whl (104.8 kB view details)

Uploaded Sep 5, 2023 Python 3

File details

Details for the file inference_gpu-0.8.1-py3-none-any.whl.

File metadata

Download URL: inference_gpu-0.8.1-py3-none-any.whl
Upload date: Sep 5, 2023
Size: 104.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/4.0.2 CPython/3.11.5

File hashes

Hashes for inference_gpu-0.8.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8abdc607a27c34b488718c9313a397c019a16742ee533235fafd13f4fc9f302d`
MD5	`e410916f4a4576816881e20fd0425774`
BLAKE2b-256	`e0685386501d1f95f704f6de716d6d88749ee1d4980ea73a37f083aac3c4042f`

See more details on using hashes here.

inference-gpu 0.8.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

👋 hello

🎥 Inference in action

👩‍🏫 Examples

💻 Why Inference?

📌 Install pip vs Docker:

💻 install

With ONNX CPU Runtime:

With ONNX GPU Runtime:

Without ONNX Runtime:

Extras:

🐋 docker

🔥 quickstart

🏗️ inference process

📝 license

🚀 enterprise

📚 documentation

🏆 contribution

💻 explore more Roboflow open source projects

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes