A flexible and efficient inference framework

These details have not been verified by PyPI

Project links

Project description

TorchPipe

TorchPipe is an alternative choice for Triton Inference Server, mainly featuring similar functionalities such as Shared-momory, Ensemble, and BLS mechanism.

For serving scenarios, TorchPipe is designed to support multi-instance deployment, pipeline parallelism, adaptive batching, GPU-accelerated operators, and reduced head-of-line (HOL) blocking.It acts as a bridge between lower-level acceleration libraries (e.g., TensorRT, OpenCV, CVCUDA) and RPC frameworks (e.g., Thrift). At its core, it is an engine that enables programmable scheduling.

News

[2026-01-23] 📦 Available on PyPI: pip install torchpipe
[2026-01-04] 🔧 We switched to tvm_ffi to provide clearer C++-Python interaction.

Usage

Below are some usage examples, for more check out the examples.

Initialize and Prepare Pipeline

from torchpipe import pipe
import torch

from torchvision.models.resnet import resnet101

# create some regular pytorch model...
model = resnet101(pretrained=True).eval().cuda()

# create example model
model_path = f"./resnet101.onnx"
x = torch.ones((1, 3, 224, 224)).cuda()
torch.onnx.export(model, x, model_path, opset_version=17,
                    input_names=['input'], output_names=['output'], 
                    dynamic_axes={'input': {0: 'batch_size'},
                                'output': {0: 'batch_size'}})

thread_safe_pipe = pipe({
    "preprocessor": {
        "backend": "S[DecodeTensor,ResizeTensor,CvtColorTensor,SyncTensor]",
        # "backend": "S[DecodeMat,ResizeMat,CvtColorMat,Mat2Tensor,SyncTensor]",
        'instance_num': 2,
        'color': 'rgb',
        'resize_h': '224',
        'resize_w': '224',
        'next': 'model',
    },
    "model": {
        "backend": "SyncTensor[TensorrtTensor]",
        "model": model_path,
        "model::cache": model_path.replace(".onnx", ".trt"),
        "max": '4',
        'batching_timeout': 4,  # ms, timeout for batching
        'instance_num': 2,
        'mean': "123.675, 116.28, 103.53",
        'std': "58.395, 57.120, 57.375",  # merged into trt
    }}
)

Execute

We can execute the returned thread_safe_pipe just like the original PyTorch model, but in a thread-safe manner.

data = {'data': open('/path/to/img.jpg', 'rb').read()}
thread_safe_pipe(data) # <-- this is thread-safe
result = data['result']

Installation

NGC Docker containers (recommended):

test on 25.05, 25.06, 24.05, 23.05

img_name=nvcr.io/nvidia/pytorch:25.05-py3

docker run --rm --gpus all -it --network host \
    -v $(pwd):/workspace/ --ipc=host --ulimit memlock=-1 --ulimit stack=67108864 \
    -w /workspace/ \
    $img_name \
    bash

pip install torchpipe
python -c "import torchpipe"

The backends it introduces will be JIT-compiled and cached.

There are one core backend group(torchpipe_core) and three optional groups (torchpipe_opencv, torchpipe_nvjpeg, and torchpipe_tensorrt) with different dependencies. For details, see here.

Dependencies such as OpenCV and TensorRT can also be provided in the following ways:

providing environment variables:
Users can specify paths via the following environment variables:
OPENCV_INCLUDE, OPENCV_LIB, TENSORRT_INCLUDE, TENSORRT_LIB.

Other installation options

How does it work?

See Basic Usage.

How to add (or override) a backend

WIP

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.24

May 28, 2026

0.2.3

May 27, 2026

0.2.2

May 20, 2026

This version

0.2.1

May 20, 2026

0.2.0

May 20, 2026

0.1.28

May 20, 2026

0.1.27

May 19, 2026

0.1.26

May 18, 2026

0.1.25 yanked

May 18, 2026

0.1.24 yanked

May 18, 2026

0.1.23

Feb 3, 2026

0.1.23a1 pre-release

Feb 3, 2026

0.1.23a0 pre-release

Feb 3, 2026

0.1.22

Jan 23, 2026

0.1.21.post0

Jan 22, 2026

0.1.21

Jan 22, 2026

0.1.19

Jan 18, 2026

0.1.18

Jan 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omniback-0.2.1.tar.gz (1.2 MB view details)

Uploaded May 20, 2026 Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

omniback-0.2.1-py3-none-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl (4.1 MB view details)

Uploaded May 20, 2026 Python 3manylinux: glibc 2.27+ x86-64manylinux: glibc 2.28+ x86-64

omniback-0.2.1-py3-none-manylinux_2_26_aarch64.manylinux_2_28_aarch64.whl (3.8 MB view details)

Uploaded May 20, 2026 Python 3manylinux: glibc 2.26+ ARM64manylinux: glibc 2.28+ ARM64

omniback-0.2.1-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (3.8 MB view details)

Uploaded May 20, 2026 Python 3manylinux: glibc 2.17+ x86-64

omniback-0.2.1-py3-none-manylinux2014_aarch64.manylinux_2_17_aarch64.whl (3.6 MB view details)

Uploaded May 20, 2026 Python 3manylinux: glibc 2.17+ ARM64

File details

Details for the file omniback-0.2.1.tar.gz.

File metadata

Download URL: omniback-0.2.1.tar.gz
Upload date: May 20, 2026
Size: 1.2 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for omniback-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`bffdcb7493e9ba2df859d3bf119b77dcc8027ed445750bbf51b996ff2d0cb1db`
MD5	`c228b7abe03c6b2694963cc547a701f7`
BLAKE2b-256	`377d41b9c74f59046e5c9c98ac8456dc774219f6f191fbc0437b68513489d77d`

See more details on using hashes here.

File details

Details for the file omniback-0.2.1-py3-none-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl.

File metadata

Download URL: omniback-0.2.1-py3-none-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Upload date: May 20, 2026
Size: 4.1 MB
Tags: Python 3, manylinux: glibc 2.27+ x86-64, manylinux: glibc 2.28+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for omniback-0.2.1-py3-none-manylinux_2_27_x86_64.manylinux_2_28_x86_64.whl
Algorithm	Hash digest
SHA256	`3eef8046e473f22b9cba72c76dedf1e5a404d19f94b1f237ccd9198698d8544c`
MD5	`d1e95d6e13753ee83aaf1ccc60ba8e4b`
BLAKE2b-256	`5a7726666600d04b4570b22356aa71172102867a02e8c87f9860d744c5172ad1`

See more details on using hashes here.

File details

Details for the file omniback-0.2.1-py3-none-manylinux_2_26_aarch64.manylinux_2_28_aarch64.whl.

File metadata

Download URL: omniback-0.2.1-py3-none-manylinux_2_26_aarch64.manylinux_2_28_aarch64.whl
Upload date: May 20, 2026
Size: 3.8 MB
Tags: Python 3, manylinux: glibc 2.26+ ARM64, manylinux: glibc 2.28+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for omniback-0.2.1-py3-none-manylinux_2_26_aarch64.manylinux_2_28_aarch64.whl
Algorithm	Hash digest
SHA256	`f8e937a84bff172dcd2c95871392c3c79d917daeb9bddafed0bd9533111185fe`
MD5	`48e91771203c5fd49ba23baefdcf63f6`
BLAKE2b-256	`ab8dea8aee6f65c9338798ed8bc97c2ce9793f025c85ef9572b24bab04cef8bf`

See more details on using hashes here.

File details

Details for the file omniback-0.2.1-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

Download URL: omniback-0.2.1-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Upload date: May 20, 2026
Size: 3.8 MB
Tags: Python 3, manylinux: glibc 2.17+ x86-64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for omniback-0.2.1-py3-none-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm	Hash digest
SHA256	`f0b63b391b1b91a3b6c7bc5873613693e91560802d2d97daef5e4ee6863ba9d1`
MD5	`cff9770c2f169a7d14e58f7bf0eb59e5`
BLAKE2b-256	`ec9317983e417d5a6656bf47bcabc2ba9442afa12727941a2c948b8311c0923f`

See more details on using hashes here.

File details

Details for the file omniback-0.2.1-py3-none-manylinux2014_aarch64.manylinux_2_17_aarch64.whl.

File metadata

Download URL: omniback-0.2.1-py3-none-manylinux2014_aarch64.manylinux_2_17_aarch64.whl
Upload date: May 20, 2026
Size: 3.6 MB
Tags: Python 3, manylinux: glibc 2.17+ ARM64
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for omniback-0.2.1-py3-none-manylinux2014_aarch64.manylinux_2_17_aarch64.whl
Algorithm	Hash digest
SHA256	`c7229af8be3ce7e1d4a6151aebe71dc5b3d674fd342f5d97cc86efa92816ddf5`
MD5	`c783d26a88edd0f30f23e55e482abb58`
BLAKE2b-256	`fc1695dfb95f98c817a4364ff5d7186ae31c7ec3a0eefa06ae679fdbbaa46860`

See more details on using hashes here.

omniback 0.2.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TorchPipe

News

Usage

Initialize and Prepare Pipeline

Execute

Installation

How does it work?

How to add (or override) a backend

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes