GPUStack Runner is library for registering runnable accelerated backends and services in GPUStack.

These details have not been verified by PyPI

Project links

Project description

GPUStack Runner

This repository serves as the Docker image pack center for GPUStack Runner. It provides a collection of Dockerfiles to build images for various inference services across different accelerated backends.

Onboard Services

[!TIP]

The list below shows the accelerated backends and inference services available in the latest release. For support of backends or services not shown here, please refer to previous release tags.

Deprecated inference service versions in the latest release are marked with ~~strikethrough~~ formatting. They may still be available in previous releases, and not recommended for new deployments.

Polished inference service versions in the latest release are marked with bold formatting. If they are using in your deployment, it is recommended to pull the latest images and upgrade.

The following table lists the supported accelerated backends and their corresponding inference services with versions.

Ascend CANN

CANN Version (Variant)	MindIE	vLLM	SGLang
8.5 (A3/910C)	`2.3.0`	`0.18.0`, `0.17.0`(rc), `0.16.0`(rc), `0.15.0`(rc), `0.14.1`(rc), `0.13.0`	`0.5.9`, `0.5.8.post1`
8.5 (910B)	`2.3.0`	`0.18.0`, `0.17.0`(rc), `0.16.0`(rc), `0.15.0`(rc), `0.14.1`(rc), `0.13.0`	`0.5.9`, `0.5.8.post1`
8.5 (310P)	`2.3.0`	`0.18.0`, `0.17.0`(rc), `0.16.0`(rc), `0.15.0`(rc), `0.14.1`(rc)
8.3 (A3/910C)	`2.2.rc1`	`0.12.0`(rc), `0.11.0`	`0.5.7`, `0.5.6.post2`
8.3 (910B)	`2.2.rc1`	`0.12.0`(rc), `0.11.0`	`0.5.7`, `0.5.6.post2`
8.3 (310P)	`2.2.rc1`
8.2 (A3/910C)	`2.1.rc2`	`0.10.2`(rc)
8.2 (910B)	`2.1.rc2`	`0.10.2`(rc), `0.10.0`(rc), `0.9.1`
8.2 (310P)	`2.1.rc2`	`0.10.0`(rc), `0.9.1`

Iluvatar CoreX

CoreX Version (Variant)	vLLM
4.2	`0.8.3`

NVIDIA CUDA

[!NOTE]

CUDA 13.0 supports Compute Capabilities: 7.5 8.0+PTX 8.9 9.0 10.0 10.3 12.0+PTX.

CUDA 12.9 supports Compute Capabilities: 7.5 8.0+PTX 8.9 9.0 10.0 10.3 12.0 12.1+PTX.

CUDA 12.8 supports Compute Capabilities: 7.5 8.0+PTX 8.9 9.0 10.0+PTX 12.0+PTX.

CUDA 12.6/12.4 supports Compute Capabilities: 7.5 8.0+PTX 8.9 9.0+PTX.

CUDA Version (Variant)	vLLM	SGLang	VoxBox
13.0	`0.20.1`, `0.19.1`, `0.18.1`
12.9	`0.20.1`, `0.19.1`, `0.18.1`, `0.17.1`, `0.16.0`, `0.15.1`, `0.14.1`, `0.13.0`, `0.12.0`, `0.11.2`	`0.5.9`, `0.5.8.post1`, `0.5.7`, `0.5.6.post2`
12.8	`0.17.1`, `0.16.0`, `0.15.1`, `0.14.1`, `0.13.0`, `0.12.0`, `0.11.2`, `0.10.2`	`0.5.9`, `0.5.8.post1`, `0.5.7`, `0.5.6.post2`, `0.5.5.post3`	`0.0.21`
12.6	`0.15.1`, `0.14.1`, `0.13.0`, `0.12.0`, `0.11.2`, `0.10.2`		`0.0.21`

Hygon DTK

DTK Version (Variant)	vLLM
25.04	`0.11.0`, `0.9.2`, `0.8.5`

T-Head HGGC

HGGC Version (Variant)	vLLM	SGLang
12.3	`0.12.0`, `0.11.1`	`0.5.6`, `0.5.5`

MetaX MACA

MACA Version (Variant)	vLLM	SGLang
3.3	`0.11.2`	`0.5.6`
3.2	`0.10.2`
3.0	`0.9.1`

MThreads MUSA

MUSA Version (Variant)	vLLM	SGLang
4.3.2		`0.5.7`
4.1.0	`0.9.2`

AMD ROCm

[!NOTE]

ROCm 7.1/7.0 supports LLVM targets: gfx908 gfx90a gfx942 gfx950 gfx1030 gfx1100 gfx1101 gfx1200 gfx1201 gfx1150 gfx1151.

ROCm 6.4 supports LLVM targets: gfx908 gfx90a gfx942 gfx1030 gfx1100.

[!WARNING]

ROCm 7.0 vLLM 0.11.2 are reusing the official ROCm 6.4 PyTorch 2.9 wheel package rather than a ROCm 7.0 specific PyTorch build. Although supports ROCm 7.0 in vLLM 0.11.2, gfx1150/gfx1151 are not supported yet.

ROCm 6.4 vLLM 0.13.0 supports gfx903 gfx90a gfx942 only.

ROCm 6.4 SGLang supports gfx942 only.

ROCm 7.0 SGLang supports gfx950 only.

ROCm Version (Variant)	vLLM	SGLang
7.2	`0.20.1`, `0.19.1`
7.1	`0.17.1`
7.0	`0.18.1`, `0.16.0`, `0.15.1`, `0.14.1`, `0.13.0`, `0.12.0`, `0.11.2`	`0.5.9`, `0.5.8.post1`, `0.5.7`, `0.5.6.post2`
6.4	`0.16.0`, `0.15.1`, `0.14.1`, `0.13.0`, `0.12.0`, `0.11.2`, `0.10.2`	`0.5.8.post1`, `0.5.7`, `0.5.6.post2`, `0.5.5.post3`

Directory Structure

The pack skeleton is organized by backend:

pack
├── {BACKEND 1}
│   └── Dockerfile
├── {BACKEND 2}
│   └── Dockerfile
├── {BACKEND 3}
│   └── Dockerfile
├── ...
│   └── Dockerfile
└── {BACKEND N}
    └── Dockerfile

Dockerfile Convention

Each Dockerfile follows these conventions:

Begin with comments describing the package logic in steps and usage of build arguments (ARGs).
Use ARG for all required and optional build arguments. If a required argument is unused, mark it as (PLACEHOLDER).
Use heredoc syntax for RUN commands to improve readability.

Example Dockerfile Structure

# Describe package logic and ARG usage.
#
ARG PYTHON_VERSION=...                                 # REQUIRED
ARG CMAKE_MAX_JOBS=...                                 # REQUIRED
ARG {OTHERS}                                           # OPTIONAL
ARG {BACKEND}_VERSION=...                              # REQUIRED
ARG {BACKEND}_VERSION_EXTRA=...                        # OPTIONAL
ARG {BACKEND}_ARCHS=...                                # REQUIRED
ARG {BACKEND}_{OTHERS}=...                             # OPTIONAL
ARG {SERVICE}_BASE_IMAGE=...                           # REQUIRED
ARG {SERVICE}_VERSION=...                              # REQUIRED
ARG {SERVICE}_{OTHERS}=...                             # OPTIONAL
ARG {SERVICE}_{FRAMEWORK}_VERSION=...                  # REQUIRED
ARG {SERVICE}_{FRAMEWORK}_{OTHERS}=...                 # OPTIONAL

# Stage Bake Runtime
FROM {BACKEND DEVEL IMAGE} AS runtime
SHELL ["/bin/bash", "-eo", "pipefail", "-c"]
ARG TARGETPLATFORM
ARG TARGETOS
ARG TARGETARCH
ARG ...
RUN <<EOF
    # TODO: install runtime dependencies
EOF

# Stage Install Service
FROM {BACKEND}_BASE_IMAGE AS {service}
SHELL ["/bin/bash", "-eo", "pipefail", "-c"]
ARG TARGETPLATFORM
ARG TARGETOS
ARG TARGETARCH
ARG ...
RUN <<EOF
    # TODO: install service and dependencies
EOF

WORKDIR /
ENTRYPOINT [ "tini", "--" ]

Docker Image Naming Convention

The Docker image naming convention is as follows:

Multi-architecture image names: {NAMESPACE}/{REPOSITORY}:{TAG}.
Single-architecture image tags: {BACKEND}{BACKEND_VERSION%.*}[-{BACKEND_VARIANT}]-{SERVICE}{SERVICE_VERSION}-{OS}-{ARCH}.
Multi-architecture image tags: {BACKEND}{BACKEND_VERSION%.*}[-{BACKEND_VARIANT}]-{SERVICE}{SERVICE_VERSION}[-dev].
All names adn tags must be lowercase.

Example

NAMESPACE: gpustack
REPOSITORY: runner

Accelerated Backend	OS/ARCH	Inference Service	Single-Arch Image Name	Multi-Arch Image Name
Ascend CANN 910b	linux/amd64	vLLM	`gpustack/runner:cann8.1-910b-vllm0.9.2-linux-amd64`	`gpustack/runner:cann8.1-910b-vllm0.9.2`
Ascend CANN 910b	linux/arm64	vLLM	`gpustack/runner:cann8.1-910b-vllm0.9.2-linux-arm64`	`gpustack/runner:cann8.1-910b-vllm0.9.2`
NVIDIA CUDA 12.8	linux/amd64	vLLM	`gpustack/runner:cuda12.8-910b-vllm0.9.2-linux-amd64`	`gpustack/runner:cuda12.8-910b-vllm0.9.2`
NVIDIA CUDA 12.8	linux/arm64	vLLM	`gpustack/runner:cuda12.8-910b-vllm0.9.2-linux-arm64`	`gpustack/runner:cuda12.8-910b-vllm0.9.2`

Build and Release Workflow

Build single architecture images for OS/ARCH, e.g. gpustack/runner:cann8.1-910b-vllm0.9.2-linux-amd64.
Combine single-architecture images into a multiple architectures image, e.g. gpustack/runner:cann8.1-910b-vllm0.9.2-dev.
After testing, rename the multi-architecture image to the final tag, e.g. gpustack/runner:cann8.1-910b-vllm0.9.2.

Integration Process

Ingesting a New Accelerated Backend

To add support for a new accelerated backend:

Create a new directory under pack/ named with the new backend.
Add a Dockerfile in the new directory following the Dockerfile Convention.
Update pack.yml, discard.yml and prune.yml to include the new backend in the build matrix.
Update matrix.yml to include the new backend and its variants.
Update _RE_DOCKER_IMAGE in runner.py to recognize the new backend.
[Optional] Update tests if necessary.

Ingesting a New Inference Service

To add support for a new inference service:

Modify the Dockerfile of the relevant backend in pack/{BACKEND}/Dockerfile to include the new service.
Update pack.yml to include the new service in the build matrix.
Update matrix.yml to include the new service.
Update _RE_DOCKER_IMAGE in runner.py to recognize the new service.
[Optional] Update tests if necessary.

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at LICENSE file for details.

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.26.post1

May 12, 2026

0.1.26

May 8, 2026

0.1.25.post7

Mar 13, 2026

0.1.25.post6

Feb 28, 2026

0.1.25.post5

Feb 26, 2026

0.1.25.post4

Feb 13, 2026

0.1.25.post3

Feb 10, 2026

0.1.25.post2

Feb 6, 2026

0.1.25.post1

Feb 5, 2026

0.1.25

Feb 4, 2026

0.1.24.post4

Jan 31, 2026

0.1.24.post3

Jan 30, 2026

0.1.24.post2

Jan 27, 2026

0.1.24.post1

Jan 14, 2026

0.1.24

Jan 14, 2026

0.1.23.post5

Jan 8, 2026

0.1.23.post4

Jan 4, 2026

0.1.23.post3

Jan 4, 2026

0.1.23.post2

Jan 2, 2026

0.1.23.post1

Jan 1, 2026

0.1.23

Jan 1, 2026

0.1.22.post6

Jan 6, 2026

0.1.22.post5

Dec 24, 2025

0.1.22.post4

Dec 23, 2025

0.1.22.post3

Dec 23, 2025

0.1.22.post2

Dec 19, 2025

0.1.22.post1

Dec 15, 2025

0.1.22

Dec 12, 2025

0.1.21.post1

Nov 27, 2025

0.1.21

Nov 24, 2025

0.1.20.post1

Nov 20, 2025

0.1.20

Nov 18, 2025

0.1.19.post1

Nov 14, 2025

0.1.19

Nov 12, 2025

0.1.18

Nov 11, 2025

0.1.17

Nov 9, 2025

0.1.16

Nov 8, 2025

0.1.15.post1

Nov 8, 2025

0.1.15

Nov 8, 2025

0.1.14

Nov 7, 2025

0.1.13

Nov 3, 2025

0.1.12

Oct 28, 2025

0.1.11

Oct 23, 2025

0.1.10

Oct 12, 2025

0.1.9

Oct 11, 2025

0.1.8

Oct 11, 2025

0.1.7

Oct 11, 2025

0.1.6

Oct 10, 2025

0.1.5

Sep 18, 2025

0.1.4

Sep 8, 2025

0.1.3

Sep 8, 2025

0.1.2

Sep 8, 2025

0.1.1

Sep 7, 2025

0.1.0

Sep 1, 2025

0.1.0rc1 pre-release

Aug 28, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpustack_runner-0.1.26.post1.tar.gz (13.4 MB view details)

Uploaded May 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gpustack_runner-0.1.26.post1-py3-none-any.whl (29.2 kB view details)

Uploaded May 12, 2026 Python 3

File details

Details for the file gpustack_runner-0.1.26.post1.tar.gz.

File metadata

Download URL: gpustack_runner-0.1.26.post1.tar.gz
Upload date: May 12, 2026
Size: 13.4 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.24

File hashes

Hashes for gpustack_runner-0.1.26.post1.tar.gz
Algorithm	Hash digest
SHA256	`32471e283c625fb31ae3257b9c9361e30c27cbf28c887a30f0886370546226ac`
MD5	`88bf8c9a94a5a166dc87d74fe2c64dcb`
BLAKE2b-256	`efad4a61afd6ba04ae78d03481f1b156b6aa412a94b99c884c565e799439e3d4`

See more details on using hashes here.

File details

Details for the file gpustack_runner-0.1.26.post1-py3-none-any.whl.

File metadata

Download URL: gpustack_runner-0.1.26.post1-py3-none-any.whl
Upload date: May 12, 2026
Size: 29.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.24

File hashes

Hashes for gpustack_runner-0.1.26.post1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`80828afb7efae4f62278f621995c50f5e60b541c97dfb382a498f9407218721d`
MD5	`c311d9841fefea4b1c2a4b826a0c4789`
BLAKE2b-256	`587ed4ecf3535e4c56d5907b11243330aad5c04b1ec8b9b079ae4fafe2f1ca2a`

See more details on using hashes here.

gpustack-runner 0.1.26.post1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

GPUStack Runner

Agenda

Onboard Services

Ascend CANN

Iluvatar CoreX

NVIDIA CUDA

Hygon DTK

T-Head HGGC

MetaX MACA

MThreads MUSA

AMD ROCm

Directory Structure

Dockerfile Convention

Example Dockerfile Structure

Docker Image Naming Convention

Example

Build and Release Workflow

Integration Process

Ingesting a New Accelerated Backend

Ingesting a New Inference Service

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes