vLLM adapter for a TGIS-compatible grpc server
Project description
vllm-tgis-adapter
vLLM adapter for a TGIS-compatible grpc server.
Install
vllm-tgis-adapter is available on PyPi
pip install vllm-tgis-adapter
python -m vllm_tgis_adapter
HealthCheck CLI
Installing the adapter also install a grpc healthcheck cli that can be used to monitor the status of the grpc server:
$ grpc_healtheck
health check...status: SERVING
See usage with
grpc_healthcheck --help
Build
python -m build
pip install dist/*whl
python -m vllm_tgis_adapter
Inference
This will start serving a grpc server on port 8033. This can be queried with grpcurl:
bash examples/inference.sh
Docker
Image available at quay.io/opendatahub/vllm, built from opendatahub-io/vllm's Dockerfile.ubi
docker pull quay.io/opendatahub/vllm
Inference
See examples
Contributing
Set up pre-commit for linting/style/misc fixes:
pip install pre-commit
pre-commit install
# to run on all files
pre-commit run --all-files
This project uses nox to manage test automation and uv for venv management:
pip install nox uv
nox --list # list available sessions
nox -s tests-3.10 # run tests session for a specific python version
nox -s build-3.11 # build the wheel package
nox -s lint-3.11 -- --mypy # run linting with type checks
Testing without a GPU
The standard vllm built requires an Nvidia GPU. When this is not available, it is possible to compile vllm from source with CPU support:
git clone https://github.com/vllm-project/vllm
cd vllm
uv venv
source .venv/bin/activate
export UV_EXTRA_INDEX_URL=https://download.pytorch.org/whl/cpu \
UV_INDEX_STRATEGY=unsafe-best-match\
.github/scripts/install_vllm_build_deps.py pyproject.toml
env \
VLLM_TARGET_DEVICE=cpu \
python setup.py bdist_wheel
export VLLM_VERSION_OVERRIDE=$PWD/dist/*whl
# the nox session can now be run with the custom built vllm cpu version
making it possible to run the tests on most hardware. Please note that the uv extra index url is required in order to install the torch CPU version.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vllm_tgis_adapter-0.9.1.tar.gz.
File metadata
- Download URL: vllm_tgis_adapter-0.9.1.tar.gz
- Upload date:
- Size: 92.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
407fd3f4e5a1d0663fc08a247573b6c934b8986193771422883750a698446f8f
|
|
| MD5 |
27278ca69f2e785644aab55555350699
|
|
| BLAKE2b-256 |
6f71f503eb7af26a9708f4346661f84134a57cf87457cae66ac61e72113ed354
|
Provenance
The following attestation bundles were made for vllm_tgis_adapter-0.9.1.tar.gz:
Publisher:
release.yaml on opendatahub-io/vllm-tgis-adapter
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
vllm_tgis_adapter-0.9.1.tar.gz -
Subject digest:
407fd3f4e5a1d0663fc08a247573b6c934b8986193771422883750a698446f8f - Sigstore transparency entry: 612076728
- Sigstore integration time:
-
Permalink:
opendatahub-io/vllm-tgis-adapter@a0d5363e31e61c8203045bd40b21068d2b02f0a4 -
Branch / Tag:
refs/tags/0.9.1 - Owner: https://github.com/opendatahub-io
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yaml@a0d5363e31e61c8203045bd40b21068d2b02f0a4 -
Trigger Event:
release
-
Statement type:
File details
Details for the file vllm_tgis_adapter-0.9.1-py3-none-any.whl.
File metadata
- Download URL: vllm_tgis_adapter-0.9.1-py3-none-any.whl
- Upload date:
- Size: 51.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
972be3d8d67f4118505c1ec21053d8a0903efc621bbacf8e02ea5a4483cea6e4
|
|
| MD5 |
c069e6c517c0d1cd353b1d51803fea2b
|
|
| BLAKE2b-256 |
9a67df7593afe74cfc54483066eb7b3688111e4575a8998d407cdcc96962b318
|
Provenance
The following attestation bundles were made for vllm_tgis_adapter-0.9.1-py3-none-any.whl:
Publisher:
release.yaml on opendatahub-io/vllm-tgis-adapter
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
vllm_tgis_adapter-0.9.1-py3-none-any.whl -
Subject digest:
972be3d8d67f4118505c1ec21053d8a0903efc621bbacf8e02ea5a4483cea6e4 - Sigstore transparency entry: 612076736
- Sigstore integration time:
-
Permalink:
opendatahub-io/vllm-tgis-adapter@a0d5363e31e61c8203045bd40b21068d2b02f0a4 -
Branch / Tag:
refs/tags/0.9.1 - Owner: https://github.com/opendatahub-io
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yaml@a0d5363e31e61c8203045bd40b21068d2b02f0a4 -
Trigger Event:
release
-
Statement type: