vLLM adapter for a TGIS-compatible grpc server
Project description
vllm-tgis-adapter
vLLM adapter for a TGIS-compatible grpc server.
Install
vllm-tgis-adapter is available on PyPi
pip install vllm-tgis-adapter
python -m vllm_tgis_adapter
HealthCheck CLI
Installing the adapter also install a grpc healthcheck cli that can be used to monitor the status of the grpc server:
$ grpc_healtheck
health check...status: SERVING
See usage with
grpc_healthcheck --help
Build
python -m build
pip install dist/*whl
python -m vllm_tgis_adapter
Inference
This will start serving a grpc server on port 8033. This can be queried with grpcurl:
bash examples/inference.sh
Docker
Image available at quay.io/opendatahub/vllm, built from opendatahub-io/vllm's Dockerfile.ubi
docker pull quay.io/opendatahub/vllm
Inference
See examples
Contributing
Set up pre-commit
for linting/style/misc fixes:
pip install pre-commit
pre-commit install
# to run on all files
pre-commit run --all-files
This project uses nox
to manage test automation:
pip install nox
nox --list # list available sessions
nox -s tests-3.10 # run tests session for a specific python version
nox -s build-3.11 # build the wheel package
nox -s lint-3.11 -- --mypy # run linting with type checks
Testing without a GPU
The standard vllm built requires an Nvidia GPU. When this is not available, it is possible to compile vllm
from source with CPU support:
env \
VLLM_CPU_DISABLE_AVX512=true VLLM_TARGET_DEVICE=cpu \
PIP_EXTRA_INDEX_URL=https://download.pytorch.org/whl/cpu \
pip install git+https://github.com/vllm-project/vllm
making it possible to run the tests on most hardware. Please note that the pip
extra index url is required in order to install the torch CPU version.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file vllm_tgis_adapter-0.5.3.tar.gz
.
File metadata
- Download URL: vllm_tgis_adapter-0.5.3.tar.gz
- Upload date:
- Size: 123.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 270d8bbdd98842c933706927441d1ec30cb5a54846a8f700c8e3dfe9480a3780 |
|
MD5 | 6f04bfcaab01b62a4e4d5961080e0716 |
|
BLAKE2b-256 | d0471cbd98ca075f7e3ed16b9be66629d3b03c3af35202c8d49032c26e5f7934 |
File details
Details for the file vllm_tgis_adapter-0.5.3-py3-none-any.whl
.
File metadata
- Download URL: vllm_tgis_adapter-0.5.3-py3-none-any.whl
- Upload date:
- Size: 53.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ebc51978629575b6b549aa683106d1889a1ed70f8f4c770473504494e812bda0 |
|
MD5 | 179e40c8e92e8ee73f018673451215aa |
|
BLAKE2b-256 | 358063c36b27e141b76dba619d7eadb0fdd19bba5818f2d5cad30a48f70b3ed6 |