vLLM adapter for a TGIS-compatible grpc server
Project description
vllm-tgis-adapter
vLLM adapter for a TGIS-compatible grpc server.
Install
vllm-tgis-adapter is available on PyPi
pip install vllm-tgis-adapter
python -m vllm_tgis_adapter
HealthCheck CLI
Installing the adapter also install a grpc healthcheck cli that can be used to monitor the status of the grpc server:
$ grpc_healtheck
health check...status: SERVING
See usage with
grpc_healthcheck --help
Build
python -m build
pip install dist/*whl
python -m vllm_tgis_adapter
Inference
This will start serving a grpc server on port 8033. This can be queried with grpcurl:
bash examples/inference.sh
Docker
Image available at quay.io/dtrifiro/vllm-tgis
docker pull quay.io/dtrifiro/vllm-tgis
Inference
See examples
Contributing
Set up pre-commit
for linting/style/misc fixes:
pip install pre-commit
pre-commit install
# to run on all files
pre-commit run --all-files
This project uses nox
to manage test automation:
pip install nox
nox --list # list available sessions
nox -s tests-3.10 # run tests session for a specific python version
nox -s build-3.11 # build the wheel package
nox -s lint-3.11 -- --mypy # run linting with type checks
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
vllm_tgis_adapter-0.0.5.tar.gz
(40.9 kB
view hashes)
Built Distribution
Close
Hashes for vllm_tgis_adapter-0.0.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 154ddaaf5eefd04dafa6d11b0ff8092710c9f9fb3142224f1f2e3136e43e3fc3 |
|
MD5 | 434717fe9e808e4f1504841d5e3d8c8c |
|
BLAKE2b-256 | 1b29705bc069cb51b583aad8169c6c70f3023215575d59f6cb894d8bb349596a |