Client for the vLLM API with minimal dependencies
Project description
vLLM Client
Overview
Client for the vLLM API with minimal dependencies.
Installation
pip install vllm-client
Examples
See example.py for the following:
- Single generation
- Streaming
- Batch inference
It should work out of the box with a vLLM API server.
Notes
sampling_params.py
needs to be kept in sync with vLLM. It is a simplified version of their class, containing only the code required on client side.
Another programming languages
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
File details
Details for the file vllm_client-0.2.7.0-py3-none-any.whl
.
File metadata
- Download URL: vllm_client-0.2.7.0-py3-none-any.whl
- Upload date:
- Size: 9.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 453773686a504892c556b6a039000f06fa15f5f18939d5bc3bc7ffcfc7a3ec31 |
|
MD5 | 2c443597cd3123b5d87d652f39973f51 |
|
BLAKE2b-256 | 98009c0a6c80c18550f9d54424eeba968640c6becd0fb72334c708ec40f08143 |