happy_vllm is a REST API for vLLM, production ready
Project description
📚 Documentation : https://france-travail.github.io/happy_vllm/
happy_vLLM is a REST API for vLLM which was developed with production in mind. It adds some functionalities to vLLM.
Installation
You can install happy_vLLM using pip:
pip install happy_vllm
Or build it from source:
git clone https://github.com/France-Travail/happy_vllm.git
cd happy_vllm
pip install -e .
Quickstart
Just use the entrypoint happy-vllm
(see arguments for a list of all possible arguments)
happy-vllm --model path_to_model --host 127.0.0.1 --port 5000 --model-name my_model
It will launch the API and you can directly query it for example with
curl 127.0.0.1:5000/v1/info
To get various information on the application or
curl 127.0.0.1:5000/v1/completions -d '{"prompt": "Hey,", "model": "my_model"}'
if you want to generate your first LLM response using happy_vLLM. See endpoints for more details on all the endpoints provided by happy_vLLM.
Deploy with Docker image
A docker image is available from the Github Container Registry :
docker pull ghcr.io/france-travail/happy_vllm:latest
See deploying_with_docker for more details on how to serve happy_vLLM with docker.
Swagger
You can reach the swagger UI at the /docs
endpoint (so for example by default at 127.0.0.1:5000/docs
). You will be displayed all the endpoints and examples on how to use them.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file happy_vllm-1.1.11.tar.gz
.
File metadata
- Download URL: happy_vllm-1.1.11.tar.gz
- Upload date:
- Size: 68.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.8.20
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 46cd5e9a51952428e2d1d24a0acc3462bc9b544a673adbf2019dfd9ea9a19b0c |
|
MD5 | 1a628552329cdcfa7417ef05debeddde |
|
BLAKE2b-256 | 0a064415204439e61ec81f2bb0823908ffa81f2d95966d1fc2ff9473a1fa1f75 |
File details
Details for the file happy_vllm-1.1.11-py3-none-any.whl
.
File metadata
- Download URL: happy_vllm-1.1.11-py3-none-any.whl
- Upload date:
- Size: 67.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.8.20
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3ea8574f036414b91c6fffb3f73481be966cf964449f1475996a3872c13aff75 |
|
MD5 | a1344c05486a2e5d81f4140b85c90956 |
|
BLAKE2b-256 | 6d559c95cae826fc7770a9193407399262d79696b8c8d3e45cee49ad260ebe8b |