Project description

vLLM-haystack-adapter

Simply connect your haystack pipeline to a selfhosted vLLM-API server.

vLLM

Installation

Install the wrapper via pip: pip install vllm-haystack

Usage

To utilize the wrapper the vLLMInvocationLayer has to be used.

Here is a simple example of how a PromptNode can be created with the wrapper.

from haystack.nodes import PromptNode, PromptModel
from vllm_haystack import vLLMInvocationLayer


model = PromptModel(model_name_or_path="", invocation_layer_class=vLLMInvocationLayer, max_length=256, api_key="EMPTY", model_kwargs={
        "api_base" : API, # Replace this with your API-URL
        "maximum_context_length": 2048,
    })

prompt_node = PromptNode(model_name_or_path=model, top_k=1, max_length=256)

For more configuration examples, take a look at the unit-tests.

Hosting a vLLM Server

To create an OpenAI-Compatible Server via vLLM you can follow the steps in the Quickstart section of their documenetation.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.1.2

Dec 4, 2023

0.1.1

Nov 4, 2023

0.1.0

Sep 13, 2023

0.0.2

Sep 11, 2023

This version

0.0.1

Sep 7, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vllm_haystack-0.0.1.tar.gz (9.1 kB view hashes)

Uploaded Sep 7, 2023 Source

Built Distribution

vllm_haystack-0.0.1-py3-none-any.whl (6.7 kB view hashes)

Uploaded Sep 7, 2023 Python 3

Hashes for vllm_haystack-0.0.1.tar.gz

Hashes for vllm_haystack-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`565184cb6aa2e3234f94389a8ab10637edecb71b39cd767dbd0b326d7a06c2b5`
MD5	`27f48561bad1ad433b8c8fda481fe35b`
BLAKE2b-256	`2f279c3c80e2acc3990cc6a1cccc5c43917b6831b37ac80a75e9c4394806aed0`

Hashes for vllm_haystack-0.0.1-py3-none-any.whl

Hashes for vllm_haystack-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`64cebd4550db1e965d0d8c68bce8270a164e0a9b75fa340d681480f309a823da`
MD5	`2cbeec4708fde1e98ce82122b1871f7d`
BLAKE2b-256	`c92bdb947624cc4dc0b0876fdf79ad10c06e940ce1b489a7980ac1ac538ba66d`