Sinapsis templates for text, batch, streaming, and multimodal generation with vLLM.
Project description
Sinapsis vLLM
Sinapsis templates for text, batch, streaming, and multimodal generation with vLLM.
🐍 Installation • 🚀 Features • 📚 Usage example • 📙 Documentation • 🔍 License
The sinapsis-vllm package provides Sinapsis templates built on top of vLLM for high-throughput text generation, batch inference, streaming, and vision-language completion.
🐍 Installation
Install using your preferred package manager. We strongly recommend using uv.
Install the base package:
uv pip install sinapsis-vllm --extra-index-url https://pypi.sinapsis.tech
Or with raw pip:
pip install sinapsis-vllm --extra-index-url https://pypi.sinapsis.tech
[!IMPORTANT]
vLLMMultiModalrequires the optionalmultimodalextra:uv pip install sinapsis-vllm[multimodal] --extra-index-url https://pypi.sinapsis.techOr install all optional dependencies:
uv pip install sinapsis-vllm[all] --extra-index-url https://pypi.sinapsis.tech
🚀 Features
Templates Supported
- vLLMTextCompletion: Standard packet-native text generation with vLLM.
- vLLMBatchTextCompletion: Batched generation for multiple conversations in one execution pass.
- vLLMStreamingTextCompletion: Async streaming variant that yields partial packets during generation.
- vLLMMultiModal: Vision-language completion using image packets from
container.images.
🧩 Common Attributes
init_args(vLLMInitArgs, required): Engine configuration such asllm_model_name,tensor_parallel_size,dtype,quantization,trust_remote_code,max_model_len, and memory settings.completion_args(vLLMCompletionArgs, required): Request-time generation parameters such astemperature,top_p,top_k,min_p,max_tokens, penalties, and structured-output settings.cleanup_response(bool, optional): Whether to clean leaked transport or template markers from the final response.reasoning_start_tag/reasoning_end_tag(str | None, optional): Optional tags used to extract reasoning into the packetreasoningfield and keep the final answer inresponse.
[!TIP] Use CLI command
sinapsis info --all-template-namesto show a list with all the available Template names installed with Sinapsis vLLM.
📚 Usage example
The following agent runs one vLLM text-completion step from an LLMConversationPacket.
Config
agent:
name: my_test_agent
templates:
- template_name: InputTemplate
class_name: InputTemplate
attributes: {}
- template_name: LLMConversationInput
class_name: LLMConversationInput
template_input: InputTemplate
attributes:
prompt: "Give three short tips for staying organized during a busy week."
system_prompt: "You are a helpful assistant."
- template_name: vLLMTextCompletion
class_name: vLLMTextCompletion
template_input: LLMConversationInput
attributes:
init_args:
llm_model_name: "Qwen/Qwen3-1.7B"
max_model_len: 4096
dtype: auto
seed: 42
gpu_memory_utilization: 0.9
cpu_offload_gb: 0
max_num_seqs: 8
disable_log_stats: true
completion_args:
temperature: 0.7
top_p: 0.8
top_k: 20
min_p: 0
max_tokens: 2048
seed: 42
📙 Documentation
Documentation for this and other sinapsis packages is available on the sinapsis website
Tutorials for different projects within sinapsis are available at sinapsis tutorials page
🔍 License
This project is licensed under the AGPLv3 license, which encourages open collaboration and sharing. For more details, please refer to the LICENSE file.
For commercial use, please refer to our official Sinapsis website for information on obtaining a commercial license.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sinapsis_vllm-0.2.0.tar.gz.
File metadata
- Download URL: sinapsis_vllm-0.2.0.tar.gz
- Upload date:
- Size: 23.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
433d4f33a80859f7b219885e0ed7cfc1d51398fa0498ca299e83ecc2b9f31a2e
|
|
| MD5 |
27822fbd104fd4ac6a2ed324b4bb93a3
|
|
| BLAKE2b-256 |
33dea79ccb7f5686cc6b3d10d25af13dd9d367c67b4c258be711542a22a13dbf
|
File details
Details for the file sinapsis_vllm-0.2.0-py3-none-any.whl.
File metadata
- Download URL: sinapsis_vllm-0.2.0-py3-none-any.whl
- Upload date:
- Size: 24.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2d9fa53a65c770541d0fee90a25201b6beeadade0f58c24c6a203525d2e2d16a
|
|
| MD5 |
c811abe5edfc9783f825ec27175f9680
|
|
| BLAKE2b-256 |
f8bceabd05d980ff60813f4baa564f20d3c8fb09a8a912137af53742b36b2bc7
|