Skip to main content

A vLLM plugin to register the MERaLiON-2-10B model architecture with vLLM’s plugin system.

Project description

MERaLiON2 vLLM Plugin

Licence

MERaLiON-Public-Licence-v3

Set up Environment

This vLLM plugin supports vLLM version 0.6.5 ~ 0.7.3 (V0 engine), and 0.8.5 ~ 0.8.5.post1 (V1 engine).

Install the MERaLiON2 vLLM plugin.

pip install vllm-plugin-meralion2

It's strongly recommended to install flash-attn for better memory and gpu utilization.

pip install flash-attn --no-build-isolation

Offline Inference

Refer to offline_example.py for offline inference example.

OpenAI-compatible Serving

Refer to openai_serve_example.sh for openAI-compatible serving example.

To call the server, you can refer to openai_client_example.py.

Alternatively, you can try calling the server with curl, refer to openai_client_curl.sh.

Changelog

0.1.4

  • Fixed multi-audio handling for a single request.
  • Fixed server-side internal failure when multiple requests with different audio chunk counts are batched together.
  • Added more docstrings for better code readability and maintenance.

Full history: see CHANGELOG.md.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vllm_plugin_meralion2-0.1.4.tar.gz (30.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vllm_plugin_meralion2-0.1.4-py3-none-any.whl (21.6 kB view details)

Uploaded Python 3

File details

Details for the file vllm_plugin_meralion2-0.1.4.tar.gz.

File metadata

  • Download URL: vllm_plugin_meralion2-0.1.4.tar.gz
  • Upload date:
  • Size: 30.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.11

File hashes

Hashes for vllm_plugin_meralion2-0.1.4.tar.gz
Algorithm Hash digest
SHA256 49e52bc85c48d3fa0d4c313a9325a63d0445f1a6a31cd993d255ec36b3b72c1b
MD5 af808453aadb98634ad92d91fafa23f7
BLAKE2b-256 7dab8ab629af8cdb99986a88628d57c12f5fa51be44a4445e35964ed772bbcf6

See more details on using hashes here.

File details

Details for the file vllm_plugin_meralion2-0.1.4-py3-none-any.whl.

File metadata

File hashes

Hashes for vllm_plugin_meralion2-0.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 8c7b79982e779033dc956ed4e3aaa81a5371482d27493d24cc245853581072aa
MD5 aba65ba07122add6744d0b1cff69b19d
BLAKE2b-256 606ec81275e428911afb4171e2b004fa3dfc0332807e2aab0613241f5e236748

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page