Skip to main content

A vLLM plugin to register the MERaLiON-2-10B model architecture with vLLM’s plugin system.

Project description

MERaLiON2 vLLM Plugin

Security Publish CodeQL

Licence

MERaLiON-Public-Licence-v3

Set up Environment

This vLLM plugin supports vLLM version 0.6.5 ~ 0.7.3 (V0 engine), and 0.8.5 ~ 0.8.5.post1 (V1 engine).

Install the MERaLiON2 vLLM plugin.

pip install vllm-plugin-meralion2

It's strongly recommended to install flash-attn for better memory and gpu utilization.

pip install flash-attn --no-build-isolation

Offline Inference

Refer to offline_example.py for offline inference example.

OpenAI-compatible Serving

Refer to openai_serve_example.sh for openAI-compatible serving example.

To call the server, you can refer to openai_client_example.py.

Alternatively, you can try calling the server with curl, refer to openai_client_curl.sh.

Changelog

0.1.4

  • Fixed multi-audio handling for a single request.
  • Fixed server-side internal failure when multiple requests with different audio chunk counts are batched together.
  • Added more docstrings for better code readability and maintenance.

Full history: see CHANGELOG.md.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vllm_plugin_meralion2-0.1.5.tar.gz (30.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vllm_plugin_meralion2-0.1.5-py3-none-any.whl (21.7 kB view details)

Uploaded Python 3

File details

Details for the file vllm_plugin_meralion2-0.1.5.tar.gz.

File metadata

  • Download URL: vllm_plugin_meralion2-0.1.5.tar.gz
  • Upload date:
  • Size: 30.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for vllm_plugin_meralion2-0.1.5.tar.gz
Algorithm Hash digest
SHA256 786f07185944d03fcb2d1d785d3cd09bde97d9b47144e2694c32d61bd6e6a5fb
MD5 4e2231bd200c7459d6df77109b73cc8d
BLAKE2b-256 31225968319d31b632a47e910102f7fd0de80203f33e470783e631be06403f80

See more details on using hashes here.

Provenance

The following attestation bundles were made for vllm_plugin_meralion2-0.1.5.tar.gz:

Publisher: publish.yml on YingxuH/vllm_plugin

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file vllm_plugin_meralion2-0.1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for vllm_plugin_meralion2-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 0c35026b59aa95a0a4188337aa21fa73196aa49d82f3be475a91764bc635b764
MD5 4a0ea447dd3bab8b2ba336c69c4227e3
BLAKE2b-256 f9bc87d46c90ee1c2cafbe6ca903358c1caa27aafd45cbce22e07a3d9d5fb377

See more details on using hashes here.

Provenance

The following attestation bundles were made for vllm_plugin_meralion2-0.1.5-py3-none-any.whl:

Publisher: publish.yml on YingxuH/vllm_plugin

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page