Skip to main content

A vLLM plugin to register the MERaLiON-2-10B model architecture with vLLM’s plugin system.

Project description

MERaLiON2 vLLM Plugin

Security (Bandit) Dependency Audit (pip-audit) Dependency Review CodeQL Publish

Licence

MERaLiON-Public-Licence-v3

Set up Environment

This plugin family has three release lines:

  • v0.1.x: compatibility lane for vLLM version 0.6.5 ~ 0.7.3 (V0 engine), and 0.8.5 ~ 0.8.5.post1 (V1 engine).
  • v0.2.x: compatibility lane for vLLM >=0.8.5,<=0.10.0.
  • v0.3.x: compatibility lane for vLLM >=0.12.0,<0.17.0. Targets the V1 engine and handles all internal API changes across the 0.12–0.16 minor series.

See scripts/compatibility/ for the automated version-matrix runner and detailed vLLM + transformers compatibility results.

Install by your vLLM version:

# For vLLM 0.6.5~0.7.3, 0.8.5.
pip install "vllm-plugin-meralion2<0.2"

# For vLLM 0.8.5 ~ 0.10.0
pip install "vllm-plugin-meralion2>=0.2,<0.3"

# For vLLM 0.12.0 ~ 0.16.x
pip install "vllm-plugin-meralion2>=0.3,<0.4"

Attention backend: MERaLiON-2 uses Gemma2's attention logit softcapping which requires the FlashInfer backend. The serve example handles this automatically. See openai_serve_example.sh for details.

Offline Inference

Refer to offline_example.py for offline inference example.

OpenAI-compatible Serving

Refer to openai_serve_example.sh for OpenAI-compatible serving example.

To call the server, you can refer to openai_client_example.py.

Alternatively, you can try calling the server with curl, refer to openai_client_curl.sh.

Full release history

See CHANGELOG.md.

vLLM + transformers compatibility (v0.3.x)

Tested with transformers==4.57.6 on H100 (TP=1). Each cell covers install, unit tests, and full-dataset ASR evaluation.

vLLM transformers install tests ASR eval overall
0.12.0 4.57.6 PASS PASS PASS PASS
0.13.0 4.57.6 PASS PASS PASS PASS
0.14.0 4.57.6 PASS PASS PASS PASS
0.15.0 4.57.6 PASS PASS PASS PASS
0.15.1 4.57.6 PASS PASS PASS PASS
0.16.0 4.57.6 PASS PASS PASS PASS

To reproduce or extend this matrix, see the compatibility matrix runner.

Security and dependency scanning

The repository uses separate workflows so each scan has a clear purpose:

  • Security (Bandit SAST) (.github/workflows/security.yml): static security linting of project Python source (bandit -r src).
  • CodeQL (.github/workflows/codeql.yml): semantic code scanning for Python + GitHub Actions security issues.
  • Dependency Audit (pip-audit) (.github/workflows/dependency-audit.yml): installed dependency vulnerability scanning.
  • Dependency Review (PR) (.github/workflows/dependency-review.yml): checks dependency changes in pull requests and fails on moderate+ severity vulnerabilities.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vllm_plugin_meralion2-0.3.0.tar.gz (31.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vllm_plugin_meralion2-0.3.0-py3-none-any.whl (18.7 kB view details)

Uploaded Python 3

File details

Details for the file vllm_plugin_meralion2-0.3.0.tar.gz.

File metadata

  • Download URL: vllm_plugin_meralion2-0.3.0.tar.gz
  • Upload date:
  • Size: 31.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for vllm_plugin_meralion2-0.3.0.tar.gz
Algorithm Hash digest
SHA256 f1ef10f0e65ff8ab7daed9d59fd43369ee833e4a1e5712400766da70e49808cf
MD5 dd863db38d403006cb0113f77fbb137d
BLAKE2b-256 b96ee224de23979f3ee7a9491c81384bcf32148ab6ff2157c95b92ccacd66e1d

See more details on using hashes here.

Provenance

The following attestation bundles were made for vllm_plugin_meralion2-0.3.0.tar.gz:

Publisher: publish.yml on YingxuH/vllm_plugin

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file vllm_plugin_meralion2-0.3.0-py3-none-any.whl.

File metadata

File hashes

Hashes for vllm_plugin_meralion2-0.3.0-py3-none-any.whl
Algorithm Hash digest
SHA256 97a016896f813e831e56f92f128e549fd1c5c7330baf5ca8ee6f0ef23e08cbd8
MD5 08478843a15577427e09320bf8226758
BLAKE2b-256 6907fd785126b1dcecf796002cfd73723d8a678f046d2415d0f2095bc44977ea

See more details on using hashes here.

Provenance

The following attestation bundles were made for vllm_plugin_meralion2-0.3.0-py3-none-any.whl:

Publisher: publish.yml on YingxuH/vllm_plugin

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page