Skip to main content

A vLLM plugin to register the MERaLiON-2-10B model architecture with vLLM’s plugin system.

Project description

MERaLiON2 vLLM Plugin

Security (Bandit) Dependency Audit (pip-audit) Dependency Review CodeQL Publish

Licence

MERaLiON-Public-Licence-v3

Set up Environment

This plugin family has two release lines:

  • v0.1.x: compatibility lane for vLLM version 0.6.5 ~ 0.7.3 (V0 engine), and 0.8.5 ~ 0.8.5.post1 (V1 engine).
  • v0.2.x: compatibility lane for vLLM >=0.8.5,<=0.10.0. Refer to matrix_summary.md for detailed vLLM + transformers compatibility.

Install by your vLLM version:

# For vLLM 0.6.5~0.7.3, 0.8.5.
pip install "vllm-plugin-meralion2<0.2"

# For vLLM 0.8.5 ~ 0.10.0
pip install "vllm-plugin-meralion2>=0.2,<0.3"

It's strongly recommended to install flash-attn for better memory and gpu utilization.

pip install flash-attn --no-build-isolation

Offline Inference

Refer to offline_example.py for offline inference example.

OpenAI-compatible Serving

Refer to openai_serve_example.sh for openAI-compatible serving example.

To call the server, you can refer to openai_client_example.py.

Alternatively, you can try calling the server with curl, refer to openai_client_curl.sh.

Full release history

See CHANGELOG.md.

vLLM + transformers compatibility

See matrix_summary.md

Security and dependency scanning

The repository uses separate workflows so each scan has a clear purpose:

  • Security (Bandit SAST) (.github/workflows/security.yml): static security linting of project Python source (bandit -r src).
  • CodeQL (.github/workflows/codeql.yml): semantic code scanning for Python + GitHub Actions security issues.
  • Dependency Audit (pip-audit) (.github/workflows/dependency-audit.yml): installed dependency vulnerability scanning.
  • Dependency Review (PR) (.github/workflows/dependency-review.yml): checks dependency changes in pull requests and fails on moderate+ severity vulnerabilities.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vllm_plugin_meralion2-0.2.0.tar.gz (28.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vllm_plugin_meralion2-0.2.0-py3-none-any.whl (17.1 kB view details)

Uploaded Python 3

File details

Details for the file vllm_plugin_meralion2-0.2.0.tar.gz.

File metadata

  • Download URL: vllm_plugin_meralion2-0.2.0.tar.gz
  • Upload date:
  • Size: 28.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for vllm_plugin_meralion2-0.2.0.tar.gz
Algorithm Hash digest
SHA256 481e94c685f5baf0f619493d2f5c009d74ffd940c06b4692ddee47e414273c06
MD5 722da7c5f5d197766a6e8f41d4053811
BLAKE2b-256 715de959dea5c8ca2b2aaeaf2f8b04b8c0b4c8b82ce2dc7931643c6429e94516

See more details on using hashes here.

Provenance

The following attestation bundles were made for vllm_plugin_meralion2-0.2.0.tar.gz:

Publisher: publish.yml on YingxuH/vllm_plugin

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file vllm_plugin_meralion2-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for vllm_plugin_meralion2-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 bfde75112208ef83e9494a94e185769d796f3752965a15e7062e7637d919c20d
MD5 e901ba8fe5b0bee7cf8c04df3e9af664
BLAKE2b-256 c1dcb1cb3f8987fbb02efd3b1b9a6647f0f557ce42ca1173e63d3eb98ac0153f

See more details on using hashes here.

Provenance

The following attestation bundles were made for vllm_plugin_meralion2-0.2.0-py3-none-any.whl:

Publisher: publish.yml on YingxuH/vllm_plugin

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page