Skip to main content

A vLLM plugin to register the MERaLiON-2-10B model architecture with vLLM’s plugin system.

Project description

MERaLiON2 vLLM Plugin

Licence

MERaLiON-Public-Licence-v2

Set up Environment

This vLLM plugin for MERaLiON2 requires transformers version 4.50.1. It supports vLLM version 0.6.5 ~ 0.7.3 (V0 engine), and 0.8.5 ~ 0.8.5.post1 (V1 engine).

pip install transformers==4.50.1
pip install vllm==0.6.5

Install the MERaLiON2 vLLM plugin.

python install vllm-plugin-meralion2

It's strongly recommended to install flash-attn for better memory and gpu utilization.

pip install flash-attn --no-build-isolation

Offline Inference

Refer to offline_example.py for offline inference example.

OpenAI-compatible Serving

Refer to openai_serve_example.sh for openAI-compatible serving example.

To call the server, you can refer to openai_client_example.py.

Alternatively, you can try calling the server with curl, refer to openai_client_curl.sh.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vllm_plugin_meralion2-0.1.2.post2.dev2.tar.gz (14.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file vllm_plugin_meralion2-0.1.2.post2.dev2.tar.gz.

File metadata

File hashes

Hashes for vllm_plugin_meralion2-0.1.2.post2.dev2.tar.gz
Algorithm Hash digest
SHA256 50ab9a4d07eff53ae2300148a2cd3735e41ecfdbfd6f0b7c8cde25121fca504e
MD5 1426764cb3d7c210eae98365d439516c
BLAKE2b-256 67dd03f397ee958f3bc3df48c904e11797e867715fe2b72d50c015836c0ffdda

See more details on using hashes here.

File details

Details for the file vllm_plugin_meralion2-0.1.2.post2.dev2-py3-none-any.whl.

File metadata

File hashes

Hashes for vllm_plugin_meralion2-0.1.2.post2.dev2-py3-none-any.whl
Algorithm Hash digest
SHA256 09bd08142f0ff96acac5b704fef652990d56a3c840436872bafcea9c5680a409
MD5 c97cf8ef04434f86262f943d69886e18
BLAKE2b-256 7c6aa90e84f9a847fdbaaf889b1805d9229228a29dbfe108ca90549845750f85

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page