Skip to main content

A vLLM plugin to register the MERaLiON-2-10B model architecture with vLLM’s plugin system.

Project description

MERaLiON2 vLLM Plugin

Licence

MERaLiON-Public-Licence-v2

Set up Environment

This vLLM plugin for MERaLiON2 requires transformers version 4.50.1. It supports vLLM version 0.6.5 ~ 0.7.3 (V0 engine), and 0.8.5 ~ 0.8.5.post1 (V1 engine).

pip install transformers==4.50.1
pip install vllm==0.6.5

Install the MERaLiON2 vLLM plugin.

python install vllm-plugin-meralion2

It's strongly recommended to install flash-attn for better memory and gpu utilization.

pip install flash-attn --no-build-isolation

Offline Inference

Refer to offline_example.py for offline inference example.

OpenAI-compatible Serving

Refer to openai_serve_example.sh for openAI-compatible serving example.

To call the server, you can refer to openai_client_example.py.

Alternatively, you can try calling the server with curl, refer to openai_client_curl.sh.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vllm_plugin_meralion2-0.1.3.tar.gz (14.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vllm_plugin_meralion2-0.1.3-py3-none-any.whl (18.3 kB view details)

Uploaded Python 3

File details

Details for the file vllm_plugin_meralion2-0.1.3.tar.gz.

File metadata

  • Download URL: vllm_plugin_meralion2-0.1.3.tar.gz
  • Upload date:
  • Size: 14.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.15

File hashes

Hashes for vllm_plugin_meralion2-0.1.3.tar.gz
Algorithm Hash digest
SHA256 ecbca7fb453031a9bdead0f205be47359e43202a8a7e1bfca4c6d5694eef25e6
MD5 593240e5120d790ea5af1f5777aa812a
BLAKE2b-256 a3c51ed84bccebbf95d7ad130385aa161863eacaad4f491b02c57d83b6155b2b

See more details on using hashes here.

File details

Details for the file vllm_plugin_meralion2-0.1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for vllm_plugin_meralion2-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 2ede7b2371b5398fcf8120cef929d223973dca567748c999d9467a91d3fa954d
MD5 81ffbff42d38270a8727a17286fd446b
BLAKE2b-256 fefdc71a106a037008eb3131c82f0e982d5ab0c238b96f4478d08fbaff8c8c5e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page