Skip to main content

SMG gRPC servicer implementations for LLM inference engines (vLLM, SGLang, MLX, TokenSpeed)

Project description

smg-grpc-servicer

gRPC servicer implementations for LLM inference engines. Supports vLLM and SGLang.

Installation

For vLLM:

pip install smg-grpc-servicer[vllm]

For SGLang:

pip install smg-grpc-servicer[sglang]

Usage

vLLM

vllm serve meta-llama/Llama-2-7b-hf --grpc

SGLang

sglang serve --model-path meta-llama/Llama-2-7b-hf --grpc-mode

Architecture

smg-grpc-servicer[vllm]    ──optional dep──>  vllm     (lazy import)
smg-grpc-servicer[sglang]  ──optional dep──>  sglang   (lazy import)
smg-grpc-servicer           ──depends on──>  smg-grpc-proto  (hard dependency)
vllm                        ──optional──>    smg-grpc-servicer (via vllm serve --grpc)
sglang                      ──optional──>    smg-grpc-servicer (via --grpc-mode)

Backend dependencies are isolated via extras to avoid conflicts between vLLM and SGLang.

Development

See DEVELOPMENT.md for local development setup, CI, and release workflows.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

smg_grpc_servicer-0.5.4.tar.gz (73.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

smg_grpc_servicer-0.5.4-py3-none-any.whl (80.5 kB view details)

Uploaded Python 3

File details

Details for the file smg_grpc_servicer-0.5.4.tar.gz.

File metadata

  • Download URL: smg_grpc_servicer-0.5.4.tar.gz
  • Upload date:
  • Size: 73.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for smg_grpc_servicer-0.5.4.tar.gz
Algorithm Hash digest
SHA256 8b463ee028c66bb000e1a782f1d42e245c4ee4103dba9a170937200da3b7d174
MD5 63e4c8fc2ce953e2d378492d03af7c34
BLAKE2b-256 ac1b28a73c936fe85c7184e4bf07aea2615a12157d779a95a4381b00ab3f9ac6

See more details on using hashes here.

File details

Details for the file smg_grpc_servicer-0.5.4-py3-none-any.whl.

File metadata

File hashes

Hashes for smg_grpc_servicer-0.5.4-py3-none-any.whl
Algorithm Hash digest
SHA256 acee0e7efc43fbb33aafe7daaf52df06714c830987f5eec7ac459c38a9e6bf85
MD5 1625f5247d5fb1d701dc61ea3e3ae9ce
BLAKE2b-256 b991d767505a68c19a4680826bb48cbdb8978cd1e06a722570ced10ef2a87732

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page