Skip to main content

Bento service with SGLang Router included

Project description

bento-sgl-router

Helper utilities for bundling an SGLang Router inside a BentoML service. It exposes a drop-in replacement for bentoml.service that starts the router sidecar, keeps it in sync with your LLM workers, and surfaces router metrics alongside your Bento.

Overview

  • Wraps an existing BentoML service definition and injects an sglang-router process.
  • Automatically syncs worker hosts discovered from your llm dependency during readiness probes.
  • Exposes router Prometheus metrics by merging them into the Bento service metrics endpoint.

Installation

pip install bento-sgl-router

Python 3.10 or newer is required. The package installs bentoml>=1.4.25 and sglang-router>=0.1.9 as dependencies.

Quickstart

Decorate an existing BentoML LLM service to bundle it with an SGLang Router:

import bentoml
from bento_sgl_router import service


@service(name="chat-router")
class ChatLLM:
    ...

Any keyword arguments supported by bentoml.service (such as image, labels, or resource configs) can be passed through the decorator.

Router configuration

Control router behaviour via environment variables:

Variable Default Description
ROUTER_POLICY cache_aware Scheduling policy passed to sglang_router.launch_router.

Set the variable in your deployment environment or via envs=[...] when declaring the service.

Development

pdm install
pdm run pytest

The project keeps the code in bento_sgl_router.py and lightweight tests under tests/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bento_sgl_router-0.0.5.tar.gz (2.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bento_sgl_router-0.0.5-py3-none-any.whl (3.6 kB view details)

Uploaded Python 3

File details

Details for the file bento_sgl_router-0.0.5.tar.gz.

File metadata

  • Download URL: bento_sgl_router-0.0.5.tar.gz
  • Upload date:
  • Size: 2.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for bento_sgl_router-0.0.5.tar.gz
Algorithm Hash digest
SHA256 e31f77d50c3a35894de530046753631e4781ccc53f0949f843c0868814af2bf0
MD5 1207233b34c95c5c93ab1aa1c3db2076
BLAKE2b-256 971599de45826ac46b1e4ab09f4ed9fe6fe39ad46553ba0316f13578bbfd2104

See more details on using hashes here.

Provenance

The following attestation bundles were made for bento_sgl_router-0.0.5.tar.gz:

Publisher: release.yml on bentoml/bento-sglang-router

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file bento_sgl_router-0.0.5-py3-none-any.whl.

File metadata

File hashes

Hashes for bento_sgl_router-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 5934cd81711963c482b8a4eb8af44b69d8ace6ffd82be38d0b87399fff734a88
MD5 f7c7c9d2a643acaf99b9af0e6af89282
BLAKE2b-256 ac3dbd352c783f274a33c595a58bbf3e985de59c7500b55ea16c8bb4a0d0aeb6

See more details on using hashes here.

Provenance

The following attestation bundles were made for bento_sgl_router-0.0.5-py3-none-any.whl:

Publisher: release.yml on bentoml/bento-sglang-router

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page