Skip to main content

Bento service with SGLang Router included

Project description

bento-sgl-router

Helper utilities for bundling an SGLang Router inside a BentoML service. It exposes a drop-in replacement for bentoml.service that starts the router sidecar, keeps it in sync with your LLM workers, and surfaces router metrics alongside your Bento.

Overview

  • Wraps an existing BentoML service definition and injects an sglang-router process.
  • Automatically syncs worker hosts discovered from your llm dependency during readiness probes.
  • Exposes router Prometheus metrics by merging them into the Bento service metrics endpoint.

Installation

pip install bento-sgl-router

Python 3.10 or newer is required. The package installs bentoml>=1.4.25 and sglang-router>=0.1.9 as dependencies.

Quickstart

Decorate an existing BentoML LLM service to bundle it with an SGLang Router:

import bentoml
from bento_sgl_router import service


@service(name="chat-router")
class ChatLLM:
    ...

Any keyword arguments supported by bentoml.service (such as image, labels, or resource configs) can be passed through the decorator.

Router configuration

Control router behaviour via environment variables:

Variable Default Description
ROUTER_POLICY cache_aware Scheduling policy passed to sglang_router.launch_router.

Set the variable in your deployment environment or via envs=[...] when declaring the service.

Development

pdm install
pdm run pytest

The project keeps the code in bento_sgl_router.py and lightweight tests under tests/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bento_sgl_router-0.0.2.tar.gz (2.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bento_sgl_router-0.0.2-py3-none-any.whl (3.6 kB view details)

Uploaded Python 3

File details

Details for the file bento_sgl_router-0.0.2.tar.gz.

File metadata

  • Download URL: bento_sgl_router-0.0.2.tar.gz
  • Upload date:
  • Size: 2.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for bento_sgl_router-0.0.2.tar.gz
Algorithm Hash digest
SHA256 9a43f51abef6d3cf8450c5f0deeebd20c6b223688970a91907711fb6afbc593a
MD5 497aabbae4d8b4d5f30b1b698629577c
BLAKE2b-256 dc38157b6495145eee2fb91c91d10b7156d1042e9bb9e6ad4b732210db08b89b

See more details on using hashes here.

Provenance

The following attestation bundles were made for bento_sgl_router-0.0.2.tar.gz:

Publisher: release.yml on bentoml/bento-sglang-router

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file bento_sgl_router-0.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for bento_sgl_router-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 946fa10990e2ff4c4cd6362a5e9cc3da8130435f71b276df31f6bd046311e6e3
MD5 fe45c03e4b8c01acfa3186a5b30257c7
BLAKE2b-256 132c87df76c87b22afcad07be8492bc58b04c93a5def2f17e3002cee97554fb1

See more details on using hashes here.

Provenance

The following attestation bundles were made for bento_sgl_router-0.0.2-py3-none-any.whl:

Publisher: release.yml on bentoml/bento-sglang-router

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page