Skip to main content

Bento service with SGLang Router included

Project description

bento-sgl-router

Helper utilities for bundling an SGLang Router inside a BentoML service. It exposes a drop-in replacement for bentoml.service that starts the router sidecar, keeps it in sync with your LLM workers, and surfaces router metrics alongside your Bento.

Overview

  • Wraps an existing BentoML service definition and injects an sglang-router process.
  • Automatically syncs worker hosts discovered from your llm dependency during readiness probes.
  • Exposes router Prometheus metrics by merging them into the Bento service metrics endpoint.

Installation

pip install bento-sgl-router

Python 3.10 or newer is required. The package installs bentoml>=1.4.25 and sglang-router>=0.1.9 as dependencies.

Quickstart

Decorate an existing BentoML LLM service to bundle it with an SGLang Router:

import bentoml
from bento_sgl_router import service


@service(name="chat-router")
class ChatLLM:
    ...

Any keyword arguments supported by bentoml.service (such as image, labels, or resource configs) can be passed through the decorator.

Router configuration

Control router behaviour via environment variables:

Variable Default Description
ROUTER_POLICY cache_aware Scheduling policy passed to sglang_router.launch_router.

Set the variable in your deployment environment or via envs=[...] when declaring the service.

Development

pdm install
pdm run pytest

The project keeps the code in bento_sgl_router.py and lightweight tests under tests/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bento_sgl_router-0.0.1.tar.gz (2.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

bento_sgl_router-0.0.1-py3-none-any.whl (3.6 kB view details)

Uploaded Python 3

File details

Details for the file bento_sgl_router-0.0.1.tar.gz.

File metadata

  • Download URL: bento_sgl_router-0.0.1.tar.gz
  • Upload date:
  • Size: 2.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for bento_sgl_router-0.0.1.tar.gz
Algorithm Hash digest
SHA256 8cdd5719f054117483be02c347cb8002ac9b21d2a8715be04a6b3b8e0e730e8c
MD5 f5373651838c9e7ed6b290c52f4b85b6
BLAKE2b-256 a62c9ebb4154f26ccfe3451dafe41c2aca97535914bece9eb05267b79ae658fc

See more details on using hashes here.

Provenance

The following attestation bundles were made for bento_sgl_router-0.0.1.tar.gz:

Publisher: release.yml on bentoml/bento-sglang-router

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file bento_sgl_router-0.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for bento_sgl_router-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 abcec1f9e1cdc666235b3d9af27eb69dd9558fbbedac0618bc5d76a942ccfbdc
MD5 2afe117efeab811699fc3ade4d149808
BLAKE2b-256 8240c57609fc725defc8e91eb366ce4839494bbe397cd7499773514b71746bb3

See more details on using hashes here.

Provenance

The following attestation bundles were made for bento_sgl_router-0.0.1-py3-none-any.whl:

Publisher: release.yml on bentoml/bento-sglang-router

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page