Generate Shenron docker-compose deployments from model config files

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

JamieDborin1

These details have not been verified by PyPI

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
- Rust

Project description

Shenron

Shenron now ships as a config-driven generator for production LLM docker-compose deployments.

shenron reads a model config YAML and generates:

docker-compose.yml
.generated/onwards_config.json
.generated/prometheus.yml
.generated/scouter_reporter.env
.generated/engine_start.sh
.generated/engine_start_N.sh + .generated/sglangmux_start.sh when using models:

Quick Start

uv pip install shenron
shenron get
docker compose up -d

shenron get reads a per-release config index asset, shows available configs with arrow-key selection, downloads the chosen config, and generates deployment artifacts in the current directory. Using --release latest also rewrites shenron_version in the downloaded config to latest. You can also override config values on download with:

--api-key (writes api_key)
--scouter-api-key (writes scouter_ingest_api_key)
--scouter-collector-instance (writes scouter_collector_instance; alias: --scouter-colector-instance)

By default, shenron get pulls release configs from doublewordai/shenron-configs.

shenron . still works and expects exactly one config YAML (*.yml or *.yaml) in the current directory, unless you pass a config file path directly.

Configs

Repo configs are stored in configs/.

Available starter configs:

configs/Qwen06B-cu126-TP1.yml
configs/Qwen06B-cu129-TP1.yml
configs/Qwen06B-cu130-TP1.yml
configs/Qwen30B-A3B-cu126-TP1.yml
configs/Qwen30B-A3B-cu129-TP1.yml
configs/Qwen30B-A3B-cu129-TP2.yml
configs/Qwen30B-A3B-cu130-TP2.yml
configs/Qwen235-A22B-cu129-TP2.yml
configs/Qwen235-A22B-cu129-TP4.yml
configs/Qwen235-A22B-cu130-TP2.yml

This file uses the same defaults that were previously hardcoded in docker/run_docker_compose.sh.

Engine selection and args:

engine: vllm or sglang (default: vllm)
vllm_args: vLLM CLI args appended after core settings. Use this for --gpu-memory-utilization, --scheduling-policy, --tool-call-parser, --override-generation-config, etc.
sglang_args: SGLang CLI args appended after core settings (use for --tp, --dp, --ep, --enable-dp-attention, etc.)
sglang_use_cuda_ipc_transport: when true, exports SGLANG_USE_CUDA_IPC_TRANSPORT=1 before launching SGLang.
models: optional per-model overrides for multi-model SGLang mux mode.
sglangmux_listen_port, sglangmux_host, sglangmux_upstream_timeout_secs, sglangmux_model_ready_timeout_secs, sglangmux_model_switch_timeout_secs, sglangmux_log_dir: optional sglangmux settings (hyphenated aliases like sglangmux-listen-port are also accepted).

vllm_args and sglang_args accept YAML scalars (string/number/bool). If you need to pass a structured value (like --override-generation-config), provide a YAML mapping and it will be JSON-encoded.

Single Config `models:` Schema (SGLang + sglangmux)

When models: is set, Shenron generates one engine launch script per model plus a mux launcher:

engine: sglang
sglangmux_listen_port: 8100
sglangmux_host: 0.0.0.0
sglangmux_upstream_timeout_secs: 120
sglangmux_model_ready_timeout_secs: 600
sglangmux_model_switch_timeout_secs: 120
sglangmux_log_dir: /tmp/sglangmux

models:
- model_name: Qwen/Qwen3-0.6B
  vllm_port: 8001
  api_key: sk-model-a
  sglang_args: [--tp, 1]
- model_name: Qwen/Qwen3-30B-A3B
  vllm_port: 8002
  api_key: sk-model-b
  sglang_use_cuda_ipc_transport: true
  sglang_args: [--tp, 2]

Rules in models: mode:

engine must be sglang
each models[*].model_name must be unique
each models[*].vllm_port must be set and unique
sglangmux_listen_port must be different from all model ports

In this mode, .generated/onwards_config.json contains one target per model and all target URLs point to http://vllm:<sglangmux_listen_port>/v1.

Generated Compose Behavior

docker-compose.yml is fully rendered from config values:

model image tag from shenron_version + cuda_version
onwards image tag from onwards_version
service ports from config
no ${SHENRON_VERSION} placeholders

Development

# Run tests (Rust + CLI + compose checks)
./scripts/ci.sh

# Install local package for manual testing
python3 -m pip install -e .

# Generate from repo config
shenron configs/Qwen06B-cu126-TP1.yml --output-dir /tmp/shenron-test

Release Automation

release-assets.yaml publishes stamped config files (*.yml) as release assets.
release-assets.yaml also publishes configs-index.txt, which powers shenron get.
release-assets.yaml mirrors *.yml + configs-index.txt into ${OWNER}/shenron-configs under the same tag as the main shenron release.
Set CONFIGS_REPO_TOKEN (or reuse RELEASE_PLEASE_TOKEN) with write access to the configs repo release assets; optional repo variable CONFIGS_REPO overrides the default target (${OWNER}/shenron-configs).
python-release.yaml builds/publishes the shenron package to PyPI on release tags.
Docker image build/push via Depot remains in ci.yaml and still triggers when docker/Dockerfile.cu* or VERSION changes.

License

MIT, see LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

JamieDborin1

These details have not been verified by PyPI

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
- Rust

Release history Release notifications | RSS feed

0.30.12

Apr 22, 2026

0.30.11

Apr 21, 2026

0.30.10

Apr 21, 2026

0.30.9

Apr 21, 2026

0.30.8

Apr 20, 2026

0.30.7

Apr 17, 2026

0.30.6

Apr 17, 2026

0.30.5

Apr 17, 2026

0.30.4

Apr 16, 2026

0.30.3

Apr 16, 2026

0.30.2

Apr 15, 2026

0.30.1

Apr 13, 2026

0.30.0

Apr 13, 2026

0.29.2

Apr 13, 2026

0.29.1

Apr 13, 2026

0.29.0

Apr 10, 2026

0.28.0

Apr 10, 2026

0.27.3

Apr 10, 2026

0.27.2

Apr 10, 2026

0.27.1

Apr 10, 2026

0.27.0

Apr 10, 2026

0.26.1

Apr 10, 2026

0.26.0

Apr 10, 2026

0.25.0

Apr 10, 2026

0.24.0

Apr 9, 2026

0.23.3

Apr 9, 2026

0.23.2

Apr 9, 2026

0.23.1

Apr 9, 2026

0.23.0

Apr 9, 2026

0.22.0

Apr 9, 2026

0.21.1

Apr 9, 2026

0.21.0

Apr 8, 2026

0.20.16

Apr 8, 2026

0.20.15

Apr 3, 2026

0.20.14

Apr 3, 2026

0.20.13

Apr 3, 2026

0.20.12

Apr 3, 2026

0.20.11

Apr 3, 2026

0.20.10

Apr 3, 2026

0.20.9

Apr 3, 2026

0.20.8

Apr 3, 2026

0.20.7

Apr 3, 2026

0.20.6

Apr 2, 2026

0.20.5

Apr 1, 2026

0.20.3

Mar 24, 2026

0.20.2

Mar 23, 2026

0.20.1

Mar 18, 2026

0.20.0

Mar 18, 2026

0.19.4

Mar 18, 2026

0.19.3

Mar 13, 2026

0.19.2

Mar 12, 2026

0.19.1

Mar 12, 2026

0.19.0

Mar 10, 2026

0.18.5

Mar 10, 2026

0.18.4

Mar 10, 2026

0.18.3

Mar 5, 2026

0.18.2

Mar 5, 2026

0.18.1

Mar 5, 2026

0.18.0

Mar 4, 2026

0.17.0

Mar 2, 2026

0.16.1

Feb 27, 2026

0.16.0

Feb 26, 2026

0.15.0

Feb 26, 2026

0.14.0

Feb 26, 2026

0.13.0

Feb 23, 2026

This version

0.12.0

Feb 23, 2026

0.11.1

Feb 18, 2026

0.11.0

Feb 17, 2026

0.10.1

Feb 17, 2026

0.10.0

Feb 12, 2026

0.9.0

Feb 12, 2026

0.8.2

Feb 12, 2026

0.8.1

Feb 12, 2026

0.7.0

Feb 12, 2026

0.6.3

Feb 11, 2026

0.6.2

Feb 11, 2026

0.6.1

Feb 11, 2026

0.5.3

Feb 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shenron-0.12.0.tar.gz (39.5 kB view details)

Uploaded Feb 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

shenron-0.12.0-cp311-cp311-manylinux_2_34_x86_64.whl (501.7 kB view details)

Uploaded Feb 23, 2026 CPython 3.11manylinux: glibc 2.34+ x86-64

File details

Details for the file shenron-0.12.0.tar.gz.

File metadata

Download URL: shenron-0.12.0.tar.gz
Upload date: Feb 23, 2026
Size: 39.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for shenron-0.12.0.tar.gz
Algorithm	Hash digest
SHA256	`6a6998c10f335f9e68d869ee4aa2d2c3f4beb742ea7b0b946613d56d6941ebac`
MD5	`37855c62fd8db875587022f6f77098c3`
BLAKE2b-256	`fff8cccf4f8736c77364adca41f507a2d0d7fb0c6b84077bc4eeb29e1222ddc4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for shenron-0.12.0.tar.gz:

Publisher: python-release.yaml on doublewordai/shenron

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: shenron-0.12.0.tar.gz
- Subject digest: 6a6998c10f335f9e68d869ee4aa2d2c3f4beb742ea7b0b946613d56d6941ebac
- Sigstore transparency entry: 981787730
- Sigstore integration time: Feb 23, 2026
Source repository:
- Permalink: doublewordai/shenron@f9b839f52ab0f25137991d744af39ba8e598ca10
- Branch / Tag: refs/tags/v0.12.0
- Owner: https://github.com/doublewordai
- Access: internal
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-release.yaml@f9b839f52ab0f25137991d744af39ba8e598ca10
- Trigger Event: push

File details

Details for the file shenron-0.12.0-cp311-cp311-manylinux_2_34_x86_64.whl.

File metadata

Download URL: shenron-0.12.0-cp311-cp311-manylinux_2_34_x86_64.whl
Upload date: Feb 23, 2026
Size: 501.7 kB
Tags: CPython 3.11, manylinux: glibc 2.34+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for shenron-0.12.0-cp311-cp311-manylinux_2_34_x86_64.whl
Algorithm	Hash digest
SHA256	`276968807c91a3d522ba80725fdcd61cde75752363ffd8654ecb95cefca57234`
MD5	`b33fb73e06e5704e0b741bce22ee05c1`
BLAKE2b-256	`286ae9d5476202e268ea589a1bb968c4495137f467424fc6fcc761f5a7430661`

See more details on using hashes here.

Provenance

The following attestation bundles were made for shenron-0.12.0-cp311-cp311-manylinux_2_34_x86_64.whl:

Publisher: python-release.yaml on doublewordai/shenron

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: shenron-0.12.0-cp311-cp311-manylinux_2_34_x86_64.whl
- Subject digest: 276968807c91a3d522ba80725fdcd61cde75752363ffd8654ecb95cefca57234
- Sigstore transparency entry: 981787785
- Sigstore integration time: Feb 23, 2026
Source repository:
- Permalink: doublewordai/shenron@f9b839f52ab0f25137991d744af39ba8e598ca10
- Branch / Tag: refs/tags/v0.12.0
- Owner: https://github.com/doublewordai
- Access: internal
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-release.yaml@f9b839f52ab0f25137991d744af39ba8e598ca10
- Trigger Event: push

shenron 0.12.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Shenron

Quick Start

Configs

Single Config `models:` Schema (SGLang + sglangmux)

Generated Compose Behavior

Development

Release Automation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

shenron 0.12.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Shenron

Quick Start

Configs

Single Config models: Schema (SGLang + sglangmux)

Generated Compose Behavior

Development

Release Automation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Single Config `models:` Schema (SGLang + sglangmux)