A lightweight CLI tool for managing LLMBoost model images and environments.

These details have not been verified by PyPI

Project links

Project description

LLMBoost Hub (lbh)

Manage LLMBoost™ model containers and environments to run, serve, and tune large language models.

Pre-requisites

Dependencies:

Python 3.10+
Docker 27.3.1+
NVIDIA GPU: nvidia-docker2 or AMD GPU: ROCm 6.3+

Install LLMBoost Hub:

pip install llmboost_hub

# Verify installation
lbh --version

Upgrade:

pip install --upgrade llmboost_hub

Note: This document uses lbh interchangeably with llmboost_hub.

Login to Hugging Face and Docker:

huggingface-cli login     # or set HF_TOKEN env var
docker login -u <your_docker_username>

Quick start

Fetch list of supported models from remote (automatically authenticates LLMBoost license):

lbh fetch # only needed once a day

One-liner to start serving a model (automatically downloads image and model, if needed):

lbh serve <Repo/Model-Name> # Full model name (including repository or organization name) must match the name from https://huggingface.co

For example:

lbh serve meta-llama/Llama-3.1-1B-Instruct

Basic workflow:

lbh fetch # authenticate LLMBoost license
lbh list [model] # list supported models; case-insensitive, regex-style match
lbh prep <Repo/Model-Name> # download image and model assets
lbh run <Repo/Model-Name> # start container
lbh serve <Repo/Model-Name> # start LLMBoost server inside container
lbh test <Repo/Model-Name> # send test request
lbh stop <Repo/Model-Name> # stop container

For more details, see the Command Reference section below. For more details, see the Configuration Options section below.

Shell completions:

eval "$(lbh completions)"                 # current shell
lbh completions [--venv|--profile]        # persist for venv or profile

Configuration Options

llmboost_hub uses the following environment variables:

LBH_HOME: base directory for all llmboost_hub data. (defaults: (host) ~/.llmboost_hub <- (container) /llmboost_hub)
LBH_MODELS: directory for storing and retrieving model assets. (default: $LBH_HOME/models)
LBH_WORKSPACE: mounted user workspace for manually transferring files out of containers. (defaults: (host) $LBH_HOME/workspace <- (container) /user_workspace)

Notes:

A configuation file is stored at $LBH_HOME/config.yaml with all the above mentioned settings (and other advanced settings).
- Precedence order for settings: Environment variables > Configuration file > Defaults
LBH_HOME can only be changed by setting the env var (or in ~/.bashrc).
- WARNING: Changing LBH_HOME will cause a new data directory to be used, and all configuration will be reset.
HF_TOKEN is injected automatically when set.

Command Reference

Use lbh -h for a summary of all commands, and lbh [COMMAND] -h for help with a specific command and all available options.

Use lbh -v [COMMAND] for verbose output with any command; shows useful diagnostic info for troubleshooting.

lbh login
- Reads $LBH_LICENSE_PATH or prompts for a token.
- Validates online and saves the license file.
lbh fetch
- Fetches latest models supported by LLMBoost.
- Filters to available GPU.
lbh list [model]
- Lists local images joined with lookup.
- Shows status:
  - pending: model not prepared; docker image or model assets missing
  - stopped: model prepared but container not running
  - running: container running but idling
  - initializing: container running and starting LLMBoost server
  - serving: LLMBoost server ready to accept requests
  - tuning: autotuner running
lbh prep <Repo/Model-Name> [--only-verify] [--fresh]
- Pulls the image and downloads HF assets.
- --only-verify checks digests and sizes.
- --fresh removes existing image and re-downloads model assets from Hugging Face.
lbh run <Repo/Model-Name> [OPTIONS] -- [DOCKER FLAGS...]
- Resolves and starts the container detached.
- Mounts $LBH_HOME and $LBH_WORKSPACE. Injects HF_TOKEN.
- NVIDIA GPUs use --gpus all. AMD maps /dev/dri and /dev/kfd.
- Useful options:
  - --image <image>: override docker image.
  - --model_path <model_path>: override model assets path.
  - --restart: restarts container, if already running.
  - Pass extra docker flags after --.
lbh serve <Repo/Model-Name> [--host 0.0.0.0] [--port 8011] [--detached] [--force] -- [LLMBOOST ARGS...]
- Starts LLMBoost server inside the container.
- Waits until ready, unless --detached.
- --force skips GPU utilization checks (use if GPU utilization is incorrectly reported by NVidia or AMD GPU drivers).
- Pass extra llmboost serve arguments after --.
lbh test <Repo/Model-Name> [--query "..."] [-t N] [--host 127.0.0.1] [--port 8011]
- Sends a test request to /v1/chat/completions.
lbh attach <Repo/Model-Name> [-c <container name or ID>]
- Opens a shell in the running container.
lbh stop <Repo/Model-Name> [-c <container name or ID>]
- Stops the container.
lbh status [model]
- Shows status and model.
lbh tune <Repo/Model-Name> [--metrics throughput] [--detached] [--image <image>]
- Runs the autotuner.
- Store results to $LBH_HOME/inference.db, and loads this on next lbh serve.

Support

Docs: https://docs.mangoboost.io/llmboost_hub/
Website: https://docs.mangoboost.io/llmboost_hub/
Email: support@mangoboost.io

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.0

Jan 5, 2026

This version

0.2.0rc0 pre-release

Dec 9, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

lbh-0.2.0rc0-py3-none-any.whl (53.6 kB view details)

Uploaded Dec 9, 2025 Python 3

File details

Details for the file lbh-0.2.0rc0-py3-none-any.whl.

File metadata

Download URL: lbh-0.2.0rc0-py3-none-any.whl
Upload date: Dec 9, 2025
Size: 53.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.14

File hashes

Hashes for lbh-0.2.0rc0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8270d5971a60226349c447b6976646955d24efb0fad6f60cb1707b884dea5cb9`
MD5	`ffae3b2ebebf0baad3cdd115c67c98c4`
BLAKE2b-256	`d57a0ecbbed67d4598aef809cda7ee281b443660bb1cbb81b332c8fcf527d859`

See more details on using hashes here.

lbh 0.2.0rc0

Navigation

Verified details

Owner

Unverified details

Project links

Meta

Classifiers

Project description

LLMBoost Hub (lbh)

Pre-requisites

Dependencies:

Install LLMBoost Hub:

Login to Hugging Face and Docker:

Quick start

Basic workflow:

Shell completions:

Configuration Options

Command Reference

Support

Project details

Verified details

Owner

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes