Skip to main content

Cluster resource management bot for IM platforms

Project description

lockbot

Cluster resource management bot for IM platforms (e.g., Baidu InfoFlow).

Lock and unlock GPU devices, cluster nodes, and queue slots via chat commands. Supports both standalone Flask deployment and a full platform mode with FastAPI + Vue.js frontend.

中文文档 | Live Demo

Features

  • Device Lock Bot — Lock/unlock individual GPUs or devices on a cluster
  • Node Lock Bot — Lock/unlock entire cluster nodes
  • Queue Bot — Manage a queue for resource allocation with booking and preemption
  • Platform Mode — Web UI (Vue 3 + Element Plus) for managing multiple bots, user authentication (JWT), role-based access control, and real-time logs
  • State Persistence — Bot state survives restarts (JSON file)
  • Bilingual — English and Chinese UI and bot responses

Quick Start — Platform Mode (Recommended)

Full management platform with Web UI, multi-bot orchestration, user authentication, and admin panel.

  1. Install:
pip install lockbot
  1. Set environment variables:
export JWT_SECRET="your-jwt-secret"
export ENCRYPTION_KEY="your-fernet-key"    # python -c "from cryptography.fernet import Fernet; print(Fernet.generate_key().decode())"
export DEV_MODE="true"                      # dev mode, auto-create admin user
  1. Start:
# Backend
uvicorn lockbot.backend.app.main:app --host 0.0.0.0 --port 8000 --reload

# Frontend (another terminal)
cd frontend && npm install && npm run dev
  1. Open http://localhost:8000 in your browser.

Docker

docker build -f docker/Dockerfile -t lockbot .

# Platform mode (default)
docker run -d -p 8000:8000 \
  -e JWT_SECRET=your-secret \
  -e ENCRYPTION_KEY=your-fernet-key \
  -v lockbot-data:/app/python/lockbot/data \
  -v lockbot-state:/data/bots \
  lockbot

Bot Configuration

Key Description Default
BOT_TYPE DEVICE, NODE, or QUEUE (required)
BOT_NAME Bot instance name demo_bot
CLUSTER_CONFIGS Cluster layout (dict or list) {}
TOKEN Bot signature verification token ""
AESKEY Message decryption AES key ""
WEBHOOK_URL Message webhook URL ""
PORT Server listen port 8090
DEFAULT_DURATION Default lock duration (seconds) 7200 (2h)
MAX_LOCK_DURATION Max lock duration (seconds) -1 (unlimited)
EARLY_NOTIFY Notify before lock expiry false

See python/lockbot/core/config.py for the full configuration reference.

Commands

Command Description
lock <node> [duration] Exclusive lock (e.g., lock gpu0 3d, lock node1 30m)
slock <node> [duration] Shared lock (multiple users)
unlock <node> / free <node> Release a specific node
unlock / free Release all your nodes
kickout <node> Force release (admin)
book <node> [duration] Queue: book a node for later
take <node> Queue: take the current lock
<node> Query current usage
help Show usage

Development

# Install dev dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Lint + format check
ruff check python/ tests/
ruff format --check python/ tests/

Standalone Mode

Single-bot deployment with a lightweight Flask webhook server.

Device Lock Bot (per-GPU locking):

from lockbot.core.bot_instance import BotInstance
from lockbot.core.entry import create_app

instance = BotInstance("DEVICE", {
    "BOT_NAME": "my-gpu-bot",
    "WEBHOOK_URL": "https://your-webhook-url",
    "TOKEN": "your-bot-token",
    "AESKEY": "your-aes-key",
    "CLUSTER_CONFIGS": {
        "node0": ["A800", "A800", "H100"],
        "node1": ["A800", "H100"],
    },
})

app = create_app(bot=instance.bot, bot_name="my-gpu-bot", port=8000)
app.run(host="0.0.0.0", port=8000)

Node Lock Bot / Queue Bot (per-node locking or queue scheduling):

from lockbot.core.bot_instance import BotInstance
from lockbot.core.entry import create_app

instance = BotInstance("NODE", {       # or "QUEUE" for queue scheduling
    "BOT_NAME": "my-node-bot",
    "WEBHOOK_URL": "https://your-webhook-url",
    "TOKEN": "your-bot-token",
    "AESKEY": "your-aes-key",
    "CLUSTER_CONFIGS": ["node0", "node1", "node2", "node3"],
})

app = create_app(bot=instance.bot, bot_name="my-node-bot", port=8000)
app.run(host="0.0.0.0", port=8000)

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

lockbot-2.0.0.tar.gz (54.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

lockbot-2.0.0-py3-none-any.whl (67.1 kB view details)

Uploaded Python 3

File details

Details for the file lockbot-2.0.0.tar.gz.

File metadata

  • Download URL: lockbot-2.0.0.tar.gz
  • Upload date:
  • Size: 54.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for lockbot-2.0.0.tar.gz
Algorithm Hash digest
SHA256 5395c5b3343fa980ed3ee231009cd2896d5e3ca540dabb397d3fdb09e50043af
MD5 ea2cfad3c9f3440e7741f96ddd87b939
BLAKE2b-256 cf7435d7a1f60997987de53a8abea6047f5322aaad3d26350c7ed9dee8b164b6

See more details on using hashes here.

Provenance

The following attestation bundles were made for lockbot-2.0.0.tar.gz:

Publisher: publish.yml on dynamicheart/lockbot

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file lockbot-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: lockbot-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 67.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for lockbot-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5ffec990ce99e70c47bef221d9f0d49c55813a154e0577590e6c8888552132bf
MD5 8b3ed85d957e0354846e8f5143f4ee1e
BLAKE2b-256 1c721b674b44c1ff130b4f3f855b9eddcc4cf2fd08a3f185a45b516327deb464

See more details on using hashes here.

Provenance

The following attestation bundles were made for lockbot-2.0.0-py3-none-any.whl:

Publisher: publish.yml on dynamicheart/lockbot

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page