Skip to main content

A simple distributed task execution system for GPU servers

Project description

PyLet

PyPI version Python versions License

A simple distributed task execution system for GPU servers. Like Ray/K8s but much simpler.

Install

pip install pylet

For development:

git clone https://github.com/ServerlessLLM/pylet.git
cd pylet
pip install -e ".[dev]"

Quick Start

CLI

# Terminal 1: Start head node
pylet start

# Terminal 2: Start worker node with GPUs
pylet start --head localhost:8000 --gpu-units 4

# Terminal 3: Submit an instance
pylet submit 'vllm serve Qwen/Qwen2.5-1.5B-Instruct --port $PORT' \
    --gpu-units 1 --name my-vllm

# Check status
pylet get-instance --name my-vllm

# Get endpoint for inference
pylet get-endpoint --name my-vllm
# Output: 192.168.1.10:15600

# View logs
pylet logs <instance-id>

# Cancel
pylet cancel <instance-id>

Python API

import pylet

# Connect to head node
pylet.init()  # or pylet.init("http://head:8000")

# Submit an instance
instance = pylet.submit(
    "vllm serve Qwen/Qwen2.5-1.5B-Instruct --port $PORT",
    name="my-vllm",
    gpu=1,
    memory=4096,
)

# Wait for it to start
instance.wait_running()
print(f"Endpoint: {instance.endpoint}")

# Get logs
print(instance.logs())

# Cancel when done
instance.cancel()
instance.wait()

For local testing:

import pylet

with pylet.local_cluster(workers=2, gpu_per_worker=1) as cluster:
    instance = pylet.submit("nvidia-smi", gpu=1)
    instance.wait()
    print(instance.logs())

Async API available via import pylet.aio as pylet.

See examples/README.md for more detailed examples including vLLM and SGLang.

Commands

Command Description
pylet start Start head node
pylet start --head <ip:port> --gpu-units N Start worker with N GPUs
pylet submit <cmd> --gpu-units N --name <name> Submit instance
pylet get-instance --name <name> Get instance status
pylet get-endpoint --name <name> Get instance endpoint (host:port)
pylet logs <id> View instance logs
pylet logs <id> --follow Follow logs in real-time
pylet cancel <id> Cancel instance
pylet list-workers List registered workers

Key Features

  • Simple: No containers, no complex configs. Just pylet start and pylet submit.
  • GPU-aware: Automatic GPU allocation via CUDA_VISIBLE_DEVICES.
  • Fine-grained GPU control: Request specific GPUs by index, enable GPU sharing for daemons.
  • Service discovery: Instances get a PORT env var; endpoint available via get-endpoint.
  • Real-time logs: Stream logs from running instances.
  • Graceful shutdown: SIGTERM with configurable grace period before SIGKILL.

Why PyLet?

PyLet is the greatest common denominator for GPU cluster orchestration - minimal, reliable, and unopinionated. It manages one thing: instances (processes with GPU allocation).

  • No pods, replicas, services, or deployments - just instances
  • Applications compose instances however they need (via labels, custom scheduling)
  • Fine-grained GPU scheduling because research workloads need it
  • Simple enough to understand, reliable enough to depend on

See docs/architecture.md for design details. See CONTRIBUTING.md for contribution philosophy.

Requirements

  • Python 3.9+
  • Linux (tested on Ubuntu)

License

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pylet-0.5.0.tar.gz (95.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pylet-0.5.0-py3-none-any.whl (62.8 kB view details)

Uploaded Python 3

File details

Details for the file pylet-0.5.0.tar.gz.

File metadata

  • Download URL: pylet-0.5.0.tar.gz
  • Upload date:
  • Size: 95.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pylet-0.5.0.tar.gz
Algorithm Hash digest
SHA256 4a30e0d886d8fb7e3f870ff403b495175a0c714744b2e3244752105f3a1f724c
MD5 08c015a1d3e511a13640b10e669602bd
BLAKE2b-256 a95f2e68f51bfc10d6607dc98c24f6acc71923510560888279bb19551c3e7d8d

See more details on using hashes here.

Provenance

The following attestation bundles were made for pylet-0.5.0.tar.gz:

Publisher: release.yml on ServerlessLLM/pylet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pylet-0.5.0-py3-none-any.whl.

File metadata

  • Download URL: pylet-0.5.0-py3-none-any.whl
  • Upload date:
  • Size: 62.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pylet-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0ddd41ca7cd9cf46e3d4374142595cada77b64aebbb522ae3bab68b13d5eec64
MD5 fa6e784282a530b2ba1824d846e04366
BLAKE2b-256 4c57101ac5235a545db409e70e13cb8e6f8064f0859c96992d2f2a40175dd4db

See more details on using hashes here.

Provenance

The following attestation bundles were made for pylet-0.5.0-py3-none-any.whl:

Publisher: release.yml on ServerlessLLM/pylet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page