Skip to main content

A simple distributed task execution system for GPU servers

Project description

PyLet

PyPI version Python versions License

A simple distributed task execution system for GPU servers. Like Ray/K8s but much simpler.

Install

pip install pylet

For development:

git clone https://github.com/ServerlessLLM/pylet.git
cd pylet
pip install -e ".[dev]"

Quick Start

CLI

# Terminal 1: Start head node
pylet start

# Terminal 2: Start worker node with GPUs
pylet start --head localhost:8000 --gpu-units 4

# Terminal 3: Submit an instance
pylet submit 'vllm serve Qwen/Qwen2.5-1.5B-Instruct --port $PORT' \
    --gpu-units 1 --name my-vllm

# Check status
pylet get-instance --name my-vllm

# Get endpoint for inference
pylet get-endpoint --name my-vllm
# Output: 192.168.1.10:15600

# View logs
pylet logs <instance-id>

# Cancel
pylet cancel <instance-id>

Python API

import pylet

# Connect to head node
pylet.init()  # or pylet.init("http://head:8000")

# Submit an instance
instance = pylet.submit(
    "vllm serve Qwen/Qwen2.5-1.5B-Instruct --port $PORT",
    name="my-vllm",
    gpu=1,
    memory=4096,
)

# Wait for it to start
instance.wait_running()
print(f"Endpoint: {instance.endpoint}")

# Get logs
print(instance.logs())

# Cancel when done
instance.cancel()
instance.wait()

For local testing:

import pylet

with pylet.local_cluster(workers=2, gpu_per_worker=1) as cluster:
    instance = pylet.submit("nvidia-smi", gpu=1)
    instance.wait()
    print(instance.logs())

Async API available via import pylet.aio as pylet.

See examples/README.md for more detailed examples including vLLM and SGLang.

Commands

Command Description
pylet start Start head node
pylet start --head <ip:port> --gpu-units N Start worker with N GPUs
pylet submit <cmd> --gpu-units N --name <name> Submit instance
pylet get-instance --name <name> Get instance status
pylet get-endpoint --name <name> Get instance endpoint (host:port)
pylet logs <id> View instance logs
pylet logs <id> --follow Follow logs in real-time
pylet cancel <id> Cancel instance
pylet list-workers List registered workers

Key Features

  • Simple: No containers, no complex configs. Just pylet start and pylet submit.
  • GPU-aware: Automatic GPU allocation via CUDA_VISIBLE_DEVICES.
  • Service discovery: Instances get a PORT env var; endpoint available via get-endpoint.
  • Real-time logs: Stream logs from running instances.
  • Graceful shutdown: SIGTERM with configurable grace period before SIGKILL.

Requirements

  • Python 3.9+
  • Linux (tested on Ubuntu)

License

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pylet-0.4.0.tar.gz (81.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pylet-0.4.0-py3-none-any.whl (57.4 kB view details)

Uploaded Python 3

File details

Details for the file pylet-0.4.0.tar.gz.

File metadata

  • Download URL: pylet-0.4.0.tar.gz
  • Upload date:
  • Size: 81.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pylet-0.4.0.tar.gz
Algorithm Hash digest
SHA256 f0ad64890448297b3a01f4808cac86f8c168094ecfd5080147ee5060c7b6eded
MD5 4bdea9d4a6d630966feba8293ad0487f
BLAKE2b-256 90fd48ce24fdb13df1157a39a6952e6c88a0c3012f9a579962498edfb8d802c4

See more details on using hashes here.

Provenance

The following attestation bundles were made for pylet-0.4.0.tar.gz:

Publisher: release.yml on ServerlessLLM/pylet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pylet-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: pylet-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 57.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pylet-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 23718a81a129b2b1e19f1199d5c5ff4935626448a1870ac387cff1c47895f278
MD5 2c8d5bcaba5a80c064bb91ef7b0a9bb8
BLAKE2b-256 fced84045d66843f973f890539fb463c9fdc5add37cc6787edbf6c8b166d8b69

See more details on using hashes here.

Provenance

The following attestation bundles were made for pylet-0.4.0-py3-none-any.whl:

Publisher: release.yml on ServerlessLLM/pylet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page