Skip to main content

Kernel Library for SGLang

Project description

SGL Kernel

Kernel Library for SGLang

PyPI

Installation

For CUDA 11.8:

pip3 install sgl-kernel -i https://docs.sglang.ai/whl/cu118

For CUDA 12.1 or CUDA 12.4:

pip3 install sgl-kernel

Developer Guide

Development Environment Setup

Use Docker to set up the development environment. See Docker setup guide.

Create and enter development container:

docker run -itd --shm-size 32g --gpus all -v $HOME/.cache:/root/.cache --ipc=host --name sglang_zhyncs lmsysorg/sglang:dev /bin/zsh
docker exec -it sglang_zhyncs /bin/zsh

Project Structure

Dependencies

Third-party libraries:

Kernel Development

Steps to add a new kernel:

  1. Implement the kernel in csrc
  2. Expose the interface in include/sgl_kernel_ops.h
  3. Create torch extension in csrc/torch_extension.cc
  4. Update setup.py to include new CUDA source
  5. Expose Python interface in python

Build & Install

Development build:

make build

Note:

The sgl-kernel is rapidly evolving. If you experience a compilation failure, try using make rebuild.

Testing & Benchmarking

  1. Add pytest tests in tests/
  2. Add benchmarks using triton benchmark in benchmark/
  3. Run test suite

Release new version

Update version in pyproject.toml and version.py

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

sgl_kernel-0.0.4-cp39-abi3-manylinux2014_x86_64.whl (9.7 MB view details)

Uploaded CPython 3.9+

File details

Details for the file sgl_kernel-0.0.4-cp39-abi3-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for sgl_kernel-0.0.4-cp39-abi3-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 a50d46cd88f33c9bb98408189ce91d58b90b9b03513ac035acc29378dfcf605d
MD5 c8f118274dc97129b8ce54855a29e7f3
BLAKE2b-256 af54d52b9dc00e6bc36704a0ff595400ac6188a56d9d747880851dd53d30b1da

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page