Skip to main content

SGLang fork of DeepGemm

Project description

Introduction

sgl-deep-gemm is a pypi package built from SGLang's customized branch of DeepGemm. Comparing with origina DeepGemm, it supports the following features to better support SGLang:

  1. ABI support: with the help of tvm-ffi wrappers, a single wheel can run on different python versions.
  2. pypi support: easy installation with pip install sgl-deep-gemm. No need to manually search for wheel links.
  3. Fast iteration: add custom kernels and bump versions at no time.

Usage

To build it locally, run bash build_sgl_deep_gemm.sh, then pip install the wheel generated under dist.

To release a new set of wheels, please contact SGLang team and run the release workflow under SGLang repo

For each major version release (0.X.Y -> 0.(X+1).0), a new branch should be created (release/v0.(X+1).0) for stability purpose.

For any incoming pull requests, it should be rebased upon dev branch. Any newly added or modified tests should be put under sgl_deep_gemm/tests

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file sgl_deep_gemm-0.1.3rc1-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for sgl_deep_gemm-0.1.3rc1-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 f24771b5933530d4e0ec1d00bd2016e5797e13e50b9e6e125f356bb1db66f697
MD5 3db8228004e2a56d7da170895af79d28
BLAKE2b-256 fcd5982e94bac313b0fd14a178f05b4044246c725a51c05d0f58e5f4c87f92f3

See more details on using hashes here.

File details

Details for the file sgl_deep_gemm-0.1.3rc1-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for sgl_deep_gemm-0.1.3rc1-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 77afc02024363418f4ee469c6d4ed9c5f443787adabd7f025a052681bbe1de8f
MD5 fd1ce4b079d5a9595c9983f9962b2c83
BLAKE2b-256 10d5fc8050e041f5d9c1764d003dc195305b14e0b69a130c82ba5970eefc9ba4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page