Skip to main content

SGLang fork of DeepGemm

Project description

Introduction

sgl-deep-gemm is a pypi package built from SGLang's customized branch of DeepGemm. Comparing with origina DeepGemm, it supports the following features to better support SGLang:

  1. ABI support: with the help of tvm-ffi wrappers, a single wheel can run on different python versions.
  2. pypi support: easy installation with pip install sgl-deep-gemm. No need to manually search for wheel links.
  3. Fast iteration: add custom kernels and bump versions at no time.

Usage

To build it locally, run bash build_sgl_deep_gemm.sh, then pip install the wheel generated under dist.

To release a new set of wheels, please contact SGLang team and run the release workflow under SGLang repo

For each major version release (0.X.Y -> 0.(X+1).0), a new branch should be created (release/v0.(X+1).0) for stability purpose.

For any incoming pull requests, it should be rebased upon dev branch.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file sgl_deep_gemm-0.1.1-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for sgl_deep_gemm-0.1.1-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 a0dcf2e2afa7c4277ca5f8622d27bad4153ef3bdcfaf2036c765476def601abe
MD5 6ecc22c12ee68fce92c82eb20a992037
BLAKE2b-256 3278b6de540c2a4fc30047f5b3db38c49575afde94c96814cedd1b3819e04571

See more details on using hashes here.

File details

Details for the file sgl_deep_gemm-0.1.1-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for sgl_deep_gemm-0.1.1-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 0e8c15b97b65199461d8de754129edc863d8ee33b19686795178e8b76a6757c3
MD5 53fa8ca9ead41f38cb19e25bad05dedd
BLAKE2b-256 30d47b23dcac0c3771d6ec98c9de106a1737d16c3797e0effca84c495e278897

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page