Skip to main content

SGLang fork of DeepGemm

Project description

Introduction

sgl-deep-gemm is a pypi package built from SGLang's customized branch of DeepGemm. Comparing with origina DeepGemm, it supports the following features to better support SGLang:

  1. ABI support: with the help of tvm-ffi wrappers, a single wheel can run on different python versions.
  2. pypi support: easy installation with pip install sgl-deep-gemm. No need to manually search for wheel links.
  3. Fast iteration: add custom kernels and bump versions at no time.

Usage

To build it locally, run bash build_sgl_deep_gemm.sh, then pip install the wheel generated under dist.

To release a new set of wheels, please contact SGLang team and run the release workflow under SGLang repo

For each major version release (0.X.Y -> 0.(X+1).0), a new branch should be created (release/v0.(X+1).0) for stability purpose.

For any incoming pull requests, it should be rebased upon dev branch.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file sgl_deep_gemm-0.1.2-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for sgl_deep_gemm-0.1.2-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 2afa47488de258adb245a7b92e8f7d13d13fd7be9cc5e2396349058b54f40137
MD5 9b1277e1d0b6499180a58a3c67b11137
BLAKE2b-256 d5d88bffb51667ce9ffe73a5a4699a137699edbd29a74a15031f523de1f25ff2

See more details on using hashes here.

File details

Details for the file sgl_deep_gemm-0.1.2-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for sgl_deep_gemm-0.1.2-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 42d10698f2ee2a369e920878d623e48605a830898def2cd2eda44a6a75e8271f
MD5 789a224dfa5647ce520b6800037a4cf2
BLAKE2b-256 689ee73bc042ee6b6372f032755f20f2da3f5192bc23893f1d89cd9126f9b2c1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page