Skip to main content

SGLang fork of DeepGemm

Project description

Introduction

sgl-deep-gemm is a pypi package built from SGLang's customized branch of DeepGemm. Comparing with origina DeepGemm, it supports the following features to better support SGLang:

  1. ABI support: with the help of tvm-ffi wrappers, a single wheel can run on different python versions.
  2. pypi support: easy installation with pip install sgl-deep-gemm. No need to manually search for wheel links.
  3. Fast iteration: add custom kernels and bump versions at no time.

Usage

To build it locally, run bash build_sgl_deep_gemm.sh, then pip install the wheel generated under dist.

To release a new set of wheels, please contact SGLang team and run the release workflow under SGLang repo

For each major version release (0.X.Y -> 0.(X+1).0), a new branch should be created (release/v0.(X+1).0) for stability purpose.

For any incoming pull requests, it should be rebased upon dev branch.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file sgl_deep_gemm-0.1.3rc0-py3-none-manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for sgl_deep_gemm-0.1.3rc0-py3-none-manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 4dbf4e292500c29579c5721be99906e2c9f1042e2fd68d154b6fb153da1370c5
MD5 d93619c10084999306dbbfe5c9503b4f
BLAKE2b-256 8a9f954240c8d425589e4ad73111d72c650155cb95bb65a2fe384b471bb0ab33

See more details on using hashes here.

File details

Details for the file sgl_deep_gemm-0.1.3rc0-py3-none-manylinux2014_aarch64.whl.

File metadata

File hashes

Hashes for sgl_deep_gemm-0.1.3rc0-py3-none-manylinux2014_aarch64.whl
Algorithm Hash digest
SHA256 d1a38d2a30e2f315cdf949e7ed2b808874a7efaefafc0424cb44cafa15ee0614
MD5 2205ec92e7b3bd16f83dab42af03c3b2
BLAKE2b-256 f459e62282024627d8abcfa31bdcf8f64b3fa6fe0b1b7fffa391e79adab402a1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page