SGLang fork of DeepGemm
Project description
Introduction
sgl-deep-gemm is a pypi package built from SGLang's customized branch of DeepGemm. Comparing with origina DeepGemm, it supports the following features to better support SGLang:
- ABI support: with the help of tvm-ffi wrappers, a single wheel can run on different python versions.
- pypi support: easy installation with
pip install sgl-deep-gemm. No need to manually search for wheel links. - Fast iteration: add custom kernels and bump versions at no time.
Usage
To build it locally, run bash build_sgl_deep_gemm.sh, then pip install the wheel generated under dist.
To release a new set of wheels, please contact SGLang team and run the release workflow under SGLang repo
For each major version release (0.X.Y -> 0.(X+1).0), a new branch should be created (release/v0.(X+1).0) for stability purpose.
For any incoming pull requests, it should be rebased upon dev branch. Any newly added or modified tests should be put under sgl_deep_gemm/tests
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file sgl_deep_gemm-0.1.3rc1-py3-none-manylinux2014_x86_64.whl.
File metadata
- Download URL: sgl_deep_gemm-0.1.3rc1-py3-none-manylinux2014_x86_64.whl
- Upload date:
- Size: 4.6 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f24771b5933530d4e0ec1d00bd2016e5797e13e50b9e6e125f356bb1db66f697
|
|
| MD5 |
3db8228004e2a56d7da170895af79d28
|
|
| BLAKE2b-256 |
fcd5982e94bac313b0fd14a178f05b4044246c725a51c05d0f58e5f4c87f92f3
|
File details
Details for the file sgl_deep_gemm-0.1.3rc1-py3-none-manylinux2014_aarch64.whl.
File metadata
- Download URL: sgl_deep_gemm-0.1.3rc1-py3-none-manylinux2014_aarch64.whl
- Upload date:
- Size: 4.5 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
77afc02024363418f4ee469c6d4ed9c5f443787adabd7f025a052681bbe1de8f
|
|
| MD5 |
fd1ce4b079d5a9595c9983f9962b2c83
|
|
| BLAKE2b-256 |
10d5fc8050e041f5d9c1764d003dc195305b14e0b69a130c82ba5970eefc9ba4
|