No project description provided
Project description
MSLK Library
MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.
Installation
# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/
Release Compatibility Table
MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.
| MSLK Release | Corresponding PyTorch Release | Supported Python Versions | Supported CUDA Versions | Supported CUDA Architectures | Supported ROCm Versions | Supported ROCm Architectures |
|---|---|---|---|---|---|---|
| 1.0.0 | 2.10.x | 3.10, 3.11, 3.12, 3.13, 3.14 | 12.6, 12.8, 12.9, 13.0 | 8.0, 9.0a, 10.0a, 12.0a | 7.0, 7.1 | gfx908, gfx90a, gfx942, gfx950 |
Running Benchmarks
python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py
Running Tests
pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py
Build From Source
We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.
# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install
Join the MSLK community
For questions, support, news updates, or feature requests, please feel free to:
- File a ticket in GitHub Issues
For contributions, please see the CONTRIBUTING file for
ways to help out.
License
MSLK is BSD licensed, as found in the LICENSE file.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mslk_cuda_nightly-2026.3.26-cp314-cp314-manylinux_2_28_x86_64.whl.
File metadata
- Download URL: mslk_cuda_nightly-2026.3.26-cp314-cp314-manylinux_2_28_x86_64.whl
- Upload date:
- Size: 48.2 MB
- Tags: CPython 3.14, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8a09e6e34638ba3d48f389f894fa1fec6761d0c37a3dd7928edb7be12a55ea11
|
|
| MD5 |
ba899cd6c1b66b0e0a5664f22dfa5c1c
|
|
| BLAKE2b-256 |
b443d176eb89100abe964cf246db97ab11da8236c3bcf46d82f2ee3330112f78
|
File details
Details for the file mslk_cuda_nightly-2026.3.26-cp313-cp313-manylinux_2_28_x86_64.whl.
File metadata
- Download URL: mslk_cuda_nightly-2026.3.26-cp313-cp313-manylinux_2_28_x86_64.whl
- Upload date:
- Size: 46.6 MB
- Tags: CPython 3.13, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
04939a346a386eb37cd0c0c45e067cd98063ca66ae17d4b526f0e42a659c29c5
|
|
| MD5 |
12f1ab795bdd60b863114565217a4fb4
|
|
| BLAKE2b-256 |
bab939dc1883235b5c4779db361ad3737abeb657df7dd6e4c2e2a809fa7ec22b
|
File details
Details for the file mslk_cuda_nightly-2026.3.26-cp312-cp312-manylinux_2_28_x86_64.whl.
File metadata
- Download URL: mslk_cuda_nightly-2026.3.26-cp312-cp312-manylinux_2_28_x86_64.whl
- Upload date:
- Size: 46.6 MB
- Tags: CPython 3.12, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cbe4585b599ded8be7536d57f40af9624e1e85daf2cba3ffcef50fc2b67db815
|
|
| MD5 |
c44e2c57dea5ee3425b1b5019b38512e
|
|
| BLAKE2b-256 |
de47998ce45653495443a5a8d6c7e5c49a94fd440f6459bfc19c7999dd441ddb
|
File details
Details for the file mslk_cuda_nightly-2026.3.26-cp311-cp311-manylinux_2_28_x86_64.whl.
File metadata
- Download URL: mslk_cuda_nightly-2026.3.26-cp311-cp311-manylinux_2_28_x86_64.whl
- Upload date:
- Size: 46.6 MB
- Tags: CPython 3.11, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8b824b46b87470c1660a0a972e96f8bddfb1fc3ffab0c6c95fbb616334dcd49e
|
|
| MD5 |
1dcaafbd9d7cdcc45ec97edec8b02efb
|
|
| BLAKE2b-256 |
8a112284578cb8caa0888c4d47401f06f56cefddb498cf9e5e0768a487b5e1a6
|
File details
Details for the file mslk_cuda_nightly-2026.3.26-cp310-cp310-manylinux_2_28_x86_64.whl.
File metadata
- Download URL: mslk_cuda_nightly-2026.3.26-cp310-cp310-manylinux_2_28_x86_64.whl
- Upload date:
- Size: 48.2 MB
- Tags: CPython 3.10, manylinux: glibc 2.28+ x86-64
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
868c487b81fac9242c5c79569210379eeb33a0e9dbce54bf07438b47b0ab7ffe
|
|
| MD5 |
b30c3b9dda58fb6bb7f7b778192fb423
|
|
| BLAKE2b-256 |
479d09d4be2159d74e9ed3a0fa1c826f415f1a410acee746c1dea72aaf898938
|