Skip to main content

No project description provided

Project description

MSLK Logo

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_nightly-2026.2.28-cp314-cp314-manylinux_2_28_x86_64.whl (49.4 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.2.28-cp313-cp313-manylinux_2_28_x86_64.whl (49.4 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.2.28-cp312-cp312-manylinux_2_28_x86_64.whl (47.8 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.2.28-cp311-cp311-manylinux_2_28_x86_64.whl (49.4 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.2.28-cp310-cp310-manylinux_2_28_x86_64.whl (47.8 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_nightly-2026.2.28-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.2.28-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 e314ac80de19bd4bda4a6c33a9eb306002abb5b6b8fd3f944d4761a0c4a090c0
MD5 e1942a0595367d0c404e0256d98addf6
BLAKE2b-256 4aa7f611bd900011070fe09d4b2d007b716a4773198a7d37fa517cb126c60d31

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.2.28-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.2.28-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 ed34ba9d81b4cd89d040eb11ef39cce2f86a506e5f8be34764f19abccc3d34dd
MD5 b9cab5505da9616e08be454b132b9982
BLAKE2b-256 0dd707e6d5d443b6c1254a72dd46448e7a4ab420aaa5f177ecc8bb2df574bbb2

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.2.28-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.2.28-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0b8fefe7f660de65b75086f59edbbb74252f2e70d4097921baa45bf50c3fdb66
MD5 bc4b0dcf118ef4192ac35573667bb86f
BLAKE2b-256 7e17515c81acb0d9e309011babd7a229bc3ee88fe9906fcdfddfceee7fb8de2b

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.2.28-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.2.28-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 096f969764790e38a0b45aac010ecc3d1c7bd586eeb22e695ff8cdddb3c2ed7b
MD5 1d36a7b1490b174766a5cb518e008596
BLAKE2b-256 56da643a30b7430c41fe9abad9b200ff00a3d502258deb1d90414f7058f00e36

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.2.28-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.2.28-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 ba7900c38666344517ec6e93b69d868aa1c65dde0de1b27317490e7b2e25d185
MD5 bddd6d699652bd0c0db2d112102bd581
BLAKE2b-256 d7d1d29afb0d76cd34c140d8cad6adbca10c5d46d7937a44a6408beb731b04f2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page