Skip to main content

No project description provided

Project description

MSLK Logo

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_nightly-2026.3.14-cp314-cp314-manylinux_2_28_x86_64.whl (48.0 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.14-cp313-cp313-manylinux_2_28_x86_64.whl (49.5 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.14-cp312-cp312-manylinux_2_28_x86_64.whl (49.5 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.14-cp311-cp311-manylinux_2_28_x86_64.whl (49.5 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.14-cp310-cp310-manylinux_2_28_x86_64.whl (48.0 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_nightly-2026.3.14-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.14-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 9513a0f2ae4c76ece1c477f3c65d794160a71c16b66fffbcaa1f4cd6b2bbca31
MD5 1a106739634a0f1d10193b4d90c637b9
BLAKE2b-256 38428beb1be4e93188afe1a12ca67ba62613e5296545e39992a56cfee7519124

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.14-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.14-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 cd5c75067b0639114a5e26ece0cf2d316446c250b1323392fd050985a050bb03
MD5 28c29223f8ca325f42157113d228ecf2
BLAKE2b-256 80b5566837c98cecabc4096fdbbf686ee4cc8d14445f309f17c93d8ac326430b

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.14-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.14-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 d18615b3de160c55a0f5bb4ab6e5dece986620d0cb000cb8993f4c6bd22ae752
MD5 c325678a9a7dd6dd9f9cfd5dc504ee2b
BLAKE2b-256 eae096d34f35d7fead8db9771c399b9d592f1d843543c1a3ff2a3510ccb4c8c8

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.14-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.14-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 e96983e2daf79e01ab4c86446a41169c2f1f98afc4450bc074ebd3f567eba736
MD5 5a8ec79b91c24b75bdd8e8d35f574b66
BLAKE2b-256 74764352959d8515f38ed26be42739988c667bd5ccb70ae670b1cad5df8cf453

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.14-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.14-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0ad2c3dbaab0e0ee21165524f63947dc3c5db17c28438ca6c44b2c1956fc2aff
MD5 6b7e63747a1079c25d5a79281dab901e
BLAKE2b-256 f9a388f7b06304d029ad579a167398bb21419dd037eadb0412e42345b16f61b2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page