Skip to main content

No project description provided

Project description

MSLK Logo

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_nightly-2026.3.26-cp314-cp314-manylinux_2_28_x86_64.whl (48.2 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.26-cp313-cp313-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.26-cp312-cp312-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.26-cp311-cp311-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.26-cp310-cp310-manylinux_2_28_x86_64.whl (48.2 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_nightly-2026.3.26-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.26-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 8a09e6e34638ba3d48f389f894fa1fec6761d0c37a3dd7928edb7be12a55ea11
MD5 ba899cd6c1b66b0e0a5664f22dfa5c1c
BLAKE2b-256 b443d176eb89100abe964cf246db97ab11da8236c3bcf46d82f2ee3330112f78

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.26-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.26-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 04939a346a386eb37cd0c0c45e067cd98063ca66ae17d4b526f0e42a659c29c5
MD5 12f1ab795bdd60b863114565217a4fb4
BLAKE2b-256 bab939dc1883235b5c4779db361ad3737abeb657df7dd6e4c2e2a809fa7ec22b

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.26-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.26-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 cbe4585b599ded8be7536d57f40af9624e1e85daf2cba3ffcef50fc2b67db815
MD5 c44e2c57dea5ee3425b1b5019b38512e
BLAKE2b-256 de47998ce45653495443a5a8d6c7e5c49a94fd440f6459bfc19c7999dd441ddb

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.26-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.26-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 8b824b46b87470c1660a0a972e96f8bddfb1fc3ffab0c6c95fbb616334dcd49e
MD5 1dcaafbd9d7cdcc45ec97edec8b02efb
BLAKE2b-256 8a112284578cb8caa0888c4d47401f06f56cefddb498cf9e5e0768a487b5e1a6

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.26-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.26-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 868c487b81fac9242c5c79569210379eeb33a0e9dbce54bf07438b47b0ab7ffe
MD5 b30c3b9dda58fb6bb7f7b778192fb423
BLAKE2b-256 479d09d4be2159d74e9ed3a0fa1c826f415f1a410acee746c1dea72aaf898938

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page