Skip to main content

No project description provided

Project description

MSLK Logo

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_nightly-2026.3.17-cp314-cp314-manylinux_2_28_x86_64.whl (48.2 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.17-cp313-cp313-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.17-cp312-cp312-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.17-cp311-cp311-manylinux_2_28_x86_64.whl (48.2 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.17-cp310-cp310-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_nightly-2026.3.17-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.17-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 6dedc474047256afac4d75b0aa60a6b9347cc3e010d7274999a5ff3581145d17
MD5 f00123ebbf00dbe9e2353515538784e0
BLAKE2b-256 2e3d5233e79c303e1a1cacc8bb0ebaa8c47886f5748847e488d1e3f84b299adf

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.17-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.17-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 7f9cc2b7313207bd4ca1b02509613b7c3611652e3b11bf14799445a03f17c009
MD5 791ecc66abc00039ca298bc5efe92d74
BLAKE2b-256 462beb5d6a15b0e07e096c8f2b9dffb9df4d68778495f1835f9ef01453911b75

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.17-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.17-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 c7538bd6c20c3b4039c9ee220eabd80bf83e8ac5d19b7eb35c4bb42ebfc76cc8
MD5 00e953377a6da0261357b7e0133b1e96
BLAKE2b-256 30637cce0e15f7307c87443faab783d0f0c3fcb62a96b4b81ba97d1972af3b2e

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.17-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.17-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0f9597913ad1291400fe8484475444e1ea58136f09174ade463e7eeee08b0d65
MD5 d89ac5aa2f8c9aafaa47633844872b14
BLAKE2b-256 a8c8fe53d848ae03dbd42cda27523073400c39c14af298f0f40dc09b8a52d791

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.17-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.17-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 12f89c8c3a3ab7f07277f16d97123102999c487327e02c5c716ae66a496a5547
MD5 2729a4ad1861e624c43de40dde31dd12
BLAKE2b-256 c76c81a0ef7c039564109eb837b6bf1a22626867fc6f588c27df1a874513859d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page