Skip to main content

No project description provided

Project description

MSLK Logo

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_nightly-2026.3.24-cp314-cp314-manylinux_2_28_x86_64.whl (48.2 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.24-cp313-cp313-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.24-cp312-cp312-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.24-cp311-cp311-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.24-cp310-cp310-manylinux_2_28_x86_64.whl (48.2 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_nightly-2026.3.24-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.24-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 6baf580867c5bdba53e29e70818b4e8c81bb4d26b326de3f4dcf8b862bc5297d
MD5 49fe178333a935bdaed1c02a23bf350a
BLAKE2b-256 8a3996738cd9d83368c49766b7d27cec84761d6874a50dfd5e3860f409a3690f

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.24-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.24-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 88bb8b6aedb58099ddd946ad3d9217bac1e76024e8d6b51ad9aa817b20c9af03
MD5 87747f2f14e548adf069aacdaa2338c0
BLAKE2b-256 20720a8f52968ec40d83cd166b6a622cb65a5f057618ebe502b578172ecf17bb

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.24-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.24-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 d4a93eec9ef2c5c406939992e65485e5882d42617a157b0cc4a0063fb8da9b12
MD5 d8ccc309554c2b520b892006b17f81df
BLAKE2b-256 1b3a3a0ed81291c653bd067a1d82ddbb45c355d2333108103870798bd85ef24c

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.24-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.24-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 1c3b4ff3952cb73c277fced5a775d6678296b44277a3dae90ad06cd4d7831885
MD5 c1cbdc46b4ac0628b2786778c6056885
BLAKE2b-256 5c1cd8e77b7464b8d3f395deb039f54c3bb073bba5e11268ed755cb0931a46e5

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.24-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.24-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 6a96b5eb96bba7a0e7696f25fe3cc235e0320f92ca69ede6b93a47e87947f180
MD5 934ae4f11862299fd8c036a0b19d20f7
BLAKE2b-256 f3f0c04f1e273669ce05151cd6121108753b0f63331dbc9034fb566eb01a7c0a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page