Skip to main content

No project description provided

Project description

MSLK Logo

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_nightly-2026.3.3-cp314-cp314-manylinux_2_28_x86_64.whl (49.4 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.3-cp313-cp313-manylinux_2_28_x86_64.whl (47.8 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.3-cp312-cp312-manylinux_2_28_x86_64.whl (49.4 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.3-cp311-cp311-manylinux_2_28_x86_64.whl (49.4 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.3-cp310-cp310-manylinux_2_28_x86_64.whl (47.8 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_nightly-2026.3.3-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.3-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 282bf4e0097d5991365deba39f91ceb5878867e7b3f4602955b88a120161f8e2
MD5 cebdf6b32e2d44aac19a7d2d651e99d8
BLAKE2b-256 bc2abdeb5a3a13f6b2c3d1a8885e935ceb0f8d23cc2f49bc6431d2cf011c4b12

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.3-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.3-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 ebe474470ed4ac300442b84ac622bb1d79fe7dd5756b7ff8cbd198af0717681a
MD5 90e48a2c37289d620073719be854eacd
BLAKE2b-256 1aa56bd6fb0ff597b55183c1ed523b53971251db54a85f2d3e68b0544ffe90de

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.3-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.3-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 53d5ab96af5cbfdc67eb1bfda49fc960d16de342c4075908cac296c89fa51b68
MD5 d9ca292d2a85bc82e546e5dbba3953e0
BLAKE2b-256 cbaa10c4947bc6647facce360f1df6d1d16573839cad6435f7b1e0603fc98688

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.3-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.3-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 1852312510f10a6b60a985c07e702b468f170cb83cea958ea84254815dd5c70c
MD5 5922ec34ebedd0eafe38efb1c904b667
BLAKE2b-256 449ab5bc50604d6954abb2b503967f69c15774d07bca9b0e1ba43c582022d512

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.3-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.3-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 8e8ee20829da6c7eb2c2fcbe133ad6bc6d603e1b17a3da96ac2eaa30546837bf
MD5 c403df53a11a509dfb857d04cb283c5a
BLAKE2b-256 4a9e8b87e8d19b859fc17bab04a2b0eb6003d1da54bd27e6f43158081067f494

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page