Skip to main content

No project description provided

Project description

MSLK Logo

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_nightly-2026.3.1-cp314-cp314-manylinux_2_28_x86_64.whl (49.4 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.1-cp313-cp313-manylinux_2_28_x86_64.whl (47.8 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.1-cp312-cp312-manylinux_2_28_x86_64.whl (49.4 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.1-cp311-cp311-manylinux_2_28_x86_64.whl (49.4 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.1-cp310-cp310-manylinux_2_28_x86_64.whl (47.8 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_nightly-2026.3.1-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.1-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 2f062027d5967ccaa9d0ede4332bc3aebbd8279c32a6167a5bbda36d41646886
MD5 9710f43888069d58d2affadcf5d660f6
BLAKE2b-256 aba10583343fad56d17530349890c754e1f6dd14db264c880918150cf3c601cc

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.1-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.1-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 db600bebd5abb2e1f00c2245942f7ad202ba55e6e56e5cc741d4d700969c3594
MD5 488062547578d1c0c269245a20816ba3
BLAKE2b-256 245b0e3234ddcaf85a1ef01c4bd6a33bda63a670b11a7ac042731ce5b371a455

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.1-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.1-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 35062e2ccf66920ab5de41611fd1de69ed397558b0ed0d60576aa4a26412c568
MD5 9b8fd5fe526a7cf9187aec81641b28e9
BLAKE2b-256 c6a09712b12becb97696fede2139d2f4c7a2298ac1e2207e87655a0e6bd1c10c

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.1-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.1-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0796c27d36b9253ff0ad1e139dcab82d6320096765f22f5353d15cb315d8e1fb
MD5 b96e17d020de6e504bef7d407713e56d
BLAKE2b-256 97b39607b9952138cda279bd726bdfd784f7b4ef033c24d8494b624a834ed4f7

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.1-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.1-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 cf25ae158f8f4c4989f0d827fbf12cc0579371047399368c73e41ba2ea1b4111
MD5 36fed824d4ca541dff03a0a26c6c00f4
BLAKE2b-256 38831aaab9d73eca11f750973e8569b0b6852a7491c298d61da03d8c00e4147f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page