Skip to main content

No project description provided

Project description

MSLK Logo

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_nightly-2026.3.2-cp314-cp314-manylinux_2_28_x86_64.whl (47.8 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.2-cp313-cp313-manylinux_2_28_x86_64.whl (47.8 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.2-cp312-cp312-manylinux_2_28_x86_64.whl (49.4 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.2-cp311-cp311-manylinux_2_28_x86_64.whl (49.4 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.2-cp310-cp310-manylinux_2_28_x86_64.whl (49.4 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_nightly-2026.3.2-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.2-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 508c0b541adaada4422b496d2f90f0eb47f3e304d029ed19c03835561dab12fc
MD5 d66f7f15bdc468ce47fd99d3ab29b6b7
BLAKE2b-256 dcdbf406732f04b843cc344f4598ec0683661a87da5f3c2ba7f9f76aaecbf21f

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.2-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.2-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 b2193acf9bdf2144de9d5d93bac1344679a98d47c10a005a898c5b4068b85a3e
MD5 5616e52b415341c6ee94f14833bfe23c
BLAKE2b-256 cbc7a6f737ec2e91d7bbc77c4a9a1fc39cb57c6ec607ec590b792e291c3b7b18

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.2-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.2-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 27052836b49f70919fbfb186e801fe13f5f19eadf3edaccd871823d1573ab0a3
MD5 a332a6a3b9f192bb2908002ebad82e03
BLAKE2b-256 95ed61817d337b48bbd4efaada118aaf5ffd6580d624cc159d3b065362da351b

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.2-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.2-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 209ba8e70c1966519c24c40ab0d5c3968ae9112b5b6cb5a42403737d4e4a83a2
MD5 f37275e6665d0b37f6dccf601ca70ca5
BLAKE2b-256 b698fa573c62477a89e38c37fa2b12a03332955fc6152fb58d7a30d9ca711faf

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.2-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.2-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 9fda8ea6569dac46be6b4707b396ee2bbfcd025df8f0aa574f65bf0d2f159df6
MD5 3ff4b2616d8f7d8bf440c8efb091510f
BLAKE2b-256 a7b4fbe0015de4c6fcda59938c98755937bb1026beef5a98feb9f477f6c68511

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page