Skip to main content

No project description provided

Project description

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_nightly-2026.2.24-cp314-cp314-manylinux_2_28_x86_64.whl (51.2 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.2.24-cp313-cp313-manylinux_2_28_x86_64.whl (49.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.2.24-cp312-cp312-manylinux_2_28_x86_64.whl (51.2 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.2.24-cp311-cp311-manylinux_2_28_x86_64.whl (51.2 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.2.24-cp310-cp310-manylinux_2_28_x86_64.whl (51.2 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_nightly-2026.2.24-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.2.24-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 9da802674f91cad5a96be10a6ef1f1fabe063571e4c1ab1aa83cdf321b2bdfc8
MD5 793d8ef8eebf69f0a9887192c47a04a8
BLAKE2b-256 52a17581e331912238f0a12c42d08da5bc4ecdc941ad5fba3866a175911ce0e1

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.2.24-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.2.24-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 cf4b4222009e5e972467eeb6881b5752610d5f78a48692a9f42ddc147cf434ce
MD5 8a65196af3736da03492d0764fd84e23
BLAKE2b-256 3dde002ba72aa88b09139db4df7e13a7ab6711eeef57a846cf684fb3a8211016

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.2.24-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.2.24-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 40dd625f4e2f951b371f5f83a93c199bafa1036fa3d9b2b476e0142e11c3d412
MD5 e9146bfd343f6c253cd51e2c2e189879
BLAKE2b-256 b24343480afe719c60c98b1133e94b9edee6d97707d581091e8447a2dada369a

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.2.24-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.2.24-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 b0c89cc1175c38f56dbb7e9fea9fca68eb88b57c13d8d56af033e8249c8bf39f
MD5 d126f21561af0ab46938c59ffb0f0552
BLAKE2b-256 3fef5016043fc9fd22e9d13f4df813bfb04224c66fd22ee7a43788091d8607ab

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.2.24-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.2.24-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 2224233602e74c9111cc622a3c9f8ff94a58646cbd540bb7a7928059c3415e57
MD5 6fd15f2ded013036cbfa6d881e30eb0a
BLAKE2b-256 80a2b0a983aedf64b5d6bf7a1d6409990b33849e738c27c59635e8d73e27d63a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page