Skip to main content

No project description provided

Project description

MSLK Logo

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_nightly-2026.3.25-cp314-cp314-manylinux_2_28_x86_64.whl (48.2 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.25-cp313-cp313-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.25-cp312-cp312-manylinux_2_28_x86_64.whl (48.2 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.25-cp311-cp311-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.25-cp310-cp310-manylinux_2_28_x86_64.whl (48.2 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_nightly-2026.3.25-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.25-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 accfc700cbdc2599b87b08a04f8c704490fcd7d75ed8735f0750410d77be6bb9
MD5 8331c4c430be5f7cf5c2e6814be23eb0
BLAKE2b-256 6fed75e992ba4db31002739fb047e54862c8a31409e225abcac7f6100237ca4c

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.25-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.25-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 ca0051b5ae2938caafc02356604b0614dc3dc448445a954bacd614bbe5cfa071
MD5 5659c6bb3af62885bd1dfd2b68c5510c
BLAKE2b-256 3e3985ac7e0d3ee51fbd3afad590e7fd9ed8c1de2ae7dee75ea24dbb60af05f8

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.25-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.25-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 2883a687fc2780dc888203b7d63bd70a73ab2f4c377758f8938e87e7146b8659
MD5 82ed91248e8b2199f506fd266c7fecd3
BLAKE2b-256 d34891492f407b9386c30809dbdd947cd0f1ba9adb1e7161eebb066a5bd951eb

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.25-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.25-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 d96f052233e1f3efcd2fce9eafec0f1bf56c1950f432c73b420a4f2eed2bc58f
MD5 d09b1f68c5104b07bbfbe14df2d4e89e
BLAKE2b-256 c75078d0c301640183de3dab2679d074111c624163fa323941e1488ba9356014

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.25-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.25-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 d9c41363b37f678813bb3ec144ba2a138891081fab3b446c45e5bbf07a56f836
MD5 5e3e60a9fc277d99503f962772428e38
BLAKE2b-256 bc4bc8feedad4e7c01367dd0016c688934d9f835b62b21c4388cda7e6539e3d0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page