Skip to main content

No project description provided

Project description

MSLK Logo

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_nightly-2026.3.16-cp314-cp314-manylinux_2_28_x86_64.whl (49.5 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.16-cp313-cp313-manylinux_2_28_x86_64.whl (49.5 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.16-cp312-cp312-manylinux_2_28_x86_64.whl (49.5 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.16-cp311-cp311-manylinux_2_28_x86_64.whl (49.5 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_nightly-2026.3.16-cp310-cp310-manylinux_2_28_x86_64.whl (48.0 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_nightly-2026.3.16-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.16-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 5fc5d884ccb95885203df64c3f445d90d1966947d4351674591e75438f9128b6
MD5 052b0ad3ba2f9bb056dd15d89d7dba39
BLAKE2b-256 93291b8221f5ffd2216e06139c97f1d9cda843a70d5293a1a39f2d401cfa2da4

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.16-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.16-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 f3242b3eed7d4e192ddcce2d32ed88afcd7c92f1d6d767e527e39610c9991b1d
MD5 ded40818b725d16c5a13d81bfa09c738
BLAKE2b-256 4588cb3fc514b93986580d2d49d96e3b7dcea0977b2a1a99258a9e1f00c428f8

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.16-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.16-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 8c827f604b737c371d546f5d56a97457670fe08c2a90afc44544151b70c62f9b
MD5 c61b4215d4eab7882c678c4c1e07ddc8
BLAKE2b-256 ff097116c157937b0eb17d4f86286e3d695a2e7d286d7af9f72e10a2b0110d6f

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.16-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.16-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 8a835d583c7b4d31524c6f2e508e862f3d3b1b27142cad26c04541575960483d
MD5 65d0db5c45a8ab4c1997a0fe7278c3c2
BLAKE2b-256 5e176390b584e27fe2059ef785dbc4b6917f4abb6c3f2b36011ef59f6668ae15

See more details on using hashes here.

File details

Details for the file mslk_cuda_nightly-2026.3.16-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_nightly-2026.3.16-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0958bc7e23779fee11bfc05ddafeda739dd7cf1c45824c180e5feb1e47c8428e
MD5 28a7c64194e67a8cdacb5312db78e02c
BLAKE2b-256 9739ed9b27473b9f7cdd2e0fe357f4718df22460020aba8e0dab8c75d4ea6a93

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page