Skip to main content

No project description provided

Project description

MSLK Logo

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_test-1.1.0rc1-cp314-cp314-manylinux_2_28_x86_64.whl (44.5 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_test-1.1.0rc1-cp313-cp313-manylinux_2_28_x86_64.whl (44.5 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_test-1.1.0rc1-cp312-cp312-manylinux_2_28_x86_64.whl (42.9 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_test-1.1.0rc1-cp311-cp311-manylinux_2_28_x86_64.whl (42.9 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_test-1.1.0rc1-cp310-cp310-manylinux_2_28_x86_64.whl (42.9 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_test-1.1.0rc1-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-1.1.0rc1-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 5b47d29928aa2b7329564421ad3d2ad02974ccc418a580920b2d8d6f88538405
MD5 dd1dba7bc85fcf0bd96aa73d6834b849
BLAKE2b-256 9b79b640a1a323cac4c2151611f62c66ddfc5c1ab07770b5b474de926808a0d6

See more details on using hashes here.

File details

Details for the file mslk_cuda_test-1.1.0rc1-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-1.1.0rc1-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 882c4e0b578783fcd2612988a6557913a09cc4d33719d970bea044b798e3105e
MD5 1fcd0b7ec20bd9ad767713cf0fbaede3
BLAKE2b-256 5f04ad5bb4c7983187eadbc099558e5bde52539c4e30d794ada199e6f15773ba

See more details on using hashes here.

File details

Details for the file mslk_cuda_test-1.1.0rc1-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-1.1.0rc1-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 86d102f73aaa3c0491629e9dd81a4cf3a59538dac1cc3b8a55ac6166759fe095
MD5 62a19b60636ace3083389ab3ce4065f4
BLAKE2b-256 957924e8b208b4654b9567831debc4048f7976c9d318eed28be3c2904195432e

See more details on using hashes here.

File details

Details for the file mslk_cuda_test-1.1.0rc1-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-1.1.0rc1-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 56032d728b907f40bb73037c7a703caeaa90483a68a677e04459553a10cc03fe
MD5 de5ec1ece5a5d10c89c4cb85991e59f3
BLAKE2b-256 ec91242d2b8fe197eeddd3f4cf595c128e7bc18c14ad2c0e7244e26d12c9b0fd

See more details on using hashes here.

File details

Details for the file mslk_cuda_test-1.1.0rc1-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-1.1.0rc1-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 edae36d17ad8da3d89d68d63a66dcd44f519bd7d06ea1f95b2f487865fe1131e
MD5 115df285e0e0b639d4f6b54b651ec01d
BLAKE2b-256 e33a231c490d51b33a1272e0e6da94f2ba6cafd30b32a1372a64ec8f951649d0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page