Skip to main content

No project description provided

Project description

MSLK Logo

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_test-0.0.1-cp314-cp314-manylinux_2_28_x86_64.whl (48.2 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_test-0.0.1-cp313-cp313-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_test-0.0.1-cp312-cp312-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_test-0.0.1-cp311-cp311-manylinux_2_28_x86_64.whl (46.6 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_test-0.0.1-cp310-cp310-manylinux_2_28_x86_64.whl (48.2 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_test-0.0.1-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-0.0.1-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 9688d3c8f6fb1e8bfb1c5e48ce9ecfc2c6f4bd516fa8fa8a1ff18d5473c8edee
MD5 ef0cb89648ec7a9173065fcffca26649
BLAKE2b-256 01c0de859111a300ffe3d04b9cf4e8d37550c8b108daa4eae2b8d70e5712d4cf

See more details on using hashes here.

File details

Details for the file mslk_cuda_test-0.0.1-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-0.0.1-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 cd2dc19a9d91fa4b8e0a0c94034e757cff4dc2084b2fa1bf304e8357aafe7606
MD5 e6dee2585866a2857054a05fc513c760
BLAKE2b-256 ef8e21ba2d29c893138d640f22e3a900806853167ec000846a9a6fb59fd3bed0

See more details on using hashes here.

File details

Details for the file mslk_cuda_test-0.0.1-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-0.0.1-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 95179922bfa3ae9012fa9a9ed3f95fad6b7eb845f40c0df812b39069e10e5b2c
MD5 3674a1a275ebff15cabb3c79563b10b1
BLAKE2b-256 f23de1c0fba318512ab654cf6675106dc2855d5bcc2fd0b402ebdcfd37c902ce

See more details on using hashes here.

File details

Details for the file mslk_cuda_test-0.0.1-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-0.0.1-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 4c9d5062a4923abcba0524e3ad36ba3388366b8c43310376a6a92d7115e83a58
MD5 17279f3457c39d0c7ac3d525bd2c0a21
BLAKE2b-256 2a3b02ed087272370a0a7055ffdb5f4977aab50d19d24e65642cb5bef015067b

See more details on using hashes here.

File details

Details for the file mslk_cuda_test-0.0.1-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-0.0.1-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 195ce29145f25f97d99204f756d3e93478fa90320ebe16726d88471bfc4ea147
MD5 e00f3f347857febeb457674ba9766a8a
BLAKE2b-256 5b3f333c3aec2d0e02b6990bcdc9433d4707cd9742938b4896eeb5789c18f06b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page