Skip to main content

No project description provided

Project description

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda_test-1.0.0rc1.post0-cp314-cp314-manylinux_2_28_x86_64.whl (50.3 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda_test-1.0.0rc1.post0-cp313-cp313-manylinux_2_28_x86_64.whl (50.3 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda_test-1.0.0rc1.post0-cp312-cp312-manylinux_2_28_x86_64.whl (48.7 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda_test-1.0.0rc1.post0-cp311-cp311-manylinux_2_28_x86_64.whl (48.7 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda_test-1.0.0rc1.post0-cp310-cp310-manylinux_2_28_x86_64.whl (48.7 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda_test-1.0.0rc1.post0-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-1.0.0rc1.post0-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 5bc5a7b18b00c9c3c5e96ce3a475b63ed356641190ed1a02ff193678de32d015
MD5 d896f02cd46be6979228b1cf1f62d576
BLAKE2b-256 511d6df29e94501ab21ca86e0aa161865f8f2a3ff94dd0985abb41ea8a7abcd4

See more details on using hashes here.

File details

Details for the file mslk_cuda_test-1.0.0rc1.post0-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-1.0.0rc1.post0-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 b481caabf0716b285cf4041bb1d51fd66d8d755d461fbfa842238877f6267a52
MD5 944c0602129c8cc398ec7ca12ed3c09d
BLAKE2b-256 ac8d8b181f748fe1b48d8d267d12cea9bc930d3eb55d4bec5e12fb29b2117e7a

See more details on using hashes here.

File details

Details for the file mslk_cuda_test-1.0.0rc1.post0-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-1.0.0rc1.post0-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 1ea7832af3315be7e1d67279cef1ee10ffeb487d8ffe53cba3ca97f2b83a9743
MD5 387047b2199119846a563da4bc967a07
BLAKE2b-256 07bd8f8020eaa5a7d5f8028cf14405bef3d895bf308662ed2d846ace1fbbc2fe

See more details on using hashes here.

File details

Details for the file mslk_cuda_test-1.0.0rc1.post0-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-1.0.0rc1.post0-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 205d435fca65e75afb93b9fd7b3e1c030112e9c7c9e434d8f6934c8c7ca72778
MD5 a1c51339f9f9cabb88a3e2292456eea9
BLAKE2b-256 82b49b800e225749cc3bb9de8ab0002578475733ad6a677f310dfea70f571897

See more details on using hashes here.

File details

Details for the file mslk_cuda_test-1.0.0rc1.post0-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda_test-1.0.0rc1.post0-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 a8ff7aaa9d37345ebf1a37a4df6b1fda0807337792cb5b97b412f818b498a255
MD5 d88ff675b3ec0c0cd388fe4aaf344786
BLAKE2b-256 3dacbdeff0d41a6f8535b4935d8e168c481c68ab55f0f53311c56a576385ddce

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page