Skip to main content

No project description provided

Project description

MSLK Library

MSLK (Meta Superintelligence Labs Kernels, formerly known as FBGEMM GenAI) is a collection of high-performance kernels and optimizations built on top of PyTorch primitives for GenAI training and inference.

Installation

# Install MSLK for CUDA
pip install mslk-cuda==1.0.0
# Install MSLK for ROCm
pip install mslk-rocm==1.0.0
# Install a nightly CUDA version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/cu128
# Install a nightly ROCm version
pip install --pre mslk --index-url https://download.pytorch.org/whl/nightly/rocm7.1/

Release Compatibility Table

MSLK is released in accordance to the PyTorch release schedule, and each release has no guarantee to work in conjunction with PyTorch releases that are older than the one that the MSLK release corresponds to.

MSLK Release Corresponding PyTorch Release Supported Python Versions Supported CUDA Versions Supported CUDA Architectures Supported ROCm Versions Supported ROCm Architectures
1.0.0 2.10.x 3.10, 3.11, 3.12, 3.13, 3.14 12.6, 12.8, 12.9, 13.0 8.0, 9.0a, 10.0a, 12.0a 7.0, 7.1 gfx908, gfx90a, gfx942, gfx950

Running Benchmarks

python bench/gemm/gemm_bench.py --M 4096 --N 4096 --K 4096
python bench/quantize/quantize_bench.py --M 4096 --K 4096
python bench/conv/conv_bench.py

Running Tests

pytest test/gemm/gemm_test.py
pytest test/quantize/fp8_quantize_correctness_test.py
pytest test/conv/conv_test.py

Build From Source

We only support building on Linux. See the release compatibility table above for supported versions of Python, CUDA, ROCm.

# Clone repo
git clone https://github.com/meta-pytorch/MSLK
cd MSLK
git submodule sync
git submodule update --init --recursive
# Build and install
# The script will create a conda environment and install the required dependencies.
# The conda environment will look something like: build-py3.14-torchnightly-cuda12.9.1
./ci/integration/mslk_oss_build.bash
# After the initial environment setup, you can activate the environment and iterate faster:
conda activate build-py3.14-torchnightly-cuda12.9.1
python setup.py install

Join the MSLK community

For questions, support, news updates, or feature requests, please feel free to:

For contributions, please see the CONTRIBUTING file for ways to help out.

License

MSLK is BSD licensed, as found in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

mslk_cuda-1.0.0-cp314-cp314-manylinux_2_28_x86_64.whl (50.3 MB view details)

Uploaded CPython 3.14manylinux: glibc 2.28+ x86-64

mslk_cuda-1.0.0-cp313-cp313-manylinux_2_28_x86_64.whl (50.3 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

mslk_cuda-1.0.0-cp312-cp312-manylinux_2_28_x86_64.whl (50.3 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

mslk_cuda-1.0.0-cp311-cp311-manylinux_2_28_x86_64.whl (48.7 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

mslk_cuda-1.0.0-cp310-cp310-manylinux_2_28_x86_64.whl (48.7 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

File details

Details for the file mslk_cuda-1.0.0-cp314-cp314-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda-1.0.0-cp314-cp314-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0c198e82e3fb9118b6337b151964804d7345da429f1bb6889bf10cf738e81af5
MD5 69211c4ff2ca55ea2ebcb4137df2c256
BLAKE2b-256 c55a2b75b294c5ffedf6686f70f475ae52e734202130390da18535e081aa5d58

See more details on using hashes here.

File details

Details for the file mslk_cuda-1.0.0-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda-1.0.0-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 81e4889efb48e78b60cfb6d2558e8a67013b89c7079cb7a224a2465074f00df9
MD5 1cb24772dba97560e5141b98c55117e2
BLAKE2b-256 ed60807b03264f20e7ba6cdcb0e3c0612cb45bcf70ef7681c5dca29c16526bf6

See more details on using hashes here.

File details

Details for the file mslk_cuda-1.0.0-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda-1.0.0-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 d9bfe20b1ee1046a6fdf0561ac53bd4f0c4f8a5cba89d50c73d649df53230f72
MD5 b80298b3f2300e451712cf2010c4d1d2
BLAKE2b-256 5d8f559e5b9857f6fe6bca0373fada848d12c715e5f7b303b7adad04ce3c2e3e

See more details on using hashes here.

File details

Details for the file mslk_cuda-1.0.0-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda-1.0.0-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 13820f7694eeed362912f32a314a4ebc18857ef67a54065cb481a1716d528b66
MD5 a49a6040d5348fb12781816117d39112
BLAKE2b-256 50b603be38c75e3054c2a6719bbe1f55ee89f27895ce1c2148d5e8d278ce2761

See more details on using hashes here.

File details

Details for the file mslk_cuda-1.0.0-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for mslk_cuda-1.0.0-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 848114cf83b3f1e7d2d9addc7392029915db5074a8ad31e6c9cbd44385403bdd
MD5 f96ceb4a9752921915f4fe62ab7bc0c3
BLAKE2b-256 8933122ff3b88317ccba1de19aa59d28467eda015c7f3d816efbd1417e5d0afd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page