Skip to main content

CUda Matrix Multiply library

Project description

cumm

CUda Matrix Multiply library.

Build Status

cumm is developed during learning of CUTLASS, which use too much c++ template and make code unmaintainable. So I develop pccm, use python as meta programming language, to replace c++ template meta programming. Now pccm become a foundational framework of cumm and my other c++ project such as spconv. cumm also contains a python asyncio-based gemm simulator that share same meta program with CUDA code, enable gemm visualization and easy debug experience.

Install

Prebuilt

We offer python 3.7-3.10 and cuda 10.2/11.1/11.3/11.4 prebuilt binaries for linux (manylinux).

We offer python 3.7-3.10 and cuda 10.2/11.1/11.3/11.4 prebuilt binaries for windows 10/11.

We will offer prebuilts for CUDA versions supported by latest pytorch release. For example, pytorch 1.9 support cuda 10.2 and 11.1, so we support them too.

pip install cumm-cu102 for CUDA 10.2

pip install cumm-cu111 for CUDA 11.1

pip install cumm-cu113 for CUDA 11.3

pip install cumm-cu114 for CUDA 11.4

Build from source

Linux

  1. install build-essential, install CUDA
  2. run export CUMM_DISABLE_JIT="1"
  3. run python setup.py install/pip install -e ./python setup.py bdist_wheel+pip install dists/xxx.whl

Windows 10/11

  1. install visual studio 2019 or newer. make sure C++ development package is installed. install CUDA
  2. set powershell script execution policy
  3. start a new powershell, run tools/msvc_setup.ps1
  4. run $Env:CUMM_DISABLE_JIT = "1"
  5. run python setup.py install/pip install -e ./python setup.py bdist_wheel+pip install dists/xxx.whl

Note

The work is done when the author is an employee at Tusimple.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

cumm-0.2.2-cp310-cp310-win_amd64.whl (713.6 kB view hashes)

Uploaded CPython 3.10 Windows x86-64

cumm-0.2.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (848.8 kB view hashes)

Uploaded CPython 3.10 manylinux: glibc 2.17+ x86-64

cumm-0.2.2-cp39-cp39-win_amd64.whl (710.2 kB view hashes)

Uploaded CPython 3.9 Windows x86-64

cumm-0.2.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (848.8 kB view hashes)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64

cumm-0.2.2-cp38-cp38-win_amd64.whl (713.5 kB view hashes)

Uploaded CPython 3.8 Windows x86-64

cumm-0.2.2-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (848.3 kB view hashes)

Uploaded CPython 3.8 manylinux: glibc 2.17+ x86-64

cumm-0.2.2-cp37-cp37m-win_amd64.whl (713.8 kB view hashes)

Uploaded CPython 3.7m Windows x86-64

cumm-0.2.2-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (852.9 kB view hashes)

Uploaded CPython 3.7m manylinux: glibc 2.17+ x86-64

cumm-0.2.2-cp36-cp36m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (852.6 kB view hashes)

Uploaded CPython 3.6m manylinux: glibc 2.17+ x86-64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page