Skip to main content

spatial sparse convolution

Project description

SpConv: Spatially Sparse Convolution Library

Build Status

spconv is a project that provide heavily-optimized sparse convolution implementation with tensor core support.

Spconv 1.x code. We won't provide any support for spconv 1.x since it's deprecated. use spconv 2.x if possible.

Breaking changes in Spconv 2.x

Spconv 1.x users NEED READ THIS before using spconv 2.x.

Spconv 2.1 vs Spconv 1.x

  • spconv now can be installed by pip. see install section in readme for more details.
  • Microsoft Windows support (only windows 10 has been tested).
  • fp32 (not tf32) training/inference speed is increased (+50~80%)
  • fp16 training/inference speed is greatly increased when your layer support tensor core (channel size must be multiple of 8).
  • int8 op is ready, but we still need some time to figure out how to run int8 in pytorch.
  • doesn't depend on pytorch binary.
  • since spconv 2.x doesn't depend on pytorch binary (never in future), it's impossible to support torch.jit/libtorch inference.

Spconv 2.1 vs 1.x speed:

1080Ti Spconv 1.x F32 1080Ti Spconv 2.0 F32 3080M* Spconv 2.1 F16
27x128x128 Fwd 11ms 5.4ms 1.4ms

* 3080M (Laptop) ~= 3070 Desktop

Usage

Firstly you need to use import spconv.pytorch as spconv in spconv 2.x.

Then see this.

Don't forget to check performance guide.

Install

You need to install python >= 3.7 first to use spconv 2.x.

You need to install CUDA toolkit first before using prebuilt binaries or build from source.

You need at least CUDA 10.2 to build and run spconv 2.x. We won't offer any support for CUDA < 10.2.

Prebuilt

We offer python 3.7-3.10 and cuda 10.2/11.1/11.3/11.4 prebuilt binaries for linux (manylinux) and windows 10/11.

We will provide prebuilts for CUDA versions supported by latest pytorch release. For example, pytorch 1.10 provide cuda 10.2 and 11.3 prebuilts, so we provide them too.

For Linux users, you need to install pip >= 20.3 first to install prebuilt.

CUDA 11.1 will be removed in spconv 2.2 because pytorch 1.10 don't provide prebuilts for it.

pip install spconv for CPU only (Linux Only). you should only use this for debug usage, the performance isn't optimized due to manylinux limit (no omp support).

pip install spconv-cu102 for CUDA 10.2

pip install spconv-cu111 for CUDA 11.1

pip install spconv-cu113 for CUDA 11.3 (Linux Only)

pip install spconv-cu114 for CUDA 11.4

NOTE It's safe to have different minor cuda version between system and conda (pytorch) in Linux. for example, you can use spconv-cu114 with anaconda version of pytorch cuda 11.1 in a OS with CUDA 11.2 installed.

Build from source for development (JIT, recommend)

The c++ code will be built automatically when you change c++ code in project.

For NVIDIA Embedded Platforms, you need to specify cuda arch before build: export CUMM_CUDA_ARCH_LIST="7.2" for xavier.

Linux

  1. uninstall spconv and cumm installed by pip
  2. install build-essential, install CUDA
  3. git clone https://github.com/FindDefinition/cumm, cd ./cumm, pip install -e .
  4. git clone https://github.com/traveller59/spconv, cd ./spconv, pip install -e .
  5. in python, import spconv and wait for build finish.

Windows

  1. uninstall spconv and cumm installed by pip
  2. install visual studio 2019 or newer. make sure C++ development component is installed. install CUDA
  3. set powershell script execution policy
  4. start a new powershell, run tools/msvc_setup.ps1
  5. git clone https://github.com/FindDefinition/cumm, cd ./cumm, pip install -e .
  6. git clone https://github.com/traveller59/spconv, cd ./spconv, pip install -e .
  7. in python, import spconv and wait for build finish.

Build wheel from source (not recommend, this is done in CI.)

You need to rebuild cumm first if you are build along a CUDA version that not provided in prebuilts.

Linux

  1. install build-essential, install CUDA
  2. run export SPCONV_DISABLE_JIT="1"
  3. run pip install pccm cumm wheel
  4. run python setup.py bdist_wheel+pip install dists/xxx.whl

Windows

  1. install visual studio 2019 or newer. make sure C++ development component is installed. install CUDA
  2. set powershell script execution policy
  3. start a new powershell, run tools/msvc_setup.ps1
  4. run $Env:SPCONV_DISABLE_JIT = "1"
  5. run pip install pccm cumm wheel
  6. run python setup.py bdist_wheel+pip install dists/xxx.whl

Roadmap for Spconv 2.2-2.3:

  • TensorFormat32 support for faster fp32 training when you use NVIDIA Geforce RTX 30x0/Tesla A100/Quadro RTX Ax000 (2.2)
  • change implicit gemm weight layout from KRSC to RSKC to make sure we can use native algorithm with implicit gemm weight. (2.2)
  • documents (2.2)
  • Ampere feature support (2.3)
  • pytorch int8 inference, and QAT support (2.3)

TODO in Spconv 2.x

  • Ampere (A100 / RTX 3000 series) feature support (work in progress)
  • torch QAT support (work in progress)
  • TensorRT (torch.fx based)
  • Build C++ only package
  • JIT compilation for CUDA kernels
  • Document (low priority)

Note

The work is done when the author is an employee at Tusimple.

LICENSE

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

spconv-2.1.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (467.9 kB view details)

Uploaded CPython 3.10manylinux: glibc 2.17+ x86-64

spconv-2.1.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (467.8 kB view details)

Uploaded CPython 3.9manylinux: glibc 2.17+ x86-64

spconv-2.1.2-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (466.0 kB view details)

Uploaded CPython 3.8manylinux: glibc 2.17+ x86-64

spconv-2.1.2-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (455.8 kB view details)

Uploaded CPython 3.7mmanylinux: glibc 2.17+ x86-64

File details

Details for the file spconv-2.1.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for spconv-2.1.2-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 824b596836ed9ccd954ec988d8216803b4ce484efcf5953e519d235b43a1b823
MD5 fca710be6a46c1218fa79d54e238e5b0
BLAKE2b-256 836a2468ed8706fc45a360b3d826a16a0ae804a48954f2ead997579f3b09c44b

See more details on using hashes here.

File details

Details for the file spconv-2.1.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for spconv-2.1.2-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 0e0493f4c6de32ab29bcc540b9120fb427f77367b3af45aa8fcaf3090ec85f75
MD5 ce3783a7ba2a2b3cf8d06b5bebcfc7ff
BLAKE2b-256 75a56fa27449c1f4351b6d91216c66894b1661e1a74417ebe7ea65ae62028cff

See more details on using hashes here.

File details

Details for the file spconv-2.1.2-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for spconv-2.1.2-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 d0c6ae7d45a0bdef99231a11c0868e7e422b867878da5d7929927209a27ace95
MD5 25f7a1ece1fd0aae85614418f0aab0f0
BLAKE2b-256 a6b68326f900008bd988815b969aac3b1035de7c6396585579a0e9b23f3e8095

See more details on using hashes here.

File details

Details for the file spconv-2.1.2-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.

File metadata

File hashes

Hashes for spconv-2.1.2-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
Algorithm Hash digest
SHA256 46cf9123ba22b123051ad4ef6e3ea16aefbfd87ad0428b9c9cd93eb17bce5951
MD5 af35baf052a32e6db2e0d2c3726a9890
BLAKE2b-256 7f93521cfc143d77c8f10ed6f17f46e5a7068dd7395a7c7d9f8cd8f3214e96e9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page