Pure-Python NVIDIA Warp backend for 3D Gaussian Splatting

These details have not been verified by PyPI

Project description

gswarp

English · 中文

gswarp is a pure-Python NVIDIA Warp backend for 3D Gaussian Splatting, reimplementing the three core CUDA modules used in 3DGS training — the rasterizer, SSIM loss, and KNN initialization. No C++/CUDA compilation required; install via pip and swap out the original CUDA implementations directly.

License: Apache License 2.0. Third-party attributions in NOTICE.

Three Replacement Modules
Requirements
Installation
Replacing CUDA Backends in a 3DGS Project
Performance
Quality Metrics
Detailed Documentation
Acknowledgements

Three Replacement Modules

Module	Replaces	Import Path	Notes
Rasterizer	`diff_gaussian_rasterization`	`gswarp`	Full differentiable Gaussian rasterization + auto-tuning
SSIM	`fused_ssim`	`gswarp.fused_ssim`	Separable Gaussian convolution with launch caching
KNN	`simple_knn`	`gswarp.knn`	Morton-sort + bounding-box pruning 3-NN

Requirements

Component	Minimum
Python	3.10+
NVIDIA GPU	Compute capability ≥ 7.0 (Volta)
PyTorch	2.0+ (with CUDA support)
NVIDIA Warp	1.12.0+

Installation

pip install gswarp

This installs warp-lang automatically via package dependencies. If you want to pin the Warp version explicitly, use:

pip install "warp-lang>=1.12.0" gswarp

Or install from source:

git clone https://github.com/fancifulland2718/gswarp.git
cd gswarp
pip install .

No compilation steps are needed after installation. The first call to any Warp kernel triggers JIT compilation (a few seconds); subsequent runs use the cache.

Replacing CUDA Backends in a 3DGS Project

The examples below follow the gaussian-splatting reference implementation.

Rasterizer

Original (gaussian_renderer/__init__.py):

from diff_gaussian_rasterization import (
    GaussianRasterizationSettings,
    GaussianRasterizer,
)

Replace with:

from gswarp import (
    GaussianRasterizationSettings,
    GaussianRasterizer,
)

GaussianRasterizationSettings is a NamedTuple with the same fields as the original (Warp-specific fields have defaults). GaussianRasterizer.forward() returns additional outputs compared to the original:

# Original CUDA:
color, radii = rasterizer(means3D=..., means2D=..., ...)

# gswarp:
color, radii, depth, alpha, proj_2D, conic_2D, conic_2D_inv, \
    gs_per_pixel, weight_per_gs_pixel, x_mu = rasterizer(...)
# Extra outputs can be ignored when only color and radii are needed

Optional runtime configuration (call once before the training loop):

from gswarp import initialize_runtime_tuning, set_binning_sort_mode

# Detect GPU and select optimal block_dim automatically (recommended)
initialize_runtime_tuning(device="cuda:0", verbose=True)

# Choose a sort mode (default warp_depth_stable_tile is usually best)
set_binning_sort_mode("warp_depth_stable_tile")  # recommended for large scenes
# set_binning_sort_mode("warp_radix")            # alternative
# set_binning_sort_mode("torch")                 # fallback

SSIM

Original (train.py):

from fused_ssim import fused_ssim

Replace with:

from gswarp.fused_ssim import fused_ssim

The function signature is identical:

loss_ssim = fused_ssim(img1, img2, padding="same", train=True)

KNN

Original (scene/gaussian_model.py):

from simple_knn._C import distCUDA2

Replace with:

from gswarp.knn import distCUDA2

The function signature is identical:

dist2 = distCUDA2(points)  # points: (N, 3) float32 CUDA tensor

Recommended: Replace All at Once

Add the following near the top of train.py:

try:
    from gswarp import GaussianRasterizationSettings, GaussianRasterizer
    from gswarp.fused_ssim import fused_ssim
    from gswarp.knn import distCUDA2
    GSWARP_AVAILABLE = True
except ImportError:
    GSWARP_AVAILABLE = False

Then switch backends at each usage site using the GSWARP_AVAILABLE flag. A reference integration is available in gaussian-splatting/train.py.

Performance

Results from full 30K-step training on 12 standard 3DGS datasets. Hardware: RTX 5090D V2 (sm_120, 24 GiB), Python 3.14, PyTorch 2.11.0+cu130, Warp 1.12.0. All three modules use the Warp backend, with Python-layer overhead optimizations applied.

Dataset	CUDA (it/s)	Warp (it/s)	Speedup
chair	103.6	113.1	×1.09
drums	103.0	115.3	×1.12
ficus	139.5	148.2	×1.06
hotdog	144.5	156.5	×1.08
lego	117.5	126.5	×1.08
materials	134.0	144.9	×1.08
mic	95.4	105.1	×1.10
ship	107.0	113.2	×1.06
train	55.6	58.3	×1.05
truck	39.4	40.1	×1.02
drjohnson	30.8	32.0	×1.04
playroom	46.9	47.5	×1.01

NeRF Synthetic (8 scenes): average ×1.08 speedup. Tanks & Temples / Deep Blending (4 large scenes): average ×1.03 speedup. The smaller gains on large scenes (drjohnson, playroom) are explained by the higher Gaussian counts diluting the rasterizer kernel advantage — see the rasterizer documentation for a per-phase breakdown.

Quality Metrics

Test-set evaluation after 30K training steps:

NeRF Synthetic (8-scene average)

Metric	CUDA	Warp	Δ
PSNR (dB)	33.31	33.33	+0.02
SSIM	0.9692	0.9693	+0.0001
LPIPS	0.0303	0.0302	−0.0001

Tanks & Temples (2-scene average)

Metric	CUDA	Warp	Δ
PSNR (dB)	23.74	23.79	+0.04
SSIM	0.8512	0.8515	+0.0003
LPIPS	0.1711	0.1707	−0.0004

Deep Blending (2-scene average)

Metric	CUDA	Warp	Δ
PSNR (dB)	29.77	30.01	+0.04
SSIM	0.9062	0.9063	+0.0001
LPIPS	0.2390	0.2388	−0.0002

Per-scene PSNR differences are within ±0.25 dB; SSIM differences are < 0.001. The Warp backend produces training quality equivalent to the CUDA baseline across all tested scenes.

Detailed Documentation

Document	Contents
docs/rasterizer.md	Architecture, CUDA implementation differences, micro-benchmarks, correctness, known limitations
docs/ssim.md	SSIM kernel optimizations, performance analysis, correctness
docs/knn.md	KNN algorithm, Morton sorting, performance analysis

Acknowledgements

3D Gaussian Splatting (INRIA / MPII)
fused-ssim (Rahul Goel et al.)
simple-knn (graphdeco-inria)
Fast Converging 3DGS (Zhang et al., 2025) — inspiration for compact AABB culling
NVIDIA Warp

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.0.5

Apr 23, 2026

1.0.4

Apr 13, 2026

1.0.3

Apr 13, 2026

This version

1.0.2

Apr 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gswarp-1.0.2.tar.gz (65.6 kB view details)

Uploaded Apr 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gswarp-1.0.2-py3-none-any.whl (62.9 kB view details)

Uploaded Apr 12, 2026 Python 3

File details

Details for the file gswarp-1.0.2.tar.gz.

File metadata

Download URL: gswarp-1.0.2.tar.gz
Upload date: Apr 12, 2026
Size: 65.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gswarp-1.0.2.tar.gz
Algorithm	Hash digest
SHA256	`7f1a05f690e0ca0367bd8709a6a9377e27f4bf02deb5f539e843aed5d3b7e14f`
MD5	`807bc8458ede0e0966bdaff0bb5536fc`
BLAKE2b-256	`5f1387867246c75c7804f8f0ac21fd4351adc2eb0f1ba60f2f9d4acb6738e495`

See more details on using hashes here.

Provenance

The following attestation bundles were made for gswarp-1.0.2.tar.gz:

Publisher: python-publish.yml on fancifulland2718/gswarp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: gswarp-1.0.2.tar.gz
- Subject digest: 7f1a05f690e0ca0367bd8709a6a9377e27f4bf02deb5f539e843aed5d3b7e14f
- Sigstore transparency entry: 1280927505
- Sigstore integration time: Apr 12, 2026
Source repository:
- Permalink: fancifulland2718/gswarp@42f6b9eba2483c84aff94a933ccba6e35f66e8bd
- Branch / Tag: refs/heads/main
- Owner: https://github.com/fancifulland2718
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@42f6b9eba2483c84aff94a933ccba6e35f66e8bd
- Trigger Event: workflow_dispatch

File details

Details for the file gswarp-1.0.2-py3-none-any.whl.

File metadata

Download URL: gswarp-1.0.2-py3-none-any.whl
Upload date: Apr 12, 2026
Size: 62.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for gswarp-1.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f49400efc411b82f8c510f6a49faf4537eace20680f49dea5105e430815003e8`
MD5	`f1d83a66afe2150f18a2514268850864`
BLAKE2b-256	`a7e60f2791f6567c056b8435e30bfa6d9cb34e01f31e25c7b37adcac3a310973`

See more details on using hashes here.

Provenance

The following attestation bundles were made for gswarp-1.0.2-py3-none-any.whl:

Publisher: python-publish.yml on fancifulland2718/gswarp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: gswarp-1.0.2-py3-none-any.whl
- Subject digest: f49400efc411b82f8c510f6a49faf4537eace20680f49dea5105e430815003e8
- Sigstore transparency entry: 1280927506
- Sigstore integration time: Apr 12, 2026
Source repository:
- Permalink: fancifulland2718/gswarp@42f6b9eba2483c84aff94a933ccba6e35f66e8bd
- Branch / Tag: refs/heads/main
- Owner: https://github.com/fancifulland2718
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish.yml@42f6b9eba2483c84aff94a933ccba6e35f66e8bd
- Trigger Event: workflow_dispatch

gswarp 1.0.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

gswarp

Table of Contents

Three Replacement Modules

Requirements

Installation

Replacing CUDA Backends in a 3DGS Project

Rasterizer

SSIM

KNN

Recommended: Replace All at Once

Performance

Quality Metrics

Detailed Documentation

Acknowledgements

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance