Skip to main content

Real space transformations of 2D/3D images in PyTorch

Project description

torch-transform-image

License PyPI Python Version CI codecov

Real space transformations of 2D/3D images in PyTorch

Motivation

This package provides a simple, consistent API for applying affine transformations to 2D/3D images in PyTorch. It enables efficient, GPU-accelerated geometric transformations of images.

Installation

pip install torch-transform-image

Features

  • Apply arbitrary affine transformations to 2D and 3D images
  • Support for various interpolation methods (nearest, bilinear, bicubic for 2D; nearest, trilinear for 3D)
  • Batched operations for efficient processing
  • Fully differentiable operations compatible with PyTorch's autograd

Coordinate System

This package uses the same coordinate system as NumPy/PyTorch array indexing:

  • For 2D images: coordinates are ordered as [y, x] for dimensions (height, width)
  • For 3D images: coordinates are ordered as [z, y, x] for dimensions (depth, height, width)

Transformation matrices left-multiply homogeneous pixel coordinates ([y, x, 1] for 2D and [z, y, x, 1] for 3D).

Generating Transformation Matrices

The companion package torch-affine-utils provides convenient functions to generate transformation matrices that work with homogenous pixel coordinates (yxw/zyxw):

from torch_affine_utils.transforms_2d import R, T, S  # Rotation, Translation, Scale for 2D
from torch_affine_utils.transforms_3d import Rx, Ry, Rz, T, S  # Rotation, Translation, Scale for 3D

Usage

2D Transformations

import torch
from torch_transform_image import affine_transform_image_2d
from torch_affine_utils.transforms_2d import R, T, S  # Rotation, Translation, Scale

# Create a test image (28×28)
image = torch.zeros((28, 28), dtype=torch.float32)
image[14, 14] = 1  # Place a dot at the center

# Create a transformation matrix to translate coordinates 4 pixels in y direction
translation = T([4, 0])  # Uses [y, x] coordinate order matching dimensions (h, w)

# Apply the transformation
result = affine_transform_image_2d(
    image=image,
    matrices=translation, 
    interpolation='bilinear'  # Options: 'nearest', 'bilinear', 'bicubic'
)

# Compose multiple transformations
# First translate to origin, then rotate, then translate back
T1 = T([-14, -14])  # Move center to origin
R1 = R(45)  # Rotate 45 degrees
T2 = T([14, 14])  # Move back
transform = T2 @ R1 @ T1  # Matrix composition (applied right-to-left)

# Apply the composed transformation
rotated = affine_transform_image_2d(
    image=image,
    matrices=transform,
    interpolation='bicubic'
)

3D Transformations

import torch
from torch_transform_image import affine_transform_image_3d
from torch_affine_utils.transforms_3d import R, T, S  # Rotation, Translation, Scale

# Create a test volume (64×64×64)
volume = torch.zeros((64, 64, 64), dtype=torch.float32)
volume[32, 32, 32] = 1  # Place a dot at the center

# Create a transformation matrix (translate coordinates 5 voxels in z direction)
translation = T([5, 0, 0])  # Uses [z, y, x] coordinate order matching dimensions (d, h, w)

# Apply the transformation
result = affine_transform_image_3d(
    image=volume,
    matrices=translation, 
    interpolation='trilinear'  # Options: 'nearest', 'trilinear'
)

How It Works

Under the hood, the package:

  1. Creates a coordinate grid for the input image
  2. Applies the transformation matrix to these coordinates
  3. Samples the original image at the transformed coordinates using the specified interpolation method

All operations are performed in PyTorch, making them fully differentiable and GPU-compatible.

License

This project is licensed under the BSD 3-Clause License - see the LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torch_transform_image-0.1.0.tar.gz (7.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

torch_transform_image-0.1.0-py3-none-any.whl (5.7 kB view details)

Uploaded Python 3

File details

Details for the file torch_transform_image-0.1.0.tar.gz.

File metadata

  • Download URL: torch_transform_image-0.1.0.tar.gz
  • Upload date:
  • Size: 7.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for torch_transform_image-0.1.0.tar.gz
Algorithm Hash digest
SHA256 daf6baa41096fb4e8e725fdf7b5d7d5c3f6e2c50c68a4f2ae77c1753a1a827a1
MD5 8b13f881943721e3b10c782c89307407
BLAKE2b-256 49ccdb9414f9322413b889d84effe5147b9320bf797528ef5da2bad61c0940f3

See more details on using hashes here.

Provenance

The following attestation bundles were made for torch_transform_image-0.1.0.tar.gz:

Publisher: ci.yml on teamtomo/torch-transform-image

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file torch_transform_image-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for torch_transform_image-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 bd1f3a49b3a3f41d82ed3b0dfee6eea4a94b387303b71aa20687b095b71eac67
MD5 ec05efee4d97ec82cfd73f62dfae7299
BLAKE2b-256 97f2ebef052a1424720400643687585cac4ed36ee0400b5c1fc71d7bd2085f23

See more details on using hashes here.

Provenance

The following attestation bundles were made for torch_transform_image-0.1.0-py3-none-any.whl:

Publisher: ci.yml on teamtomo/torch-transform-image

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page