Skip to main content

equirectangular image processing with python using minimum dependencies

Project description

equilib

Processing Equirectangular Images with Python

PyPI version GitHub license
equilib
  • A library for processing equirectangular image that runs on Python.
  • Developed using Python>=3.6 (c++ is WIP).
  • Compatible with cuda tensors for faster processing.
  • No need for other dependencies except for numpy and torch.
  • Added functionality like creating rotation matrices, batched processing, and automatic type detection.
  • Highly modular

If you found this module helpful to your project, please site this repository:

@software{pyequilib2021github,
  author = {Haruya Ishikawa},
  title = {PyEquilib: Processing Equirectangular Images with Python},
  url = {http://github.com/haruishi43/equilib},
  version = {0.5.0},
  year = {2021},
}

Installation:

Prerequisites:

  • Python (>=3.6)
  • Pytorch (tested on 1.12)
pip install pyequilib

For developing, use:

git clone --recursive https://github.com/haruishi43/equilib.git
cd equilib

pip install -r requirements.txt

pip install -e .
# or
python setup.py develop

NOTE: might not work for PyTorch>=2.0. If you have any issues, please open an issue.

Basic Usage:

equilib has different transforms of equirectangular (or cubemap) images (note each transform has class and func APIs):

  • Cube2Equi/cube2equi: cubemap to equirectangular transform
  • Equi2Cube/equi2cube: equirectangular to cubemap transform
  • Equi2Equi/equi2equi: equirectangular transform
  • Equi2Pers/equi2pers: equirectangular to perspective transform

There are no real differences in class or func APIs:

  • class APIs will allow instantiating a class which you can call many times without having to specify configurations (class APIs call the func API)
  • func APIs are useful when there are no repetitive calls
  • both class and func APIs are extensible, so you can extend them to your use-cases or create a method that's more optimized (pull requests are welcome btw)

Each API automatically detects the input type (numpy.ndarray or torch.Tensor), and outputs are the same type.

An example for Equi2Pers/equi2pers:

Equi2Pers
equi2pers
import numpy as np
from PIL import Image
from equilib import Equi2Pers

# Input equirectangular image
equi_img = Image.open("./some_image.jpg")
equi_img = np.asarray(equi_img)
equi_img = np.transpose(equi_img, (2, 0, 1))

# rotations
rots = {
    'roll': 0.,
    'pitch': np.pi/4,  # rotate vertical
    'yaw': np.pi/4,  # rotate horizontal
}

# Intialize equi2pers
equi2pers = Equi2Pers(
    height=480,
    width=640,
    fov_x=90.0,
    mode="bilinear",
)

# obtain perspective image
pers_img = equi2pers(
    equi=equi_img,
    rots=rots,
)
import numpy as np
from PIL import Image
from equilib import equi2pers

# Input equirectangular image
equi_img = Image.open("./some_image.jpg")
equi_img = np.asarray(equi_img)
equi_img = np.transpose(equi_img, (2, 0, 1))

# rotations
rots = {
    'roll': 0.,
    'pitch': np.pi/4,  # rotate vertical
    'yaw': np.pi/4,  # rotate horizontal
}

# Run equi2pers
pers_img = equi2pers(
    equi=equi_img,
    rots=rots,
    height=480,
    width=640,
    fov_x=90.0,
    mode="bilinear",
)

For more information about how each APIs work, take a look in .readme or go through example codes in the tests or scripts.

Coordinate System:

Right-handed rule XYZ global coordinate system. x-axis faces forward and z-axis faces up.

  • roll: counter-clockwise rotation about the x-axis
  • pitch: counter-clockwise rotation about the y-axis
  • yaw: counter-clockwise rotation about the z-axis

You can chnage the right-handed coordinate system so that the z-axis faces down by adding z_down=True as a parameter.

See demo scripts under scripts.

Grid Sampling

To process equirectangular images fast, whether to crop perspective images from the equirectangular image, the library takes advantage of grid sampling techniques. Some sampling techniques are already implemented, such as scipy.ndimage.map_coordiantes and cv2.remap. This project's goal was to reduce these dependencies and use cuda and batch processing with torch and c++ for a faster processing of equirectangular images. There were not many projects online for these purposes. In this library, we implement varieties of methods using c++, numpy, and torch. This part of the code needs cuda acceleration because grid sampling is parallelizable. For torch, the built-in torch.nn.functional.grid_sample function is very fast and reliable. I have implemented a pure torch implementation of grid_sample which is very customizable (might not be fast as the native function). For numpy, I have implemented grid sampling methods that are faster than scipy and more robust than cv2.remap. Just like with this implementation of torch, numpy implementation is just as customizable. It is also possible to pass the scipy and cv2's grid sampling function through the use of override_func argument in grid_sample. Developing faster approaches and c++ methods are WIP. See here for more info on implementations.

Some notes:

  • By default, numpy's grid_sample will use pure numpy implementation. It is possible to override this implementation with scipy and cv2's implementation using override_func.
  • By default, torch's grid_sample will use the official implementation.
  • Benchmarking codes are stored in tests/. For example, benchmarking codes for numpy's equi2pers is located in tests/equi2pers/numpy_run_baselines.py and you can benchmark the runtime performance using different parameters against scipy and cv2.

Develop:

Test files for equilib are included under tests.

Running tests:

pytest tests

Note that I have added codes to benchmark every step of the process so that it is possible to optimize the code. If you find there are optimal ways of the implementation or bugs, all pull requests and issues are welcome.

Check CONTRIBUTING.md for more information

TODO:

  • Documentations for each transform
  • Add table and statistics for speed improvements
  • Batch processing for numpy
  • Mixed precision for torch
  • c++ version of grid sampling
  • More accurate intrinsic matrix formulation using vertial FOV for equi2pers
  • Multiprocessing support (slow when running on torch.distributed)

Acknowledgements:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyequilib-0.5.8.tar.gz (71.1 kB view details)

Uploaded Source

Built Distribution

pyequilib-0.5.8-py3-none-any.whl (129.3 kB view details)

Uploaded Python 3

File details

Details for the file pyequilib-0.5.8.tar.gz.

File metadata

  • Download URL: pyequilib-0.5.8.tar.gz
  • Upload date:
  • Size: 71.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for pyequilib-0.5.8.tar.gz
Algorithm Hash digest
SHA256 fa34947162583bdf235d94e6b9d2d32847575ba1357ced9eeb4ec91e5650b592
MD5 f1cd2093a75db0a23be5a550b27017b7
BLAKE2b-256 e5a36b39bb3286212f79e89c1f5bb85096a05c0c5d5feefd99b50eb56ad33aa0

See more details on using hashes here.

File details

Details for the file pyequilib-0.5.8-py3-none-any.whl.

File metadata

  • Download URL: pyequilib-0.5.8-py3-none-any.whl
  • Upload date:
  • Size: 129.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.7

File hashes

Hashes for pyequilib-0.5.8-py3-none-any.whl
Algorithm Hash digest
SHA256 645cdb71d2432800df60f3176df6baf033cdfedbdadceb8bb2349ed70afb338e
MD5 43d2b72400eda8f712e51d38098609f3
BLAKE2b-256 e760b0a3eae47431fd1f9912a94ae9ce5e5afc17436bfeb74a082c11d94543f8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page