A 2D toy illustration of Neural Radiance Fields
Project description
nerf-toy
A 2D toy illustration of Neural Radiance Fields
Description • Features • Examples • References • License
Original | No Mapping | Basic Mapping | Fourier Feature Mapping |
---|---|---|---|
Description
nerf-toy is a 2D toy illustration of the Neural Radiance Fields. Memorizing a 2D image using an MLP is a nice problem to understand before going into solving the problem NeRF tackles i.e. memorising a 3D scene given multiple view points.
The task is to produce the (r, g, b) value of an image as a function of (x,y) image coordinates. Our model is a coordinate-based multilayer perceptron.
This is implemented using 2D convolutions with a kernel size of (1, 1), which act as "pointwise" convolutions. This is equivalent to a densely connected multilayer perceptron for each coordinate. Also, we use batchnorm to speed up convergence.
The model is trained with the following input mappings $\gamma (\mathbf{v})$ for comparison :
-
No mapping: $\gamma(\mathbf{v})= \mathbf{v}$.
-
Basic mapping: $\gamma(\mathbf{v})=\left[ \cos(2 \pi \mathbf{v}),\sin(2 \pi \mathbf{v}) \right]^\mathrm{T}$.
- Gaussian Fourier feature mapping: $\gamma(\mathbf{v})=\left[ \cos(2 \pi \mathbf B \mathbf{v}), \sin(2 \pi \mathbf B \mathbf{v}) \right]^\mathrm{T}$, where each entry in $\mathbf B \in \mathbb R^{m \times d}$ is sampled from $\mathcal N(0,\sigma^2)$
Features
- Transforms: Basic, Positional Encoding and Gaussian Fourier Feature
- Data loader for any input image, where filepath, image url or bytes can be passed as input
- Keras based NeRF toy model, which can be customised based on number of layer and output channels
- Metrics: PSNR , SSIM
- Custom training callback: PredictionVideoSaverCallback, PlotLossesAndMetricsCallback
- Utility functions to read and manipulate image
Examples
The demo notebook demonstrates the core idea with full model training from scratch.
Original | No Mapping | Basic Mapping | Fourier Feature Mapping |
---|---|---|---|
It can be observed that directly regressing the (x, y) image coordinates results in blurry reconstructions.
Instead, lifting the input pixel coordinates (x, y) to higher dimensions via transformations (e.g. gaussian fourier feature) makes it easier for network to learn high frequency functions in low dimensional domains. Training with transformed (x, y) coordinates shows dramatic improvements in the results and can preserve the sharp edges in the image.
References
- NeRF Paper
- Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains
- Neural Tangent Kernel: Convergence and Generalization in Neural Networks
- NeRF Video
License
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file nerf_toy-0.0.1.tar.gz
.
File metadata
- Download URL: nerf_toy-0.0.1.tar.gz
- Upload date:
- Size: 6.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.15
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6cef347043767f95ce97c64fce57f4f848bf53996f9abb666a05667bc2616385 |
|
MD5 | 3f20627b45638b145a437e846d21ad09 |
|
BLAKE2b-256 | d72b06d47ae9c040b4364cbb546ce19d605ad8daf7fabae3b9fa51b53f3c4168 |
File details
Details for the file nerf_toy-0.0.1-py3-none-any.whl
.
File metadata
- Download URL: nerf_toy-0.0.1-py3-none-any.whl
- Upload date:
- Size: 4.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.15
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ee378d406b4ef5a840bd116e37944cb25b29071320df988957404f40cc042a9c |
|
MD5 | de63f55e8af2a92c4d306bff7db1c735 |
|
BLAKE2b-256 | c18a8e0604ae8621c684c27a3172430f60e5002254a393ab3eb5446a1d6ba715 |