Official toolkit for the NeRSemble Photorealistic 3D Head Avatar Benchmark

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

NeRSemble Photorealistic 3D Head Avatar Benchmark

This is the official NeRSemble Benchmark Toolkit for downloading the data and preparing submissions for the NeRSemble Photorealistic 3D Head Avatar benchmarks.
For submitting your results, please go to our submission system.

1. Data Access & Setup

Request access to the NeRSemble dataset (only necessary if you did not request access previously): https://forms.gle/rYRoGNh2ed51TDWX9
Once approved, you will receive a mail with the download link in the form of
```
NERSEMBLE_BENCHMARK_URL = "..."
```

Create a file at ~/.config/nersemble_benchmark/.env with following content:

NERSEMBLE_BENCHMARK_URL = "<<<URL YOU GOT WHEN REQUESTING ACCESS TO NERSEMBLE>>>"

Install this repository via pip install nersemble_benchmark

2. Data Download

After installation of the benchmark repository, a nersemble-benchmark-download command will be available in your environment. This is the main tool to download the benchmark data. To get a detailed description of download options, run nersemble-benchmark-download --help. In the following, ${benchmark_folder} denotes the path to your local folder where the benchmark data should be downloaded to.

Overview

NVS Benchmark (1604 x 1100)

Participant ID	Sequence	#Frames	Size	Size (incl. pointclouds)
388	GLASSES	1118	1.06 GB	21.8 GB
422	EXP-2-eyes	517	386 MB	16.1 GB
443	FREE	1108	1.19 GB	17.3 GB
445	EXP-6-tongue-1	514	401 MB	13.4 GB
475	HAIR	259	325 MB	773 MB
		Σ = 3516	Σ = 3.34 GB	Σ = 69.6 GB

13 out of the available 16 cameras are provided for training, the remaining 3 cameras (222200046, 222200037, 222200039) are hold-out and used to compute the test metrics.

Mono FLAME Avatar Benchmark (512 x 512)

Participant ID	#Sequences (train / test)	#Frames (train / test)	Size
393	18 / 4	2,964 / 816	27 MB
404	18 / 4	2,009 / 665	28 MB
461	18 / 4	2,057 / 486	29 MB
477	18 / 4	2,543 / 530	37 MB
486	18 / 4	2,440 / 608	23 MB
		12,013 / 3,105	Σ = 144 MB

Only a single camera is provided for training: 222200037. For all participants, the same 4 sequences are held out: EMO-1-shout+laugh, FREE, SEN-09-frown_events_bad, and SEN-10-port_strong_smokey. To compute test metrics, both the training camera as well as 3 hold-out cameras (222200046, 220700191, 222200039) are used to compute the test metrics.

NVS Benchmark download

nersemble-benchmark-download ${benchmark_folder} nvs

NVS pointclouds

The NVS benchmark also comes with pointclouds for each timestep that can be used to solve the task. Due to their size, per default only the first pointcloud of each sequence is downloaded which can be helpful to initialize 3D Gaussians for example. To download the pointclouds for all frames of the benchmark sequences, use --pointcloud_frames all. The pointclouds contain 3D point positions, colors, and normals.

Mono FLAME Avatar Benchmark download

nersemble-benchmark-download ${benchmark_folder} mono_flame_avatar

FLAME tracking

The Mono FLAME Avatar benchmark comes with FLAME tracking for each timesteps of both the train sequences as well as the hold-out sequences.
These are downloaded per default, but can also be specifically targeted for download via --assets flame2023_tracking.

3. Usage

Data Managers

The benchmark repository provides data managers to simplify loading individual assets such as images in Python code.

from nersemble_benchmark.data.benchmark_data import NVSDataManager
from nersemble_benchmark.constants import BENCHMARK_NVS_IDS_AND_SEQUENCES, BENCHMARK_NVS_TRAIN_SERIALS

benchmark_folder = "path/to/local/benchmark/folder"
participant_id, sequence_name = BENCHMARK_NVS_IDS_AND_SEQUENCES[0]  # <- Use first benchmark subject
serial = BENCHMARK_NVS_TRAIN_SERIALS[0]  # <- Use first train camera
timestep = 0  # <- Use first timestep

data_manager = NVSDataManager(benchmark_folder, participant_id)

Load image

image = data_manager.load_image(sequence_name, serial, timestep)  # <- Load first frame. Background is already removed

Load Alpha Map

image = data_manager.load_alpha_map(sequence_name, serial, timestep)  # <- Load alpha map

Load cameras

camera_params = data_manager.load_camera_calibration()
world_2_cam_pose = camera_params.world_2_cam[serial]  # <- 4x4 world2cam extrinsic matrix in OpenCV camera coordinate convention
intrinsics = camera_params.intrinsics[serial]  # <- 3x3 intrinsic matrix

Furthermore, the visualize_cameras.py script shows the arrangement of the cameras in 3D. The hold-out cameras used for the hidden test set are shown in red. The 388 indicates the ID of the participant (see the data section for available participant IDs in the benchmark)

python scripts/visualize/visualize_cameras.py ${benchmark_folder} 388

NVS Data Manager assets

The dynamic NVS benchmark has some assets specific to the benchmark. The following code assumes the use of a NVSDataManager:

from nersemble_benchmark.data.benchmark_data import NVSDataManager
nvs_data_manager = NVSDataManager(benchmark_folder, participant_id)

Load Pointcloud

points, colors, normals = nvs_data_manager.load_pointcloud(sequence_name, timestep)  # <- Load pointcloud of some timestep

Mono FLAME Avatar assets

The Mono FLAME Avatar benchmark has some additional assets specific to the benchmark. The following code assumes the use of a MonoFlameAvatarDataManager:

from nersemble_benchmark.data.benchmark_data import MonoFlameAvatarDataManager
mono_flame_data_manager = MonoFlameAvatarDataManager(benchmark_folder, participant_id)

FLAME tracking

The FLAME tracking for the benchmark has been conducted with the FLAME 2023 model.
The tracking result can be loaded via:

flame_tracking = mono_flame_data_manager.load_flame_tracking(sequence_name)  # <- Load the FLAME tracking for an entire sequence

it contains shape and expression codes, jaw and eyes parameters, as well as rigid head rotation and translation in world space:

class FlameTracking:
    shape               # (1, 300)
    expression          # (T, 100)
    rotation            # (T, 3)
    rotation_matrices   # (T, 3, 3)
    translation         # (T, 3)
    jaw                 # (T, 3)
    frames              # (T,)
    scale               # (1, 1)
    neck                # (T, 3)
    eyes                # (T, 6)

The FLAME tracking will provide a FLAME mesh that is already perfectly aligned with the given cameras from the benchmark.
The easiest way to obtain the mesh from the tracking parameters is using the FlameProvider class:

from nersemble_benchmark.models.flame import FlameProvider

flame_provider = FlameProvider(flame_tracking)
mesh = flame_provider.get_mesh(timestep)  # <- Get tracked mesh for the specified timestep in the sequence

The visualize_flame_tracking.py script shows how to load the FLAME tracking and visualizes the corresponding FLAME mesh with the correct cameras:

python scripts/visualize/visualize_flame_tracking.py ${benchmark_folder} --participant_id 461

4. Submission

Submissions to the benchmark tasks are done by uploading a submission .zip file to our submission system. The following describes the expected format of a submission .zip file.

4.1. NVS Benchmark

Submission .zip creation

For each of the 5 benchmark sequences, you need to render the whole sequence from the three hold-out cameras (222200046, 222200037, 222200039).
The corresponding camera extrinsics and intrinsics can be loaded the same way as the train cameras:

from nersemble_benchmark.constants import BENCHMARK_NVS_HOLD_OUT_SERIALS

camera_params = data_manager.load_camera_calibration()
for serial in BENCHMARK_NVS_HOLD_OUT_SERIALS:
    world_2_cam_pose = camera_params.world_2_cam[serial]
    intrinsics = camera_params.intrinsics[serial]
    ...  #  <- Render video from your reconstructed 4D representation

Once you rendered the images from the hold out viewpoints for all frames of the 5 benchmark sequences, you can pack them into a .zip file for submission.
The expected structure of the .zip file is as follows:

nvs_submission.zip
├── 388
│   └── GLASSES
│       ├── cam_222200037.mp4  # <- Video predictions from your method
│       ├── cam_222200039.mp4
│       └── cam_222200046.mp4
├── 422
│   └── EXP-2-eyes
│       ├── cam_222200037.mp4
│       ├── cam_222200039.mp4
│       └── cam_222200046.mp4
┆
└── 475
    └── ...

Since .mp4 is a lossy compression format, we use a very high quality setting of --crf 14 to ensure the metric calculation is not affected by compression artifacts.

To facilitate the creation of the submission .zip, this repository also contains some Python helpers that you can use:

from nersemble_benchmark.data.submission_data import NVSSubmissionDataWriter

zip_path = ...  #  <- Local path where you want to create your submission .zip file
images = ...  # <-  List of uint8 numpy arrays (H, W, 3) in range 0-255 that hold the image data for all frames of a single camera

with NVSSubmissionDataWriter(zip_path) as submission_data_manager:
    submission_data_manager.add_video(participant, sequence_name, serial, images)  #  <- will automatically package the images into a .mp4 file and place it correctly into the .zip

Note that the NVSSubmissionDataWriter will overwrite any previously existing .zip file with the same path. So, the predictions for all sequences and all hold out cameras have to be added at once.
After creation, you can submit the .zip to the Dynamic NVS benchmark.

4.2. Monocular FLAME Avatar Benchmark

Submission .zip creation

For each of the 4 hold-out sequences of the 5 benchmark people, you need to render the whole sequence from the three hold-out cameras (222200046, 220700191, 222200039) as well as the train camera (222200037).
The corresponding camera extrinsics and intrinsics can be loaded the same way as the train cameras:

from nersemble_benchmark.constants import BENCHMARK_MONO_FLAME_AVATAR_IDS, BENCHMARK_MONO_FLAME_AVATAR_TRAIN_SERIAL, BENCHMARK_MONO_FLAME_AVATAR_HOLD_OUT_SERIALS, BENCHMARK_MONO_FLAME_AVATAR_SEQUENCES_TEST

camera_params = data_manager.load_camera_calibration()
for participant_id in BENCHMARK_MONO_FLAME_AVATAR_IDS:
    for sequence_name in BENCHMARK_MONO_FLAME_AVATAR_SEQUENCES_TEST:
        flame_tracking = data_manager.load_flame_tracking(sequence_name)
        flame_provider = FlameProvider(flame_tracking)  # <- Use FLAME tracking to get expression codes / tracked meshes for hold-out sequence
        # 3 hold-out cameras
        for serial in BENCHMARK_MONO_FLAME_AVATAR_HOLD_OUT_SERIALS:
            world_2_cam_pose = camera_params.world_2_cam[serial]
            intrinsics = camera_params.intrinsics[serial]
            ...  #  <- Render video from your reconstructed 3D head avatar representation

        # train viewpoint
        serial = BENCHMARK_MONO_FLAME_AVATAR_TRAIN_SERIAL
        world_2_cam_pose = camera_params.world_2_cam[serial]
        intrinsics = camera_params.intrinsics[serial]
        ...  #  <- Render video from your reconstructed 3D head avatar representation

Once you rendered all frames from the 4 viewpoints for all 4 hold-out sequences of the 5 benchmark persons, you can pack them into a .zip file for submission.
The expected structure of the .zip file is as follows:

mono_flame_avatar_submission.zip
├── 393
│   ├── EMO-1-shout+laugh
│   │   ├── cam_220700191.mp4  # <- Video predictions from your method
│   │   ├── cam_222200037.mp4
│   │   ├── cam_222200039.mp4
│   │   └── cam_222200046.mp4
│   ┆
│   └── SEN-10-port_strong_smokey
│       ├── cam_220700191.mp4
│       ├── cam_222200037.mp4
│       ├── cam_222200039.mp4
│       └── cam_222200046.mp4
┆
└── 486
    └── ...

Since .mp4 is a lossy compression format, we use a very high quality setting of --crf 14 to ensure the metric calculation is not affected by compression artifacts.

To facilitate the creation of the submission .zip, this repository also contains some Python helpers that you can use:

from nersemble_benchmark.data.submission_data import MonoFlameAvatarSubmissionDataWriter

zip_path = ...  #  <- Local path where you want to create your submission .zip file
images = ...  # <-  List of uint8 numpy arrays (H, W, 3) in range 0-255 that hold the image data for all frames of a single camera

with MonoFlameAvatarSubmissionDataWriter(zip_path) as submission_data_manager:
    submission_data_manager.add_video(participant, sequence_name, serial, images)  #  <- will automatically package the images into a .mp4 file and place it correctly into the .zip

Note that the MonoFlameAvatarSubmissionDataWriter will overwrite any previously existing .zip file with the same path. So, the predictions for all sequences and all hold out cameras have to be added at once.
After creation, you can submit the .zip to the Monocular FLAME Avatar benchmark.

Project details

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.2.0

Apr 7, 2026

0.0.16

Nov 6, 2025

0.0.14

Mar 26, 2025

This version

0.0.13

Mar 26, 2025

0.0.12

Mar 17, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nersemble_benchmark-0.0.13.tar.gz (23.5 kB view details)

Uploaded Mar 26, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

nersemble_benchmark-0.0.13-py3-none-any.whl (19.9 kB view details)

Uploaded Mar 26, 2025 Python 3

File details

Details for the file nersemble_benchmark-0.0.13.tar.gz.

File metadata

Download URL: nersemble_benchmark-0.0.13.tar.gz
Upload date: Mar 26, 2025
Size: 23.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.6

File hashes

Hashes for nersemble_benchmark-0.0.13.tar.gz
Algorithm	Hash digest
SHA256	`f2a5f99c71ab9be6dd7733a8ab17cdd6ec6908e5f0ed735253dcf663f3bd7407`
MD5	`bbbe999406d55f0df1960ffa52ef6a9b`
BLAKE2b-256	`2184282451e0d897ab8391c84cc4682e8eb531e6c6f2031569d6493fa82506e9`

See more details on using hashes here.

File details

Details for the file nersemble_benchmark-0.0.13-py3-none-any.whl.

File metadata

Download URL: nersemble_benchmark-0.0.13-py3-none-any.whl
Upload date: Mar 26, 2025
Size: 19.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.6

File hashes

Hashes for nersemble_benchmark-0.0.13-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7682254f7ed63b1a5fb5fb914ddf33d382e4f7b26305a156bbbb01cd0f50bbae`
MD5	`734c6adfb14be62bc1ebba0abdf0c1f3`
BLAKE2b-256	`d81cd1b544ce7a0158e25cdeb5392f83b1d8ac1294757ba57f9ac44562268de7`

See more details on using hashes here.

nersemble-benchmark 0.0.13

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

NeRSemble Photorealistic 3D Head Avatar Benchmark

1. Data Access & Setup

2. Data Download

Overview

NVS Benchmark (1604 x 1100)

Mono FLAME Avatar Benchmark (512 x 512)

NVS Benchmark download

NVS pointclouds

Mono FLAME Avatar Benchmark download

FLAME tracking

3. Usage

Data Managers

Load image

Load Alpha Map

Load cameras

NVS Data Manager assets

Load Pointcloud

Mono FLAME Avatar assets

FLAME tracking

4. Submission

4.1. NVS Benchmark

Submission .zip creation

4.2. Monocular FLAME Avatar Benchmark

Submission .zip creation

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes