A simple approach to 3D keypoint detection using 2D estimation methods and multiview rendering.
Project description
Multiview 3D Keypoint Detection (Muke)
A simple approach to 3D keypoint detection using 2D estimation methods and multiview rendering, based on the blender project for automatic keypoint retopology.
Basically, the 3D model is rendered from different angles (views) and a 2D keypoint detection is performed. For each detected keypoint, a ray-cast is performed to determine the intersection point with the mesh surface. In the end, all intersection points of the different views are combined to calculate the current 3D position of the keypoint within the mesh. It is possible to define view-dependent keypoint indices to extract only the points that are visible in the current rendering. Muke returns a list of 3D keypoints containing both the position and the nearest vertex index.
Muke Process
Direct 3D keypoint recognition using mesh data would be more accurate, but it is still difficult to train 3D models or find already trained weights for them. By using 2D recognition alone, it is possible to use the entire zoo of keypoint image recognition models. Muke comes with a built-in MediaPipe face and pose detector, but can be extended with any other 2D keypoint detection framework.
3D Facial Landmark Estimation (Human Head by VistaPrime CC Attribution)
The project was originally implemented to have a simple and fast solution for 3D keypoints detection for retopology purposes. However, it can also be used for any other application where 3D keypoints are needed, such as rigging, animation, etc.
Installation
To install the package use the following pip command:
pip install muke
Usage
Muke can be used as a command line tool to extract the keypoints in a specific format (e.g. Wrap3). For that a configuration has to be created which defines the detection parameters as well as the rendering views.
Configuration
Example configuration:
{
"description": "MP Face",
"detector": "media-pipe-face",
"resolution": 1024,
"generator": "wrap3",
"views": [
{
"name": "frontal",
"rotation": 0,
"keypoints": [
4,
76,
306
]
}
]
}
To select a range of keypoint indices, it is possible to define a start
and end
(included) index. It is also possible to skip certain indices in that range. Here an example on how to create a range (skip
is optional):
{
"start": 10,
"end": 15,
"skip": [13, 14]
}
Infinite Ray
Per view it is possible to set the infinite-ray
value to True
to shoot a ray through the mesh to infinity. Every intersection point with the mesh is used as a point to calculate the average center of the keypoint inside the mesh.
Demo
Quickly try out Muke by using the following commands.
python -m muke assets/person.ply --display --resolution 1024
python -m muke assets/human_head.obj --display --resolution 1024 --detector media-pipe-face
python -m muke assets/human_head.obj --config config/media-pipe-face.json --display
Help
usage: muke [-h] [--detector {media-pipe-pose,media-pipe-face}]
[--resolution RESOLUTION] [--infinite-ray] [--generator {wrap3}]
[--config CONFIG] [--load-raw] [--display] [--debug]
input
Detects keypoint locations in a 3d model.
positional arguments:
input Input mesh to process.
optional arguments:
-h, --help show this help message and exit
--detector {media-pipe-pose,media-pipe-face}
Detection method for 2d keypoint detection (default:
media-pipe-pose).
--resolution RESOLUTION
Render resolution for each view pass (default: 512).
--infinite-ray Send ray through mesh to infinity and use average of
intersections (default: False)
--generator {wrap3} Generator methods for output generation (default:
wrap3).
--config CONFIG Path to the configuration JSON file.
--load-raw Load mesh raw without post-processing (default: False)
--display Shows result rendering with keypoints (default: False)
--debug Shows debug frames and information (default: False)
Library
It is also possible to use Muke as a library to detect keypoints on an existing 3d mesh.
import open3d as o3d
from muke.Muke import Muke
from muke.detector.MediaPipePoseDetector import MediaPipePoseDetector
from muke.model.DetectionView import DetectionView
# load mesh from filesystem
mesh = o3d.io.read_triangle_mesh("assets/person.ply")
# define rendered views
keypoint_indexes = {28, 27, 26, 25, 24, 23, 12, 11, 14, 13, 16, 15, 5, 2, 0}
views = [
DetectionView("front", 0, keypoint_indexes),
DetectionView("back", 180, keypoint_indexes),
]
# detect keypoints
with Muke(MediaPipePoseDetector()) as m:
result = m.detect(mesh, views)
# present results
for kp in result:
print(f"KP {kp.index}: {kp.x:.2f} {kp.y:.2f} {kp.z:.2f}")
Detectors
It is possible to implement custom keypoint detectors. The custom detector has to implement the BaseDetector
interface as shown in the following example.
import numpy as np
from muke.detector.BaseDetector import BaseDetector
from muke.detector.KeyPoint2 import KeyPoint2
class CustomDetector(BaseDetector):
def setup(self):
# todo: initialize the custom detector
pass
def detect(self, image: np.ndarray) -> [KeyPoint2]:
# todo: implement the custom 2d keypoint detection
pass
def release(self):
# todo: clean up allocated resources
pass
Renderer
The current version uses pygfx as lightweight and offscreen renderer, trimesh for model loading into pygfx and Open3D for raycasting. Initially, trimesh was used for everything, which is archived in the trimesh-renderer branch. Open3D was also once used for everything, but has been archived in version 0.2.x
and the open3d-renderer branch.
About
MIT License - Copyright (c) 2024 Florian Bruggisser
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distribution
File details
Details for the file muke-0.3.5.4-py3-none-any.whl
.
File metadata
- Download URL: muke-0.3.5.4-py3-none-any.whl
- Upload date:
- Size: 19.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 787240f71e8f4747ac5661cd5a258e027d2ccd7cb579ed931439d8360b2fbf4c |
|
MD5 | 6dd9ff28cee72f8214a394257da6a749 |
|
BLAKE2b-256 | d84132079feb531244fb60cb4d703289e315aba2d51356e159e4e5c34ba4c282 |