Project description

FaceXFormer Pipeline Implementation

Example Image

This repository contains an easy-to-use pipeline implementation of the FaceXFormer, a unified transformer model for comprehensive facial analysis, as described in the paper by Kartik Narayan et al. from Johns Hopkins University.

Here is the official code repository: FaceXFormer Official Repository

What Does This Implementation Do Differently?

The official implementation is excellent as it primarily focuses on benchmarking. However, it is not yet application-ready. With this implementation:

No need to deal with reverse transforms, resizing, or remapping to the original image size.
Cropping is being handled internally (different crops are used for face parsing and landmarks for better accuracy).
It is possible to run only one task or any combination of tasks.
You can pass your own face detection method's coordinates as arguments and you are not forced to rerun the face detection calculations.
Visual debugging is much easier thanks to the use of the visual_debugger package.
Results are provided with all the extra information you may need.

What is it

You can use FaceXFormer to extract

faceparsing mask
landmarks
headpose orientation
various attributes
visibility
age-gender-race

information really fast and from unified model. And you can do it really fast (37 FPS).

Installation

pip install facexformer_pipeline

Usage

To use the FaceXFormer pipeline, follow these steps:

# Import the pipeline class
from facexformer_pipeline import FacexformerPipeline

# Initialize the pipeline with desired tasks
pipeline = FacexformerPipeline(debug=True, tasks=['headpose', 'landmark', 'faceparsing'])

# Put your code for reading an image 
# image_path = "sample_image_head_only.jpg"
# uih = UniversalImageInputHandler(image_path)   #  to use UniversalImageInputHandler you need "pip install image_input_handler"
# img = uih.img

# Run the model on an image
results = pipeline.run_model(img)

# Access the results from results dictionary
print(results['headpose'])
print(results['landmarks']) 
print(results['faceparsing_mask']) 


# Also you can access intermediate results such as face region crop, face coordinates etc
print(results['face_ROI'])
print(results['face_coordinates']) 
print(results['head_coordinates'])

You can demonstrate the results really easily with visual_debugger (These lines creates the above image)

# Show the results on the image
from visual_debugger import VisualDebugger, Annotation, AnnotationType

vdebugger = VisualDebugger(tag="facex", debug_folder_path="./", active=True)

annotation_landmarks_face_ROI = [
    Annotation(type=AnnotationType.POINTS, coordinates=results["landmarks_face_ROI"], color=(0, 255, 0))
]
annotation_landmarks = [
    Annotation(type=AnnotationType.POINTS, coordinates=results["landmarks"], color=(0, 255, 0))
]
annotation_headpose = [
    Annotation(type=AnnotationType.PITCH_YAW_ROLL, orientation=results["headpose"], color=(0, 255, 0))
]
annotation_face_coordinates = [
    Annotation(type=AnnotationType.RECTANGLE, coordinates=results["face_coordinates"], color=(0, 255, 0))
]
annotation_head_coordinates = [
    Annotation(type=AnnotationType.RECTANGLE, coordinates=results["head_coordinates"], color=(0, 255, 0))
]
annotation_faceparsing = [
    Annotation(type=AnnotationType.MASK, mask=results["faceparsing_mask"], color=(0, 255, 0))
]
annotation_faceparsing_head_ROI = [
    Annotation(type=AnnotationType.MASK, mask=results["faceparsing_mask_head_ROI"], color=(0, 255, 0))
]

vdebugger.visual_debug(img, name="original_image")
vdebugger.visual_debug(img, annotation_face_coordinates, name="", stage_name="face_coor")
vdebugger.visual_debug(results["face_ROI"], name="", stage_name="cropped_face_ROI")
vdebugger.visual_debug(img, annotation_head_coordinates, name="", stage_name="head_coor")
vdebugger.visual_debug(results["head_ROI"], name="", stage_name="cropped_head_ROI")
vdebugger.visual_debug(results["face_ROI"], annotation_landmarks_face_ROI, name="landmarks", stage_name="on_face_ROI")
vdebugger.visual_debug(img, annotation_landmarks, name="landmarks", stage_name="on_image")
vdebugger.visual_debug(results["face_ROI"], annotation_headpose, name="headpose")
vdebugger.visual_debug(results["head_ROI"], annotation_faceparsing_head_ROI, name="faceparsing", stage_name="mask_on_head_ROI")
vdebugger.visual_debug(img, annotation_faceparsing, name="faceparsing", stage_name="mask_on_full_image")
vdebugger.cook_merged_img() # creates merged image

Acknowledgements

This implementation is based on the research conducted by Kartik Narayan and his team at Johns Hopkins University. All credit for the conceptual model and its validation belongs to them.

Project details

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language

Release history Release notifications | RSS feed

0.2.8

Jul 9, 2024

0.2.7

Jul 9, 2024

This version

0.2.5

Jul 9, 2024

0.2.4

Jun 2, 2024

0.2.2

May 15, 2024

0.2.1

May 13, 2024

0.2.0

May 13, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

facexformer_pipeline-0.2.5.tar.gz (16.9 kB view hashes)

Uploaded Jul 9, 2024 Source

Built Distribution

facexformer_pipeline-0.2.5-py3-none-any.whl (17.3 kB view hashes)

Uploaded Jul 9, 2024 Python 3

Hashes for facexformer_pipeline-0.2.5.tar.gz

Hashes for facexformer_pipeline-0.2.5.tar.gz
Algorithm	Hash digest
SHA256	`215ac6b2b97a9d38df072194f777311d7a7b56cc4380c3154d6487acad940124`
MD5	`6319ffe1c92048390f976039e69dc102`
BLAKE2b-256	`4b37b639ea8d5bf7185c934d32a24dee9da797a601e16d54e23515deb159361c`

Hashes for facexformer_pipeline-0.2.5-py3-none-any.whl

Hashes for facexformer_pipeline-0.2.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d2cf1b61a9558a7f2f3e7f5b9774d401a3e90a5cfedf99ed20a47c95706f5f62`
MD5	`8d3efaf0b33fb5da0caa4c1ab0c47805`
BLAKE2b-256	`34253f73d91ba19747f0f4c0707d15cea8fc69246c2db0db9149fc6c35a21ec6`