Skip to main content

SPIGA: Shape Preserving Facial Landmarks with Graph Attention Networks

Project description

SPIGA: Shape Preserving Facial Landmarks with Graph Attention Networks.

Project Page arXiv Open in Colab PyPI version License

This repository contains the source code of SPIGA, a face alignment and headpose estimator that takes advantage of the complementary benefits from CNN and GNN architectures producing plausible face shapes in presence of strong appearance changes.

It achieves top-performing results in:

PWC PWC PWC PWC PWC PWC PWC PWC

Setup

The repository has been tested on Ubuntu 20.04 with CUDA 11.4, the latest version of cuDNN, Python 3.8 and Pytorch 1.12.1. To run the video analyzer demo or evaluate the algorithm, install the repository from the source code:

# Best practices: 
#  1. Create a virtual environment.
#  2. Install Pytorch according to your CUDA version.
#  3. Install SPIGA from source code:

git clone https://github.com/andresprados/SPIGA.git
cd spiga
pip install -e .  

# To run the video analyzer demo install the extra requirements.
pip install -e .[demo]

Models: By default, model weights are automatically downloaded on demand and stored at ./spiga/models/weights/. You can also download them from Google Drive.

Note: All the callable files provide a detailed parser that describes the behaviour of the program and their inputs. Please, check the operational modes by using the extension --help.

Inference and Demo

We provide an inference framework for SPIGA available at ./spiga/inference. The models can be easily deployed in third-party projects by adding a few lines of code. Check out our image inference example for more information: Open in Colab

Face Video Analyzer Demo:

The demo application provides a general framework for tracking, detecting and extracting features of human faces in images or videos. You can use the following commands to run the demo:

python ./spiga/demo/app.py \
            [--input] \      # Webcam ID or Video Path. Dft: Webcam '0'.
            [--dataset] \    # SPIGA pretrained weights per dataset. Dft: 'wflw'.
            [--tracker] \    # Tracker name. Dft: 'RetinaSort'.
            [--show] \       # Select the attributes of the face to be displayed. Dft: ['fps', 'face_id', 'landmarks', 'headpose']
            [--save] \       # Save record.
            [--noview] \     # Do not visualize window.
            [--outpath] \    # Recorded output directory. Dft: './spiga/demo/outputs'
            [--fps] \        # Frames per second.
            [--shape] \      # Visualizer shape (W,H).

Note: For more information check the Demo Readme or call the app parser --help.

Dataloaders and Benchmarks

This repository provides general-use tools for the task of face alignment and headpose estimation:

  • Dataloaders: Training and inference dataloaders are available at ./spiga/data. Including the data augmentation tools used for training SPIGA and data-visualizer to analyze the dataset images and features. For more information check the Data Readme .

  • Benchmark: A common benchmark framework to test any algorithm in the task of face alignment and headpose estimation is available at ./spiga/eval/benchmark. For more information check the following Evaluation Section and the Benchmark Readme.

Datasets: To run the data visualizers or the evaluation benchmark please download the dataset images from the official websites (300W, AFLW, WFLW, COFW). By default they should be saved following the next folder structure:

./spiga/data/databases/   # Default path can be updated by modifying 'db_img_path' in ./spiga/data/loaders/dl_config.py
|
└───/300w               
│   └─── /images           
│        | /private     
│        | /test                   
|        └ /train             
|
└───/cofw                   
│   └─── /images
|  
└───/aflw                   
│   └─── /data
|        └ /flickr
|  
└───/wflw
    └─── /images

Annotations: We have stored for simplicity the datasets annotations directly in ./spiga/data/annotations. We strongly recommend to move them out of the repository if you plan to use it as a git directory.

Results: Similar to the annotations problem, we have stored the SPIGA results in ./spiga/eval/results/<dataset_name>. Remove them if need it.

Evaluation

The models evaluation is divided in two scripts:

Results generation: The script extracts the data alignments and headpose estimation from the desired <dataset_name> trained network. Generating a ./spiga/eval/results/results_<dataset_name>_test.json file which follows the same data structure defined by the dataset annotations.

python ./spiga/eval/results_gen.py <dataset_name>

Benchmark metrics: The script generates the desired landmark or headpose estimation metrics. We have implemented an useful benchmark which allows you to test any model using a results file as input.

python ./spiga/eval/benchmark/evaluator.py /path/to/<results_file.json> --eval lnd pose -s

Note: You will have to interactively select the NME_norm and other parameters in the terminal window.

Results Sum-up

WFLW Dataset
PWC NME_ioc AUC_10 FR_10 NME_P90 NME_P95 NME_P99
full 4.060 60.558 2.080 6.766 8.199 13.071
pose 7.141 35.312 11.656 10.684 13.334 26.890
expression 4.457 57.968 2.229 7.023 8.148 22.388
illumination 4.004 61.311 1.576 6.528 7.919 11.090
makeup 3.809 62.237 1.456 6.320 8.289 11.564
occlusion 4.952 53.310 4.484 8.091 9.929 16.439
blur 4.650 55.310 2.199 7.311 8.693 14.421
MERLRAV Dataset
PWC NME_bbox AUC_7 FR_7 NME_P90 NME_P95 NME_P99
full 1.509 78.474 0.052 2.163 2.468 3.456
frontal 1.616 76.964 0.091 2.246 2.572 3.621
half_profile 1.683 75.966 0.000 2.274 2.547 3.397
profile 1.191 82.990 0.000 1.735 2.042 2.878
300W Private Dataset
PWC NME_bbox AUC_7 FR_7 NME_P90 NME_P95 NME_P99
full 2.031 71.011 0.167 2.788 3.078 3.838
indoor 2.035 70.959 0.333 2.726 3.007 3.712
outdoor 2.027 37.174 0.000 2.824 3.217 3.838
COFW68 Dataset
PWC NME_bbox AUC_7 FR_7 NME_P90 NME_P95 NME_P99
full 2.517 64.050 0.000 3.439 4.066 5.558
300W Public Dataset
PWC NME_ioc AUC_8 FR_8 NME_P90 NME_P95 NME_P99
full 2.994 62.726 0.726 4.667 5.436 7.320
common 2.587 44.201 0.000 3.710 4.083 5.215
challenge 4.662 42.449 3.704 6.626 7.390 10.095

Coming soon...

  • Release evaluation code and pretrained models.
  • Project page and demo.
  • Training code.

BibTeX Citation

@inproceedings{Prados-Torreblanca_2022_BMVC,
  author    = {Andrés  Prados-Torreblanca and José M Buenaposada and Luis Baumela},
  title     = {Shape Preserving Facial Landmarks with Graph Attention Networks},
  booktitle = {33rd British Machine Vision Conference 2022, {BMVC} 2022, London, UK, November 21-24, 2022},
  publisher = {{BMVA} Press},
  year      = {2022},
  url       = {https://bmvc2022.mpi-inf.mpg.de/0155.pdf}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

spiga-0.0.4-py3-none-any.whl (95.1 kB view details)

Uploaded Python 3

File details

Details for the file spiga-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: spiga-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 95.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 colorama/0.4.4 importlib-metadata/4.6.4 keyring/23.5.0 pkginfo/1.8.2 readme-renderer/34.0 requests-toolbelt/0.9.1 requests/2.25.1 rfc3986/1.5.0 tqdm/4.57.0 urllib3/1.26.5 CPython/3.10.6

File hashes

Hashes for spiga-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 97c90a2b387e0177661bffdc2002c36ac3e097ecbd8c1dfba512fb8d745a8ca3
MD5 17c52134371bf3e3b7c5d6677af95ed6
BLAKE2b-256 236955c577a595ace210d71572017206250a208aa61bc2f9dd2f687e6498c1fe

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page