A Python library for generating synthetic face mask datasets by mixing facial regions

These details have not been verified by PyPI

Project description

Synthetic Face Mask Generator

A Python library for generating synthetic face datasets with facial region masks between different face images. This tool is designed for creating training datasets for computer vision and machine learning applications.

Background

Presentation Attack Detection (PAD) systems rely on analyzing facial dynamics, particularly the movement of eyes and mouth regions, to distinguish between live faces and spoofing attempts. Face mask attacks are one of the challenges in biometric security, where physical cutouts or synthetic overlays are used to circumvent facial recognition systems.

Acquiring real-world face mask attack datasets is resource-intensive and requires controlled environments. This tool addresses this limitation by generating synthetic face mask datasets through computational facial region manipulation, enabling researchers and security professionals to develop and evaluate PAD systems with diverse training data.

Face Mask Generation Process

Features

Face Detection & Landmark Extraction: Uses MediaPipe for robust face detection and landmark extraction
Facial Region Masking: Creates precise masks for eyes and mouth regions with elliptical or rectangular shapes
Mask RegionBlending: Blends facial regions between different images with smooth transitions
COCO Format Output: Generates datasets in COCO format for easy integration with ML frameworks
Background Integration: Supports masking with random background images

Installation

From PyPI (Recommended)

pip install synthetic-face-masks

From Source

git clone https://github.com/AmadeusITGroup/synthetic-face-masks.git
cd synthetic-face-masks
pip install -e .

Development Installation

For development with additional tools:

git clone https://github.com/AmadeusITGroup/synthetic-face-masks.git
cd synthetic-face-masks
pip install -e ".[dev]"

Prerequisites

Python 3.7 or higher
OpenCV compatible system
Sufficient disk space for output datasets

Dependencies

The package automatically installs these dependencies:

opencv-python>=4.8.0: Image processing operations
mediapipe>=0.9.1.0: Face detection and landmark extraction
numpy>=1.21.0: Numerical operations
imgaug>=0.4.0: Image augmentation
tqdm>=4.64.0: Progress bars
Pillow>=8.0.0: Additional image processing support

Quick Start

Command Line Usage

After installation, you can use the synthetic-face-masks command:

# Generate dataset with default settings
synthetic-face-masks examples/testImages/ output_face_masks/



# Custom configuration
synthetic-face-masks examples/testImages/ output_face_masks/ \
    --num_images 100 \
    --mask_types eye mouth both \
    --train_ratio 0.8 \
    --output_format coco

Python Script Usage

You can also run the main script directly:

# If installed from source
python main.py examples/testImages/ output_face_masks/

# With custom parameters
python main.py examples/testImages/ output_face_masks/ \
    --num_images 50 \
    --mask_types eye mouth

Project Structure

synthetic-face-masks/
├── face_mask/                     # Main Python package
│   ├── core/                      # Core processing modules
│   │   ├── face_processor.py      # Face detection and landmarks
│   │   ├── mask_generator.py      # Facial region mask creation
│   │   ├── image_mixer.py         # Image masking and blending
│   │   └── dataset_generator.py   # Main orchestration class
│   └── utils/                     # Utility modules
│       ├── image_utils.py         # Image processing utilities
│       ├── display_utils.py       # Visualization utilities
│       └── coco_utils.py          # COCO format utilities
├── examples/                      # Example scripts and notebooks
│   ├── basic_example.py           # Simple usage example
│   ├── face_mixing_example.ipynb  # Jupyter notebook demo
│   └── testImages/                # Sample test images
├── tests/                         # Unit tests
│   └── test_installation.py       # Installation verification
├── .github/                       # GitHub Actions workflows
│   └── workflows/
│       └── release-pypi.yml       # Automated PyPI publishing
├── main.py                        # Command-line interface script
├── setup.py                       # Minimal setup (backward compatibility)
├── pyproject.toml                 # Modern Python packaging configuration
├── MANIFEST.in                    # Package manifest for distribution
├── requirements.txt               # Runtime dependencies
├── LICENSE                        # MIT License
└── README.md                      # This documentation

Detailed Usage

Command Line Options

Usage:

synthetic-face-masks [INPUT_DIR] [OUTPUT_DIR] [OPTIONS]

Options:

Option	Description	Default
`--input_folder`	Path to folder containing face images	Required
`--output_folder`	Path to output folder for generated dataset	Required
`--background_folder`	Path to folder containing random background images	None
`--crop_border`	Number of pixels to crop from image borders	50
`--target_width`	Target width for processed images	320
`--target_height`	Target height for processed images	320
`--mix_probability`	Probability of creating masked vs normal images (0.0-1.0)	0.5
`--ellipse_probability`	Probability of using ellipse vs rectangle masks (0.0-1.0)	0.5
`--train_split_ratio`	Ratio of images for training set (0.0-1.0)	0.8
`--config_file`	Path to JSON configuration file	None
`--validate_only`	Only validate existing dataset without generating new images	False
`--generate_report`	Generate dataset analysis report	False

Configuration

You can configure the dataset generation by creating a simple configuration dictionary:

# Create configuration directly
config = {
    "mix_probability": 0.7,
    "ellipse_probability": 0.6,
    "train_split_ratio": 0.85,
    "max_images_per_run": 2000,
    "target_width": 512,
    "target_height": 512
}

# Use with DatasetConfig
from face_mask.core.dataset_generator import DatasetConfig

dataset_config = DatasetConfig(
    input_folder="/path/to/input",
    output_folder="/path/to/output",
    **config
)

Using a configuration file for reusable settings:

# Use configuration file
python main.py --config_file config.json --input_folder /path/to/faces --output_folder /path/to/output

Programming Interface

Basic Example

from face_mask.core.dataset_generator import DatasetGenerator, DatasetConfig

# Create configuration
config = DatasetConfig(
    input_folder="/path/to/faces",
    output_folder="/path/to/output",
    background_folder="/path/to/backgrounds",
    mix_probability=0.7,
    target_size=(512, 512)
)

# Generate dataset
generator = DatasetGenerator(config)
stats = generator.generate_dataset()

print(f"Generated {stats['total_images']} images")

Advanced Usage

from face_mask.core.face_processor import FaceProcessor
from face_mask.core.mask_generator import MaskGenerator
from face_mask.core.image_mixer import ImageMixer

# Initialize components
face_processor = FaceProcessor(min_detection_confidence=0.7)
mask_generator = MaskGenerator(face_processor)
image_mixer = ImageMixer(mask_generator)

# Process individual images
source_data = mask_generator.generate_face_masks("source.jpg", is_ellipse=True)
target_data = mask_generator.generate_face_masks("target.jpg", is_ellipse=True)

# Apply masks to images
if source_data and target_data:
    mask_result = image_mixer.mix_images(
        source_data, target_data, 
        mix_eyes=True, mix_mouth=True
    )
    
    # Save results
    cv2.imwrite("masked_result.jpg", mask_result.mixed_image)

Output Structure

The generated dataset follows this structure:

output_folder/
├── images/                     # Generated images
│   ├── img_000001.png         # Normal image
│   ├── img_000002-Eye.png     # Eye-masked image
│   ├── img_000002-EyeBG.png   # Eye-masked with background
│   ├── img_000002-Mouth.png   # Mouth-masked image
│   ├── img_000002-MouthBG.png # Mouth-masked with background
│   ├── img_000002-Both.png    # Both regions masked
│   └── img_000002-BothBG.png  # Both regions with background
├── annotations/                # COCO format annotations
│   ├── annotations.json       # Complete dataset annotations
│   ├── train.json            # Training set annotations
│   └── test.json             # Test set annotations
├── dataset_report.json        # Dataset analysis report
└── generation_config.json     # Configuration used for generation

Image Types Generated

Normal Images: Processed original face images (category_id: 1)
Eye-Masked Images: Images with eye regions from different sources (category_id: 2)
Mouth-Masked Images: Images with mouth regions from different sources (category_id: 2)
Both-Masked Images: Images with both eye and mouth regions masked (category_id: 2)
Background Variants: All masked types with random background patterns in mask cutout areas instead of face regions

Dataset Validation

The tool includes comprehensive dataset validation:

# Validate existing dataset
python main.py --output_folder /path/to/dataset --validate_only

# Generate detailed report
python main.py --output_folder /path/to/dataset --generate_report

Validation checks include:

COCO format compliance
Image-annotation consistency
File existence verification
Category distribution analysis

Examples and Tutorials

Check the examples/ directory for:

Basic usage scripts
Advanced configuration examples
Jupyter notebook tutorials
Integration examples with ML frameworks

Development

Setting up Development Environment

Clone the repository:

git clone https://github.com/AmadeusITGroup/synthetic-face-masks.git
cd synthetic-face-masks

Create a virtual environment:

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install development dependencies:

pip install -e ".[dev]"

Building the Package

To build the package locally:

pip install build
python -m build

This creates distribution files in the dist/ directory.

Releasing and Publishing

This project uses modern Python packaging with automated GitHub Actions workflows for seamless publishing to PyPI.

Package Status

📦 Available on PyPI: pip install synthetic-face-masks
🔄 Automated Publishing: GitHub Actions handles building and publishing
✅ Modern Packaging: Uses pyproject.toml with setuptools backend
🏷️ Semantic Versioning: Version tags trigger automatic releases

How It Works

The GitHub Actions workflow automatically:

Release Process

Update Version: The version is automatically managed by setuptools_scm based on Git tags.
Create and Push Tag:

git tag v1.0.1
git push origin v1.0.1

Automated Publication: The GitHub Actions workflow will:
- Build the package
- Run tests
- Publish to PyPI (for version tags)
- Create a GitHub release with signed artifacts
- Publish to TestPyPI (for main branch pushes)

Manual Publication

For manual publication (if needed):

TestPyPI (for testing):

python -m twine upload --repository testpypi dist/*

PyPI (for production):

python -m twine upload dist/*

GitHub Environments

The workflow uses GitHub environments for secure publishing:

testpypi: For TestPyPI publications
pypi: For PyPI publications

Make sure these environments are configured in your GitHub repository settings with appropriate secrets or trusted publishing.

Installation from PyPI

Once published, users can install the package:

# From PyPI
pip install synthetic-face-masks

# From TestPyPI (for testing)
pip install -i https://test.pypi.org/simple/ synthetic-face-masks

License

This project is licensed under the MIT License - see the LICENSE file for details.

Citation

If you use this tool in your research, please cite:

@software{synthetic_face_masks,
  title = {Synthetic Face Mask Generator},
  author = {Eldho Abraham},
  year = {2025},
  url = {https://github.com/AmadeusITGroup/synthetic-face-masks}
}

Acknowledgments

MediaPipe for robust face detection and landmark extraction
OpenCV for computer vision operations
imgaug for image augmentation capabilities
sample images taken from https://thispersondoesnotexist.com/

Support

For support and questions:

Create an issue on GitHub
Check the troubleshooting section
Review example scripts and documentation

Note: This tool is designed for research and educational purposes. Please ensure you have appropriate permissions for any face images you process and comply with relevant privacy and data protection regulations.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

1.0.0

Sep 24, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

synthetic_face_masks-1.0.0.tar.gz (2.4 MB view details)

Uploaded Sep 24, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

synthetic_face_masks-1.0.0-py3-none-any.whl (29.0 kB view details)

Uploaded Sep 24, 2025 Python 3

File details

Details for the file synthetic_face_masks-1.0.0.tar.gz.

File metadata

Download URL: synthetic_face_masks-1.0.0.tar.gz
Upload date: Sep 24, 2025
Size: 2.4 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for synthetic_face_masks-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`1e667c28bc9eb9e0b2860c0405cc375450dca81dc20f389cf6706257fcaec211`
MD5	`936d6336559891de32bc253d8686e270`
BLAKE2b-256	`93d15d77a7817fa9dc88c414838fbc801c0d2b91cc7816c3b8b4c8a6a18d6fb5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for synthetic_face_masks-1.0.0.tar.gz:

Publisher: release-pypi.yml on AmadeusITGroup/synthetic-face-masks

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: synthetic_face_masks-1.0.0.tar.gz
- Subject digest: 1e667c28bc9eb9e0b2860c0405cc375450dca81dc20f389cf6706257fcaec211
- Sigstore transparency entry: 555623868
- Sigstore integration time: Sep 24, 2025
Source repository:
- Permalink: AmadeusITGroup/synthetic-face-masks@1a3646c8efbc6c3130c23a4b769b7d2e72a616f8
- Branch / Tag: refs/tags/v1.0.0
- Owner: https://github.com/AmadeusITGroup
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release-pypi.yml@1a3646c8efbc6c3130c23a4b769b7d2e72a616f8
- Trigger Event: push

File details

Details for the file synthetic_face_masks-1.0.0-py3-none-any.whl.

File metadata

Download URL: synthetic_face_masks-1.0.0-py3-none-any.whl
Upload date: Sep 24, 2025
Size: 29.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for synthetic_face_masks-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4354367ea33091cc661c1576b2006ea061d6f01e5f165fe8059cd0a29da9975f`
MD5	`4c059f6484a29457e1330d61bf76c79f`
BLAKE2b-256	`620409c4984dd10ace05bb9aceddc449c10ce6a24878330016232bf3ab1600c4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for synthetic_face_masks-1.0.0-py3-none-any.whl:

Publisher: release-pypi.yml on AmadeusITGroup/synthetic-face-masks

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: synthetic_face_masks-1.0.0-py3-none-any.whl
- Subject digest: 4354367ea33091cc661c1576b2006ea061d6f01e5f165fe8059cd0a29da9975f
- Sigstore transparency entry: 555623874
- Sigstore integration time: Sep 24, 2025
Source repository:
- Permalink: AmadeusITGroup/synthetic-face-masks@1a3646c8efbc6c3130c23a4b769b7d2e72a616f8
- Branch / Tag: refs/tags/v1.0.0
- Owner: https://github.com/AmadeusITGroup
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release-pypi.yml@1a3646c8efbc6c3130c23a4b769b7d2e72a616f8
- Trigger Event: push

synthetic-face-masks 1.0.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Synthetic Face Mask Generator

Background

Features

Installation

From PyPI (Recommended)

From Source

Development Installation

Prerequisites

Dependencies

Quick Start

Command Line Usage

Python Script Usage

Project Structure

Detailed Usage

Command Line Options

Configuration

Programming Interface

Basic Example

Advanced Usage

Output Structure

Image Types Generated

Dataset Validation

Examples and Tutorials

Development

Setting up Development Environment

Building the Package

Releasing and Publishing

Package Status

How It Works

Release Process

Manual Publication

GitHub Environments

Installation from PyPI

License

Citation

Acknowledgments

Support

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance