FaceAI-BGImpact implements various generative methods and studies the impact of altering the background on the generated images and the training of the models.

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

FaceAI - Background Impact

This project implements various Generative AI models to generate faces:

Deep Convolutional Generative Adversarial Network (DCGAN)
Progressive-growing StyleGAN
Variational Autoencoder (VAE)

The projects also implements two new versions of the Flicker-Face-HQ (FFHQ)

ffhq_blur (Where the background is blurred)
ffhq_grey (Where the background is greyed-out)

The motivation stemmed from the fact that a lot of the variance in VAEs seemed to be wasted on the background of the image. There are no existing large-scale faces datasets with uniform background (the ORL dataset only has 400 images), so we decided to create our own.

Folder structure

faceai-bgimpact
├── configs                     # Configuration files for models
├── data_processing             # Scripts and notebooks for data preprocessing   
│   ├── paths.py                # Script to define data paths
│   ├── download_raw_ffhq.py    # Script to download raw FFHQ images
│   ├── create_masks.py         # Script to create image masks
│   ├── create_blur_and_grey.py # Script to create blurred and greyscale images
│   └── download_all_ffhq.py    # Script to download our pre-processed datasets
├── models
│   ├── dcgan_                  # DCGAN model implementation
│   │   ├── __init__.py         
│   │   ├── dcgan.py            # DCGAN model definition
│   │   ├── discriminator.py    # Discriminator part of DCGAN
│   │   └── generator.py        # Generator part of DCGAN
│   ├── stylegan_               # StyleGAN model implementation
│   │   ├── __init__.py         
│   │   ├── discriminator.py    # Discriminator part of StyleGAN
│   │   ├── generator.py        # Generator part of StyleGAN
│   │   ├── loss.py             # Loss functions for StyleGAN
│   │   ├── stylegan.py         # StyleGAN model definition
│   │   └── utils.py            # Utility functions for StyleGAN   
│   ├── abstract_model.py       # Abstract model class for common functionalities
│   ├── data_loader.py          # Data loading utilities
│   └── utils.py                # General utility functions
├── tests
│   ├── test_data_loader.py     # Pytests for data loading utilities
│   ├── test_dcgan.py           # Pytests for DCGAN model
│   └── test_stylegan.py        # Pytests for StyleGAN model
├── create_video.py             # Utility script to create videos
├── generate_images.py          # Script to generate images using models
├── graph_fids.py               # Script to graph FID scores
├── train.py                    # Script to train models
├── pyproject.toml              # Poetry package management configuration file
└── README.md

Training Script (train.py)

The train.py script is an entry point to train a model. It includes command-line arguments to specify the model type, configuration parameters, learning rate, latent dimension, batch size, number of epochs, and intervals for saving.

How to Use

Ensure you have the required dependencies installed (see Dependencies section below).
To train a model, you need to specify the model type using the --model flag, and the dataset with the --dataset flag.
Additional command-line arguments allow for fine-tuning the training process:

Model arguments:

--model: Required. Specifies the type of model to train with options "DCGAN", "StyleGAN".
--dataset: Required. Selects the dataset to use with options "ffhq_raw", "ffhq_blur", "ffhq_grey".
--latent-dim: Optional. Defines the dimension of the latent space for generative models.
--config-path: Optional. Path to a custom JSON configuration file for model training.

Training arguments:

--lr: Optional. Learning rate for DCGAN model training.
--dlr: Optional. Discriminator learning rate for StyleGAN training.
--glr: Optional. Generator learning rate for StyleGAN training.
--mlr: Optional. W-Mapping learning rate for StyleGAN training.
--loss: Optional. Specifies the loss function to use with defaults to "wgan-gp"; choices are "wgan", "wgan-gp", "basic".
--batch-size: Optional. Defines the batch size during training.
--num-epochs: Optional. Sets the number of epochs for training the model.
--save-interval: Optional. Epoch interval to wait before saving models and generated images.
--image-interval: Optional. Iteration interval to wait before saving generated images.

Checkpoint arguments:

--list: Optional. Lists all available checkpoints if set.
--checkpoint-path: Optional. Path to a specific checkpoint file to resume training, takes precedence over --checkpoint-epoch.
--checkpoint-epoch: Optional. Specifies the epoch number from which to resume training.

Example usage:

python train.py --model StyleGAN --dataset ffhq_raw

You can also provide additional arguments as needed.

Dependencies

We use Poetry for dependency management. To install the dependencies, you should use the provided poetry environment. If you do not have poetry installed, you can install it using pip:

pip install poetry

Then, you can install the dependencies using:

poetry install

And activate the environment using:

poetry shell

Since these packages are heavy (especially PyTorch), you may use your own environment if you wish, but it might not work as expected.

Training on GPU is highly recommended. If you have a CUDA-enabled GPU, you should install the CUDA version of PyTorch.

Testing

Automated tests can be run using PyTest. Ensure you have PyTest installed and run:

pytest -v

in the project root to execute all tests.

Alternatively, with poetry, you can run:

poetry run pytest -v

TODO:

VAE
BMSG-GAN (https://github.com/akanimax/BMSG-GAN)

Project details

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.2.4

Jan 19, 2024

0.2.3

Jan 17, 2024

0.2.2

Nov 26, 2023

0.2.1

Nov 26, 2023

0.2.0

Nov 24, 2023

This version

0.1.0

Nov 23, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

faceai_bgimpact-0.1.0.tar.gz (711.1 kB view hashes)

Uploaded Nov 23, 2023 Source

Built Distribution

faceai_bgimpact-0.1.0-py3-none-any.whl (719.8 kB view hashes)

Uploaded Nov 23, 2023 Python 3

Hashes for faceai_bgimpact-0.1.0.tar.gz

Hashes for faceai_bgimpact-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`28c944dc479d74d8c1e5b154aaa19d7a015379bc257f8c8dff662edcb456ba5e`
MD5	`140504ae2771fae179b59aae3c067c79`
BLAKE2b-256	`2c13098473ae8743464b05e21166587c61d3c5fcb210b6325f7b220b7229613e`

Hashes for faceai_bgimpact-0.1.0-py3-none-any.whl

Hashes for faceai_bgimpact-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bd5d6bef1cba24039c7cc2267ecbd2e1f9bb3bb65b383591fce8ece89a8317d0`
MD5	`84c7b68175e3925918b97cc2c032f76c`
BLAKE2b-256	`dac2317971f662f8e6b6be32e4992d92767c890cf34099e4e122eac142fcd979`