Skip to main content

This package provides measurement tools for Generative Adversarial Networks (GANs), including Inception Score (IS), Fréchet Inception Distance (FID), Kernel Inception Distance (KID), and Precision and Recall (PR). These metrics are used to evaluate the quality and diversity of generated images in GANs. The package streamlines the use of these metrics, making it easier to apply them to your work.

Project description

GANs evaluation metrics Pytorch version

Downloads

This implementation covers all the current state-of-the-art evaluation metrics for Generative Adversarial Networks (GANs), including: Inception Score (IS), Fréchet Inception Distance (FID), Kernel Inception Distance (KID), and Precision and Recall (PR).

Although there are many existing implementations of these metrics in other repositories, they have different usage and requirements, which may cause inconvenience during the evaluation process. This package aims to simplify the usage by wrapping all the metrics into one setting.

Inception Score (IS) evaluates the reality and diversity of generated images based on the Kullback-Leibler (KL) divergence between the probability outputs of the Inception model for each fake image and the marginal distribution (the cumulative distribution of all samples). The original Tensorflow implementation can be found here.

Fréchet Inception Distance (FID) compares the Fréchet distance between two datasets by comparing the mean and covariance of the features extracted from the last average pooling layer of the Inception model. The original Tensorflow implementation can be found here.

Kernel Inception Distance (KID) compares the Maximum Mean Discrepancy (MMD) between two datasets by comparing the features extracted using the same method as FID. The original Tensorflow implementation can be found here.

Precision and Recall (PR) of GANs evaluate the quality and diversity of generated images, respectively. The method uses VGG16 as the feature extractor, and the real and fake domains are decided by k-nearest neighbor in the Euclidean space. The precision and recall values are calculated by the intersection between the two domains. The original Tensorflow implementation can be found here.

The Inception and FID models have been modified from the unofficial Pytorch FID implementation. The VGG model uses the timm package, and the implementation of IS, FID, and PR has been modified from the Pytorch StyleGAN2-ADA.

Installation

Install from pip:

pip install gans-eval

Requirements:

  • numpy
  • opencv-python
  • pillow
  • scipy
  • tqdm
  • timm
  • torch>=1.7.0
  • torchvision>=0.8.0

Usage

Except for IS, other metrics need two individual datasets to process, the simple usage:

gans-eval --fake-dir path_to_syn_dataset --real-dir path_to_real_dataset --metrics is fid kid pr

All available arguments:

  • --fake-dir Synthesis dataset directory.

  • --real-dir Real dataset directory.

  • --data-name the statistic features of dataset will be save as pickle file in ['HOME']/.cache/gans_metrics/dataset_name.pkl. Default: default.

  • --save-json path to save result as json file. Default: None.

  • --metrics evaluation method. Available options is, fid, kid, pr. Default: fid.

  • --num-splits parameter to calculate IS, number of divisions to calculate statistics. This parameter together with the number of samples can cause the IS to change. Default: 10.

  • --max-real parameter to calculate KID, the maximum number of real samples. Default: 1000000.

  • --num-subsets parameter to calculate KID, the number of subsets while compute KID. Default: 100.

  • --max-subset-size parameter to calculate KID, the maximum number of sample in one subset. Default: 1000.

  • --k-nearest parameter to calculate PR, the kth nearest neighbor. Default: 3.

  • --row-batch-size parameter to calculate PR, number of samples per once Euclidean distance computing. Default: 10000.

  • --col-batch-size parameter to calculate PR, number of samples per once Euclidean distance computing. Default: 10000.

  • --dims as pytorch-fid package, there are four available positions to extract features from Inception module. Available options: 64, 192, 768, 2048. Default: 2048.

    • 64: first max pooling features.
    • 192: second max pooling features.
    • 768: pre-aux classifier features.
    • 2048: final average pooling features.
  • --batch-size Batch size. Default: 50.

  • --num-workers The number of workers use for dataloder. Default: 8.

  • --not-save-cache Store true parameter. Call it if you don't want to save dataset statistic as cache.

  • --verbose Store true parameter. Call it if you want to see the process bar while computing statistics information from the image folder.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gans-eval-0.0.3.tar.gz (14.5 kB view hashes)

Uploaded Source

Built Distribution

gans_eval-0.0.3-py3-none-any.whl (14.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page