Skip to main content

ArtFID: Quantitative Evaluation of Neural Style Transfer

Project description

ArtFID: Quantitative Evaluation of Neural Style Transfer (GCPR Oral 2022)

ArtFID: Quantitative Evaluation of Neural Style Transfer
Matthias Wright and Björn Ommer.

Abstract

The field of neural style transfer has experienced a surge of research exploring different avenues ranging from optimization-based approaches and feed-forward models to meta-learning methods. The developed techniques have not just progressed the field of style transfer, but also led to breakthroughs in other areas of computer vision, such as all of visual synthesis. However, whereas quantitative evaluation and benchmarking have become pillars of computer vision research, the reproducible, quantitative assessment of style transfer models is still lacking. Even in comparison to other fields of visual synthesis, where widely used metrics exist, the quantitative evaluation of style transfer is still lagging behind. To support the automatic comparison of different style transfer approaches and to study their respective strengths and weaknesses, the field would greatly benefit from a quantitative measurement of stylization performance. Therefore, we propose a method to complement the currently mostly qualitative evaluation schemes. We provide extensive evaluations and a large-scale user study to show that the proposed metric strongly coincides with human judgment.

Installation

> pip install art-fid

Usage

CUDA_VISIBLE_DEVICES=0 python -m art_fid --style_images path/to/style-images --content_images path/to/content-images --stylized_images path/to/stylized-images

The content images and the corresponding stylized images are compared in pairs. In order to ensure that a content image is matched up with the correct stylized image, both the content images and the stylized images are processed in lexicographical order. A simple way of pairing the content images and the stylized images is to use the name of content image for the corresponding stylized image.

Arguments

--batch_size - Batch size for computing activations.
--num_workers - Number of threads used for data loading.
--mode - Evaluate ArtFID or ArtFID_infinity, choices = ['art_fid', 'art_fid_inf'].
--content_metric - Content metric, choices = ['lpips', 'vgg', 'alexnet'].
--device - Device to use, choices = ['cuda', 'cpu'].
--style_images - Path to style images.
--content_images - Path to content images.
--stylized_images - Path to stylized images.

Data

The dataset is contained in artfid_dataset.csv. It consists of 250k labeled artworks.

Acknowledgments

Citation

@article{wright_gcpr_2022,
    title={ArtFID: Quantitative Evaluation of Neural Style Transfer},
    author={Matthias Wright and Bj{\"o}rn Ommer},
    journal={GCPR},
    year={2022}
}

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

art_fid-0.0.2.tar.gz (11.7 kB view details)

Uploaded Source

Built Distribution

art_fid-0.0.2-py3-none-any.whl (11.4 kB view details)

Uploaded Python 3

File details

Details for the file art_fid-0.0.2.tar.gz.

File metadata

  • Download URL: art_fid-0.0.2.tar.gz
  • Upload date:
  • Size: 11.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for art_fid-0.0.2.tar.gz
Algorithm Hash digest
SHA256 bcaa39e8c7b5de3fcb71425f065062ca246e066cd3c4f1169435b3e6ba602407
MD5 58c3fe18a64fe66b21b692c1e00a9af0
BLAKE2b-256 46f78c88715a6067b8b3257c07ad49868d601c5ebc8f76febff0c904e4d9dc71

See more details on using hashes here.

File details

Details for the file art_fid-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: art_fid-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 11.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.4

File hashes

Hashes for art_fid-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 eecf4d9682a263d7cd9264e40c4d2326ae97bb9b2b2b586d4530784c47daf864
MD5 06772ff785174c9c336734f51807f7b5
BLAKE2b-256 e2ef714cf2e50a204a07d11e983bcb1f1d3110917379496bb66943323238a00d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page