Synthesizing and manipulating 2048x1024 images with conditional GANs

Project description

pix2pixHD

Project | Youtube | Paper

Pytorch implementation of our method for high-resolution (e.g. 2048x1024) photorealistic image-to-image translation. It can be used for turning semantic label maps into photo-realistic images or synthesizing portraits from face label maps.

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs
Ting-Chun Wang¹, Ming-Yu Liu¹, Jun-Yan Zhu², Andrew Tao¹, Jan Kautz¹, Bryan Catanzaro¹
¹NVIDIA Corporation, ²UC Berkeley
In CVPR 2018.

Image-to-image translation at 2k/1k resolution

Our label-to-streetview results

- Interactive editing results

- Additional streetview results

Label-to-face and interactive editing results

Our editing interface

Prerequisites

Linux or macOS
Python 2 or 3
[optionally:] NVIDIA GPU (11G memory or larger) + CUDA cuDNN

Getting Started

Installation

Create a virtual environment, and activate it

Clone this repo:

  git clone https://github.com/NVIDIA/pix2pixHD
  pip install pix2pixHD

Testing

A few example Cityscapes test images are included in the datasets folder.
Please download the pre-trained Cityscapes model from here (google drive link), and put it under ./checkpoints/label2city_1024p/
Test the model (bash ./scripts/test_1024p.sh):

#!./scripts/test_1024p.sh
pix2pixhd-test --name label2city_1024p --netG local --ngf 32 --resize_or_crop none

The test results will be saved to a html file here: ./results/label2city_1024p/test_latest/index.html.

More example scripts can be found in the scripts directory.

For other options, cf.

pix2pixhd-test --help

Dataset

We use the Cityscapes dataset. To train a model on the full dataset, please download it from the official website (registration required). After downloading, please put it under the datasets folder in the same way the example images are provided.

Training

Train a model at 1024 x 512 resolution (bash ./scripts/train_512p.sh):

#!./scripts/train_512p.sh
pix2pixhd-train --name label2city_512p

To view training results, please checkout intermediate results in ./checkpoints/label2city_512p/web/index.html. If you have tensorflow installed, you can see tensorboard logs in ./checkpoints/label2city_512p/logs by adding --tf_log to the training scripts.

For other options, cf.

pix2pixhd-train --help

Multi-GPU training

Train a model using multiple GPUs (bash ./scripts/train_512p_multigpu.sh):

#!./scripts/train_512p_multigpu.sh
pix2pixhd-train --name label2city_512p --batchSize 8 --gpu_ids 0,1,2,3,4,5,6,7

Note: this is not tested and we trained our model using single GPU only. Please use at your own discretion.

Training with Automatic Mixed Precision (AMP) for faster speed

To train with mixed precision support, please first install apex from: https://github.com/NVIDIA/apex
You can then train the model by adding --fp16. For example,

#!./scripts/train_512p_fp16.sh
python -m torch.distributed.launch pix2pixhd/train.py --name label2city_512p --fp16

In our test case, it trains about 80% faster with AMP on a Volta machine.

Training at full resolution

To train the images at full resolution (2048 x 1024) requires a GPU with 24G memory (bash ./scripts/train_1024p_24G.sh), or 16G memory if using mixed precision (AMP).
If only GPUs with 12G memory are available, please use the 12G script (bash ./scripts/train_1024p_12G.sh), which will crop the images during training. Performance is not guaranteed using this script.

Training with your own dataset

If you want to train with your own dataset, please generate label maps which are one-channel whose pixel values correspond to the object labels (i.e. 0,1,...,N-1, where N is the number of labels). This is because we need to generate one-hot vectors from the label maps. Please also specity --label_nc N during both training and testing.
If your input is not a label map, please just specify --label_nc 0 which will directly use the RGB colors as input. The folders should then be named train_A, train_B instead of train_label, train_img, where the goal is to translate images from A to B.
If you don't have instance maps or don't want to use them, please specify --no_instance.
The default setting for preprocessing is scale_width, which will scale the width of all training images to opt.loadSize (1024) while keeping the aspect ratio. If you want a different setting, please change it by using the --resize_or_crop option. For example, scale_width_and_crop first resizes the image to have width opt.loadSize and then does random cropping of size (opt.fineSize, opt.fineSize). crop skips the resizing step and only performs random cropping. If you don't want any preprocessing, please specify none, which will do nothing other than making sure the image is divisible by 32.

More Training/Test Details

Flags: see options/train_options.py and options/base_options.py for all the training flags; see options/test_options.py and options/base_options.py for all the test flags.
Instance map: we take in both label maps and instance maps as input. If you don't want to use instance maps, please specify the flag --no_instance.

Citation

If you find this useful for your research, please use the following.

@inproceedings{wang2018pix2pixHD,
  title={High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs},
  author={Ting-Chun Wang and Ming-Yu Liu and Jun-Yan Zhu and Andrew Tao and Jan Kautz and Bryan Catanzaro},  
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year={2018}
}

Acknowledgments

This code borrows heavily from pytorch-CycleGAN-and-pix2pix.

Project details

Release history Release notifications | RSS feed

This version

1.0

Mar 24, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pix2pixhd-1.0.tar.gz (29.3 kB view details)

Uploaded Mar 24, 2022 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pix2pixhd-1.0-py2.py3-none-any.whl (38.8 kB view details)

Uploaded Mar 24, 2022 Python 2Python 3

File details

Details for the file pix2pixhd-1.0.tar.gz.

File metadata

Download URL: pix2pixhd-1.0.tar.gz
Upload date: Mar 24, 2022
Size: 29.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.9

File hashes

Hashes for pix2pixhd-1.0.tar.gz
Algorithm	Hash digest
SHA256	`d7f653819eb924e233665012a9a35ca3a8657a89db95a55870553ca1d13a8359`
MD5	`c596b079cd425d7447c16768f587ca9c`
BLAKE2b-256	`116b1fcea45e08c3debc23256c526533a450ccda28b116ef80a531a64bde3a87`

See more details on using hashes here.

File details

Details for the file pix2pixhd-1.0-py2.py3-none-any.whl.

File metadata

Download URL: pix2pixhd-1.0-py2.py3-none-any.whl
Upload date: Mar 24, 2022
Size: 38.8 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.63.1 importlib-metadata/4.8.3 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.9

File hashes

Hashes for pix2pixhd-1.0-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`45b3fbc295fb296f3f415a01c62dba474e5943994a5b1c503aa142ba189440c2`
MD5	`8c1377e77ff493dbcf399c33ded5ca60`
BLAKE2b-256	`d9670cb242dda6948c7a4c8093ec9556ebd1759503694673bc686f1f6806d8d3`

See more details on using hashes here.

pix2pixhd 1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

pix2pixHD

Project | Youtube | Paper

Image-to-image translation at 2k/1k resolution

Prerequisites

Getting Started

Installation

Testing

Dataset

Training

Multi-GPU training

Training with Automatic Mixed Precision (AMP) for faster speed

Training at full resolution

Training with your own dataset

More Training/Test Details

Citation

Acknowledgments

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes