Skip to main content

Image Composition Toolbox

Project description


libcom: everything about image composition


PyPI Downloads Hits GitHub Repo stars Static Badge Static Badge GitHub

Introduction

libcom is an image composition toolbox. The goal of image composition is inserting one foreground into a background image to get a realistic composite image. Generally speaking, image composition could be used to combine the visual elements from different images.

libcom covers various related tasks in the field of image composition, including image harmonization, painterly image harmonization, shadow generation, object placement, generative composition, quality evaluation, etc. For each task, we integrate one or two selected methods considering both efficiency and effectiveness. The selected methods will be continuously updated upon the emergence of better methods.

Main Parts

  • get_composite_image generates composite images using naive copy-and-paste followed by image blending.
  • OPAScoreModel is an object placement assessment model that evaluates the rationality of object placement by predicting a rationality scores.
  • FOPAHeatMapModel can predict the rationality scores for all locations with a pair of background and scaled foreground as input in a single forward pass.
  • color_transfer tranfers the color of foreground to fit background scene using reinhard algorithm.
  • ImageHarmonizationModel contains several pretrained models for image harmonization, which aims to adjust the illumination statistics of foreground to fit background.
  • PainterlyHarmonizationModel contains serveral pretrained models for painterly image harmonization, which aims to adjust the foreground style of the painterly composite image to make it compatible with the background.
  • HarmonyScoreModel predicts harmony score for a composite image, in which larger harmony score implies more harmonious composite image.
  • InharmoniousLocalizationModel supports the localization of the inharmonious region in a synthetic image.
  • FOSScoreModel contains two foreground object search models, which can be used to evaluate the compatibility between foreground and background in terms of geometry and semantics.
  • ControlComModel is a controllable image composition model, which unifies image blending and image harmonization in one diffusion model.
  • ShadowGenerationModel takes in deshadowed composite image and foreground object mask, and generates images with semantically plausible foreground shadows.

For more detailed user guidance and method description, please refer to our [documents].

Requirements

The main branch is built on the Linux system with Python 3.8 and PyTorch 1.10.1. For other dependencies, please refer to [conda_env] and [runtime_dependencies].

Get Started

Please refer to [Installation] for installation instructions and [documents] for user guides.

Contributors

License

This project is released under the Apache 2.0 license.

Bibtex

If you use our toolbox, please cite our survey paper using the following BibTeX [arxiv]:

@article{niu2021making,
  title={Making images real again: A comprehensive survey on deep image composition},
  author={Niu, Li and Cong, Wenyan and Liu, Liu and Hong, Yan and Zhang, Bo and Liang, Jing and Zhang, Liqing},
  journal={arXiv preprint arXiv:2106.14490},
  year={2021}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

libcom-0.0.1.post3.tar.gz (354.9 kB view hashes)

Uploaded Source

Built Distribution

libcom-0.0.1.post3-cp38-cp38-manylinux1_x86_64.whl (2.3 MB view hashes)

Uploaded CPython 3.8

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page