Skip to main content

A package for baking images.

Project description

Image-Baker

Dependabot Status GitHub License commit activity code size in bytes

Code Formatting PyPI version

Centered Demo

An example of baked images. (Each object is a layer and an annotation will also be extracted for all layers.)

Let's bake an image.

Why is it relevant?

When training computer vision models (especially for detection and segmentation), labeling large amounts of data is crucial for better model performance. Often, the process involves multiple cycles of labeling, training, and evaluation. By generating multiple realistic labeled datasets from a single image, the time spent on labeling can be significantly reduced.

What's up with the name?

The concept involves extracting portions of an image (e.g., objects of interest) using tools like polygons or models such as Segment Anything. These extractions are treated as layers, which can then be copied, pasted, and manipulated to create multiple instances of the desired object. By combining these layers step by step, a new labeled image with annotations in JSON format is created. The term "baking" refers to the process of merging these layers into a single cohesive image.

Getting Started

ImageBaker can be used in three ways:

  1. GUI Application - Visual interface for interactive image composition
  2. CLI Tool - Command-line interface for automation and scripting
  3. Python Library - Programmatic API for integration into your projects

Installation

Using PIP

This project is also available on the PyPI server.

pip install imagebaker

Developing

Please, clone this repository and install it locally:

For Windows users:

git clone https://github.com/q-viper/image-baker.git
cd image-baker
python -m venv .venv       # create a virtual environment
.venv\Scripts\activate     # activate the virtual environment
pip install -e .

For Linux/macOS users:

git clone https://github.com/q-viper/image-baker.git 
cd image-baker
pip install -e .

Usage

🖥️ GUI Mode

Launch the visual interface for interactive work:

# Simple launch (default)
imagebaker

# Or with explicit gui command
imagebaker gui

# With models
imagebaker gui --models-file examples/loaded_models.py

⌨️ CLI Mode

Automate image composition from the command line:

# Simple composition
imagebaker cli bake simple bg.png fg.png -o output.png \
    --positions "0,0;100,100" --opacities "1.0,0.5"

# Config-based baking
imagebaker cli bake from-config config.py

# Get image info
imagebaker cli info image.png

# Check version
imagebaker cli version

🐍 Python API

Integrate into your projects programmatically:

from imagebaker import ImageBaker

baker = ImageBaker()
layer1 = baker.add_layer_from_file("background.png")
layer2 = baker.add_layer_from_file("foreground.png")
baker.set_layer_position(layer2, 100, 100)
baker.set_layer_opacity(layer2, 0.7)

result = baker.bake()
baker.save(result, "output.png")

📚 Documentation:

🎨 Examples:

  • GUI: Traditional visual interface (unchanged)
  • CLI: examples/bake_config.py - Configuration file example
  • API: examples/api_example.py - Full Python example
  • Samples: assets/demo/api_samples/ - Generated outputs

Features

  • Annotating Images: Load a folder of images and annotate them using bounding boxes or polygons.
  • Model Testing: Define models for detection, segmentation, and prompts (e.g., points or rectangles) by following the base model structure in imagebaker/models/base_model.py. See examples/loaded_models.py for a working example.
  • Layerifying: Crop images based on annotations to create reusable layers. Each cropped image represents a single layer.
  • Baking States: Arrange layers to create image variations by dragging, rotating, adjusting opacity, and more. Save the state using the Save State button or Ctrl + S.
  • Playing States: Replay saved states, export them locally, or use them for further predictions.
  • Exporting States: Export the final annotated JSON and the baked multilayer image.
  • Drawing On Layers: First select a layer then draw upon it. Only selected layer will be drawn. And if no layers are selected, then the drawing will not be exported.

Shortcuts

  • Ctrl + C: Copy selected annotation/layer.
  • Ctrl + V: Paste copied annotation/layer in its parent image/layer if it is currently open.
  • Delete: Delete selected annotation/layer.
  • Left Click: Select an annotation/layer on mouse position.
  • Left Click + Drag: Drag a selected annotation/layer.
  • Double Left Click: When using polygon annotation, completes the polygon.
  • Right Click: Unselect an annotation/layer. While annotating the polygon, undo the last point.
  • Ctrl + Mouse Wheel: Zoom In/Out on the mouse position, i.e., resize the viewport.
  • Ctrl + Drag: If done on the background, the viewport is panned.
  • Ctrl + S: Save State on Baker Tab.
  • Ctrl + D: Draw Mode on Baker Tab. Drawing can happen on a selected or main layer.
  • Ctrl + E: Erase Mode on Baker Tab.
  • Ctrl + H: Opens a help window.
  • Wheel: Change the size of the drawing pointer.
  • Q: Point mode on annotation.
  • W: Polygon mode on annotation. Moves selected layer one step up in layer lists in baker.
  • S: Moves selected layer one step down in layer list in baker.
  • E: Rectangle mode on annotation.
  • H: Hides/un-hides selected annotation/layer.
  • L: Creates layer from an annotation. If any annotation selected, creates only its, else creates layers from all visible annotations.
  • C: If any annotation is selected, a input box for Caption is created. It can be edited on baker tab as well and is state aware.
  • Numerics: Selecting number 1, 2, till 9 selects label. If not available, asks for a new label.
  • Escape: Closes the application.

Demo

Annotation Page

This is where the loading of the image folder and annotation, connection with the model running in the backend, and layerifying happen.

Baker Page

This is where the layer baking happens. And the extraction of the layers as well.

An example of drawing:

Annotated

The JSON and the baked image will be exported to the local folder, and in debug mode, the annotations and the mask for each layer will be exported too.

Demo Video

To see the tool in action, check out the demo video below:

Demo Video

Click on the image above to play the video on YouTube.

Contributions

Contributions are welcome!

Do you find this project to be useful and are you looking for some features that are not implemented yet? Feel free to open issues or submit pull requests to improve the project.

For more please visit CONTRIBUTING.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

imagebaker-0.0.55.tar.gz (913.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

imagebaker-0.0.55-py3-none-any.whl (92.2 kB view details)

Uploaded Python 3

File details

Details for the file imagebaker-0.0.55.tar.gz.

File metadata

  • Download URL: imagebaker-0.0.55.tar.gz
  • Upload date:
  • Size: 913.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for imagebaker-0.0.55.tar.gz
Algorithm Hash digest
SHA256 286abe115d174703212d8907c47fe9dfcd5749b26adab4e4e66eb425226aeb0f
MD5 608018fc1ee51c15a6c622a8c8cb4868
BLAKE2b-256 3b39da08f16e7010137d28b78e6eb9e27d395240b7e820268725d2db132c01a0

See more details on using hashes here.

File details

Details for the file imagebaker-0.0.55-py3-none-any.whl.

File metadata

  • Download URL: imagebaker-0.0.55-py3-none-any.whl
  • Upload date:
  • Size: 92.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for imagebaker-0.0.55-py3-none-any.whl
Algorithm Hash digest
SHA256 d407dc202cd47f45455668cb1bd19ec720b609d19fa7ca56760c2db5c0af81e0
MD5 67fa7e8112da3cc9c76e5431518c8c99
BLAKE2b-256 1a0638808434abb682b0b7eead6a3dfb72f24b7e33481e2d509a0e352b98c881

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page