Multi-provider AI image generation cookbook: prompt gallery, agentic skill, and CLI for OpenAI, Google Imagen, Flux, and more.
Project description
gpt-image-cookbook
Multi-provider AI image generation cookbook — prompt gallery, agentic skill, and CLI for OpenAI gpt-image-2, Google Imagen, Flux, and more.
gpt-image-cookbook is an open-source multi-provider AI image generation toolkit that bundles a curated prompt gallery, an agentic skill (SKILL.md runbook for Claude Code, Codex, OpenClaw, and Hermes), and a Python CLI (gic) into one repository. It supports OpenAI gpt-image-2, Google Imagen, and Flux (fal.ai / Replicate) under a single interface for text-to-image generation, reference-image editing, inpainting, and multi-reference workflows.
If you build with AI image models, this gives you copy-paste prompts that work, a CLI that handles auth and edits, and an agent runbook that wires it all together.
What this is
Three things bundled together:
- A curated prompt gallery — copy-paste prompts organized by category (posters, UI mockups, photography, diagrams, brand systems, edit/inpaint workflows) that produce reliable results across providers.
- An agentic skill —
SKILL.mdrunbook for Claude Code, Codex, OpenClaw, Hermes, and other skill-capable agent runtimes. Tells the agent how to search the gallery, refine the prompt, and call the CLI without writing one-off scripts. - A CLI (
gic) — one command, multiple providers. Switch between OpenAIgpt-image-2, Google Imagen, and Flux with a single--providerflag.
Why a cookbook
Most prompt galleries are read-only inspiration. This one is wired into an agent that uses the gallery: search → refine → generate, with consistent semantics across providers. The CLI handles auth, encoding, sizing, edits, and inpainting so the agent never reinvents API plumbing.
Install
# pip
pip install gpt-image-cookbook
# or one-shot via uvx
uvx --from git+https://github.com/eugeniughelbur/gpt-image-cookbook gic --help
Set at least one provider key:
export OPENAI_API_KEY=sk-... # for openai (default)
export GOOGLE_API_KEY=... # for imagen
export FAL_KEY=... # for flux
The CLI also reads ./.env and ~/.env (without overriding env vars already set).
Quickstart
# Text-to-image, OpenAI default
gic -p "A minimalist conference poster, headline reads exactly 'Signal over Noise'" --quality high --size portrait
# Reference edit
gic -p "Make the sky a stormy sunset" -i ref.png
# Inpaint
gic -p "Replace the masked area with a coffee cup" -i ref.png -m mask.png
# Switch provider
gic -p "Photoreal product shot of a ceramic mug on oak" --provider imagen --quality high
Outputs land in ./generated/<timestamp>-<slug>.png unless you pass -f.
Use as a Claude Code plugin
# In Claude Code
/plugin install eugeniughelbur/gpt-image-cookbook
The agent loads skills/gpt-image-cookbook/SKILL.md, follows the operating loop (classify → search gallery → refine → generate), and calls gic for you.
Use as an agent skill (Codex, OpenClaw, Hermes, …)
Point your runtime at skills/gpt-image-cookbook/SKILL.md. Compatible runtimes auto-resolve the gic CLI via command -v gic, uv, or uvx.
Repo layout
.claude-plugin/ # Claude Code plugin + marketplace metadata
skills/
gpt-image-cookbook/
SKILL.md # agent runbook
references/ # gallery routing index, per-category prompts, craft cheatsheet
scripts/ # generate.py launcher
agents/ # runtime metadata for OpenClaw / Hermes / etc.
src/gic/ # the CLI (Python)
docs/ # gallery thumbnails (added as you build entries)
CLI reference
| Flag | Values | Use |
|---|---|---|
-p, --prompt |
string | required prompt or edit instruction |
-f, --file |
path | output path; auto-named if omitted |
-i, --image |
repeatable path | reference image; switches to edits endpoint |
-m, --mask |
PNG path | alpha mask for inpaint; requires -i |
--provider |
openai, imagen, flux |
provider router |
--model |
string | override the provider's default model |
--size |
1k, 2k, 4k, portrait, landscape, square, wide, tall, or WxH |
canvas size |
--quality |
low, medium, high, auto |
cost/quality dial |
-n, --n |
integer | number of images |
--background |
auto, opaque, transparent |
background mode |
--format |
png, jpeg, webp |
output encoding |
--user |
string | passed to provider for end-user attribution |
Exit codes: 0 success · 1 API/refusal · 2 bad args/missing key.
Adding a prompt to the gallery
- Generate something you like with
gic. - Save the preview thumbnail under
docs/<category>/. - Add an entry to the matching
skills/gpt-image-cookbook/references/gallery-<category>.mdusing the template documented inside that file. - Open a PR.
Adding a new provider
The provider abstraction lives in src/gic/providers/. Each provider implements Provider.generate(req: GenerateRequest) and gets registered in providers/__init__.py. See openai_provider.py for the reference shape.
FAQ
What is gpt-image-cookbook?
gpt-image-cookbook is an open-source toolkit for AI image generation that bundles three things in one repository: a curated prompt gallery (copy-paste prompts that work), an agentic skill (SKILL.md runbook for Claude Code, Codex, OpenClaw, and Hermes agent runtimes), and a Python CLI (gic) wrapping OpenAI gpt-image-2, Google Imagen, and Flux under one interface.
Which AI image models does it support?
OpenAI gpt-image-2 is fully implemented for text-to-image, reference-image edits, inpainting (with PNG alpha mask), and multi-reference workflows. Google Imagen (imagen-4) and Flux (flux-pro-1.1, flux-schnell via fal.ai or Replicate) have provider stubs ready — the abstraction is in place, implementations land progressively.
How do I install it?
The fastest one-shot path is uvx --from git+https://github.com/eugeniughelbur/gpt-image-cookbook gic --help. For repeated use, clone the repo and pip install -e ".[openai]". A PyPI release (pip install gpt-image-cookbook) is planned.
How do I use it as a Claude Code plugin?
Run /plugin install eugeniughelbur/gpt-image-cookbook inside Claude Code. The plugin loads the agent skill at skills/gpt-image-cookbook/SKILL.md and resolves the gic CLI automatically. The agent then follows the operating loop documented in SKILL.md (classify → search gallery → refine → generate).
What's the difference between this and just calling the OpenAI API directly?
Three things you don't get from the raw API: a gallery of working prompts organized by category, a multi-provider abstraction so you can switch between OpenAI / Imagen / Flux without rewriting code, and an agent runbook that lets a Claude Code or Codex agent use the gallery + CLI without you writing one-off scripts.
Does it cost money to use?
Yes — calls go to OpenAI / Google / fal.ai / Replicate and bill the user's account. The CLI itself is free and open-source (MIT). The cookbook recommends drafting at --quality low first ($0.01 per image on gpt-image-2) and only moving to $0.17 per image) when the prompt is locked.--quality high (
Where are prompt API keys stored?
The CLI reads keys from process environment first, then ./.env in the project, then ~/.env in the user's home directory — never overriding values already in env. Keys are never written to disk by the tool and never printed in output.
Can I add my own prompts to the gallery?
Yes — that's the point. Each category has a gallery-<category>.md file with an entry template. Generate something via gic, save the preview to docs/<category>/, paste an entry, open a PR.
How do I add a new image provider?
Implement Provider.generate(req: GenerateRequest) in a new file under src/gic/providers/ and register it in providers/__init__.py. The OpenAI implementation in openai_provider.py is the reference shape.
License
MIT — see LICENSE.
Citation
If you use this cookbook in research or tooling, please cite via CITATION.cff or:
Ghelbur, E. (2026). gpt-image-cookbook: a multi-provider AI image generation cookbook with prompt gallery, agentic skill, and CLI [Software]. https://github.com/eugeniughelbur/gpt-image-cookbook
Author
Built and maintained by Eugeniu Ghelbur (@eugeniughelbur).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file gpt_image_cookbook-0.1.0.tar.gz.
File metadata
- Download URL: gpt_image_cookbook-0.1.0.tar.gz
- Upload date:
- Size: 22.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
98885ad72501bdc6f67dd2eb15c9b045c40c46242ffffce6f11118c6a5a09c72
|
|
| MD5 |
f9dd4e3a9f41198bf86a5aed63039b7e
|
|
| BLAKE2b-256 |
327ee301c9b31b22f88b64dbfb4b1b79b5ef5362e6482dfdf41d370419f832e5
|
File details
Details for the file gpt_image_cookbook-0.1.0-py3-none-any.whl.
File metadata
- Download URL: gpt_image_cookbook-0.1.0-py3-none-any.whl
- Upload date:
- Size: 12.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d9160d06c2d0df7729b00eaac89150898746371a3d940bf50fa4bedc5f5ecc2b
|
|
| MD5 |
00c0f4beb87902f620f9940d144a4976
|
|
| BLAKE2b-256 |
4084c92cc58bfbbc66ae538914ee59cf59cf03fbbc803002b5d273aadda26c9c
|