Multi-provider AI image generation cookbook: prompt gallery, agentic skill, and CLI for OpenAI, Google Imagen, Flux, and more.

These details have not been verified by PyPI

Project links

Project description

gpt-image-cookbook

Multi-provider AI image generation cookbook — prompt gallery, agentic skill, and CLI for OpenAI gpt-image-2, Google Imagen, Flux, and more.

License: MIT Python ≥ 3.11 Providers

gpt-image-cookbook is an open-source multi-provider AI image generation toolkit that bundles a curated prompt gallery, an agentic skill (SKILL.md runbook for Claude Code, Codex, OpenClaw, and Hermes), and a Python CLI (gic) into one repository. It supports OpenAI gpt-image-2, Google Imagen, and Flux (fal.ai / Replicate) under a single interface for text-to-image generation, reference-image editing, inpainting, and multi-reference workflows.

If you build with AI image models, this gives you copy-paste prompts that work, a CLI that handles auth and edits, and an agent runbook that wires it all together.

What this is

Three things bundled together:

A curated prompt gallery — copy-paste prompts organized by category (posters, UI mockups, photography, diagrams, brand systems, edit/inpaint workflows) that produce reliable results across providers.
An agentic skill — SKILL.md runbook for Claude Code, Codex, OpenClaw, Hermes, and other skill-capable agent runtimes. Tells the agent how to search the gallery, refine the prompt, and call the CLI without writing one-off scripts.
A CLI (gic) — one command, multiple providers. Switch between OpenAI gpt-image-2, Google Imagen, and Flux with a single --provider flag.

Why a cookbook

Most prompt galleries are read-only inspiration. This one is wired into an agent that uses the gallery: search → refine → generate, with consistent semantics across providers. The CLI handles auth, encoding, sizing, edits, and inpainting so the agent never reinvents API plumbing.

Install

# pip
pip install gpt-image-cookbook

# or one-shot via uvx
uvx --from git+https://github.com/eugeniughelbur/gpt-image-cookbook gic --help

Set at least one provider key:

export OPENAI_API_KEY=sk-...        # for openai (default)
export GOOGLE_API_KEY=...           # for imagen
export FAL_KEY=...                  # for flux

The CLI also reads ./.env and ~/.env (without overriding env vars already set).

Quickstart

# Text-to-image, OpenAI default
gic -p "A minimalist conference poster, headline reads exactly 'Signal over Noise'" --quality high --size portrait

# Reference edit
gic -p "Make the sky a stormy sunset" -i ref.png

# Inpaint
gic -p "Replace the masked area with a coffee cup" -i ref.png -m mask.png

# Switch provider
gic -p "Photoreal product shot of a ceramic mug on oak" --provider imagen --quality high

Outputs land in ./generated/<timestamp>-<slug>.png unless you pass -f.

Use as a Claude Code plugin

# In Claude Code
/plugin install eugeniughelbur/gpt-image-cookbook

The agent loads skills/gpt-image-cookbook/SKILL.md, follows the operating loop (classify → search gallery → refine → generate), and calls gic for you.

Use as an agent skill (Codex, OpenClaw, Hermes, …)

Point your runtime at skills/gpt-image-cookbook/SKILL.md. Compatible runtimes auto-resolve the gic CLI via command -v gic, uv, or uvx.

Repo layout

.claude-plugin/        # Claude Code plugin + marketplace metadata
skills/
  gpt-image-cookbook/
    SKILL.md           # agent runbook
    references/        # gallery routing index, per-category prompts, craft cheatsheet
    scripts/           # generate.py launcher
    agents/            # runtime metadata for OpenClaw / Hermes / etc.
src/gic/               # the CLI (Python)
docs/                  # gallery thumbnails (added as you build entries)

CLI reference

Flag	Values	Use
`-p, --prompt`	string	required prompt or edit instruction
`-f, --file`	path	output path; auto-named if omitted
`-i, --image`	repeatable path	reference image; switches to edits endpoint
`-m, --mask`	PNG path	alpha mask for inpaint; requires `-i`
`--provider`	`openai`, `imagen`, `flux`	provider router
`--model`	string	override the provider's default model
`--size`	`1k`, `2k`, `4k`, `portrait`, `landscape`, `square`, `wide`, `tall`, or `WxH`	canvas size
`--quality`	`low`, `medium`, `high`, `auto`	cost/quality dial
`-n, --n`	integer	number of images
`--background`	`auto`, `opaque`, `transparent`	background mode
`--format`	`png`, `jpeg`, `webp`	output encoding
`--user`	string	passed to provider for end-user attribution

Exit codes: 0 success · 1 API/refusal · 2 bad args/missing key.

Adding a prompt to the gallery

Generate something you like with gic.
Save the preview thumbnail under docs/<category>/.
Add an entry to the matching skills/gpt-image-cookbook/references/gallery-<category>.md using the template documented inside that file.
Open a PR.

Adding a new provider

The provider abstraction lives in src/gic/providers/. Each provider implements Provider.generate(req: GenerateRequest) and gets registered in providers/__init__.py. See openai_provider.py for the reference shape.

FAQ

What is gpt-image-cookbook?

gpt-image-cookbook is an open-source toolkit for AI image generation that bundles three things in one repository: a curated prompt gallery (copy-paste prompts that work), an agentic skill (SKILL.md runbook for Claude Code, Codex, OpenClaw, and Hermes agent runtimes), and a Python CLI (gic) wrapping OpenAI gpt-image-2, Google Imagen, and Flux under one interface.

Which AI image models does it support?

OpenAI gpt-image-2 is fully implemented for text-to-image, reference-image edits, inpainting (with PNG alpha mask), and multi-reference workflows. Google Imagen (imagen-4) and Flux (flux-pro-1.1, flux-schnell via fal.ai or Replicate) have provider stubs ready — the abstraction is in place, implementations land progressively.

How do I install it?

The fastest one-shot path is uvx --from git+https://github.com/eugeniughelbur/gpt-image-cookbook gic --help. For repeated use, clone the repo and pip install -e ".[openai]". A PyPI release (pip install gpt-image-cookbook) is planned.

How do I use it as a Claude Code plugin?

Run /plugin install eugeniughelbur/gpt-image-cookbook inside Claude Code. The plugin loads the agent skill at skills/gpt-image-cookbook/SKILL.md and resolves the gic CLI automatically. The agent then follows the operating loop documented in SKILL.md (classify → search gallery → refine → generate).

What's the difference between this and just calling the OpenAI API directly?

Three things you don't get from the raw API: a gallery of working prompts organized by category, a multi-provider abstraction so you can switch between OpenAI / Imagen / Flux without rewriting code, and an agent runbook that lets a Claude Code or Codex agent use the gallery + CLI without you writing one-off scripts.

Does it cost money to use?

Yes — calls go to OpenAI / Google / fal.ai / Replicate and bill the user's account. The CLI itself is free and open-source (MIT). The cookbook recommends drafting at --quality low first (~~$0.01 per image on gpt-image-2) and only moving to --quality high (~~$0.17 per image) when the prompt is locked.

Where are prompt API keys stored?

The CLI reads keys from process environment first, then ./.env in the project, then ~/.env in the user's home directory — never overriding values already in env. Keys are never written to disk by the tool and never printed in output.

Can I add my own prompts to the gallery?

Yes — that's the point. Each category has a gallery-<category>.md file with an entry template. Generate something via gic, save the preview to docs/<category>/, paste an entry, open a PR.

How do I add a new image provider?

Implement Provider.generate(req: GenerateRequest) in a new file under src/gic/providers/ and register it in providers/__init__.py. The OpenAI implementation in openai_provider.py is the reference shape.

License

MIT — see LICENSE.

Citation

If you use this cookbook in research or tooling, please cite via CITATION.cff or:

Ghelbur, E. (2026). gpt-image-cookbook: a multi-provider AI image generation cookbook with prompt gallery, agentic skill, and CLI [Software]. https://github.com/eugeniughelbur/gpt-image-cookbook

Author

Built and maintained by Eugeniu Ghelbur (@eugeniughelbur).

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

May 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpt_image_cookbook-0.1.0.tar.gz (22.6 kB view details)

Uploaded May 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

gpt_image_cookbook-0.1.0-py3-none-any.whl (12.3 kB view details)

Uploaded May 8, 2026 Python 3

File details

Details for the file gpt_image_cookbook-0.1.0.tar.gz.

File metadata

Download URL: gpt_image_cookbook-0.1.0.tar.gz
Upload date: May 8, 2026
Size: 22.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for gpt_image_cookbook-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`98885ad72501bdc6f67dd2eb15c9b045c40c46242ffffce6f11118c6a5a09c72`
MD5	`f9dd4e3a9f41198bf86a5aed63039b7e`
BLAKE2b-256	`327ee301c9b31b22f88b64dbfb4b1b79b5ef5362e6482dfdf41d370419f832e5`

See more details on using hashes here.

File details

Details for the file gpt_image_cookbook-0.1.0-py3-none-any.whl.

File metadata

Download URL: gpt_image_cookbook-0.1.0-py3-none-any.whl
Upload date: May 8, 2026
Size: 12.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for gpt_image_cookbook-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d9160d06c2d0df7729b00eaac89150898746371a3d940bf50fa4bedc5f5ecc2b`
MD5	`00c0f4beb87902f620f9940d144a4976`
BLAKE2b-256	`4084c92cc58bfbbc66ae538914ee59cf59cf03fbbc803002b5d273aadda26c9c`

See more details on using hashes here.

gpt-image-cookbook 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

gpt-image-cookbook

What this is

Why a cookbook

Install

Quickstart

Use as a Claude Code plugin

Use as an agent skill (Codex, OpenClaw, Hermes, …)

Repo layout

CLI reference

Adding a prompt to the gallery

Adding a new provider

FAQ

What is gpt-image-cookbook?

Which AI image models does it support?

How do I install it?

How do I use it as a Claude Code plugin?

What's the difference between this and just calling the OpenAI API directly?

Does it cost money to use?

Where are prompt API keys stored?

Can I add my own prompts to the gallery?

How do I add a new image provider?

License

Citation

Author

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes