Skip to main content

MCP Image Generation Server

Project description

Image Generation MCP Server

A FastMCP server for generating and editing images using OpenAI's gpt-image-1 and Azure FLUX.1-Kontext-pro models. This component is part of the AppKit platform and is designed to be integrated into the main application.

Features

  • Text-to-Image Generation: Create images from natural language prompts using multiple AI models
  • Image Editing & Inpainting: Edit existing images with text prompts and optional masks (gpt-image-1)
  • Multiple Formats: Output as PNG, JPEG, or WEBP with customizable quality
  • Prompt Enhancement: Auto-refine prompts via LLM for better results

Configuration

This component uses AppKit's configuration system (appkit_commons) and is configured via MCPImageGeneratorConfig. The settings are loaded from configuration/config.yaml or environment variables mapped by appkit_commons.

Settings

Setting Description Default
backend_server URL of the backend server (for retrieving images) http://localhost:8000
max_file_size_mb Maximum allowed file size for input images 10
max_images_to_keep Storage retention limit for generated images 50
generator Active image generator backend (azure or google) azure
azure_api_key Azure OpenAI API key None
azure_base_url Azure OpenAI endpoint URL None
azure_prompt_optimizer LLM model used for prompt enhancement (Azure) gpt-5-mini
azure_image_model Image generation model identifier (Azure) FLUX.1-Kontext-pro
google_api_key Google AI API key None
google_prompt_optimizer LLM model used for prompt enhancement (Google) gemini-2.0-flash-001
google_image_model Image generation model identifier (Google) imagen-4.0-generate-preview-06-06
auth_tokens List of MCP tokens and scopes for authentication []

Integration

This module is designed to be integrated into the main AppKit application rather than running standalone. The create_image_mcp_server function returns a configured FastMCP instance which is mounted by the main application.

Usage in AppKit

In app/app.py, the server is initialized effectively as follows:

from appkit_commons.registry import service_registry
from appkit_user.authentication.services import get_verifier
from appkit_mcp_image.server import create_image_mcp_server, init_generators
from appkit_mcp_image.configuration import MCPImageGeneratorConfig

# ... inside initialization ...
image_mcp_config = service_registry().get(MCPImageGeneratorConfig)
_generators = init_generators(image_mcp_config)

# Create and mount the MCP server
servers["/image"] = create_image_mcp_server(
    _generators[image_mcp_config.generator],
    auth=get_verifier(),
)

Tools & API

generate_image

Create images from text descriptions.

Parameters:

Parameter Type Default Description
prompt string required Image description (max 32,000 chars)
size string 1024x1024 Dimensions: 1024x1024, 1536x1024, 1024x1536, or auto
output_format string jpeg Output format: png, jpeg, or webp
seed integer 0 Random seed for reproducibility (0 = random)
enhance_prompt boolean true Auto-enhance prompt via LLM
background string auto Background: transparent, opaque, or auto

Example:

generate_image(
    prompt="A serene mountain landscape at sunset with golden light reflecting off a lake",
    size="1536x1024",
    output_format="png",
    enhance_prompt=True
)

edit_image

Edit existing images with text prompts and optional masks for inpainting.

Parameters:

Parameter Type Default Description
prompt string required Description of desired edits (max 32,000 chars)
image_paths array required Image URLs, file paths, or base64 data URLs (max 16)
mask_path string optional Optional mask image for inpainting (transparent areas indicate edit zones)
size string auto Output dimensions
output_format string jpeg Output format: png, jpeg, or webp
background string auto Background setting

Example:

edit_image(
    prompt="Add a vibrant rainbow across the sky",
    image_paths=["https://example.com/landscape.jpg"],
    mask_path="https://example.com/sky_mask.png",
    output_format="png"
)

Image Input Formats

Supported input methods for image_paths parameter:

  • HTTP/HTTPS URLs: https://example.com/image.jpg
  • Local file paths: /path/to/image.png
  • Base64 data URLs: data:image/png;base64,iVBORw0KG...

Inpainting with Masks

For precise control over edits, use mask images:

  1. Create a PNG image with alpha transparency
  2. Transparent areas (alpha=0) mark regions to edit
  3. Opaque areas remain unchanged
  4. Mask dimensions must match the input image

License

This project is licensed under the MIT License - see LICENSE.md for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

appkit_mcp_image-1.8.4.tar.gz (19.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

appkit_mcp_image-1.8.4-py3-none-any.whl (15.7 kB view details)

Uploaded Python 3

File details

Details for the file appkit_mcp_image-1.8.4.tar.gz.

File metadata

  • Download URL: appkit_mcp_image-1.8.4.tar.gz
  • Upload date:
  • Size: 19.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for appkit_mcp_image-1.8.4.tar.gz
Algorithm Hash digest
SHA256 26d0474fa29e1eebf4b98e56f00aa0c73bcf008b443bf37993e9f53508b5228c
MD5 726cd71fc0571a1428020eb1d651bcce
BLAKE2b-256 9e0d6b1826097b1ec34b3ebc842f0efed90ead4146ee6256683a5f87d95c58f0

See more details on using hashes here.

File details

Details for the file appkit_mcp_image-1.8.4-py3-none-any.whl.

File metadata

  • Download URL: appkit_mcp_image-1.8.4-py3-none-any.whl
  • Upload date:
  • Size: 15.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for appkit_mcp_image-1.8.4-py3-none-any.whl
Algorithm Hash digest
SHA256 26cf088d85f5672fde952a8a27955460c3ce48d02c0616e2a6152b036654566c
MD5 46e1b8c619367db3f2b9a9a0eb21086e
BLAKE2b-256 45c3f13d7776e5f3ee728193fca4557303a04cf3ba365bf580aaf789fe155574

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page