AI-powered image generation MCP server with multi-model support
Project description
PixelForge MCP
An MCP server for AI-powered image generation, editing, analysis, and transformation using Google's Gemini and Imagen 4 models.
Features
- 21 MCP tools for image generation, editing, analysis, transformation, and more
- 6 models including Gemini image models and Imagen 4 family
- Quality presets (fast/balanced/quality) for simplified model selection
- Parallel multi-image generation via asyncio.gather
- Image transforms — crop, resize, rotate, flip, blur, sharpen, grayscale, watermark
- Prompt template library — 24 curated templates across 10 categories
- Generation history — full audit trail with search and pagination
- Batch processing — generate up to 10 images in parallel
- Optional Vertex AI — upscaling (x2/x4) and advanced editing modes
- Thinking mode & grounding — extended reasoning and Google Search for analysis
- 14 aspect ratios (including panoramic) and temperature control
- Async-first architecture with full Pydantic validation
- EXIF metadata embedding (prompt, model, timestamp)
Quick Start
Requirements: Python 3.10+ and a Google API key
Install
pipx install pixelforge-mcp
Configure
Claude Code
claude mcp add pixelforge --scope user -e GOOGLE_API_KEY="your-key" -- pixelforge-mcp
Cursor
Add to .cursor/mcp.json:
{
"mcpServers": {
"pixelforge": {
"command": "pixelforge-mcp",
"env": {
"GOOGLE_API_KEY": "your-key"
}
}
}
}
VS Code
code --add-mcp '{"name":"pixelforge","command":"pixelforge-mcp","env":{"GOOGLE_API_KEY":"your-key"}}'
Windsurf
Add to ~/.codeium/windsurf/mcp_config.json:
{
"mcpServers": {
"pixelforge": {
"command": "pixelforge-mcp",
"env": {
"GOOGLE_API_KEY": "your-key"
}
}
}
}
Kiro
kiro-cli mcp add --name pixelforge --scope global --command pixelforge-mcp --env "GOOGLE_API_KEY=your-key"
Claude Desktop
Edit ~/Library/Application Support/Claude/claude_desktop_config.json:
{
"mcpServers": {
"pixelforge": {
"command": "pixelforge-mcp",
"env": {
"GOOGLE_API_KEY": "your-key"
}
}
}
}
Restart Claude Desktop after saving.
Use
Ask Claude to generate, edit, or analyze images — all 21 tools are available automatically.
Available Tools
Generation
| Tool | Description |
|---|---|
generate_image |
Generate images from text (6 models, 14 aspect ratios, quality presets, parallel multi-image) |
edit_image |
Modify existing images with text prompts |
remove_background |
Remove image background (transparent or white) |
transform_image |
Crop, resize, rotate, flip, blur, sharpen, grayscale, or watermark |
batch_generate |
Generate up to 10 images in parallel from multiple prompts |
Analysis
| Tool | Description |
|---|---|
analyze_image |
AI-powered image description with optional grounding |
extract_text |
OCR — extract text with confidence levels |
detect_objects |
Detect objects with bounding boxes |
compare_images |
Compare 2-10 images for differences |
Utility
| Tool | Description |
|---|---|
optimize_prompt |
Enhance prompts for better image generation (14 styles) |
estimate_cost |
Calculate generation costs per model/operation |
list_templates |
Browse 24 curated prompt templates across 10 categories |
apply_template |
Fill a template with your subject for a ready-to-use prompt |
list_available_models |
Model capabilities, speed, quality, and selection guidance |
get_server_info |
Server configuration and status |
History
| Tool | Description |
|---|---|
list_history |
Browse generation history with pagination and filtering |
get_generation_details |
Get full details of a specific generation |
Vertex AI (Optional)
| Tool | Description |
|---|---|
upscale_image |
Upscale images x2 or x4 (requires Vertex AI) |
advanced_edit |
Inpainting, outpainting, background swap, style transfer (requires Vertex AI) |
Model Selection
PixelForge supports per-request model switching with 6 models:
Gemini Models (via Gemini API)
| Use case | Model | Why |
|---|---|---|
| Fast iterations | gemini-2.5-flash-image (default) |
Cheapest, lowest latency |
| Panoramic & grounded | gemini-3.1-flash-image-preview |
1:4/4:1/1:8/8:1, web+image grounding |
| Max text fidelity | gemini-3-pro-image-preview |
~94% accuracy, complex multi-turn edits |
Imagen 4 Models (via Gemini API)
| Use case | Model | Why |
|---|---|---|
| Cost-effective batch | imagen-4.0-generate-001 |
$0.04/img, excellent quality |
| Cheapest generation | imagen-4.0-fast-generate-001 |
$0.02/img, fastest |
| Maximum quality | imagen-4.0-ultra-generate-001 |
$0.06/img, best output |
Quality Presets
Instead of choosing a model manually, use quality presets:
generate_image(prompt="...", quality="fast") # gemini-2.5-flash-image
generate_image(prompt="...", quality="balanced") # gemini-3.1-flash-image-preview + 1K
generate_image(prompt="...", quality="quality") # gemini-3-pro-image-preview + 2K
Vertex AI (Optional)
For advanced features like image upscaling and specialized editing modes:
- Set up a Google Cloud project with Vertex AI enabled
- Set the environment variable:
GOOGLE_CLOUD_PROJECT=your-project-id - Install the Vertex AI dependency:
pip install pixelforge-mcp[vertex]
PixelForge auto-detects Vertex AI credentials and unlocks upscale_image and advanced_edit tools.
Supported Aspect Ratios
| Ratio | Description |
|---|---|
1:1 |
Square (default) |
16:9 |
Widescreen landscape |
9:16 |
Mobile portrait |
2:3 |
Classic portrait |
3:2 |
Classic landscape |
3:4 |
Portrait |
4:3 |
Traditional landscape |
4:5 |
Instagram portrait |
5:4 |
Medium format |
21:9 |
Ultrawide |
1:4 |
Tall panoramic* |
4:1 |
Wide panoramic* |
1:8 |
Extra tall panoramic* |
8:1 |
Extra wide panoramic* |
* Panoramic ratios require gemini-3.1-flash-image-preview model
Troubleshooting
"Invalid API key" or "Authentication failed"
Double-check your Google API key is correct and has access to the Gemini API. Get a key at Google AI Studio.
"Command not found: pixelforge-mcp"
Ensure the pipx bin directory is in your PATH:
export PATH="$HOME/.local/bin:$PATH"
"Server not starting"
Check that pixelforge-mcp is installed:
pipx list | grep pixelforge
Vertex AI features not available
Ensure GOOGLE_CLOUD_PROJECT is set and google-cloud-aiplatform is installed:
pip install pixelforge-mcp[vertex]
Documentation
- Configuration Guide — detailed setup and environment options
- Changelog — version history
Contributing
See CONTRIBUTING.md for development setup, code standards, and pull request guidelines.
License
Acknowledgments
Built with FastMCP, Pydantic, and google-genai.
Author
Ahmed Al-Eryani @ Tehnolabs
PixelForge MCP - Forging pixels with AI
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pixelforge_mcp-0.5.2.tar.gz.
File metadata
- Download URL: pixelforge_mcp-0.5.2.tar.gz
- Upload date:
- Size: 49.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
62ffc9448b4a98a1aeda5860454dacf6cb2a9c47bb8a11788bad2d11cd3f67cb
|
|
| MD5 |
2532425faf97ad54dbccc6fac22b8b92
|
|
| BLAKE2b-256 |
11a46f3d1462083d12f443f1c3e0e1f328bbb2ba17b98981e67f0e660359a027
|
File details
Details for the file pixelforge_mcp-0.5.2-py3-none-any.whl.
File metadata
- Download URL: pixelforge_mcp-0.5.2-py3-none-any.whl
- Upload date:
- Size: 49.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5b310be8da1359a3ca50af8b22c6267da697f9ccd2878982cca409e4831532ee
|
|
| MD5 |
150e1d35801a5e19196e8da26c420f77
|
|
| BLAKE2b-256 |
428e40f607a25340cff8320c8f6e48bc30418f27d1275de75dc6cb70bba449e5
|