Ultimate image generation MCP server unifying Gemini 2.5 Flash Image and Imagen 4/Fast/Ultra with advanced features

These details have not been verified by PyPI

Project links

Project description

Ultimate Gemini MCP Server 🎨

The most comprehensive MCP server for AI image generation, unifying Google's Gemini 2.5 Flash Image and Imagen 4/Ultra models with advanced features.

✨ Features

Unified API Support

Gemini 2.5 Flash Image: Advanced image generation with AI-powered prompt enhancement and editing
Imagen 4 & 4-Ultra: High-quality image generation with professional controls
Automatic model detection and parameter optimization

Advanced Capabilities

🤖 AI Prompt Enhancement: Automatically optimize prompts using Gemini Flash for superior results
🎨 Image Editing: Modify existing images with natural language instructions
🚀 Batch Processing: Generate multiple images efficiently with parallel processing
🎯 Character Consistency: Maintain character features across multiple generations
🌍 World Knowledge: Integrate accurate real-world context for historical/factual subjects
🎭 Multi-Image Blending: Combine multiple visual elements naturally
🎲 Reproducible Results: Use seeds for consistent generation (Imagen)
⚫ Negative Prompts: Specify what to avoid in images (Imagen)

Production Ready

Comprehensive error handling and validation
Configurable settings via environment variables
Detailed logging and debugging
MCP resources for configuration and model information

🚀 Quick Start

Prerequisites

Python 3.11 or higher
Google Gemini API key (free)

Installation

Option 1: Using uv (Recommended)

# Install uv if you haven't already
curl -LsSf https://astral.sh/uv/install.sh | sh

# Install and run the server
uvx ultimate-gemini-mcp

Option 2: Using pip

pip install ultimate-gemini-mcp

Option 3: From Source

git clone <repository-url>
cd ultimate-gemini-mcp
uv sync

Configuration

Create a .env file in your project directory:

cp .env.example .env
# Edit .env and add your GEMINI_API_KEY

Or set environment variables directly:

export GEMINI_API_KEY=your_api_key_here

📖 Usage

With Claude Desktop

Add to your claude_desktop_config.json:

{
  "mcpServers": {
    "ultimate-gemini": {
      "command": "uvx",
      "args": ["ultimate-gemini-mcp"],
      "env": {
        "GEMINI_API_KEY": "your-api-key-here"
      }
    }
  }
}

Config file locations:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json

With Claude Code (VS Code)

# Add MCP server to Claude Code
claude mcp add ultimate-gemini --env GEMINI_API_KEY=your-api-key -- uvx ultimate-gemini-mcp

With Cursor

Add to Cursor's MCP configuration (.cursor/mcp.json):

{
  "mcpServers": {
    "ultimate-gemini": {
      "command": "uvx",
      "args": ["ultimate-gemini-mcp"],
      "env": {
        "GEMINI_API_KEY": "your-api-key-here"
      }
    }
  }
}

🎯 Available Models

Gemini Models

gemini-2.5-flash-image (default): Advanced image generation with prompt enhancement and editing

Imagen Models

imagen-4: High-quality image generation with improved text rendering
imagen-4-fast: Optimized for faster generation with good quality
imagen-4-ultra: Highest quality with best prompt adherence and professional results

🛠️ Tools

`generate_image`

Generate images using any supported model with comprehensive parameters.

Parameters:

prompt (required): Text description of the image
model: Model to use (default: gemini-2.5-flash-image)
enhance_prompt: Automatically enhance prompt (default: true)
number_of_images: Number of images to generate, 1-4 (default: 1)
aspect_ratio: Aspect ratio like 1:1, 16:9, 9:16 (default: 1:1)
output_format: Image format: png, jpeg, webp (default: png)

Gemini-Specific Parameters:

input_image_path: Path to input image for editing
maintain_character_consistency: Maintain character features across generations
blend_images: Enable multi-image blending
use_world_knowledge: Use real-world knowledge for context

Imagen-Specific Parameters:

person_generation: Person policy: dont_allow, allow_adult, allow_all
negative_prompt: What to avoid in the image
seed: Random seed for reproducibility

Example:

Generate an image of "a serene mountain landscape at sunset with a lake reflection" using imagen-4-ultra

`batch_generate`

Process multiple prompts efficiently with parallel batch processing.

Parameters:

prompts (required): List of text prompts
model: Model to use for all images
enhance_prompt: Enhance all prompts (default: true)
aspect_ratio: Aspect ratio for all images
batch_size: Parallel processing size (default: from config)

Example:

Batch generate images for these prompts:
1. "minimalist logo design for a tech startup"
2. "modern dashboard UI design"
3. "mobile app wireframe"

🎨 Advanced Features

AI Prompt Enhancement

When enabled (default), the server uses Gemini Flash to automatically enhance your prompts:

Original: a cat wearing a space helmet

Enhanced: A photorealistic portrait of a domestic tabby cat wearing a futuristic space helmet, close-up composition, warm studio lighting, detailed fur texture, reflective helmet visor showing subtle reflections, soft focus background, professional photography style

This significantly improves image quality without requiring you to be a prompt engineering expert!

Image Editing

Use natural language to edit existing images (Gemini model):

Generate an image with:
- prompt: "Add a red scarf to the person"
- input_image_path: "/path/to/image.jpg"

Character Consistency

Generate the same character in different scenes:

Generate an image of "a young wizard in a library, studying ancient books"
with maintain_character_consistency: true

Then:

Generate an image of "the same young wizard, now in a magical forest"
with maintain_character_consistency: true

Reproducible Results

Use seeds for consistent generation (Imagen models):

Generate an image with:
- prompt: "a futuristic cityscape"
- model: "imagen-4-ultra"
- seed: 42

Running with the same seed will produce the same image.

⚙️ Configuration

Environment Variables

Variable	Description	Default
`GEMINI_API_KEY`	Google Gemini API key (required)	-
`OUTPUT_DIR`	Directory for generated images	`generated_images`
`ENABLE_PROMPT_ENHANCEMENT`	Enable AI prompt enhancement	`true`
`ENABLE_BATCH_PROCESSING`	Enable batch processing	`true`
`DEFAULT_GEMINI_MODEL`	Default Gemini model	`gemini-2.5-flash-image`
`DEFAULT_IMAGEN_MODEL`	Default Imagen model	`imagen-4-ultra`
`REQUEST_TIMEOUT`	API request timeout (seconds)	`60`
`MAX_BATCH_SIZE`	Maximum parallel batch size	`8`
`LOG_LEVEL`	Logging level	`INFO`

📚 MCP Resources

`models://list`

View all available models with descriptions and features.

`settings://config`

View current server configuration.

🎭 Use Cases

Web Development

Hero images and banners
UI/UX mockups and wireframes
Logo and branding assets
Placeholder images

App Development

App icons and splash screens
User interface elements
Marketing materials
Documentation images

Content Creation

Blog post illustrations
Social media graphics
Presentation visuals
Product mockups

Creative Projects

Character design iterations
Concept art exploration
Style variations
Scene composition

📊 Comparison

Feature	Gemini 2.5 Flash	Imagen 4/Fast/Ultra
Prompt Enhancement	✅ Built-in	✅ Built-in
Image Editing	✅ Yes	❌ No
Character Consistency	✅ Yes	❌ No
Multi-Image Blending	✅ Yes	❌ No
Negative Prompts	❌ No	✅ Yes
Seed-based Reproducibility	❌ No	✅ Yes
Person Generation Controls	❌ No	✅ Yes
Speed Options	Standard	Fast/Standard/Ultra
Best For	Editing, iteration, context-aware	Photorealism, final quality, speed

🐛 Troubleshooting

"GEMINI_API_KEY not found"

Add your API key to .env or environment variables
Get a free key at Google AI Studio

"Content blocked by safety filters"

Modify your prompt to comply with content policies
Try rephrasing without potentially sensitive content

"Rate limit exceeded"

Wait a few moments and try again
Consider upgrading your API plan for higher limits

Images not saving

Check that OUTPUT_DIR exists and is writable
Verify you have sufficient disk space

🤝 Contributing

Contributions are welcome! This project combines the best features from multiple MCP servers:

mcp-image (TypeScript): Prompt enhancement and editing features
nanobanana-mcp-server (Python): Architecture and FastMCP integration
gemini-imagen-mcp-server (TypeScript): Imagen API support and batch processing

📄 License

MIT License - see LICENSE file for details.

🙏 Acknowledgments

Built on the excellent work of:

mcp-image - Prompt enhancement concept
nanobanana-mcp-server - FastMCP architecture
gemini-imagen-mcp-server - Imagen integration

🔗 Links

Ready to create amazing AI-generated images? Install now and start generating! 🚀

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

6.0.9

Mar 3, 2026

6.0.8

Mar 3, 2026

6.0.7

Mar 2, 2026

6.0.6

Mar 2, 2026

6.0.5

Mar 2, 2026

6.0.4

Feb 28, 2026

6.0.3

Feb 28, 2026

6.0.2

Feb 26, 2026

6.0.1

Feb 26, 2026

5.0.6

Feb 26, 2026

5.0.5

Feb 26, 2026

5.0.4

Feb 26, 2026

5.0.3

Feb 26, 2026

5.0.2

Feb 26, 2026

5.0.1

Feb 19, 2026

3.0.19

Feb 19, 2026

3.0.18

Feb 19, 2026

3.0.17

Feb 19, 2026

3.0.16

Feb 19, 2026

3.0.15

Jan 18, 2026

3.0.14

Dec 28, 2025

3.0.13

Dec 28, 2025

3.0.12

Nov 21, 2025

3.0.11

Nov 21, 2025

3.0.10

Nov 21, 2025

3.0.9

Nov 21, 2025

3.0.8

Nov 21, 2025

3.0.7

Nov 21, 2025

3.0.6

Nov 21, 2025

3.0.5

Nov 21, 2025

3.0.4

Nov 21, 2025

3.0.3

Nov 21, 2025

3.0.2

Nov 21, 2025

3.0.1

Nov 21, 2025

2.0.1

Nov 21, 2025

1.6.2

Oct 31, 2025

1.6.1

Oct 30, 2025

1.6.0

Oct 26, 2025

1.5.1

Oct 26, 2025

1.0.19

Oct 26, 2025

1.0.18

Oct 26, 2025

1.0.17

Oct 26, 2025

1.0.16

Oct 26, 2025

1.0.15

Oct 26, 2025

1.0.14

Oct 26, 2025

1.0.13

Oct 26, 2025

1.0.12

Oct 26, 2025

1.0.11

Oct 26, 2025

1.0.10

Oct 26, 2025

1.0.9

Oct 26, 2025

1.0.8

Oct 26, 2025

1.0.7

Oct 26, 2025

1.0.6

Oct 26, 2025

This version

1.0.5

Oct 25, 2025

1.0.4

Oct 25, 2025

1.0.3

Oct 25, 2025

1.0.2

Oct 25, 2025

1.0.1

Oct 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ultimate_gemini_mcp-1.0.5.tar.gz (34.8 kB view details)

Uploaded Oct 25, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ultimate_gemini_mcp-1.0.5-py3-none-any.whl (28.2 kB view details)

Uploaded Oct 25, 2025 Python 3

File details

Details for the file ultimate_gemini_mcp-1.0.5.tar.gz.

File metadata

Download URL: ultimate_gemini_mcp-1.0.5.tar.gz
Upload date: Oct 25, 2025
Size: 34.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for ultimate_gemini_mcp-1.0.5.tar.gz
Algorithm	Hash digest
SHA256	`ddf45e5038ad8cddbef32da238653447455ea34259d284c33238ff433c27b2ac`
MD5	`c3e44f9ee3e0e204fb39cac045e32566`
BLAKE2b-256	`1fdf53af4abb9845c093070d4ffe09a0eb9cbaa76b1ec576923096ea53cb76c0`

See more details on using hashes here.

File details

Details for the file ultimate_gemini_mcp-1.0.5-py3-none-any.whl.

File metadata

Download URL: ultimate_gemini_mcp-1.0.5-py3-none-any.whl
Upload date: Oct 25, 2025
Size: 28.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for ultimate_gemini_mcp-1.0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fb72194471f1339e9b98a34b07a3539c21c88ed0ca4f40093255733ac8a6acd4`
MD5	`2c4ad0d40b3b142fe072ff16c4e1d764`
BLAKE2b-256	`192d0d277da0d580d01019f374c2a6d877fe423d9ffdb9196682e426878631d9`

See more details on using hashes here.

ultimate-gemini-mcp 1.0.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Ultimate Gemini MCP Server 🎨

✨ Features

Unified API Support

Advanced Capabilities

Production Ready

🚀 Quick Start

Prerequisites

Installation

Option 1: Using uv (Recommended)

Option 2: Using pip

Option 3: From Source

Configuration

📖 Usage

With Claude Desktop

With Claude Code (VS Code)

With Cursor

🎯 Available Models

Gemini Models

Imagen Models

🛠️ Tools

generate_image

batch_generate

🎨 Advanced Features

AI Prompt Enhancement

Image Editing

Character Consistency

Reproducible Results

⚙️ Configuration

Environment Variables

📚 MCP Resources

models://list

settings://config

🎭 Use Cases

Web Development

App Development

Content Creation

Creative Projects

📊 Comparison

🐛 Troubleshooting

"GEMINI_API_KEY not found"

"Content blocked by safety filters"

"Rate limit exceeded"

Images not saving

🤝 Contributing

📄 License

🙏 Acknowledgments

🔗 Links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`generate_image`

`batch_generate`

`models://list`

`settings://config`