Skip to main content

Ultimate image generation MCP server unifying Gemini 2.5 Flash Image and Imagen 4/Fast/Ultra with advanced features

Project description

Ultimate Gemini MCP Server 🎨

The most comprehensive MCP server for AI image generation, unifying Google's Gemini 2.5 Flash Image and Imagen 4/Ultra models with advanced features.

✨ Features

Unified API Support

  • Gemini 2.5 Flash Image: Advanced image generation with AI-powered prompt enhancement and editing
  • Imagen 4 & 4-Ultra: High-quality image generation with professional controls
  • Automatic model detection and parameter optimization

Advanced Capabilities

  • 🤖 AI Prompt Enhancement: Automatically optimize prompts using Gemini Flash for superior results
  • 🎨 Image Editing: Modify existing images with natural language instructions
  • 🚀 Batch Processing: Generate multiple images efficiently with parallel processing
  • 🎯 Character Consistency: Maintain character features across multiple generations
  • 🌍 World Knowledge: Integrate accurate real-world context for historical/factual subjects
  • 🎭 Multi-Image Blending: Combine multiple visual elements naturally
  • 🎲 Reproducible Results: Use seeds for consistent generation (Imagen)
  • Negative Prompts: Specify what to avoid in images (Imagen)

Production Ready

  • Comprehensive error handling and validation
  • Configurable settings via environment variables
  • Detailed logging and debugging
  • MCP resources for configuration and model information

🚀 Quick Start

Prerequisites

Installation

Option 1: Using uv (Recommended)

# Install uv if you haven't already
curl -LsSf https://astral.sh/uv/install.sh | sh

# Install and run the server
uvx ultimate-gemini-mcp

Option 2: Using pip

pip install ultimate-gemini-mcp

Option 3: From Source

git clone <repository-url>
cd ultimate-gemini-mcp
uv sync

Configuration

Create a .env file in your project directory:

cp .env.example .env
# Edit .env and add your GEMINI_API_KEY

Or set environment variables directly:

export GEMINI_API_KEY=your_api_key_here

📖 Usage

With Claude Desktop

Add to your claude_desktop_config.json:

{
  "mcpServers": {
    "ultimate-gemini": {
      "command": "uvx",
      "args": ["ultimate-gemini-mcp"],
      "env": {
        "GEMINI_API_KEY": "your-api-key-here"
      }
    }
  }
}

Config file locations:

  • macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
  • Windows: %APPDATA%\Claude\claude_desktop_config.json

With Claude Code (VS Code)

# Add MCP server to Claude Code
claude mcp add ultimate-gemini --env GEMINI_API_KEY=your-api-key -- uvx ultimate-gemini-mcp

With Cursor

Add to Cursor's MCP configuration (.cursor/mcp.json):

{
  "mcpServers": {
    "ultimate-gemini": {
      "command": "uvx",
      "args": ["ultimate-gemini-mcp"],
      "env": {
        "GEMINI_API_KEY": "your-api-key-here"
      }
    }
  }
}

🎯 Available Models

Gemini Models

  • gemini-2.5-flash-image (default): Advanced image generation with prompt enhancement and editing

Imagen Models

  • imagen-4: High-quality image generation with improved text rendering
  • imagen-4-fast: Optimized for faster generation with good quality
  • imagen-4-ultra: Highest quality with best prompt adherence and professional results

🛠️ Tools

generate_image

Generate images using any supported model with comprehensive parameters.

Parameters:

  • prompt (required): Text description of the image
  • model: Model to use (default: gemini-2.5-flash-image)
  • enhance_prompt: Automatically enhance prompt (default: true)
  • number_of_images: Number of images to generate, 1-4 (default: 1)
  • aspect_ratio: Aspect ratio like 1:1, 16:9, 9:16 (default: 1:1)
  • output_format: Image format: png, jpeg, webp (default: png)

Gemini-Specific Parameters:

  • input_image_path: Path to input image for editing
  • maintain_character_consistency: Maintain character features across generations
  • blend_images: Enable multi-image blending
  • use_world_knowledge: Use real-world knowledge for context

Imagen-Specific Parameters:

  • person_generation: Person policy: dont_allow, allow_adult, allow_all
  • negative_prompt: What to avoid in the image
  • seed: Random seed for reproducibility

Example:

Generate an image of "a serene mountain landscape at sunset with a lake reflection" using imagen-4-ultra

batch_generate

Process multiple prompts efficiently with parallel batch processing.

Parameters:

  • prompts (required): List of text prompts
  • model: Model to use for all images
  • enhance_prompt: Enhance all prompts (default: true)
  • aspect_ratio: Aspect ratio for all images
  • batch_size: Parallel processing size (default: from config)

Example:

Batch generate images for these prompts:
1. "minimalist logo design for a tech startup"
2. "modern dashboard UI design"
3. "mobile app wireframe"

🎨 Advanced Features

AI Prompt Enhancement

When enabled (default), the server uses Gemini Flash to automatically enhance your prompts:

Original: a cat wearing a space helmet

Enhanced: A photorealistic portrait of a domestic tabby cat wearing a futuristic space helmet, close-up composition, warm studio lighting, detailed fur texture, reflective helmet visor showing subtle reflections, soft focus background, professional photography style

This significantly improves image quality without requiring you to be a prompt engineering expert!

Image Editing

Use natural language to edit existing images (Gemini model):

Generate an image with:
- prompt: "Add a red scarf to the person"
- input_image_path: "/path/to/image.jpg"

Character Consistency

Generate the same character in different scenes:

Generate an image of "a young wizard in a library, studying ancient books"
with maintain_character_consistency: true

Then:

Generate an image of "the same young wizard, now in a magical forest"
with maintain_character_consistency: true

Reproducible Results

Use seeds for consistent generation (Imagen models):

Generate an image with:
- prompt: "a futuristic cityscape"
- model: "imagen-4-ultra"
- seed: 42

Running with the same seed will produce the same image.

⚙️ Configuration

Environment Variables

Variable Description Default
GEMINI_API_KEY Google Gemini API key (required) -
OUTPUT_DIR Directory for generated images generated_images
ENABLE_PROMPT_ENHANCEMENT Enable AI prompt enhancement true
ENABLE_BATCH_PROCESSING Enable batch processing true
DEFAULT_GEMINI_MODEL Default Gemini model gemini-2.5-flash-image
DEFAULT_IMAGEN_MODEL Default Imagen model imagen-4-ultra
REQUEST_TIMEOUT API request timeout (seconds) 60
MAX_BATCH_SIZE Maximum parallel batch size 8
LOG_LEVEL Logging level INFO

📚 MCP Resources

models://list

View all available models with descriptions and features.

settings://config

View current server configuration.

🎭 Use Cases

Web Development

  • Hero images and banners
  • UI/UX mockups and wireframes
  • Logo and branding assets
  • Placeholder images

App Development

  • App icons and splash screens
  • User interface elements
  • Marketing materials
  • Documentation images

Content Creation

  • Blog post illustrations
  • Social media graphics
  • Presentation visuals
  • Product mockups

Creative Projects

  • Character design iterations
  • Concept art exploration
  • Style variations
  • Scene composition

📊 Comparison

Feature Gemini 2.5 Flash Imagen 4/Fast/Ultra
Prompt Enhancement ✅ Built-in ✅ Built-in
Image Editing ✅ Yes ❌ No
Character Consistency ✅ Yes ❌ No
Multi-Image Blending ✅ Yes ❌ No
Negative Prompts ❌ No ✅ Yes
Seed-based Reproducibility ❌ No ✅ Yes
Person Generation Controls ❌ No ✅ Yes
Speed Options Standard Fast/Standard/Ultra
Best For Editing, iteration, context-aware Photorealism, final quality, speed

🐛 Troubleshooting

"GEMINI_API_KEY not found"

  • Add your API key to .env or environment variables
  • Get a free key at Google AI Studio

"Content blocked by safety filters"

  • Modify your prompt to comply with content policies
  • Try rephrasing without potentially sensitive content

"Rate limit exceeded"

  • Wait a few moments and try again
  • Consider upgrading your API plan for higher limits

Images not saving

  • Check that OUTPUT_DIR exists and is writable
  • Verify you have sufficient disk space

🤝 Contributing

Contributions are welcome! This project combines the best features from multiple MCP servers:

  • mcp-image (TypeScript): Prompt enhancement and editing features
  • nanobanana-mcp-server (Python): Architecture and FastMCP integration
  • gemini-imagen-mcp-server (TypeScript): Imagen API support and batch processing

📄 License

MIT License - see LICENSE file for details.

🙏 Acknowledgments

Built on the excellent work of:

🔗 Links


Ready to create amazing AI-generated images? Install now and start generating! 🚀

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ultimate_gemini_mcp-1.0.5.tar.gz (34.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ultimate_gemini_mcp-1.0.5-py3-none-any.whl (28.2 kB view details)

Uploaded Python 3

File details

Details for the file ultimate_gemini_mcp-1.0.5.tar.gz.

File metadata

  • Download URL: ultimate_gemini_mcp-1.0.5.tar.gz
  • Upload date:
  • Size: 34.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for ultimate_gemini_mcp-1.0.5.tar.gz
Algorithm Hash digest
SHA256 ddf45e5038ad8cddbef32da238653447455ea34259d284c33238ff433c27b2ac
MD5 c3e44f9ee3e0e204fb39cac045e32566
BLAKE2b-256 1fdf53af4abb9845c093070d4ffe09a0eb9cbaa76b1ec576923096ea53cb76c0

See more details on using hashes here.

File details

Details for the file ultimate_gemini_mcp-1.0.5-py3-none-any.whl.

File metadata

File hashes

Hashes for ultimate_gemini_mcp-1.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 fb72194471f1339e9b98a34b07a3539c21c88ed0ca4f40093255733ac8a6acd4
MD5 2c4ad0d40b3b142fe072ff16c4e1d764
BLAKE2b-256 192d0d277da0d580d01019f374c2a6d877fe423d9ffdb9196682e426878631d9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page