Session-based conversational image generation MCP server using OpenAI's advanced models
Project description
OpenAI Image MCP Server
A Model Context Protocol (MCP) server that provides conversational OpenAI image generation capabilities. Generate, edit, and refine images through multi-turn conversations with advanced models like GPT-4o and GPT-4.1.
🎯 What Problems Does This Solve?
Traditional Image Generation Pain Points
❌ Single-shot limitations - "Make it more blue" requires re-describing everything
❌ No conversation memory - Each request starts from scratch
❌ Context loss - Can't reference previous images naturally
❌ Manual workflows - Complex multi-step processes require multiple tools
Our Solution
✅ Conversational refinement - "Make it more blue" works naturally
✅ Session memory - Builds on previous context automatically
✅ Reference awareness - "Use the same style as the previous image"
✅ Integrated workflows - Single interface for complex creative projects
🚀 Key Capabilities
🔄 Session-Based Conversations
# Start a focused session
session = create_image_session("Logo design for tech startup")
# Initial generation
result1 = generate_image_in_session(session_id, "modern tech logo")
# Natural refinement - no need to repeat everything
result2 = generate_image_in_session(session_id, "make it more minimalist")
# Build on context
result3 = generate_image_in_session(session_id, "try it in dark blue")
🔄 Hybrid Workflows
Start simple, expand when needed:
# Quick one-shot for immediate need
result = generate_image("modern office workspace")
# Later, promote to session for refinement
session = promote_image_to_session(
result["image_path"],
"Office workspace refinement project"
)
# Continue with conversational context
generate_image_in_session(session_id, "add more plants and warmer lighting")
🎨 Specialized Tools
- Product photography - E-commerce optimized with multiple angles
- UI/UX assets - Design elements with consistent styling
- Reference-based editing - Use existing images as style guides
- Batch processing - Multiple variations with consistent themes
🎯 Perfect For
LLM Applications
- Claude Desktop integration - Conversational image workflows
- AI assistants - Contextual image generation capabilities
- Chatbots - Visual content creation with memory
Creative Workflows
- Iterative design - Refine concepts through conversation
- Brand development - Consistent visual identity across assets
- Product visualization - Multiple angles and contexts
- Content creation - Blog headers, social media, presentations
Development Teams
- Rapid prototyping - Quick UI mockups and concepts
- Documentation - Visual aids and diagrams
- Marketing assets - Consistent brand imagery
- User testing - Visual variations for A/B testing
🚀 Quick Start
1. Installation
pip install openai-image-mcp
For development installation from source, see DEVELOPMENT.md
2. Claude Desktop Integration
Add to your Claude Desktop MCP configuration:
{
"mcpServers": {
"openai-image-mcp": {
"command": "sh",
"args": [
"-c",
"openai-image-mcp 2> mcp_server_stderr.log"
],
"env": {
"OPENAI_API_KEY": "your_openai_api_key_here"
}
}
}
}
For development setup and alternative configurations, see DEVELOPMENT.md
3. Start Creating
# Create a session for your project
session = create_image_session("Website hero images")
# Generate with natural language
generate_image_in_session(session_id, "modern tech office with diverse team")
# Refine naturally
generate_image_in_session(session_id, "make the lighting warmer")
# Add context
generate_image_in_session(session_id, "create a mobile version of this scene")
🛠️ Available Tools
Core Session Management
create_image_session- Start conversational sessiongenerate_image_in_session- Generate with context awarenessget_session_status- View conversation history and progressclose_session- End session and cleanup
Image Generation & Editing
generate_image- General purpose (session optional)edit_image- Modify existing imagesgenerate_product_image- E-commerce optimizedgenerate_ui_asset- UI/UX design elementsanalyze_and_improve_image- AI-powered image enhancement
Workflow Tools
promote_image_to_session- Upgrade one-shot to conversationallist_active_sessions- Manage multiple projectsget_usage_guide- Comprehensive tool documentation
🎯 Usage Patterns
📱 Conversational Design Sessions (Recommended)
Best for: Multi-image projects, iterative refinement, brand consistency
session = create_image_session("App icon design")
generate_image_in_session(session_id, "colorful chat app icon")
generate_image_in_session(session_id, "make it more professional")
generate_image_in_session(session_id, "try different color schemes")
⚡ Quick One-Shot Generation
Best for: Immediate needs, single images, uncertain scope
generate_image("professional headshot for LinkedIn")
generate_product_image("wireless headphones", background_type="white")
🔄 Hybrid Start-Simple-Expand-Later
Best for: Testing concepts, uncertain requirements, flexible workflows
# Start quick
result = generate_image("logo concept for bakery")
# Expand when needed
session = promote_image_to_session(result["image_path"], "Bakery brand development")
generate_image_in_session(session_id, "create business card version")
🎨 Example Workflows
Brand Identity Development
session = create_image_session("TechCorp brand identity")
# Logo concepts
generate_image_in_session(session_id, "modern tech company logo")
generate_image_in_session(session_id, "make it more geometric and minimal")
# Expand to brand elements
generate_image_in_session(session_id, "business card design using this logo")
generate_image_in_session(session_id, "website header with the logo")
Product Marketing Suite
session = create_image_session("Wireless headphones marketing")
# Product shots
generate_product_image("premium wireless headphones", angle="45deg")
result = promote_image_to_session(previous_result["image_path"], "headphones campaign")
# Marketing variations
generate_image_in_session(session_id, "lifestyle shot with person using them")
generate_image_in_session(session_id, "create packaging design mockup")
📚 Documentation
- LLM.md - Comprehensive guide for LLMs using this server
- DEVELOPMENT.md - Technical implementation, testing, and contribution guide
📋 Requirements
- Python 3.11+
- OpenAI API key with GPT-4o/GPT-4.1 access
- Poetry for dependency management
🔐 Environment Variables
OPENAI_API_KEY(required) - Your OpenAI API keyMCP_MAX_SESSIONS(optional) - Maximum concurrent sessions (default: 100)MCP_SESSION_TIMEOUT(optional) - Session timeout in seconds (default: 3600)
🤝 Contributing
We welcome contributions! Please see DEVELOPMENT.md for:
- Technical architecture details
- Development setup instructions
- Testing guidelines
- Code style requirements
📄 License
This project is licensed under the MIT License - see the LICENSE file for details.
🔗 Related Resources
- Model Context Protocol - Protocol specification
- OpenAI Responses API - Underlying API
- Claude Desktop - Primary integration target
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file openai_image_mcp-0.2.0.tar.gz.
File metadata
- Download URL: openai_image_mcp-0.2.0.tar.gz
- Upload date:
- Size: 26.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.13.2 Darwin/24.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dbb37c8d4d054c337216614c0aafcb0358d8d2bd92510766007bc3258f77018e
|
|
| MD5 |
142cd72be63cc6ae41e708c35ed89714
|
|
| BLAKE2b-256 |
3f0aa88d6b3be38759af6f856e4a8ee08fc976a262d08bebd36b0c045b061019
|
File details
Details for the file openai_image_mcp-0.2.0-py3-none-any.whl.
File metadata
- Download URL: openai_image_mcp-0.2.0-py3-none-any.whl
- Upload date:
- Size: 28.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.1.1 CPython/3.13.2 Darwin/24.5.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e6dc1fa202d031f75f1f1d4f8b965a14e2ac1deeb4080c10c06c798dd3a742b1
|
|
| MD5 |
bd0b5ed865fa3fd921241dd5089b29df
|
|
| BLAKE2b-256 |
2ec632d63b60d459c259248fff1ad305710c4fbff5e2c1ae28c11859887095e7
|