Skip to main content

YOLO MCP Service - A powerful YOLO computer vision service that integrates with Claude AI through Model Context Protocol

Project description

YOLO MCP Service

A powerful YOLO (You Only Look Once) computer vision service that integrates with Claude AI through Model Context Protocol (MCP). This service enables Claude to perform object detection, segmentation, classification, and real-time camera analysis using state-of-the-art YOLO models.

Features

  • Object detection, segmentation, classification, and pose estimation
  • Real-time camera integration for live object detection
  • Support for model training, validation, and export
  • Comprehensive image analysis combining multiple models
  • Support for both file paths and base64-encoded images
  • Seamless integration with Claude AI

Setup Instructions

Prerequisites

  • Python 3.10 or higher
  • Git (optional, for cloning the repository)

Environment Setup

  1. Create a directory for the project and navigate to it:

    mkdir yolo-mcp-service
    cd yolo-mcp-service
    
  2. Download the project files or clone from repository:

    # If you have the files, copy them to this directory
    # If using git:
    git clone https://github.com/GongRzhe/YOLO-MCP-Server.git .
    
  3. Create a virtual environment:

    # On Windows
    python -m venv .venv
    
    # On macOS/Linux
    python3 -m venv .venv
    
  4. Activate the virtual environment:

    # On Windows
    .venv\Scripts\activate
    
    # On macOS/Linux
    source .venv/bin/activate
    
  5. Run the setup script:

    python setup.py
    

    The setup script will:

    • Check your Python version
    • Create a virtual environment (if not already created)
    • Install required dependencies
    • Generate an MCP configuration file (mcp-config.json)
    • Output configuration information for different MCP clients including Claude
  6. Note the output from the setup script, which will look similar to:

    MCP configuration has been written to: /path/to/mcp-config.json
    
    MCP configuration for Cursor:
    
    /path/to/.venv/bin/python /path/to/server.py
    
    MCP configuration for Windsurf/Claude Desktop:
    {
      "mcpServers": {
        "yolo-service": {
          "command": "/path/to/.venv/bin/python",
          "args": [
            "/path/to/server.py"
          ],
          "env": {
            "PYTHONPATH": "/path/to"
          }
        }
      }
    }
    
    To use with Claude Desktop, merge this configuration into: /path/to/claude_desktop_config.json
    

Downloading YOLO Models

Before using the service, you need to download the YOLO models. The service looks for models in the following directories:

  • The current directory where the service is running
  • A models subdirectory
  • Any other directory configured in the CONFIG["model_dirs"] variable in server.py

Create a models directory and download some common models:

# Create models directory
mkdir models

# Download YOLOv8n for basic object detection
curl -L https://github.com/ultralytics/assets/releases/download/v0.0.0/yolov8n.pt -o models/yolov8n.pt

# Download YOLOv8n-seg for segmentation
curl -L https://github.com/ultralytics/assets/releases/download/v0.0.0/yolov8n-seg.pt -o models/yolov8n-seg.pt

# Download YOLOv8n-cls for classification
curl -L https://github.com/ultralytics/assets/releases/download/v0.0.0/yolov8n-cls.pt -o models/yolov8n-cls.pt

# Download YOLOv8n-pose for pose estimation
curl -L https://github.com/ultralytics/assets/releases/download/v0.0.0/yolov8n-pose.pt -o models/yolov8n-pose.pt

For Windows PowerShell users:

# Create models directory
mkdir models

# Download models using Invoke-WebRequest
Invoke-WebRequest -Uri "https://github.com/ultralytics/assets/releases/download/v0.0.0/yolov8n.pt" -OutFile "models/yolov8n.pt"
Invoke-WebRequest -Uri "https://github.com/ultralytics/assets/releases/download/v0.0.0/yolov8n-seg.pt" -OutFile "models/yolov8n-seg.pt"
Invoke-WebRequest -Uri "https://github.com/ultralytics/assets/releases/download/v0.0.0/yolov8n-cls.pt" -OutFile "models/yolov8n-cls.pt"
Invoke-WebRequest -Uri "https://github.com/ultralytics/assets/releases/download/v0.0.0/yolov8n-pose.pt" -OutFile "models/yolov8n-pose.pt"

Configuring Claude

To use this service with Claude:

  1. For Claude web: Set up the service on your local machine and use the configuration provided by the setup script in your MCP client.

  2. For Claude Desktop:

    • Run the setup script and note the configuration output
    • Locate your Claude Desktop configuration file (the path is provided in the setup script output)
    • Add or merge the configuration into your Claude Desktop configuration file
    • Restart Claude Desktop

Using YOLO Tools in Claude

1. First Check Available Models

Always check which models are available on your system first:

I'd like to use the YOLO tools. Can you first check which models are available on my system?

<function_calls>
<invoke name="list_available_models">
</invoke>
</function_calls>

2. Detecting Objects in an Image

For analyzing an image file on your computer:

Can you analyze this image file for objects?

<function_calls>
<invoke name="analyze_image_from_path">
<parameter name="image_path">/path/to/your/image.jpg</parameter>
<parameter name="confidence">0.3</parameter>
</invoke>
</function_calls>

You can also specify a different model:

Can you analyze this image using a different model?

<function_calls>
<invoke name="analyze_image_from_path">
<parameter name="image_path">/path/to/your/image.jpg</parameter>
<parameter name="model_name">yolov8n.pt</parameter>
<parameter name="confidence">0.4</parameter>
</invoke>
</function_calls>

3. Running Comprehensive Image Analysis

For more detailed analysis that combines object detection, classification, and more:

Can you perform a comprehensive analysis on this image?

<function_calls>
<invoke name="comprehensive_image_analysis">
<parameter name="image_path">/path/to/your/image.jpg</parameter>
<parameter name="confidence">0.3</parameter>
</invoke>
</function_calls>

4. Image Segmentation

For identifying object boundaries and creating segmentation masks:

Can you perform image segmentation on this photo?

<function_calls>
<invoke name="segment_objects">
<parameter name="image_data">/path/to/your/image.jpg</parameter>
<parameter name="is_path">true</parameter>
<parameter name="model_name">yolov8n-seg.pt</parameter>
</invoke>
</function_calls>

5. Image Classification

For classifying the entire image content:

What does this image show? Can you classify it?

<function_calls>
<invoke name="classify_image">
<parameter name="image_data">/path/to/your/image.jpg</parameter>
<parameter name="is_path">true</parameter>
<parameter name="model_name">yolov8n-cls.pt</parameter>
<parameter name="top_k">5</parameter>
</invoke>
</function_calls>

6. Using Your Computer's Camera

Start real-time object detection using your computer's camera:

Can you turn on my camera and detect objects in real-time?

<function_calls>
<invoke name="start_camera_detection">
<parameter name="model_name">yolov8n.pt</parameter>
<parameter name="confidence">0.3</parameter>
</invoke>
</function_calls>

Get the latest camera detections:

What are you seeing through my camera right now?

<function_calls>
<invoke name="get_camera_detections">
</invoke>
</function_calls>

Stop the camera when finished:

Please turn off the camera.

<function_calls>
<invoke name="stop_camera_detection">
</invoke>
</function_calls>

7. Advanced Model Operations

Training a Custom Model

I want to train a custom object detection model on my dataset.

<function_calls>
<invoke name="train_model">
<parameter name="dataset_path">/path/to/your/dataset</parameter>
<parameter name="model_name">yolov8n.pt</parameter>
<parameter name="epochs">50</parameter>
</invoke>
</function_calls>

Validating a Model

Can you validate the performance of my model on a test dataset?

<function_calls>
<invoke name="validate_model">
<parameter name="model_path">/path/to/your/trained/model.pt</parameter>
<parameter name="data_path">/path/to/validation/dataset</parameter>
</invoke>
</function_calls>

Exporting a Model to Different Formats

I need to export my YOLO model to ONNX format.

<function_calls>
<invoke name="export_model">
<parameter name="model_path">/path/to/your/model.pt</parameter>
<parameter name="format">onnx</parameter>
</invoke>
</function_calls>

8. Testing Connection

Check if the YOLO service is running correctly:

Is the YOLO service running correctly?

<function_calls>
<invoke name="test_connection">
</invoke>
</function_calls>

Troubleshooting

Camera Issues

If the camera doesn't work, try different camera IDs:

<function_calls>
<invoke name="start_camera_detection">
<parameter name="camera_id">1</parameter>  <!-- Try 0, 1, or 2 -->
</invoke>
</function_calls>

Model Not Found

If a model is not found, make sure you've downloaded it to one of the configured directories:

<function_calls>
<invoke name="get_model_directories">
</invoke>
</function_calls>

Performance Issues

For better performance with limited resources, use the smaller models (e.g., yolov8n.pt instead of yolov8x.pt)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iflow_mcp_gongrzhe_yolo_mcp_server-0.1.0.tar.gz (32.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file iflow_mcp_gongrzhe_yolo_mcp_server-0.1.0.tar.gz.

File metadata

  • Download URL: iflow_mcp_gongrzhe_yolo_mcp_server-0.1.0.tar.gz
  • Upload date:
  • Size: 32.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.0 {"installer":{"name":"uv","version":"0.10.0","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"13","id":"trixie","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for iflow_mcp_gongrzhe_yolo_mcp_server-0.1.0.tar.gz
Algorithm Hash digest
SHA256 5aab6b62a7037f13627eb742ec3b9cb7a236a0539140a3c5cc7502b8709280a3
MD5 19d8989a94cd4ac8db7c477d8e7240dd
BLAKE2b-256 64e3ac739220bdd632ba8ea0f6d7d3809227464f7fc18135ae1053ae883f2553

See more details on using hashes here.

File details

Details for the file iflow_mcp_gongrzhe_yolo_mcp_server-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: iflow_mcp_gongrzhe_yolo_mcp_server-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 39.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.0 {"installer":{"name":"uv","version":"0.10.0","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"13","id":"trixie","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for iflow_mcp_gongrzhe_yolo_mcp_server-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 cd7c7c18bf1075ecf5e8e35c78d0fbc2fad9c73a251cc5433440579cbd20bf3d
MD5 40cc88efca1a967793899d8826fc24b0
BLAKE2b-256 32bbb91473bd9a9b47ed057522f40697ab208c2786d7adde337636f4b1ba8ec5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page