Skip to main content

Universal vision tools for AI agents via Model Context Protocol

Project description

agent-vision-mcp

agent-vision-mcp exposes image analysis, inspection, cropping, OCR, and comparison tools through the Model Context Protocol.

Quickstart

Run the published package without installing it permanently:

uvx agent-vision-mcp

Example MCP client configuration:

{
  "mcpServers": {
    "agent-vision": {
      "command": "uvx",
      "args": ["agent-vision-mcp"],
      "env": {
        "VISION_API_KEY": "your-api-key",
        "VISION_BASE_URL": "https://your-provider.example/v1",
        "VISION_MODEL_ID": "your-vision-model"
      }
    }
  }
}

Development

python -m venv .venv
.venv/bin/pip install -e ".[dev]"
cp .env.example .env
.venv/bin/agent-vision-mcp

Configure an OpenAI-compatible multimodal endpoint with VISION_API_KEY, VISION_BASE_URL, and VISION_MODEL_ID.

URL Handling

VISION_URL_MODE controls how remote images are sent to the model:

  • auto passes URLs through for analysis and comparison, but downloads them when inspection, cropping, or OCR requires image bytes.
  • passthrough prefers URL passthrough, except for tools that require bytes.
  • download always downloads and verifies remote images before model calls.

Downloads are streamed with byte limits, redirects are security checked, and all downloaded or encoded inputs are verified as supported images. URL passthrough relies on the configured model provider to fetch URLs safely; use download when the provider is not trusted to enforce outbound-network restrictions.

Dedicated OCR is disabled by default. Set OCR_ENABLED=true and configure the OCR_* variables to use a separate OCR model; otherwise OCR uses the VLM.

Run Tests

.venv/bin/python -m unittest discover -s tests -v

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_vision_mcp-0.0.1.tar.gz (21.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

agent_vision_mcp-0.0.1-py3-none-any.whl (22.6 kB view details)

Uploaded Python 3

File details

Details for the file agent_vision_mcp-0.0.1.tar.gz.

File metadata

  • Download URL: agent_vision_mcp-0.0.1.tar.gz
  • Upload date:
  • Size: 21.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agent_vision_mcp-0.0.1.tar.gz
Algorithm Hash digest
SHA256 8a8977b0e8c42f8268304e8f71e18869c86ccf4722799c75c48959e972ca1efa
MD5 f3640554ebe98ccf10575f54a0269c63
BLAKE2b-256 982bfd0f961497f75ad665d219812219e9acb41a5a1595da2b449df54cdf0dd4

See more details on using hashes here.

Provenance

The following attestation bundles were made for agent_vision_mcp-0.0.1.tar.gz:

Publisher: publish.yml on idealizing/agent-vision-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file agent_vision_mcp-0.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for agent_vision_mcp-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 60982f5ce41d0b311418f085741a5d4496e188c13bfd00ec42a2f4d83a7f746f
MD5 8bb1d30378b72562ea36e8b9a3dd7dcb
BLAKE2b-256 85663998900e193a6baf094423b6c223fda45be1367baca05ae8a463829d941e

See more details on using hashes here.

Provenance

The following attestation bundles were made for agent_vision_mcp-0.0.1-py3-none-any.whl:

Publisher: publish.yml on idealizing/agent-vision-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page