Universal vision tools for AI agents via Model Context Protocol
Project description
agent-vision-mcp
agent-vision-mcp exposes image analysis, inspection, cropping, OCR, and comparison
tools through the Model Context Protocol.
Quickstart
Run the published package without installing it permanently:
uvx agent-vision-mcp
Example MCP client configuration:
{
"mcpServers": {
"agent-vision": {
"command": "uvx",
"args": ["agent-vision-mcp"],
"env": {
"VISION_API_KEY": "your-api-key",
"VISION_BASE_URL": "https://your-provider.example/v1",
"VISION_MODEL_ID": "your-vision-model"
}
}
}
}
Development
python -m venv .venv
.venv/bin/pip install -e ".[dev]"
cp .env.example .env
.venv/bin/agent-vision-mcp
Configure an OpenAI-compatible multimodal endpoint with VISION_API_KEY,
VISION_BASE_URL, and VISION_MODEL_ID.
URL Handling
VISION_URL_MODE controls how remote images are sent to the model:
autopasses URLs through for analysis and comparison, but downloads them when inspection, cropping, or OCR requires image bytes.passthroughprefers URL passthrough, except for tools that require bytes.downloadalways downloads and verifies remote images before model calls.
Downloads are streamed with byte limits, redirects are security checked, and
all downloaded or encoded inputs are verified as supported images.
URL passthrough relies on the configured model provider to fetch URLs safely;
use download when the provider is not trusted to enforce outbound-network
restrictions.
Dedicated OCR is disabled by default. Set OCR_ENABLED=true and configure the
OCR_* variables to use a separate OCR model; otherwise OCR uses the VLM.
Run Tests
.venv/bin/python -m unittest discover -s tests -v
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agent_vision_mcp-0.0.1.tar.gz.
File metadata
- Download URL: agent_vision_mcp-0.0.1.tar.gz
- Upload date:
- Size: 21.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8a8977b0e8c42f8268304e8f71e18869c86ccf4722799c75c48959e972ca1efa
|
|
| MD5 |
f3640554ebe98ccf10575f54a0269c63
|
|
| BLAKE2b-256 |
982bfd0f961497f75ad665d219812219e9acb41a5a1595da2b449df54cdf0dd4
|
Provenance
The following attestation bundles were made for agent_vision_mcp-0.0.1.tar.gz:
Publisher:
publish.yml on idealizing/agent-vision-mcp
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
agent_vision_mcp-0.0.1.tar.gz -
Subject digest:
8a8977b0e8c42f8268304e8f71e18869c86ccf4722799c75c48959e972ca1efa - Sigstore transparency entry: 1767881391
- Sigstore integration time:
-
Permalink:
idealizing/agent-vision-mcp@68e2f0c50f75cd50e186aea69deae0009159e0ca -
Branch / Tag:
refs/tags/v0.0.1 - Owner: https://github.com/idealizing
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@68e2f0c50f75cd50e186aea69deae0009159e0ca -
Trigger Event:
push
-
Statement type:
File details
Details for the file agent_vision_mcp-0.0.1-py3-none-any.whl.
File metadata
- Download URL: agent_vision_mcp-0.0.1-py3-none-any.whl
- Upload date:
- Size: 22.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
60982f5ce41d0b311418f085741a5d4496e188c13bfd00ec42a2f4d83a7f746f
|
|
| MD5 |
8bb1d30378b72562ea36e8b9a3dd7dcb
|
|
| BLAKE2b-256 |
85663998900e193a6baf094423b6c223fda45be1367baca05ae8a463829d941e
|
Provenance
The following attestation bundles were made for agent_vision_mcp-0.0.1-py3-none-any.whl:
Publisher:
publish.yml on idealizing/agent-vision-mcp
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
agent_vision_mcp-0.0.1-py3-none-any.whl -
Subject digest:
60982f5ce41d0b311418f085741a5d4496e188c13bfd00ec42a2f4d83a7f746f - Sigstore transparency entry: 1767881630
- Sigstore integration time:
-
Permalink:
idealizing/agent-vision-mcp@68e2f0c50f75cd50e186aea69deae0009159e0ca -
Branch / Tag:
refs/tags/v0.0.1 - Owner: https://github.com/idealizing
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@68e2f0c50f75cd50e186aea69deae0009159e0ca -
Trigger Event:
push
-
Statement type: