Skip to main content

An MCP server providing tools to control web browsers using the Amazon Nova Act SDK

Project description

nova-act-mcp

PyPI

nova‑act‑mcp‑server is a zero‑install Model Context Protocol (MCP) server that exposes Amazon Nova Act browser‑automation tools.

What's New in v3.0.0

  • On-Demand Screenshots: New inspect_browser tool to explicitly request screenshots only when needed
  • Reduced Token Usage: Browser actions no longer automatically include screenshots, saving context space
  • More Efficient Workflows: Agents can now control when to get visual feedback
  • Better Performance: Smaller response payloads improve overall agent experience

New inspect_browser Tool Example

# Start a browser session
start_result = await control_browser(action="start", url="https://example.com")
session_id = start_result["session_id"]

# Execute an action without getting a screenshot
execute_result = await control_browser(
    action="execute",
    session_id=session_id,
    instruction="Click on the 'More information...' link"
)

# Now explicitly request a screenshot to see the result
inspect_result = await inspect_browser(session_id=session_id)

# Example output from inspect_browser:
{
  "session_id": "f8a53291-b3a7-4e1e-8c9d-9a12b3c45d67",
  "current_url": "https://www.iana.org/domains/reserved",
  "page_title": "IANA — IANA-managed Reserved Domains",
  "content": [
    {
      "type": "image_base64",
      "data": "data:image/jpeg;base64,/9j/4AAQSkZJRgABAQEASABIAAD/2wBDAAMCA...",
      "caption": "Current viewport"
    },
    {
      "type": "text",
      "text": "Current URL: https://www.iana.org/domains/reserved\nPage Title: IANA — IANA-managed Reserved Domains"
    }
  ],
  "agent_thinking": [],
  "success": true
}

What's New in v0.2.9

  • Improved Screenshot Reliability: More dependable screenshot delivery in responses
  • Enhanced Log Path Discovery: Smart, efficient path tracking for logs and screenshots
  • Better Agent Communication: Clear messaging when screenshots can't be embedded
  • Improved Performance: Eliminated inefficient directory scanning for faster responses

What's New in v0.2.8

  • Enhanced Inline Screenshots: Screenshots now appear directly in the response content array
  • Improved compatibility with vision-capable models like Claude
  • Screenshots include descriptive captions based on the executed instruction
  • Each screenshot is delivered as { type: "image_base64", data: "..." } in the content array

What's New in v0.2.7

  • Automatic Inline Screenshots: Every browser action now includes an optimized screenshot
  • Improved screenshot quality and reliability for AI agents
  • Added environment variables to customize screenshot quality and size limits
  • Comprehensive test coverage ensuring screenshots work in all scenarios

New Feature: Inline Screenshots

Every successful execute response now contains inline_screenshot, a base64-encoded JPEG of the current viewport:

  • Quality ≈ 45, hard-capped at 250 KB (configurable via NOVA_MCP_MAX_INLINE_IMG env variable)
  • If the raw JPEG is larger than the cap, the field is null
  • No extra API calls needed - screenshots are included automatically
  • For full-resolution images and HAR/HTML logs, use the compress_logs tool

What's New in v0.2.6

  • Added compatibility with NovaAct SDK 0.9+ by normalizing log directory handling
  • Improved test organization with clear markers for unit, mock, smoke and e2e tests
  • Moved mock HTML creation logic from production code to test helpers
  • Fixed several syntax errors and incomplete code blocks
  • Added SCREENSHOT_QUALITY constant for consistent compression settings

Quick start (uvx)

Add it to your MCP client configuration:

{
  "mcpServers": {
    "nova-act-mcp-server": {
      "command": "uvx",
      "args": ["nova-act-mcp-server@latest"],
      "env": { "NOVA_ACT_API_KEY": "<your_api_key>" }
    }
  }
}

That's all you need to start controlling browsers from any MCP‑compatible client such as Claude Desktop or VS Code.

Local development (optional)

git clone https://github.com/madtank/nova-act-mcp.git
cd nova-act-mcp
uv sync
uv run nova_mcp.py

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nova_act_mcp_server-3.0.0.tar.gz (49.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nova_act_mcp_server-3.0.0-py3-none-any.whl (30.7 kB view details)

Uploaded Python 3

File details

Details for the file nova_act_mcp_server-3.0.0.tar.gz.

File metadata

  • Download URL: nova_act_mcp_server-3.0.0.tar.gz
  • Upload date:
  • Size: 49.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for nova_act_mcp_server-3.0.0.tar.gz
Algorithm Hash digest
SHA256 cf34f14ec8272a6f86e4b992d4b8c1bbaae4918d66420c5d9292b61e8632f7c6
MD5 de9b98f8e616177b3f95ba885a99e520
BLAKE2b-256 c7be9d0bfe6f2559a010debea0beb477e60258e7e93a0751f3229c7bca6ac816

See more details on using hashes here.

Provenance

The following attestation bundles were made for nova_act_mcp_server-3.0.0.tar.gz:

Publisher: ci.yml on madtank/nova-act-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file nova_act_mcp_server-3.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for nova_act_mcp_server-3.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 989b873ddba72baf3e3f1656947198f3e27ef36333d064b0998d831c8f41d8a3
MD5 1d91ee69f30744895a3948108e39ee4a
BLAKE2b-256 caee95febcdb14c92902dd04a88554c3adf2250cc720c28484f7b6a4309ad2e7

See more details on using hashes here.

Provenance

The following attestation bundles were made for nova_act_mcp_server-3.0.0-py3-none-any.whl:

Publisher: ci.yml on madtank/nova-act-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page