An MCP server providing tools to control web browsers using the Amazon Nova Act SDK

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

nova-act-mcp

An MCP server providing tools to control web browsers using the Amazon Nova Act SDK. Enables interactive browser automation with transparent agent reasoning via MCP.

What is nova-act-mcp?
Prerequisites
Installation
Getting Started
Understanding Agent Thinking
Tips for Effective Browser Automation
Advanced Features
Example Capabilities
Performance Considerations
Testing
Troubleshooting
Contributing
License
Acknowledgements
Feedback for Amazon Nova Act
Limitations
Development
Future Enhancements

What is nova-act-mcp?

Nova Act MCP is a bridge between Amazon's Nova Act browser automation SDK and the Model Context Protocol (MCP). It allows AI assistants like Claude to control web browsers to perform tasks through natural language instructions, while providing visibility into the agent's reasoning process.

This project exposes Nova Act's powerful browser automation capabilities through an MCP server interface, making it easy to:

Control web browsers directly from AI assistants
Execute interactive browser automation tasks
Persist cookies and local storage between interactions (profile data is reused)
See the agent's step-by-step reasoning process

Prerequisites

Before getting started, you'll need:

Python 3.10 or higher
An Amazon Nova Act API key (get one from https://nova.amazon.com/act)
An MCP-compatible client (Claude Desktop, VSCode with Copilot Plugin, or any other client that supports the Model Context Protocol)

Installation

Clone this repository:

git clone https://github.com/madtank/nova-act-mcp.git
cd nova-act-mcp

Install dependencies:

# Update dependencies after changing pyproject.toml or cloning
uv sync

# For development only - if you need to add a new dependency
# uv add package_name

Quick Start with uvx

For the fastest way to get started without cloning the repository or setting up a virtual environment:

# No clone, no venv – zero-install test
uvx nova-act-mcp control_browser --action start --url https://example.com

This uses the Python package manager UVX to directly run the latest version from PyPI.

Getting Started

This guide focuses on setting up the server with the Claude Desktop application, but nova-act-mcp is designed to work with any Model Context Protocol (MCP) compatible client, such as Visual Studio Code or others. Consult your specific client's documentation for instructions on integrating MCP servers.

Nova Act MCP Claude Example

1. Obtain a Nova Act API Key

Go to https://nova.amazon.com/act
Generate a new API key
Save it for the next step

2. Configure Your MCP Client (Example: Claude Desktop)

The following steps show how to configure the Claude Desktop application. If you are using a different MCP client, adapt these instructions accordingly.

Install Claude Desktop from https://claude.ai/desktop (if using Claude).

Open your Claude for Desktop App configuration:

# MacOS
code ~/Library/Application\ Support/Claude/claude_desktop_config.json

# Windows
code %USERPROFILE%\AppData\Roaming\Claude\claude_desktop_config.json

Add the nova-act-mcp server configuration:

{
  "mcpServers": {
    "nova-browser": {
      "command": "uv",
      "args": [
        "--directory",
        "~/path/to/nova-act-mcp", // Replace with the absolute path to the cloned repo
        "run",
        "nova_mcp.py"
      ],
      "env": {
        "NOVA_ACT_API_KEY": "your_api_key_here" // Replace with your Nova Act API key
      }
    }
  }
}

Replace:

~/path/to/nova-act-mcp with the absolute path to where you cloned this repository
your_api_key_here with your actual Nova Act API key

Save the file and restart Claude Desktop

3. Using with Your MCP Client (Example: Claude)

Once configured, you can use the browser automation tool with your MCP client. For Claude Desktop, look for the hammer icon (🔨), which indicates available MCP tools.

Try a simple example like this:

Can you help me search for news on Google? Use the nova-browser tool to:
1. Start a session on google.com
2. Execute the instruction "search for news"
3. Tell me what you see and your reasoning process

Understanding Agent Thinking

One of the key features of nova-act-mcp is the ability to see the agent's reasoning process. When the agent performs actions in the browser, you'll see detailed explanations of:

How it's interpreting your instructions
What elements it sees on the page
Why it's choosing specific actions
How it confirms successful completion

This transparency helps you understand how the agent is approaching tasks and makes debugging easier when something doesn't work as expected.

Tips for Effective Browser Automation

Navigation Limitations and Best Practices

IMPORTANT: The Nova Act agent has specific navigation limitations that are critical to understand:

Always Provide URLs for Page Changes: When navigating to a different website or page, you MUST include the full URL in the url parameter:

{"action": "execute", "session_id": "your-session-id", "url": "https://www.amazon.com/", "instruction": "Search for headphones"}

No Back Button Navigation: The agent doesn't understand browser history concepts like "back" or "forward" buttons. To return to a previous page, you must provide its full URL again.
Direct URL Access is Best: Rather than trying to navigate through multiple pages to reach your target, provide the direct URL of your target page whenever possible.
Page-Specific Actions: Each instruction should be specific to the current page. If you need to work on different pages, use separate commands with the appropriate URLs.

Example: Correct Workflow

# CORRECT: Direct navigation to target site with specific action
control_browser {"action": "execute", "session_id": "your-session-id", "url": "https://www.amazon.com", "instruction": "Search for bluetooth headphones"}

# CORRECT: New direct navigation to a different site
control_browser {"action": "execute", "session_id": "your-session-id", "url": "https://www.google.com", "instruction": "Search for weather forecast"}

Example: Problematic Workflow

# PROBLEMATIC: Expecting navigation without URL
control_browser {"action": "execute", "session_id": "your-session-id", "instruction": "Go to Amazon and search for headphones"}

# PROBLEMATIC: Assuming back button functionality
control_browser {"action": "execute", "session_id": "your-session-id", "instruction": "Go back to the previous page"}

Be Specific and Concise

When prompting for browser actions:

✅ DO:

Provide clear, actionable instructions
Focus on one task at a time
Be specific about what elements to interact with

❌ DON'T:

Use vague or ambiguous language
Request complex multi-step actions in a single instruction
Ask for subjective evaluations without clear criteria

Example of Good Instructions

Instead of trying to do everything at once, break it down into interactive steps:

1. Start a session on amazon.com
2. Execute "search for bluetooth headphones"
3. Execute "filter by 4+ stars"
4. Execute "sort by price low to high"
5. Execute "find the price of the first item"

Handling Invalid URLs and Navigation Errors

The Nova Act agent handles invalid or inaccessible URLs in different ways, which is important to understand when working with AI systems that might reference non-existent websites:

Non-existent Domains: If the domain doesn't exist (e.g., https://this-website-definitely-does-not-exist-123456789.com), you'll receive an error message like:
```
Error executing instruction: Failed to navigate to execute URL https://non-existent-site.com: 
Page.goto: net::ERR_NAME_NOT_RESOLVED
```
Valid Domain with Non-existent Path: If the domain exists but the specific page doesn't (e.g., https://example.com/non-existent-page-12345), the browser will still load the domain's default page or an error page provided by the website. The agent will describe whatever content appears.
Connection Refused: For URLs pointing to servers that aren't running or aren't accessible (e.g., http://localhost:9999), you'll receive an error like:
```
Error executing instruction: Failed to navigate to execute URL http://localhost:9999: 
Page.goto: net::ERR_CONNECTION_REFUSED
```

Best Practice: When dealing with URLs that might not exist (such as those suggested by an AI model), start with a verified root domain first, then navigate to more specific pages:

# Start with a known working domain
control_browser {"action": "execute", "session_id": "your-session-id", "url": "https://example.com", "instruction": "Tell me what you see"}

# Then try a more specific path if needed
control_browser {"action": "execute", "session_id": "your-session-id", "url": "https://example.com/specific-path", "instruction": "Tell me what you see"}

Advanced Features

Persistent Browser Profiles

The nova-act-mcp server maintains a separate profile directory for each session under profiles/{session_id}, reusing that profile on every execute. This ensures:

Cookies, login tokens, and local storage persist between calls
Profile isolation per session ID

Note: Each execute command launches a new browser context on the same profile to pick up cookies/storage. In-page state (e.g. form fields, DOM changes) will not persist across separate execute calls. To preserve multi-step page state, combine your steps into a single execute instruction.

Browser Session Management and Persistence

The following actions are available for browser session management:

start - Starts a new browser session with a specified URL
execute - Executes a natural language instruction in the current session
end - Ends the current browser session and cleans up resources

Example flow:

# Start a session
start: https://www.google.com

# Execute instructions
execute: search for news
execute: click on the first result

# End the session
end

Important: Browser Persistence

The browser launched during a start action remains open throughout the session lifecycle:

The same browser instance is reused for all execute commands within a session
Each browser runs in its own dedicated thread to ensure thread safety
The browser continues running in the background between commands, maintaining its state
Browser windows are only closed when:
- You explicitly call the end action for a session
- The nova-act-mcp server process exits (using a cleanup mechanism)
- Sessions are automatically cleaned up after 10 minutes of inactivity

Warning: If you don't properly end sessions with the end action, browser instances may continue running in the background, consuming system resources. Always make sure to end your sessions when you're done using them.

To check for and close any lingering sessions, you can use the list_browser_sessions tool to identify active sessions, then call end on any sessions you no longer need.

Browser Visibility and Screenshots

The nova-act-mcp server has been configured with some important defaults:

Headless Mode by Default: Browsers now start in headless mode (invisible) by default, which improves performance and resource usage. This is ideal for production environments and automated workflows.
Screenshot Embedding Disabled: Screenshot embedding has been disabled to prevent excessive token usage in the context window. Large base64-encoded screenshots were causing token explosions, making conversations difficult to maintain.

You can still:

Explicitly set "headless": false in your requests if you want to observe the browser's actions
Access Nova Act's HTML logs which are referenced in the response (the path is provided)

Example:

# Start a headless browser (default)
control_browser {"action": "start", "url": "https://www.google.com"}

# Start a visible browser by overriding the default
control_browser {"action": "start", "url": "https://www.google.com", "headless": false}

Secure Login Handling

Nova Act MCP provides a secure way to handle login credentials that avoids exposing sensitive information in instructions:

// SECURE: Using dedicated username/password parameters
control_browser {
  "action": "execute", 
  "session_id": "your-session-id", 
  "username": "tomsmith", 
  "password": "SuperSecretPassword!"
}

When you provide the username and/or password parameters:

The MCP server directly uses Playwright to find and fill credential fields
Credentials are never exposed in the instruction text or agent thinking
If a login form is detected, it will automatically try to submit the form after filling credentials
If just username is provided without a password, it will still attempt form submission (useful for testing invalid login scenarios)

This approach is significantly more secure than including credentials in the instruction text:

// INSECURE: Don't do this!
control_browser {
  "action": "execute", 
  "session_id": "your-session-id", 
  "instruction": "Type username 'tomsmith' and password 'SuperSecretPassword!' into the login form"
}

How Login Form Detection Works

The tool uses these strategies to find and fill login forms:

First tries explicit selectors: input#username, input[name='username'], input[type='text'], input[name*='user'] for username fields
Similarly uses input#password, input[name='password'], input[type='password'], input[name*='pass'] for password fields
If those fail, it falls back to using Nova Act to "focus the username/password field" and then uses keyboard typing
After filling fields, if no further instruction is provided, it automatically clicks the Login button

Example: Complete Login Flow

// Start a session on a login page
control_browser {"action": "start", "url": "https://the-internet.herokuapp.com/login"}

// Provide credentials securely (will auto-submit form)
control_browser {
  "action": "execute", 
  "session_id": "your-session-id", 
  "username": "tomsmith", 
  "password": "SuperSecretPassword!"
}

// After login, perform actions on the authenticated page
control_browser {
  "action": "execute", 
  "session_id": "your-session-id", 
  "instruction": "Check if we're logged in successfully"
}

This approach keeps credentials out of instruction text while providing a seamless login experience.

Browser Session State Persistence

One of the key features of nova-act-mcp is its ability to maintain session state across multiple navigation actions. This makes it ideal for complex workflows that span multiple pages on a website.

Our testing has verified that the browser automation tool properly maintains:

Authentication State: Login sessions persist across page navigations
Cookies & Local Storage: Site preferences and settings are preserved
Navigation History: The ability to navigate between pages while maintaining state

This has been verified with workflows that:

Log in to a secure area
Navigate to different pages
Interact with elements on those pages
Return to the secure area with authentication still intact

For optimal session persistence:

// Start a session
control_browser {"action": "start", "url": "https://example.com/login"}

// Login securely
control_browser {
  "action": "execute", 
  "session_id": "your-session-id", 
  "username": "your-username", 
  "password": "your-password"
}

// Navigate to another page while maintaining session cookies
control_browser {
  "action": "execute", 
  "session_id": "your-session-id", 
  "instruction": "Navigate to https://example.com/account"
}

// Continue working with the authenticated session
control_browser {
  "action": "execute", 
  "session_id": "your-session-id", 
  "instruction": "Check my account balance"
}

Important Note: While authentication state (cookies, local storage) persists across page navigations, each execute command launches a fresh browser context on that same profile. This means that in-page state changes (form fields, JavaScript variables) do not persist between separate execute commands. For multi-step interactions on a single page, include all steps in a single instruction.

Example Capabilities

Nova Act MCP can handle a variety of browser automation tasks:

Basic Navigation & Form Handling: Navigate websites, fill forms, and handle login flows securely
Dynamic Content: Wait for and interact with content that loads asynchronously
Structured Data Extraction: Extract table data and other structured content using JSON schemas
Complex Interactions: Handle alerts, dropdowns, and other interactive elements

For best results, provide clear, specific instructions and follow the navigation best practices outlined above.

Structured Data Extraction

One of the most powerful capabilities of Nova Act MCP is its ability to extract structured data from web pages using JSON schemas:

// Example schema for extracting table data
{
  "type": "object",
  "properties": {
    "tableData": {
      "type": "array",
      "items": {
        "type": "object",
        "properties": {
          "name": {"type": "string"},
          "price": {"type": "string"},
          "description": {"type": "string"}
        }
      }
    }
  }
}

To use this feature:

// Extract structured data using a schema
control_browser {
  "action": "execute", 
  "session_id": "your-session-id", 
  "schema": {your_schema_here}
}

The agent will analyze the page content and extract data that matches your schema structure. This is particularly useful for:

Extracting product information from e-commerce sites
Retrieving data from tables and lists
Gathering structured information from search results
Collecting consistent data across multiple pages

Note on Data Size: The default response size limit is 5000 characters. If you're extracting large datasets that might exceed this limit, consider:

Breaking your extraction into smaller chunks
Extracting only the most relevant columns
Adding pagination to retrieve data in batches

Form Element Interactions

Nova Act MCP provides specific capabilities for interacting with common form elements. Based on our testing, here are the recommended ways to interact with different form elements:

Dropdown Menus

For reliable dropdown interactions, use explicit phrasing:

// Recommended approach for dropdown interaction
control_browser {
  "action": "execute", 
  "session_id": "your-session-id", 
  "instruction": "select Option 2 from the dropdown"
}

Our testing shows that these phrasings work consistently:

"select [option name] from dropdown"
"choose [option name] from the dropdown menu"

To verify which option is currently selected, use a schema:

// Check current dropdown selection
control_browser {
  "action": "execute", 
  "session_id": "your-session-id",
  "schema": {
    "type": "object", 
    "properties": {
      "selectedOption": {"type": "string"}
    }
  }
}

Less reliable approaches to avoid:

❌ "Click on the dropdown" (too vague, may not complete the selection)
❌ "Open the dropdown menu" (only opens but doesn't select)

Checkboxes

For checkboxes, use precise instructions:

// Toggle a checkbox
control_browser {
  "action": "execute", 
  "session_id": "your-session-id", 
  "instruction": "check the checkbox labeled 'I agree to terms'"
}

// Or uncheck
control_browser {
  "action": "execute", 
  "session_id": "your-session-id", 
  "instruction": "uncheck the checkbox labeled 'Subscribe to newsletter'"
}

Radio Buttons

For radio button selection:

// Select a radio button
control_browser {
  "action": "execute", 
  "session_id": "your-session-id", 
  "instruction": "select the radio button labeled 'Option B'"
}

Input Fields

For text input fields:

// Type into an input field
control_browser {
  "action": "execute", 
  "session_id": "your-session-id", 
  "instruction": "type 'example@email.com' into the email field"
}

Performance Considerations

When working with browser automation at scale, keep these performance considerations in mind:

Resource Management

Concurrent Sessions: Each browser session consumes significant memory. For large-scale automation, consider implementing session pooling and limiting concurrent sessions.
Memory Usage: Chrome instances can use 300MB-1GB of RAM each. Monitor memory usage when running multiple sessions.
CPU Utilization: Browser automation is CPU-intensive, especially when parsing complex pages. Schedule tasks to avoid CPU saturation.

Optimization Strategies

Headless Mode: For production environments, consider explicitly setting headless mode to true ("headless": true) for better resource efficiency. While the default is now non-headless for better debugging, headless browsers use fewer resources.
Request Throttling: Implement rate limiting to avoid triggering anti-bot measures on websites.
Caching: Cache responses when appropriate to reduce redundant requests.
Efficient Selectors: Use precise selectors rather than broad instructions to improve performance.

Best Practices for Large-Scale Automation

Session Reuse: Where appropriate, reuse browser sessions instead of creating new ones.
Cleanup: Always end sessions properly when done to free up system resources.
Explicit Session Ending: Always call the end action when you're done with a session to prevent resource leaks.
Session Monitoring: Periodically check for active sessions using list_browser_sessions to ensure proper cleanup.
Batching: Group related tasks into batches to minimize the overhead of session creation.
Error Handling: Implement robust error handling with automatic retries for transient failures.

Testing

Setting Up Your Test Environment

Before running tests, ensure you have all the required testing dependencies installed:

# Install development dependencies including testing packages
uv sync --extra dev

# Verify test dependencies are installed
uv pip list | grep pytest

This will install pytest and all related testing packages defined in the [project.optional-dependencies] section of your pyproject.toml.

Running Tests

You can test the nova-act-mcp server in several ways:

1. Running Specific Test Files with Detailed Output

For detailed test output with verbose logging, use the -v flag. To see print statements in your tests, add the -s flag:

# Run a specific test file with verbose output and print statements
python -m pytest -v -s tests/test_nova_mcp_integration.py

# Run all tests with verbose output
python -m pytest -v


The `-s` flag is particularly useful for debugging as it allows print statements in your tests to be displayed in the console.

#### 2. Using the MCP Inspector Tool

You can test the nova-act-mcp server using the MCP Inspector tool:

```bash
# Install MCP Inspector if you haven't already
npm install -g @modelcontextprotocol/inspector

# Run the server with the MCP Inspector
NOVA_ACT_API_KEY="your_api_key_here" npx @modelcontextprotocol/inspector uv --directory /path/to/nova-act-mcp run nova_mcp.py

Then use the following input format to test a simple browser action:

{
  "action": "start",
  "url": "https://www.google.com"
}

Followed by:

{
  "action": "execute",
  "session_id": "your_session_id_from_previous_response",
  "instruction": "search for tea pots"
}

This lets you verify that the server is working correctly and see exactly what the agent is thinking during each step.

For manual testing, we provide a test script that works with The Internet Herokuapp (a testing website):

# Run the test script to generate a set of test commands
./testing/test_form_interaction.sh

# Follow the instructions to test interactive browser automation

Troubleshooting

Nova Act API Key Issues

If you see an error about the Nova Act API key:

Verify your API key is valid at https://nova.amazon.com/act
Check that the key is correctly set in your Claude Desktop configuration
Try setting it as an environment variable: export NOVA_ACT_API_KEY="your_key_here"

Browser Automation Problems

If the browser is not behaving as expected:

Check the agent_thinking field in the response to see how the agent is interpreting your instructions
Make your instructions more specific and actionable
For tasks involving forms or logins, be explicit about field names

Lingering Browser Windows

If you notice browser windows staying open after you're done using the tool:

Make sure you always call the end action when you're finished with a session
Use list_browser_sessions to check for any active sessions that need to be closed
Restart the MCP server if needed to force cleanup of all browser sessions
Check your system's task manager for any lingering browser processes

Viewing Browser Activity

Since browsers now run in visible mode by default:

You can directly observe the automation actions in real-time
If you need to hide the browser (for example, in CI/CD environments), explicitly set "headless": true
For debugging, the response includes the path to Nova Act's HTML log file, which shows detailed execution steps

Debug Mode

To enable more detailed debugging information:

export NOVA_MCP_DEBUG=1

This will include additional diagnostic information in the responses.

Common Issues

"nova-act is installed but Python can't import it"

This happens because uv --directory ... run nova_mcp.py creates a fresh, isolated virtual environment each time it runs. That environment only contains the packages in the current directory (your nova-act-mcp installation), without access to globally installed packages.

Solution 1: For local development (with a cloned repository)

Install the "nova-act" extra into the same uv environment before running:

# From your nova-act-mcp repository directory
cd /path/to/nova-act-mcp
uv pip install -e .[nova-act]   # Pulls nova-act + playwright into the .uv-env folder

Your MCP client configuration can remain unchanged:

"nova-browser": {
  "command": "uv",
  "args": [
    "--directory",
    "/path/to/nova-act-mcp",
    "run",
    "nova_mcp.py"
  ],
  "env": { "NOVA_ACT_API_KEY": "your_api_key_here" }
}

Once the dependencies are cached in the .uv-env folder, every subsequent uv run will find nova_act in the Python path.

Solution 2: Zero-install approach (with uvx)

For production environments or other machines, use uvx to automatically handle all dependencies:

# settings.toml or claude_desktop_config.json
[servers.nova-browser]
command   = "uvx"
args      = ["nova-act-mcp[nova-act]@0.2.0"]  # The [nova-act] extra contains browser automation deps
transport = "stdio"

[servers.nova-browser.env]
NOVA_ACT_API_KEY = "${NOVA_ACT_API_KEY}"
HEADLESS         = "true"  # Optional: run browser in headless mode

With this approach, uvx will:

Download nova-act-mcp version 0.2.0
Detect the [nova-act] extra and automatically install nova-act and playwright
Launch the nova-act-mcp console script in an isolated environment

This requires no pre-installation or path configuration.

Clearing the Cache

If you're experiencing persistent issues, try clearing the uv cache:

uv cache clear  # Wipes previous temporary environments

Verify Installation

To verify that nova-act is correctly installed in your uv environment:

uv --directory /path/to/nova-act-mcp run -c "import nova_act; print(f'nova-act version: {nova_act.__version__}')"

If this prints the version without any import errors, your setup is correct.

Contributing

We welcome contributions to improve nova-act-mcp! Here's how you can help:

Getting Started with Development

Fork the repository
Create a feature branch: git checkout -b feature-name
Make your changes and add tests if applicable
Run tests to ensure your changes work correctly
Commit your changes: git commit -m 'Add some feature'
Push to the branch: git push origin feature-name
Submit a pull request

Development Guidelines

Follow the existing code style and naming conventions
Write clear, commented code with meaningful variable/function names
Include documentation for any new features
Add tests for new functionality
Update the README if necessary

Reporting Issues

If you find a bug or have a feature request, please create an issue on the repository. Include:

A clear description of the issue or feature request
Steps to reproduce (for bugs)
Expected behavior
Actual behavior
Environment information (OS, Python version, etc.)

Code Reviews

All submissions require review before being merged. We aim to review pull requests promptly and provide constructive feedback.

License

This project is licensed under the MIT License - see the LICENSE file for details.

Acknowledgements

Amazon Nova Act for providing the browser automation SDK
Model Context Protocol (MCP) for the AI agent communication standard

Feedback for Amazon Nova Act

While we've implemented agent thinking extraction, we note that this information should ideally be directly accessible through the Nova Act API. The current approach requires parsing HTML trace files, which is less efficient than having a dedicated API method. We welcome any future improvements to the Nova Act SDK that would make this process more straightforward.

Limitations

No file uploads: The browser automation can't upload files from your local system
Limited to web interactions: Can only interact with elements visible on the webpage
Hover interactions not supported: The current implementation cannot reliably perform hover-based interactions or extract content that appears on hover
JavaScript alerts not reliably handled: While alert dialogs visually appear, the agent cannot properly control or interact with them
Some sites may block automation: Sites with strong anti-bot measures may present challenges
Session timeouts: Long-running sessions may be terminated by websites

Development

Managing Dependencies

This project uses uv for dependency management, which provides faster and more reliable Python package management.

To update or add dependencies:

Edit the pyproject.toml file to add or modify dependencies in the appropriate sections:

dependencies = [
    "new-package>=1.0.0",  # Add your new dependency here
]

Use uv sync to update the lock file:
```
uv sync
```
Commit both the modified pyproject.toml and the updated uv.lock file to the repository.

This ensures that all developers and users have consistent dependency versions.

Local Development Setup

Setting up a local development environment:

Create and activate a virtual environment with uv:

# From repo root
uv venv .venv

# On macOS/Linux
source .venv/bin/activate

# On Windows
.venv\Scripts\activate

Install the package in development mode with dev dependencies:
```
uv pip install -e .[dev]
```
Verify the installation:
```
python -m nova_mcp --version
```
Run tests to verify everything works:
```
pytest -q
```
For UVX testing (even from within the venv):
```
uvx nova-act-mcp@0.2.0 --version
```

Dependency Locking

To ensure reproducible environments:

Generate a lockfile (once):
```
uv pip compile -o uv.lock
```
Commit the lockfile to your repository.
On new machines or in CI, use:
```
uv pip sync -r uv.lock
```

This approach prevents dependency issues and ensures consistent behavior across different environments.

Future Enhancements

Multi-Step Session State: Support true in-page session persistence across multiple execute calls so that form inputs and DOM changes remain live between commands.
Improved Browser Visibility: Default to visible browsers for better debugging and observation.
Reduced Token Usage: Disabled screenshot embedding to prevent token explosion in LLM context windows.
Improve Profile Persistence: Enhance handling of profiles for even more reliable cookie and storage reuse.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

madtank10

These details have not been verified by PyPI

Release history Release notifications | RSS feed

3.2.2

Jun 5, 2025

3.2.1

Jun 5, 2025

3.2.0

Jun 5, 2025

3.1.1

May 17, 2025

3.1.0

May 17, 2025

3.0.1.dev3 pre-release

May 17, 2025

3.0.0

May 8, 2025

0.2.9

May 8, 2025

0.2.8

May 5, 2025

0.2.7

May 5, 2025

0.2.6

May 4, 2025

0.2.5

May 4, 2025

0.2.4

May 1, 2025

0.2.3

May 1, 2025

0.2.2

May 1, 2025

This version

0.2.1

Apr 30, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nova_act_mcp_server-0.2.1.tar.gz (58.0 kB view details)

Uploaded Apr 30, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

nova_act_mcp_server-0.2.1-py3-none-any.whl (29.8 kB view details)

Uploaded Apr 30, 2025 Python 3

File details

Details for the file nova_act_mcp_server-0.2.1.tar.gz.

File metadata

Download URL: nova_act_mcp_server-0.2.1.tar.gz
Upload date: Apr 30, 2025
Size: 58.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.7.2

File hashes

Hashes for nova_act_mcp_server-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`15eb875edde58688b3d76ee047b9eadf385538a21c3d02960b91c6bf98e30646`
MD5	`64dd6eae0a28544e27ac16fb000c9a10`
BLAKE2b-256	`befce4a63f839f87add0082baa64d51f26bf977036c6215611db117d0a1ad0b8`

See more details on using hashes here.

File details

Details for the file nova_act_mcp_server-0.2.1-py3-none-any.whl.

File metadata

Download URL: nova_act_mcp_server-0.2.1-py3-none-any.whl
Upload date: Apr 30, 2025
Size: 29.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.7.2

File hashes

Hashes for nova_act_mcp_server-0.2.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`17c5525d8e9ea768ed24e1784a6139669a602a4c76567ab3911ae68d0a6b7ea6`
MD5	`6b3ff25c2e60426e9d0410989c4887aa`
BLAKE2b-256	`0ac76cd300dfcd8e307a7953ad397bd0395d946de76646ffcc889ef4f3a4c163`

See more details on using hashes here.

nova-act-mcp-server 0.2.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

nova-act-mcp

Table of Contents

What is nova-act-mcp?

Prerequisites

Installation

Quick Start with uvx

Getting Started

1. Obtain a Nova Act API Key

2. Configure Your MCP Client (Example: Claude Desktop)

3. Using with Your MCP Client (Example: Claude)

Understanding Agent Thinking

Tips for Effective Browser Automation

Navigation Limitations and Best Practices

Example: Correct Workflow

Example: Problematic Workflow

Be Specific and Concise

Example of Good Instructions

Handling Invalid URLs and Navigation Errors

Advanced Features

Persistent Browser Profiles

Browser Session Management and Persistence

Important: Browser Persistence

Browser Visibility and Screenshots

Secure Login Handling

How Login Form Detection Works

Example: Complete Login Flow

Browser Session State Persistence

Example Capabilities

Structured Data Extraction

Form Element Interactions

Dropdown Menus

Checkboxes

Radio Buttons

Input Fields

Performance Considerations

Resource Management

Optimization Strategies

Best Practices for Large-Scale Automation

Testing

Setting Up Your Test Environment

Running Tests

1. Running Specific Test Files with Detailed Output

Troubleshooting

Nova Act API Key Issues

Browser Automation Problems

Lingering Browser Windows

Viewing Browser Activity

Debug Mode

Common Issues

"nova-act is installed but Python can't import it"

Clearing the Cache

Verify Installation

Contributing

Getting Started with Development

Development Guidelines

Reporting Issues

Code Reviews

License

Acknowledgements

Feedback for Amazon Nova Act

Limitations

Development

Managing Dependencies

Local Development Setup

Dependency Locking

Future Enhancements

Project details

Verified details

Project links

GitHub Statistics

Maintainers