Skip to main content

Automate anything with a vision agent.

Project description

🤖 AskUI Vision Agent

🔧 Setup

1. Install AskUI Agent OS

Windows
AMD64

AskUI Installer for AMD64

ARM64

AskUI Installer for ARM64

Linux

⚠️ Warning: Agent OS currently does not work on Wayland. Switch to XOrg to use it.

AMD64
curl -o /tmp/AskUI-Suite-24.9.1-User-Installer-Linux-x64-Full.run https://files.askui.com/releases/Installer/24.9.1/AskUI-Suite-24.9.1-User-Installer-Linux-x64-Full.run
bash /tmp/AskUI-Suite-24.9.1-User-Installer-Linux-x64-Full.run
ARM64
curl -o /tmp/AskUI-Suite-24.9.1-User-Installer-Linux-ARM64-Full.run https://files.askui.com/releases/Installer/24.9.1/AskUI-Suite-24.9.1-User-Installer-Linux-ARM64-Full.run
bash /tmp/AskUI-Suite-24.9.1-User-Installer-Linux-ARM64-Full.run
MacOS
curl -o /tmp/AskUI-Suite-24.9.1-User-Installer-MacOS-ARM64-Full.run https://files.askui.com/releases/Installer/24.9.1/AskUI-Suite-24.9.1-User-Installer-MacOS-ARM64-Full.run
bash /tmp/AskUI-Suite-24.9.1-User-Installer-MacOS-ARM64-Full.run

2. Install vision-agent in your Python environment

pip install askui

3. Authenticate with Anthropic

Set the ANTHROPIC_API_KEY environment variable to access the Claude computer use model. (Create a Anthropic key here)

Linux & MacOS

Use export to set an evironment variable:

export ANTHROPIC_API_KEY=<your-api-key-here>
Windows PowerShell

Set an environment variable with $env:

$env:ANTHROPIC_API_KEY="<your-api-key-here>"

▶️ Start Building

from askui import VisionAgent

# Initialize your agent context manager
with VisionAgent() as agent:
    # Use the webbrowser tool to start browsing
    agent.webbrowser.open_new("http://www.google.com")

    # Start to automate individual steps
    agent.click("url bar")
    agent.type("http://www.google.com")
    agent.keyboard("enter")

    # Extract information from the screen
    datetime = agent.get("What is the datetime at the top of the screen?")
    print(datetime)

    # Or let the agent work on its own
    agent.act("search for a flight from Berlin to Paris in January")

📜 Logging

You want a better understanding of what you agent is doing? Set the log_level to DEBUG.

import logging

with VisionAgent(log_level=logging.DEBUG) as agent:
    agent...

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

askui-0.1.1.tar.gz (50.2 kB view details)

Uploaded Source

Built Distribution

askui-0.1.1-py3-none-any.whl (31.2 kB view details)

Uploaded Python 3

File details

Details for the file askui-0.1.1.tar.gz.

File metadata

  • Download URL: askui-0.1.1.tar.gz
  • Upload date:
  • Size: 50.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: pdm/2.20.0.post1 CPython/3.10.12 Linux/6.5.0-1025-azure

File hashes

Hashes for askui-0.1.1.tar.gz
Algorithm Hash digest
SHA256 d38324316b974e5cd812b3971ff5a6c770a926d8022862b092f2ef92ad7d87aa
MD5 61b67de5d4e42277e281d38aa48e94a3
BLAKE2b-256 a3ecc11afc0a04edd8969b7744612704ab866fabdfa2dce949c5b33eb395066b

See more details on using hashes here.

File details

Details for the file askui-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: askui-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 31.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: pdm/2.20.0.post1 CPython/3.10.12 Linux/6.5.0-1025-azure

File hashes

Hashes for askui-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 695ea03bf07f492a759bb1bff1fd12b913119ad2de7e91e4e0dd6ab6094889df
MD5 338cfb682ad5b87db62614c76d0bb828
BLAKE2b-256 b3d88ac3016cb57e33d8e94a106e8554d01f83415fdee571a08b4333dcef9404

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page