Skip to main content

A desktop for AI agents

Project description


AgentDesk

Desktops for AI agents   :computer:
Explore the docs »

View Demo · Report Bug · Request Feature


AgentDesk provides full-featured desktop environments which can be programatically controlled by AI agents. Spin them up locally or in the cloud.

▶ Built on agentd a runtime daemon which exposes a REST API for interacting with the desktop.

▶ Implements the DeviceBay Protocol.

Installation

pip install agentdesk

Quick Start

from agentdesk import Desktop

# Create a desktop container
desktop = Desktop.docker()

# Launch the UI for it
desktop.view(background=True)

# Open a browser to Google
desktop.open_url("https://google.com")

# Take actions on the desktop
desktop.move_mouse(500, 500)
desktop.click()
img = desktop.take_screenshots()

Usage

Create a local desktop

from agentdesk import Desktop

desktop = Desktop.docker()
$ agentdesk create --provider docker

*requires docker

Create a remote desktop on Kubernetes

desktop = Desktop.kube()
$ agentdesk create --provider kube

View the desktop in the UI

desktop.view()
$ agentdesk view old_mckinny

*requires docker

List desktops

Desktop.find()
$ agentdesk get

Delete a desktop

Desktop.delete("old_mckinny")
$ agentdesk delete old_mckinny

Use the desktop

desktop.open_url("https://google.com")

coords = desktop.mouse_coordinates()

desktop.move_mouse(500, 500)

desktop.click()

desktop.type_text("What kind of ducks are in Canada?")

desktop.press_key('Enter')

desktop.scroll()

img = desktop.take_screenshots()

Processors

Process images to make them more accessible to LMMs.

Grid

Add a coordinate grid on top of the image

from agentdesk.processors import GridProcessor

img = desktop.take_screenshots()

processor = GridProcessor()
grid_img = processor.process_b64(img)

Community

Come join us on Discord.

Developing

Please open an issue before creating a PR.

Changes to the VM happen in agentd.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentdesk-0.2.137.tar.gz (53.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

agentdesk-0.2.137-py3-none-any.whl (67.3 kB view details)

Uploaded Python 3

File details

Details for the file agentdesk-0.2.137.tar.gz.

File metadata

  • Download URL: agentdesk-0.2.137.tar.gz
  • Upload date:
  • Size: 53.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.10.1 Darwin/24.3.0

File hashes

Hashes for agentdesk-0.2.137.tar.gz
Algorithm Hash digest
SHA256 7c6ca663cfd5299cdd7eec4f5cf652ee99575dba5cf2c719a4645f0e044b7272
MD5 69c017c24a6919d14587a211be1179a9
BLAKE2b-256 9455e72660577f4b065823d1bcd81c888587ea20367d5dd0eafeaba092472c5e

See more details on using hashes here.

File details

Details for the file agentdesk-0.2.137-py3-none-any.whl.

File metadata

  • Download URL: agentdesk-0.2.137-py3-none-any.whl
  • Upload date:
  • Size: 67.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.3 CPython/3.10.1 Darwin/24.3.0

File hashes

Hashes for agentdesk-0.2.137-py3-none-any.whl
Algorithm Hash digest
SHA256 13fca16b2e0c4b71c6105d4fe1eab8c5297dc7eb8f0259d7e32abdd262857807
MD5 859044572b09d372d848ab0e5f7e65cc
BLAKE2b-256 72d0c2769b7f3ba4bfb4c72f522a0067b6e7139f4d186c00d8d38b3b8df7f71d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page