A desktop for AI agents
Project description
AgentDesk
Desktops for AI agents :computer:
Built on agentd to make desktop VMs accessible to AI agents.
Implements the ToolsV1 protocol
Installation
pip install agentdesk
Quick Start
from agentdesk import Desktop
# Create a local VM
desktop = Desktop.local()
# Launch the UI for it
desktop.view(background=True)
# Open a browser to Google
desktop.open_url("https://google.com")
# Take actions on the desktop
desktop.move_mouse(500, 500)
desktop.click()
img = desktop.take_screenshot()
Usage
Create a local desktop
from agentdesk import Desktop
desktop = Desktop.local()
$ agentdesk create --provider qemu
*requires qemu
Create a remote desktop on GCE
desktop = Desktop.gce()
$ agentdesk create --provider gce
Create a remote desktop on EC2
desktop = Desktop.ec2()
$ agentdesk create --provider ec2
View the desktop in the UI
desktop.view()
$ agentdesk view old_mckinny
*requires docker
List desktops
Desktop.list()
$ agentdesk get
Delete a desktop
Desktop.delete("old_mckinny")
$ agentdesk delete old_mckinny
Use the desktop
desktop.open_url("https://google.com")
coords = desktop.mouse_coordinates()
desktop.move_mouse(500, 500)
desktop.click()
desktop.type_text("What kind of ducks are in Canada?")
desktop.press_key('Enter')
desktop.scroll()
img = desktop.take_screenshot()
Processors
Process images to make them more accessible to LMMs.
Grid
Add a coordinate grid on top of the image
from agentdesk.processors import GridProcessor
img = desktop.take_screenshot()
processor = GridProcessor()
grid_img = processor.process_b64(img)
Examples
GPT-4V
See how to use GPT-4V with AgentDesk in our notebook or agent
Developing
Please open an issue before creating a PR.
Changes to the VM happen in agentd
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
agentdesk-0.2.5.tar.gz
(30.1 kB
view hashes)
Built Distribution
agentdesk-0.2.5-py3-none-any.whl
(36.7 kB
view hashes)
Close
Hashes for agentdesk-0.2.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 58b9de3644bdef7bf016be147e308d5d9760d729e7ec143d42807c3a56ed76ca |
|
MD5 | 7d7b00d2d16bf32523f26b5a1373adfe |
|
BLAKE2b-256 | d92f5cd6b23b1b355ac6f15d2583c0852e07919f32030d0ecc40cde875b0e9a0 |