Agentsphere Desktop
Project description
title: AgentSphere Desktop description: Discover AgentSphere Desktop, a secure virtual desktop designed for AI agents performing 'Computer Use' tasks. This guide shows you how to programmatically control a complete, isolated desktop environment to launch GUI applications, all via our Python and JavaScript SDKs. # for seo optimization keywords: AgentSphere Desktop, Computer Use, virtual desktop, GUI automation, autonomous agents, stream desktop, mouse control, keyboard control, launch applications, screenshot, secure agent runtime, cloud sandbox, AI agent infra, AgentSphere alternative, build autonomous AI agent # for seo optimization
AgentSphere Desktop - Virtual Computer for Computer Use
AgentSphere Desktop is a secure virtual desktop ready for Computer Use. Each sandbox is isolated from the others and can be customized with any dependencies you need.
Get Started
AgentSphere Desktop is built on top of AgentSphere Sandbox.
1. Get AgentSphere API key
-
Sign up at AgentSphere and get your API key.
-
Set environment variable
AGENTSPHERE_API_KEYwith your API key.
2. Install SDK
pip install agentsphere-desktop
npm install agentsphere-desktop
3. Create Your Desktop
from agentsphere_desktop import Sandbox
# Create a new desktop sandbox
desktop = Sandbox()
# Launch an application
desktop.launch('google-chrome') # or vscode, firefox, etc.
# Wait 10s for the application to open
desktop.wait(10000)
# Stream the application's window
# Note: There can be only one stream at a time
# You need to stop the current stream before streaming another application
desktop.stream.start(
window_id=desktop.get_current_window_id(), # if not provided the whole desktop will be streamed
require_auth=True
)
# Get the stream auth key
auth_key = desktop.stream.get_auth_key()
# Print the stream URL
print('Stream URL:', desktop.stream.get_url(auth_key=auth_key))
# Kill the sandbox after the tasks are finished
# desktop.kill()
import { Sandbox } from 'agentsphere-desktop'
// Start a new desktop sandbox
const desktop = await Sandbox.create()
// Launch an application
await desktop.launch('google-chrome') // or vscode, firefox, etc.
// Wait 10s for the application to open
await desktop.wait(10000)
// Stream the application's window
// Note: There can be only one stream at a time
// You need to stop the current stream before streaming another application
await desktop.stream.start({
windowId: await desktop.getCurrentWindowId(), // if not provided the whole desktop will be streamed
requireAuth: true,
})
// Get the stream auth key
const authKey = desktop.stream.getAuthKey()
// Print the stream URL
console.log('Stream URL:', desktop.stream.getUrl({ authKey }))
// Kill the sandbox after the tasks are finished
// await desktop.kill()
Work with AgentSphere Desktop
Stream Desktop Screen
from agentsphere_desktop import Sandbox
desktop = Sandbox()
# Start the stream
desktop.stream.start()
# Get stream URL
url = desktop.stream.get_url()
print(url)
# Get stream URL and disable user interaction
url = desktop.stream.get_url(view_only=True)
print(url)
# Stop the stream
desktop.stream.stop()
import { Sandbox } from 'agentsphere-desktop'
const desktop = await Sandbox.create()
// Start the stream
await desktop.stream.start()
// Get stream URL
const url = desktop.stream.getUrl()
console.log(url)
// Get stream URL and disable user interaction
const url = desktop.stream.getUrl({ viewOnly: true })
console.log(url)
// Stop the stream
await desktop.stream.stop()
Stream with Password Protection
from agentsphere_desktop import Sandbox
desktop = Sandbox()
# Start the stream
desktop.stream.start(
require_auth=True # Require authentication with an auto-generated key
)
# Retrieve the authentication key
auth_key = desktop.stream.get_auth_key()
# Get stream URL
url = desktop.stream.get_url(auth_key=auth_key)
print(url)
# Stop the stream
desktop.stream.stop()
import { Sandbox } from 'agentsphere-desktop'
const desktop = await Sandbox.create()
// Start the stream
await desktop.stream.start({
requireAuth: true, // Require authentication with an auto-generated key
})
// Retrieve the authentication key
const authKey = await desktop.stream.getAuthKey()
// Get stream URL
const url = desktop.stream.getUrl({ authKey })
console.log(url)
// Stop the stream
await desktop.stream.stop()
Stream Specific Apps
[!WARNING]
- Will raise an error if the desired application is not open yet
- The stream will close once the application closes
- Creating multiple streams at the same time is not supported, you may have to stop the current stream and start a new one for each application
from agentsphere_desktop import Sandbox
desktop = Sandbox()
# Get current (active) window ID
window_id = desktop.get_current_window_id()
# Get all windows of the application
window_ids = desktop.get_application_windows("Firefox")
# Start the stream
desktop.stream.start(window_id=window_ids[0])
# Stop the stream
desktop.stream.stop()
import { Sandbox } from 'agentsphere-desktop'
const desktop = await Sandbox.create()
// Get current (active) window ID
const windowId = await desktop.getCurrentWindowId()
// Get all windows of the application
const windowIds = await desktop.getApplicationWindows('Firefox')
// Start the stream
await desktop.stream.start({ windowId: windowIds[0] })
// Stop the stream
await desktop.stream.stop()
Mouse Control
from agentsphere_desktop import Sandbox
desktop = Sandbox()
desktop.double_click()
desktop.left_click()
desktop.left_click(x=100, y=200)
desktop.right_click()
desktop.right_click(x=100, y=200)
desktop.middle_click()
desktop.middle_click(x=100, y=200)
desktop.scroll(10) # Scroll by the amount. Positive for up, negative for down.
desktop.move_mouse(100, 200) # Move to x, y coordinates
desktop.drag((100, 100), (200, 200)) # Drag using the mouse
desktop.mouse_press("left") # Press the mouse button
desktop.mouse_release("left") # Release the mouse button
import { Sandbox } from 'agentsphere-desktop'
const desktop = await Sandbox.create()
await desktop.doubleClick()
await desktop.leftClick()
await desktop.leftClick(100, 200)
await desktop.rightClick()
await desktop.rightClick(100, 200)
await desktop.middleClick()
await desktop.middleClick(100, 200)
await desktop.scroll(10) // Scroll by the amount. Positive for up, negative for down.
await desktop.moveMouse(100, 200) // Move to x, y coordinates
await desktop.drag([100, 100], [200, 200]) // Drag using the mouse
await desktop.mousePress('left') // Press the mouse button
await desktop.mouseRelease('left') // Release the mouse button
Keyboard Control
from agentsphere_desktop import Sandbox
desktop = Sandbox()
# Write text at the current cursor position with customizable typing speed
desktop.write("Hello, world!") # Default: chunk_size=25, delay_in_ms=75
desktop.write("Fast typing!", chunk_size=50, delay_in_ms=25) # Faster typing
# Press keys
desktop.press("enter")
desktop.press("space")
desktop.press("backspace")
desktop.press(["ctrl", "c"]) # Key combination
import { Sandbox } from 'agentsphere-desktop'
const desktop = await Sandbox.create()
// Write text at the current cursor position with customizable typing speed
await desktop.write('Hello, world!')
await desktop.write('Fast typing!', { chunkSize: 50, delayInMs: 25 }) // Faster typing
// Press keys
await desktop.press('enter')
await desktop.press('space')
await desktop.press('backspace')
await desktop.press(['ctrl', 'c']) // Key combination
Window Control
from agentsphere_desktop import Sandbox
desktop = Sandbox()
# Get current (active) window ID
window_id = desktop.get_current_window_id()
# Get all windows of the application
window_ids = desktop.get_application_windows("Firefox")
# Get window title
title = desktop.get_window_title(window_id)
import { Sandbox } from 'agentsphere-desktop'
const desktop = await Sandbox.create()
// Get current (active) window ID
const windowId = await desktop.getCurrentWindowId()
// Get all windows of the application
const windowIds = await desktop.getApplicationWindows('Firefox')
// Get window title
const title = await desktop.getWindowTitle(windowId)
Capture a Screenshot
from agentsphere_desktop import Sandbox
desktop = Sandbox()
# Take a screenshot and save it as "screenshot.png" locally
image = desktop.screenshot()
# Save the image to a file
with open("screenshot.png", "wb") as f:
f.write(image)
import { Sandbox } from 'agentsphere-desktop'
const desktop = await Sandbox.create()
const image = await desktop.screenshot()
// Save the image to a file
fs.writeFileSync('screenshot.png', image)
Open a File
from agentsphere_desktop import Sandbox
desktop = Sandbox()
# Open file with default application
desktop.files.write("/home/user/index.js", "console.log('hello')") # First create the file
desktop.open("/home/user/index.js") # Then open it
import { Sandbox } from 'agentsphere-desktop'
const desktop = await Sandbox.create()
// Open file with default application
await desktop.files.write('/home/user/index.js', "console.log('hello')") // First create the file
await desktop.open('/home/user/index.js') // Then open it
Launch Apps
from agentsphere_desktop import Sandbox
desktop = Sandbox()
# Launch the application
desktop.launch('google-chrome')
import { Sandbox } from 'agentsphere-desktop'
const desktop = await Sandbox.create()
// Launch the application
await desktop.launch('google-chrome')
Run Bash Commands
from agentsphere_desktop import Sandbox
desktop = Sandbox()
# Run any bash command
out = desktop.commands.run("ls -la /home/user")
print(out)
import { Sandbox } from 'agentsphere-desktop'
const desktop = await Sandbox.create()
// Run any bash command
const out = await desktop.commands.run('ls -la /home/user')
console.log(out)
Wait
from agentsphere_desktop import Sandbox
desktop = Sandbox()
desktop.wait(1000) # Wait for 1 second
import { Sandbox } from 'agentsphere-desktop'
const desktop = await Sandbox.create()
await desktop.wait(1000) // Wait for 1 second
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agentsphere_desktop-0.1.1.tar.gz.
File metadata
- Download URL: agentsphere_desktop-0.1.1.tar.gz
- Upload date:
- Size: 11.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3c59e544022468dc3f83e0e5d8c1d933ec6f40d231b6df14a1063e6f6e9eacf4
|
|
| MD5 |
a56250b7f7563ed94155e455722722e2
|
|
| BLAKE2b-256 |
0011274c9a22a4c4671d3ede62eb876c3b7ed2c11d6401ab23f16fceff56e842
|
File details
Details for the file agentsphere_desktop-0.1.1-py3-none-any.whl.
File metadata
- Download URL: agentsphere_desktop-0.1.1-py3-none-any.whl
- Upload date:
- Size: 10.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
719bab2c82d566824187408d3593e8e66f5d76fea97b8ec3fb7b9b8fc1e40bf2
|
|
| MD5 |
a5c391588b2a965e082689ec691670bb
|
|
| BLAKE2b-256 |
01fdbf6ee6238fefd3e58f31ee9b1519a45fc4fdb3f1b1e10eec7d224a941c42
|