Skip to main content

Agentsphere Desktop

Project description


title: AgentSphere Desktop description: Discover AgentSphere Desktop, a secure virtual desktop designed for AI agents performing 'Computer Use' tasks. This guide shows you how to programmatically control a complete, isolated desktop environment to launch GUI applications, all via our Python and JavaScript SDKs. # for seo optimization keywords: AgentSphere Desktop, Computer Use, virtual desktop, GUI automation, autonomous agents, stream desktop, mouse control, keyboard control, launch applications, screenshot, secure agent runtime, cloud sandbox, AI agent infra, AgentSphere alternative, build autonomous AI agent # for seo optimization

AgentSphere Desktop - Virtual Computer for Computer Use

AgentSphere Desktop is a secure virtual desktop ready for Computer Use. Each sandbox is isolated from the others and can be customized with any dependencies you need.

AgentSphere Desktop

Get Started

AgentSphere Desktop is built on top of AgentSphere Sandbox.

1. Get AgentSphere API key

  1. Sign up at AgentSphere and get your API key.

  2. Set environment variable AGENTSPHERE_API_KEY with your API key.

2. Install SDK

pip install agentsphere-desktop
npm install agentsphere-desktop

3. Create Your Desktop

from agentsphere_desktop import Sandbox

# Create a new desktop sandbox
desktop = Sandbox()

# Launch an application
desktop.launch('google-chrome')  # or vscode, firefox, etc.

# Wait 10s for the application to open
desktop.wait(10000)

# Stream the application's window
# Note: There can be only one stream at a time
# You need to stop the current stream before streaming another application
desktop.stream.start(
    window_id=desktop.get_current_window_id(), # if not provided the whole desktop will be streamed
    require_auth=True
)

# Get the stream auth key
auth_key = desktop.stream.get_auth_key()

# Print the stream URL
print('Stream URL:', desktop.stream.get_url(auth_key=auth_key))

# Kill the sandbox after the tasks are finished
# desktop.kill()
import { Sandbox } from 'agentsphere-desktop'

// Start a new desktop sandbox
const desktop = await Sandbox.create()

// Launch an application
await desktop.launch('google-chrome') // or vscode, firefox, etc.

// Wait 10s for the application to open
await desktop.wait(10000)

// Stream the application's window
// Note: There can be only one stream at a time
// You need to stop the current stream before streaming another application
await desktop.stream.start({
  windowId: await desktop.getCurrentWindowId(), // if not provided the whole desktop will be streamed
  requireAuth: true,
})

// Get the stream auth key
const authKey = desktop.stream.getAuthKey()

// Print the stream URL
console.log('Stream URL:', desktop.stream.getUrl({ authKey }))

// Kill the sandbox after the tasks are finished
// await desktop.kill()

Work with AgentSphere Desktop

Stream Desktop Screen

from agentsphere_desktop import Sandbox
desktop = Sandbox()

# Start the stream
desktop.stream.start()

# Get stream URL
url = desktop.stream.get_url()
print(url)

# Get stream URL and disable user interaction
url = desktop.stream.get_url(view_only=True)
print(url)

# Stop the stream
desktop.stream.stop()
import { Sandbox } from 'agentsphere-desktop'

const desktop = await Sandbox.create()

// Start the stream
await desktop.stream.start()

// Get stream URL
const url = desktop.stream.getUrl()
console.log(url)

// Get stream URL and disable user interaction
const url = desktop.stream.getUrl({ viewOnly: true })
console.log(url)

// Stop the stream
await desktop.stream.stop()

Stream with Password Protection

from agentsphere_desktop import Sandbox
desktop = Sandbox()

# Start the stream
desktop.stream.start(
    require_auth=True  # Require authentication with an auto-generated key
)

# Retrieve the authentication key
auth_key = desktop.stream.get_auth_key()

# Get stream URL
url = desktop.stream.get_url(auth_key=auth_key)
print(url)

# Stop the stream
desktop.stream.stop()
import { Sandbox } from 'agentsphere-desktop'

const desktop = await Sandbox.create()

// Start the stream
await desktop.stream.start({
  requireAuth: true, // Require authentication with an auto-generated key
})

// Retrieve the authentication key
const authKey = await desktop.stream.getAuthKey()

// Get stream URL
const url = desktop.stream.getUrl({ authKey })
console.log(url)

// Stop the stream
await desktop.stream.stop()

Stream Specific Apps

[!WARNING]

  • Will raise an error if the desired application is not open yet
  • The stream will close once the application closes
  • Creating multiple streams at the same time is not supported, you may have to stop the current stream and start a new one for each application
from agentsphere_desktop import Sandbox
desktop = Sandbox()

# Get current (active) window ID
window_id = desktop.get_current_window_id()

# Get all windows of the application
window_ids = desktop.get_application_windows("Firefox")

# Start the stream
desktop.stream.start(window_id=window_ids[0])

# Stop the stream
desktop.stream.stop()
import { Sandbox } from 'agentsphere-desktop'

const desktop = await Sandbox.create()

// Get current (active) window ID
const windowId = await desktop.getCurrentWindowId()

// Get all windows of the application
const windowIds = await desktop.getApplicationWindows('Firefox')

// Start the stream
await desktop.stream.start({ windowId: windowIds[0] })

// Stop the stream
await desktop.stream.stop()

Mouse Control

from agentsphere_desktop import Sandbox
desktop = Sandbox()

desktop.double_click()
desktop.left_click()
desktop.left_click(x=100, y=200)
desktop.right_click()
desktop.right_click(x=100, y=200)
desktop.middle_click()
desktop.middle_click(x=100, y=200)
desktop.scroll(10) # Scroll by the amount. Positive for up, negative for down.
desktop.move_mouse(100, 200) # Move to x, y coordinates
desktop.drag((100, 100), (200, 200)) # Drag using the mouse
desktop.mouse_press("left") # Press the mouse button
desktop.mouse_release("left") # Release the mouse button
import { Sandbox } from 'agentsphere-desktop'

const desktop = await Sandbox.create()

await desktop.doubleClick()
await desktop.leftClick()
await desktop.leftClick(100, 200)
await desktop.rightClick()
await desktop.rightClick(100, 200)
await desktop.middleClick()
await desktop.middleClick(100, 200)
await desktop.scroll(10) // Scroll by the amount. Positive for up, negative for down.
await desktop.moveMouse(100, 200) // Move to x, y coordinates
await desktop.drag([100, 100], [200, 200]) // Drag using the mouse
await desktop.mousePress('left') // Press the mouse button
await desktop.mouseRelease('left') // Release the mouse button

Keyboard Control

from agentsphere_desktop import Sandbox
desktop = Sandbox()

# Write text at the current cursor position with customizable typing speed
desktop.write("Hello, world!")  # Default: chunk_size=25, delay_in_ms=75
desktop.write("Fast typing!", chunk_size=50, delay_in_ms=25)  # Faster typing

# Press keys
desktop.press("enter")
desktop.press("space")
desktop.press("backspace")
desktop.press(["ctrl", "c"]) # Key combination
import { Sandbox } from 'agentsphere-desktop'

const desktop = await Sandbox.create()

// Write text at the current cursor position with customizable typing speed
await desktop.write('Hello, world!')
await desktop.write('Fast typing!', { chunkSize: 50, delayInMs: 25 }) // Faster typing

// Press keys
await desktop.press('enter')
await desktop.press('space')
await desktop.press('backspace')
await desktop.press(['ctrl', 'c']) // Key combination

Window Control

from agentsphere_desktop import Sandbox
desktop = Sandbox()

# Get current (active) window ID
window_id = desktop.get_current_window_id()

# Get all windows of the application
window_ids = desktop.get_application_windows("Firefox")

# Get window title
title = desktop.get_window_title(window_id)
import { Sandbox } from 'agentsphere-desktop'

const desktop = await Sandbox.create()

// Get current (active) window ID
const windowId = await desktop.getCurrentWindowId()

// Get all windows of the application
const windowIds = await desktop.getApplicationWindows('Firefox')

// Get window title
const title = await desktop.getWindowTitle(windowId)

Capture a Screenshot

from agentsphere_desktop import Sandbox
desktop = Sandbox()

# Take a screenshot and save it as "screenshot.png" locally
image = desktop.screenshot()
# Save the image to a file
with open("screenshot.png", "wb") as f:
    f.write(image)
import { Sandbox } from 'agentsphere-desktop'

const desktop = await Sandbox.create()
const image = await desktop.screenshot()
// Save the image to a file
fs.writeFileSync('screenshot.png', image)

Open a File

from agentsphere_desktop import Sandbox
desktop = Sandbox()

# Open file with default application
desktop.files.write("/home/user/index.js", "console.log('hello')") # First create the file
desktop.open("/home/user/index.js") # Then open it
import { Sandbox } from 'agentsphere-desktop'

const desktop = await Sandbox.create()

// Open file with default application
await desktop.files.write('/home/user/index.js', "console.log('hello')") // First create the file
await desktop.open('/home/user/index.js') // Then open it

Launch Apps

from agentsphere_desktop import Sandbox
desktop = Sandbox()

# Launch the application
desktop.launch('google-chrome')
import { Sandbox } from 'agentsphere-desktop'

const desktop = await Sandbox.create()

// Launch the application
await desktop.launch('google-chrome')

Run Bash Commands

from agentsphere_desktop import Sandbox
desktop = Sandbox()

# Run any bash command
out = desktop.commands.run("ls -la /home/user")
print(out)
import { Sandbox } from 'agentsphere-desktop'

const desktop = await Sandbox.create()

// Run any bash command
const out = await desktop.commands.run('ls -la /home/user')
console.log(out)

Wait

from agentsphere_desktop import Sandbox
desktop = Sandbox()

desktop.wait(1000) # Wait for 1 second
import { Sandbox } from 'agentsphere-desktop'

const desktop = await Sandbox.create()
await desktop.wait(1000) // Wait for 1 second

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentsphere_desktop-0.1.1.tar.gz (11.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

agentsphere_desktop-0.1.1-py3-none-any.whl (10.0 kB view details)

Uploaded Python 3

File details

Details for the file agentsphere_desktop-0.1.1.tar.gz.

File metadata

  • Download URL: agentsphere_desktop-0.1.1.tar.gz
  • Upload date:
  • Size: 11.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.9.6

File hashes

Hashes for agentsphere_desktop-0.1.1.tar.gz
Algorithm Hash digest
SHA256 3c59e544022468dc3f83e0e5d8c1d933ec6f40d231b6df14a1063e6f6e9eacf4
MD5 a56250b7f7563ed94155e455722722e2
BLAKE2b-256 0011274c9a22a4c4671d3ede62eb876c3b7ed2c11d6401ab23f16fceff56e842

See more details on using hashes here.

File details

Details for the file agentsphere_desktop-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for agentsphere_desktop-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 719bab2c82d566824187408d3593e8e66f5d76fea97b8ec3fb7b9b8fc1e40bf2
MD5 a5c391588b2a965e082689ec691670bb
BLAKE2b-256 01fdbf6ee6238fefd3e58f31ee9b1519a45fc4fdb3f1b1e10eec7d224a941c42

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page