The package provides a desktop environment for setting and evaluating desktop automation tasks.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

DesktopEnv: An Environment towards Human-like Computer Task Mastery

SLOGAN

Website • Paper

Overview

Updates

2024-03-01:

Install

Install VMWare and configure vmrun command: Please refer to guidance
Install the environment package, download the examples and the virtual machine image.

pip install desktop_env
mkdir -p ~/.desktop_env
wget xxxx
wget xxxx

Quick Start

Run the following minimal example to interact with the environment:

import json
from desktop_env.envs.desktop_env import DesktopEnv

with open("evaluation_examples/examples/gimp/f723c744-e62c-4ae6-98d1-750d3cd7d79d.json", "r", encoding="utf-8") as f:
    example = json.load(f)

env = DesktopEnv(
    path_to_vm=r"path_to_vm",
    action_space="computer_13",
    task_config=example
)
observation = env.reset()

observation, reward, done, info = env.step({"action_type": "CLICK", "parameters": {"button": "right", "num_clicks": 1}})

Annotation Tool Usage

We provide an annotation tool to help you annotate the examples.

Agent Usage

We provide a simple agent to interact with the environment. You can use it as a starting point to build your own agent.

Road map of infra (Proposed)

Explore VMWare, and whether it can be connected and control through mouse package
Explore Windows and MacOS, whether it can be installed
- MacOS is closed source and cannot be legally installed
- Windows is available legally and can be installed
Build gym-like python interface for controlling the VM
Recording of actions (mouse movement, click, keyboard) for humans to annotate, and we can replay it and compress it
Build a simple task, e.g. open a browser, open a website, click on a button, and close the browser
Set up a pipeline and build agents implementation (zero-shot) for the task
Start to design on which tasks inside the DesktopENv to focus on, start to wrap up the environment to be public
Start to annotate the examples for ~~training~~ and testing
Error handling during file passing and file opening, etc.
Add accessibility tree from the OS into the observation space
Add pre-process and post-process action support for benchmarking setup and evaluation
Multiprocess support, this can enable the reinforcement learning to be more efficient
Experiment logging and visualization system
Add more tasks, maybe scale to 300 for v1.0.0, and create a dynamic leaderboard

Road map of benchmark, tools and resources (Proposed)

Improve the annotation tool base on DuckTrack, make it more robust which align on accessibility tree
Annotate the steps of doing the task
Build a website for the project
Crawl all resources we explored from the internet, and make it easy to access
Set up ways for community to contribute new examples

Citation

If you find this environment useful, please consider citing our work:

@article{DesktopEnv,
  title={},
  author={},
  journal={arXiv preprint arXiv:xxxx.xxxx},
  year={2024}
}

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.1.14

May 20, 2024

0.1.13

May 19, 2024

0.1.12

May 10, 2024

0.1.11

May 10, 2024

0.1.9

May 10, 2024

0.1.8

May 10, 2024

0.1.7

Apr 25, 2024

0.1.6

Apr 22, 2024

0.1.5

Apr 13, 2024

0.1.4

Apr 13, 2024

0.1.3

Apr 7, 2024

0.1.2

Feb 24, 2024

This version

0.1.1

Feb 24, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

desktop_env-0.1.1.tar.gz (70.2 kB view hashes)

Uploaded Feb 24, 2024 Source

Built Distribution

desktop_env-0.1.1-py3-none-any.whl (79.0 kB view hashes)

Uploaded Feb 24, 2024 Python 3

Hashes for desktop_env-0.1.1.tar.gz

Hashes for desktop_env-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`54c734c48e00e3bc0ad582b7c40c5d2559e850a7d139be8ddc308b98cf71d978`
MD5	`df23af5dcd2ea225b3664afb37605e49`
BLAKE2b-256	`c48ae8c1174ae1f12421f7bec2087b1d52cdad95b1c1495e42d2e018155f0c25`

Hashes for desktop_env-0.1.1-py3-none-any.whl

Hashes for desktop_env-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`497c29195233f9483434e3c5072cc09d788da7cd2deb298093468ecd39c1de95`
MD5	`38db4a1f01c02a12cda000a117380303`
BLAKE2b-256	`46bcb5f150d10ee0f560e1921d93f36f2ec7146c580fcff6690c9695b549d3a4`