Skip to main content

GPT

Project description


GPT Computer Assistant - Create intelligence for your products | Product Hunt . Static Badge . Static Badge


Dockerized Computer Use Agents with Production Ready API’s - Supports MCP


Made_with_python . pypi_downloads

|ENGLISH|简体中文|正體中文|TÜRKÇE






GPT Computer Assistant(GCA)

GCA is an innovative AI agent designed to seamlessly control systems across Windows, macOS, and Ubuntu. With its advanced capabilities, GCA enables you to delegate repetitive, logic-based tasks to an AI, drastically improving efficiency and productivity. Whether you’re a developer, analyst, or IT professional, GCA can empower you to accomplish more in less time.

Imagine this:

Task Human Time GCA Can GCA Time Optimization
Extract the tech stacks of xxx Company Sales Development Representer Human: 30 Minute Yes GCA: 5 Minute %80 Faster
Identify Relevant tables for Analysis for xxx Data Analytics Human: 20 Minute Yes GCA: 2 Minute %80 Faster
Check the logs to find core cause of this incident Technical Support Engineer Human: 20 Minute Yes GCA: 3 Minute %80 Faster
Making CloudFlare Security Settings Security Specialist Human: 1 Day Yes GCA: 10 Minute %80 Faster

These examples illustrate how GCA is revolutionizing the concept of Vertical AI Agents—tailored solutions that not only replicate human tasks but perform them at unparalleled speed and precision.



How GCA Works?

GCA is a Python-based project that runs on multiple operating systems, including Windows, macOS, and Ubuntu. It integrates external concepts, like the Model Context Protocol (MCP), along with its own modules, to interact with and control a computer efficiently. The system performs both routine and advanced tasks by mimicking human-like actions and applying computational precision.

Core Functionalities:

1. Human-like Actions:

GCA can replicate common user actions, such as:

  • Clicking: Interact with buttons or other UI elements.
  • Reading: Recognize and interpret text on the screen.
  • Scrolling: Navigate through documents or web pages.
  • Typing: Enter text into forms or other input fields.

2. Advanced Capabilities:

Through MCP and GCA’s own modules, it achieves tasks that go beyond standard human interaction, such as:

  • Updating dependencies of a project in seconds.
  • Analyzing entire database tables to locate specific data almost instantly.
  • Automating cloud security configurations with minimal input.






Prequisites

  • Python 3.10



Using GCA.dev Cloud

Installation

pip install gpt-computer-assistant

Single Instance:

from gpt_computer_assistant import cloud

# Starting instance
instance = cloud.instance()


# Asking and getting result
result = instance.request("Extract the tech stacks of gpt-computer-assitant Company", "i want a list")
print(result)


instance.close()






Self-Hosted GCA Server

Local

Installation

pip install 'gpt-computer-assistant[base]'
pip install 'gpt-computer-assistant[api]'

LLM Settings

from gpt_computer_assistant import local

# Starting instance
instance = local.instance()

# Connecting to OpenAI
instance.client.save_models("gpt-4o")
instance.client.save_openai_api_key("sk-**")

# Asking and getting result
result = instance.request("Extract the tech stacks of gpt-computer-assitant Company", "i want a list")
print(result)

instance.close()




Roadmap

Feature Status Target Release
Clear Chat History Completed Q2 2024
Long Audios Support (Split 20mb) Completed Q2 2024
Text Inputs Completed Q2 2024
Just Text Mode (Mute Speech) Completed Q2 2024
Added profiles (Different Chats) Completed Q2 2024
More Feedback About Assistant Status Completed Q2 2024
Local Model Vision and Text (With Ollama, and vision models) Completed Q2 2024
Our Customizable Agent Infrastructure Completed Q2 2024
Supporting Groq Models Completed Q2 2024
Adding Custom Tools Completed Q2 2024
Click on something on the screen (text and icon) Completed Q2 2024
New UI Completed Q2 2024
Native Applications, exe, dmg Completed Q3 2024
Collaborated Speaking Different Voice Models on long responses. Completed Q2 2024
Auto Stop Recording, when you complate talking Completed Q2 2024
Wakeup Word Completed Q2 2024
Continuously Conversations Completed Q2 2024
Adding more capability on device Completed Q2 2024
Local TTS Completed Q3 2024
Local STT Completed Q3 2024
Tray Menu Completed Q3 2024
New Line (Shift + Enter) Completed Q4 2024
Copy Pasting Text Compatibility Completed Q4 2024
Global Hotkey On the way Q3 2024
DeepFace Integration (Facial Recognition) Planned Q3 2024

Capabilities

At this time we have many infrastructure elements. We just aim to provide whole things that already in ChatGPT app.

Capability Status
Local LLM with Vision (Ollama) OK
Local text-to-speech OK
Local speech-to-text OK
Screen Read OK
Click to and Text or Icon in the screen OK
Move to and Text or Icon in the screen OK
Typing Something OK
Pressing to Any Key OK
Scrolling OK
Microphone OK
System Audio OK
Memory OK
Open and Close App OK
Open a URL OK
Clipboard OK
Search Engines OK
Writing and running Python OK
Writing and running SH OK
Using your Telegram Account OK
Knowledge Management OK
Add more tool ?

Predefined Agents

If you enable it your assistant will work with these teams:

Team Name Status
search_on_internet_and_report_team OK
generate_code_with_aim_team_ OK
Add your own one ?
Logo

Contributors

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpt_computer_assistant-0.23.19.tar.gz (2.0 MB view details)

Uploaded Source

File details

Details for the file gpt_computer_assistant-0.23.19.tar.gz.

File metadata

File hashes

Hashes for gpt_computer_assistant-0.23.19.tar.gz
Algorithm Hash digest
SHA256 66c067e88983baf40c6a3ab21acf69c096ae2aad0df720f09ab815dde1f36e24
MD5 775e9b0eb18d0d1a833605c6d7ed5edb
BLAKE2b-256 c7ad8691c3a6d6a6c23191616d510582a00c51537dbfeef10b6a52e4a18d8870

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page