GPT
Project description
GPT Computer Agent
Hi, this is an alternative work for providing ChatGPT MacOS app to Windows and Linux. In this way this is a fresh and stable work. You can easily install as Python library for this time but we will prepare a pipeline for providing native install scripts (.exe).
Powered by Upsonic Tiger 🐅 A function hub for llm agents.
Installation and Run
Needed >= Python 3.9
pip3 install 'gpt-computer-agent[base]'
computeragent
Wake Word | NEW
We have added Pvporcupine integration. To use it, you need to install an additional library:
pip3 install 'gpt-computer-agent[wakeword]'
After that, please enter your Pvporcupine API key and enable the wake word feature.
Agent Infrastructure
With this way you can create crewai agents and using it into gpt-computer-agent gui and tools.
pip3 install 'gpt-computer-agent[base]'
pip3 install 'gpt-computer-agent[agentic]'
from gpt_computer_agent import Agent, start
manager = Agent(
role='Project Manager',
goal='understands project needs and assist coder',
backstory="""You're a manager at a large company.""",
)
coder = Agent(
role='Senior Python Coder',
goal='writing python scripts and copying to clipboard',
backstory="""You're a python developer at a large company.""",
)
start()
Adding Custom Tools
Now you are able to add custom tools that run in the agentic infra and agent processes.
from gpt_computer_agent import Tool, start
@Tool
def sum_tool(first_number: int, second_number: int) -> str:
"""Useful for when you need to sum two numbers together."""
return first_number + second_number
start()
API | NEW
Now you can use your GPT Computer Agent remotely! GUI still active, for this there is few steps:
pip3 install 'gpt-computer-agent[base]'
pip3 install 'gpt-computer-agent[api]'
computeragent --api
from gpt_computer_agent.remote import remote
output = remote.input("Hi, how are you today?", screen=False, talk=False)
print(output)
remote.just_screenshot()
remote.talk("TTS test")
# Other Functionalities
remote.reset_memory()
remote.profile("default")
remote.enable_predefined_agents()
remote.disable_predefined_agents()
remote.enable_online_tools()
remote.disable_online_tools()
Roadmap
| Feature | Status | Target Release |
|---|---|---|
| Clear Chat History | Completed | Q2 2024 |
| Long Audios Support (Split 20mb) | Completed | Q2 2024 |
| Text Inputs | Completed | Q2 2024 |
| Just Text Mode (Mute Speech) | Completed | Q2 2024 |
| Added profiles (Different Chats) | Completed | Q2 2024 |
| More Feedback About Agent Status | Completed | Q2 2024 |
| Local Model Vision and Text (With Ollama, and vision models) | Completed | Q2 2024 |
| Our Customizable Agent Infrastructure | Completed | Q2 2024 |
| Supporting Groq Models | Completed | Q2 2024 |
| Adding Custom Tools | Completed | Q2 2024 |
| Click on something on the screen (text and icon) | Completed | Q2 2024 |
| New UI | Completed | Q2 2024 |
| Native Applications, exe, dmg | Failed (Agentic Infra libraries not supported for now) | Q2 2024 |
| Collaborated Speaking Different Voice Models on long responses. | Completed | Q2 2024 |
| Auto Stop Recording, when you complate talking | Completed | Q2 2024 |
| Wakeup Word | Completed | Q2 2024 |
| Continuously Conversations | Completed | Q2 2024 |
| Adding more capability on device | Planned | Q2 2024 |
| DeepFace Integration (Facial Recognition) | Planned | Q2 2024 |
Capabilities
At this time we have many infrastructure elements. We just aim to provide whole things that already in ChatGPT app.
| Capability | Status |
|---|---|
| Screen Read | OK |
| Click to and Text or Icon in the screen | OK |
| Move to and Text or Icon in the screen | OK |
| Typing Something | OK |
| Pressing to Any Key | OK |
| Scrolling | OK |
| Microphone | OK |
| System Audio | OK |
| Memory | OK |
| Open and Close App | OK |
| Open a URL | OK |
| Clipboard | OK |
| Search Engines | OK |
| Writing and running Python | OK |
| Writing and running SH | OK |
| Using your Telegram Account | OK |
| Knowledge Management | OK |
| Add more tool | ? |
Predefined Agents
If you enable it your agent will work with these teams:
| Team Name | Status |
|---|---|
| search_on_internet_and_report_team | OK |
| generate_code_with_aim_team_ | OK |
| Add your own one | ? |
Contributors
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file gpt_computer_agent-1.0.2.tar.gz.
File metadata
- Download URL: gpt_computer_agent-1.0.2.tar.gz
- Upload date:
- Size: 1.7 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.8.18
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9461921f7931845c57530b0c70636af34183e6fd79ab2884a044db936154891e
|
|
| MD5 |
d2885a6bf77ef1dd4a413d696c69c3d6
|
|
| BLAKE2b-256 |
d47609284a5b878e63bcba6561736f672340db8dba8c94dc6fc2e11ae571b2f8
|