Skip to main content

๐Ÿค– AI-Powered Browser & Computer Automation CLI with Voice, RAG, Multi-Agent & more

Project description

๐Ÿค– ClawBot Plus โ€” AI-Powered Computer Control Agent

Your PC. Your AI. Full Control.

GitHub release MIT License Stars PyPI

ClawBot Plus is a personal AI assistant that takes full control of your Windows PC. It can open apps, write code, browse the web, generate presentations, search the internet, control system settings, and talk to you โ€” all from a single CLI. Connect it to Telegram, and you can control your PC from your phone too.

If you want a personal, single-user AI that feels local, fast, and always-on โ€” this is it.

Install (recommended)

Runtime: Python 3.11+

pip install clawbot-plus
clawbot

That's it. ClawBot will auto-detect your system, scan your apps, and set up its workspace on first run.

Quick Start

# Start ClawBot
clawbot

# Choose your AI provider (Gemini, OpenAI, Groq, Ollama, GitHub Models, NVIDIA NIM, etc.)
# Choose your model
# Start giving commands!

โฏ open chrome and go to youtube
โฏ make a portfolio website
โฏ what's the weather today
โฏ /voice   # Start voice mode
โฏ switch   # Change AI model
โฏ quit     # Exit

From Source (development)

git clone https://github.com/aayushsoam/clawbot-plus.git
cd clawbot-plus
pip install -e .
clawbot

Supported AI Providers

Provider Models Auth
๐Ÿ”ท Google Gemini gemini-3.1-pro, gemini-2.5-flash, gemini-2.0-flash, etc. GEMINI_API_KEY
๐Ÿ”ต OpenAI gpt-4o, gpt-4.1-mini, o3-mini, o4-mini OPENAI_API_KEY
๐ŸŸ  Anthropic claude-3-7-sonnet, claude-3-5-sonnet, claude-3-opus ANTHROPIC_API_KEY
โšก Groq llama-3.3-70b-versatile, gemma2-9b-it GROQ_API_KEY
๐Ÿ‹ DeepSeek deepseek-chat, deepseek-reasoner DEEPSEEK_API_KEY
๐ŸŸฉ NVIDIA NIM nemotron-120b, llama-3.3-70b, deepseek-r1, phi-4 NVIDIA_API_KEY
๐ŸŒ™ Kimi (Moonshot) moonshot-v1-8k/32k/128k MOONSHOT_API_KEY
๐Ÿ™ GitHub Models gpt-4o, gpt-4.1, Codestral-2501, Llama-3.3-70B GITHUB_TOKEN
๐Ÿฆ™ Ollama (Local) qwen3-vl, llama3.3, deepseek-r1, mistral, phi3 Free (local)
โ˜๏ธ Ollama (Cloud) Same as local, custom host OLLAMA_BASE_URL

Highlights

  • ๐Ÿ–ฅ๏ธ Full Computer Control โ€” open apps, click, type, scroll, screenshot, drag & drop. ClawBot sees your screen and acts.
  • ๐Ÿ’ป Senior Developer Coding โ€” writes production-quality code with separate HTML/CSS/JS files, verifies via shell, and shows output.
  • ๐Ÿ”Š Voice Mode โ€” /voice to talk to ClawBot using your microphone. It speaks back using Edge TTS.
  • ๐Ÿ“ฑ Telegram Bot โ€” connect your Telegram bot and control your PC from your phone.
  • ๐Ÿง  Persistent Memory โ€” remembers your name, preferences, past sessions, and daily logs across restarts.
  • ๐Ÿ›ก๏ธ Anti-Loop Engine โ€” detects when the AI is stuck and forces a different approach or graceful exit.
  • ๐Ÿ” Web Search โ€” search the web and fetch pages without opening a browser.
  • ๐Ÿ“Š PPT Generation โ€” generate_ppt action creates professional PowerPoint presentations.
  • ๐ŸŽฌ Video Generation โ€” local Python or cloud API video creation via Koda-Stack.
  • ๐Ÿค– Multi-Agent Swarm โ€” spawn worker agents for complex projects (ClawTeam).
  • ๐Ÿ”’ Security Layer โ€” dangerous commands are blocked. All shell commands are audited.
  • ๐Ÿ“ฆ Skills System โ€” extensible SKILL.md-based plugin architecture for CCTV, CapCut, LaTeX, Office docs, and more.

How It Works

                            โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ
                            โ”‚  You (CLI / Telegram / Voice)โ”‚
                            โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ
                                           โ”‚ (Commands, Voice, Images)
                                           โ–ผ
         โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ
         โ”‚ ๐Ÿค– ClawBot Plus (Agentic Core)                                    โ”‚
         โ”‚   โ€ข Brain (LLMs)      โ€ข RAG Memory       โ€ข Anti-Loop Engine       โ”‚
         โ”‚   โ€ข Skill Plugins     โ€ข Error Healing    โ€ข Multi-Agent Team       โ”‚
         โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ
                                         โ”‚
             โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ
             โ”‚                           โ”‚                           โ”‚
             โ–ผ                           โ–ผ                           โ–ผ
 โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ   โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ   โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ
 โ”‚ ๐Ÿ–ฅ๏ธ System Control     โ”‚   โ”‚ ๐Ÿ’ป Code Engine        โ”‚   โ”‚ ๐ŸŒ Browser Automation โ”‚
 โ”‚ --------------------- โ”‚   โ”‚ --------------------- โ”‚   โ”‚ --------------------- โ”‚
 โ”‚ โ€ข OS APIs / PyAutoGUI โ”‚   โ”‚ โ€ข read/write_file     โ”‚   โ”‚ โ€ข browser-use Agent   โ”‚
 โ”‚ โ€ข UI Automation       โ”‚   โ”‚ โ€ข Shell (Subprocess)  โ”‚   โ”‚ โ€ข Playwright / DOM    โ”‚
 โ”‚ โ€ข Window Management   โ”‚   โ”‚ โ€ข Syntax Validation   โ”‚   โ”‚ โ€ข Scrape & Extract    โ”‚
 โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ   โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ   โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ

Key Subsystems

  • Computer Agent (clawbot/computer/agent.py) โ€” the brain. Receives tasks, plans actions, executes them, and learns.
  • CLI (clawbot/cli.py) โ€” the face. Interactive prompt with model selection, mode switching, and live task execution.
  • Telegram Bot (clawbot/computer/telegram_bot.py) โ€” remote control from your phone.
  • Voice Engine (clawbot/computer/voice.py) โ€” speech-to-text + text-to-speech loop.
  • Skills (skills/) โ€” modular capabilities loaded on-demand based on task keywords.
  • Memory (~/.clawbot/) โ€” sessions, daily logs, preferences, and persona files.

Workspace & Persona

  • Workspace root: ~/.clawbot/workspace/
  • Persona files: IDENTITY.md, SOUL.md, USER.md, AGENTS.md, BOOTSTRAP.md
  • Skills: skills/<skill_name>/SKILL.md
  • Sessions: ~/.clawbot/sessions/ (one .md per task)
  • Daily memory: ~/.clawbot/memory/ (one .md per day)

Configuration

Minimal ~/.clawbot/clawbot.json:

{
  "default_provider": "๐Ÿ”ท Google Gemini (Free)",
  "default_model": "gemini-2.5-flash"
}

CLI Commands

Command Action
clawbot Start the interactive CLI
clawbot --version Show version
clawbot devices Manage Claw3D device connections
clawbot tunnel Setup remote access tunnel
/voice Start voice control mode
/video Switch to video generation mode
/code Route tasks to OpenCode
/normal Back to standard agent mode
switch Change AI provider/model
quit Exit ClawBot

Chat Commands (Telegram)

Send these from Telegram when connected:

  • Any text โ†’ executes as a task on your PC
  • Images โ†’ analyzed by vision models
  • Voice messages โ†’ transcribed and executed

Tech Stack

  • Python 3.11+ โ€” core runtime
  • browser-use โ€” LLM classes and browser automation
  • PyAutoGUI โ€” screen control, clicks, typing
  • pywin32 โ€” Windows native API (UIAutomation)
  • Rich โ€” beautiful terminal UI
  • InquirerPy โ€” interactive prompts
  • edge-tts โ€” text-to-speech
  • SpeechRecognition โ€” voice input
  • python-telegram-bot โ€” Telegram integration
  • Pillow โ€” image processing for vision
  • httpx โ€” async HTTP client

Project Structure

clawbot/
  __init__.py
  cli.py              # Main CLI entry point
  computer/
    agent.py           # Core AI agent loop
    telegram_bot.py    # Telegram integration
    voice.py           # Voice mode
    actions.py         # System actions
    system_control.py  # OS-level controls
    window_manager.py  # Window focus/snap
    security.py        # Command auditing
    gateway.py         # Claw3D WebSocket gateway
    memory_search.py   # RAG-based memory search
    self_healing.py    # Auto-fix failed code
    workflows.py       # Macro recording/replay
    orchestrator.py    # Multi-agent coordination
  clawteam/            # Multi-agent swarm system
skills/                # Extensible skill plugins
workspace/             # Persona & identity files
agents/                # AI reviewer/planner agents

Star History

Star History Chart

Built By

ClawBot Plus was built by Aayush Soam with significant contributions from Claude (Anthropic) โ€” pair-programmed from architecture to deployment.

  • ๐Ÿ‘ค Aayush Soam โ€” Creator, Vision & Product
  • ๐Ÿค– Claude (Anthropic) โ€” Co-Developer, Architecture & Agent Engineering

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

clawbot_plus-2.2.1.tar.gz (5.7 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

clawbot_plus-2.2.1-py3-none-any.whl (5.9 MB view details)

Uploaded Python 3

File details

Details for the file clawbot_plus-2.2.1.tar.gz.

File metadata

  • Download URL: clawbot_plus-2.2.1.tar.gz
  • Upload date:
  • Size: 5.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.4

File hashes

Hashes for clawbot_plus-2.2.1.tar.gz
Algorithm Hash digest
SHA256 0ce69bcf7056eecf2b325114efb28ea8f936802795274ee7c3a5ed9f145c3622
MD5 dce0a25b70465c38f41b6eed04f200ed
BLAKE2b-256 14fbcff72c9c8c94d2ce430f1a4cbfa380578583443e156812bbc1f4d5a5bd44

See more details on using hashes here.

File details

Details for the file clawbot_plus-2.2.1-py3-none-any.whl.

File metadata

  • Download URL: clawbot_plus-2.2.1-py3-none-any.whl
  • Upload date:
  • Size: 5.9 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.4

File hashes

Hashes for clawbot_plus-2.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 3c0d74b291b2e764575d073e976de052bf2b312826805b627f1c9c502908b78b
MD5 f32f9373a7624a05dc57fc3814d7ce2b
BLAKE2b-256 03a20e3a35671939c5c13aa349bc72a5c1655f7ec686da73c2b8d27874a7a46a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page