WeClaw-CUA — vision-based WeChat message capture and report CLI for LLMs

These details have not been verified by PyPI

Project links

Project description

WeClaw-CUA

Vision-based WeChat message capture and report generation from the command line.

License: Apache-2.0 Platform

Capture chats · Generate reports · Search messages · Export · Statistics

Highlights

Vision-based capture — uses screenshots + vision LLM to extract messages, no database decryption needed
Cross-platform — macOS (Accessibility API + Quartz) and Windows (UI Automation)
No API key required — stepwise mode lets the calling agent handle all LLM calls
AI-first — JSON output by default, designed for LLM agent tool calls
Fully local — all UI automation runs on your machine, no data leaves your machine
13 commands — init, run, capture, finalize, report, build-report-prompt, sessions, history, search, export, stats, unread, new-messages

How It Works

Unlike tools that decrypt WeChat's local SQLite databases, WeClaw-CUA uses a pure vision approach:

Locates the WeChat desktop window via OS-level APIs
Scans the sidebar for unread chats using vision AI
Clicks into each chat, scrolls through messages, captures screenshots
Stitches screenshots into long images (OpenCV-based template matching)
Sends stitched images to a vision LLM for structured message extraction
Post-processes and deduplicates extracted messages into clean JSON

This means WeClaw-CUA works with any WeChat version and requires no key extraction or database access.

Installation

Requires Python >= 3.10.

PyPI

The PyPI project name weclaw is an unrelated third-party package. This project publishes as weclaw-cua. When available: pip install weclaw-cua (with extras [llm], [macos] as needed). Until you see it on PyPI, use From Source below.

From Source (Recommended)

git clone https://github.com/anthropic-ai/weclaw.git
cd weclaw
python3 -m venv .venv
./.venv/bin/pip install -r requirements.txt

Then install in editable mode (pick one):

./.venv/bin/pip install -e ".[macos,llm]"   # macOS: automation + LLM deps

On Windows, omit macos:

./.venv/bin/pip install -e ".[llm]"

Other variants:

./.venv/bin/pip install -e .              # core only (stepwise, no LLM deps)
./.venv/bin/pip install -e ".[llm]"       # LLM deps (any OS)
./.venv/bin/pip install -e ".[macos]"      # macOS-only deps

The weclaw console command remains an alias for weclaw-cua.

Installation (For AI Agents)

Paste the following into Claude Code, Cursor, or any AI coding agent:

Help me install and configure WeClaw-CUA from the README "From Source" section (clone, venv, requirements.txt, pip install -e ".[macos,llm]" on macOS or ".[llm]" on Windows).

Quick Start

Step 1 — Initialize

weclaw-cua init

This creates config/config.json from the template and verifies platform prerequisites.

macOS: Grant Accessibility Permission

Before running, make sure your terminal app has Accessibility access:

Open System Settings > Privacy & Security > Accessibility
Add your terminal app (Terminal, iTerm2, or your IDE's terminal)
Restart the terminal after enabling

Windows: Match Elevation

If WeChat is running as admin, run the script elevated too (Run as Administrator).

Step 2 — Configure

Edit config/config.json:

{
  "wechat_app_name": "WeChat",
  "groups_to_monitor": ["*"],
  "sidebar_unread_only": true,
  "report_custom_prompt": "Summarize key decisions and action items.",
  "openrouter_api_key": "sk-or-YOUR-KEY-HERE",
  "llm_model": "openai/gpt-4o",
  "output_dir": "output"
}

Set your API key either in the config or via environment variable:

export OPENROUTER_API_KEY="sk-or-v1-your-key"

Step 3 — Use It

weclaw-cua run                                   # full pipeline: capture + report
weclaw-cua capture                               # capture only
weclaw-cua report                                # report from existing captures
weclaw-cua sessions                              # list captured chats
weclaw-cua history "Group A" --limit 20          # view messages
weclaw-cua search "deadline" --chat "Team"       # search

Using with AI Agents (Stepwise Mode)

WeClaw-CUA is designed for AI agents. In stepwise mode (--no-llm), WeClaw-CUA handles all UI automation while the agent handles all LLM calls. No API key needed.

How Stepwise Mode Works

Agent                          WeClaw-CUA                    WeChat
  |                              |                              |
  |-- weclaw-cua capture --no-llm -->|                              |
  |                              |-- screenshot, scroll ------->|
  |                              |-- stitch images              |
  |                              |<-- stitched images           |
  |<-- manifest.json + images ---|                              |
  |                              |                              |
  |  (agent reads manifest.json)                                |
  |  (for each task: send .png + .prompt.txt to own LLM)        |
  |  (write response to .response.txt)                          |
  |                              |                              |
  |-- weclaw-cua finalize ---------> |                              |
  |<-- messages.json ----------- |                              |
  |                              |                              |
  |-- weclaw-cua build-report-prompt |                              |
  |<-- prompt text --------------|                              |
  |  (agent sends prompt to own LLM, gets report)               |

Step-by-Step for Agents

Capture (no LLM needed):

weclaw-cua capture --no-llm --work-dir /tmp/weclaw_work

This outputs a manifest.json listing all pending vision tasks, along with .png images and .prompt.txt files.

Process vision tasks (agent's responsibility):

For each task in manifest.json:

Read the .png image and .prompt.txt
Send to the agent's own vision LLM
Write the model response to .response.txt

Finalize (produce message JSON):

weclaw-cua finalize --work-dir /tmp/weclaw_work

Get report prompt (agent calls own LLM for report):

weclaw-cua build-report-prompt

Claude Code / Cursor Configuration

Add to your CLAUDE.md or .cursor/rules/:

## WeClaw-CUA

You can use `weclaw-cua` (or the `weclaw` alias) to capture and query WeChat messages.

Stepwise workflow (you handle LLM calls):
1. `weclaw-cua capture --no-llm` — capture screenshots, no LLM needed
2. Process each task in manifest.json with your vision model
3. `weclaw-cua finalize --work-dir <dir>` — produce message JSON
4. `weclaw-cua build-report-prompt` — get report prompt, call your own LLM

Query commands (work on captured data, no LLM needed):
- `weclaw-cua sessions` — list captured chats
- `weclaw-cua history "NAME" --limit 20 --format text` — view messages
- `weclaw-cua search "KEYWORD" --chat "CHAT_NAME"` — search messages
- `weclaw-cua stats "CHAT" --format text` — statistics
- `weclaw-cua export "CHAT" --format markdown` — export chat
- `weclaw-cua new-messages` — incremental new messages

Direct Mode (Built-in LLM)

If you prefer WeClaw-CUA to handle LLM calls directly (requires OpenRouter API key):

export OPENROUTER_API_KEY="sk-or-v1-your-key"
weclaw-cua run                    # capture + report in one step
weclaw-cua capture                # capture only
weclaw-cua report                 # report from existing captures

Command Reference

`init` — First-Time Setup

weclaw-cua init                        # create config + verify permissions
weclaw-cua init --force                # overwrite existing config
weclaw-cua init --config-dir /path     # custom config directory

`run` — Full Pipeline

weclaw-cua run                         # capture + report (JSON, requires API key)
weclaw-cua run --no-llm                # stepwise: capture only, agent handles LLM
weclaw-cua run --format text           # human-readable output

`capture` — Capture Messages

weclaw-cua capture                     # capture with built-in LLM
weclaw-cua capture --no-llm            # stepwise: output images+prompts only
weclaw-cua capture --no-llm --work-dir /tmp/w  # custom work directory
weclaw-cua capture --format text       # human-readable output

`finalize` — Process Agent Responses

weclaw-cua finalize --work-dir /tmp/weclaw_work  # produce final JSON from agent responses

`report` — Generate Report

weclaw-cua report                                    # full report (requires API key)
weclaw-cua report --prompt-only                      # output prompt only (no LLM call)
weclaw-cua report --input output/GroupA.json          # from specific files
weclaw-cua report --format text                      # human-readable

`build-report-prompt` — Get Report Prompt

weclaw-cua build-report-prompt                       # output prompt for agent's own LLM
weclaw-cua build-report-prompt --input output/A.json # from specific files

`sessions` — List Captured Chats

weclaw-cua sessions                    # all captured chats (JSON)
weclaw-cua sessions --limit 10        # last 10
weclaw-cua sessions --format text     # human-readable

`history` — View Chat Messages

weclaw-cua history "Group A"                         # last 50 messages
weclaw-cua history "Group A" --limit 100 --offset 50 # pagination
weclaw-cua history "Alice" --type text               # text messages only
weclaw-cua history "Alice" --format text             # human-readable

Options: --limit, --offset, --type, --format

`search` — Search Messages

weclaw-cua search "hello"                            # global search
weclaw-cua search "hello" --chat "Alice"             # in specific chat
weclaw-cua search "meeting" --chat "A" --chat "B"    # multiple chats
weclaw-cua search "report" --type text               # text only

Options: --chat (repeatable), --limit, --offset, --type, --format

`export` — Export Chat

weclaw-cua export "Alice" --format markdown          # to stdout
weclaw-cua export "Alice" --format txt --output chat.txt  # to file
weclaw-cua export "Team" --limit 1000                # more messages

Options: --format markdown|txt, --output, --limit

`stats` — Chat Statistics

weclaw-cua stats "Group A"             # JSON stats
weclaw-cua stats "Alice" --format text # human-readable

`unread` — Scan for Unread Chats

weclaw-cua unread                      # scan sidebar via vision AI
weclaw-cua unread --limit 10           # at most 10
weclaw-cua unread --format text        # human-readable

`new-messages` — Incremental Messages

weclaw-cua new-messages                # first: save state, return all
weclaw-cua new-messages                # subsequent: only new since last

State saved at <output_dir>/last_check.json. Delete to reset.

Message Types

The --type option (on history and search):

Value	Description
text	Text messages
system	System messages
link_card	Links and shared content
image	Images
file	File attachments
recalled	Recalled messages
unsupported	Unsupported message types

System Requirements

Platform	Status	Notes
macOS (Apple Silicon)	Supported	Requires Accessibility permission
macOS (Intel)	Supported	Requires Accessibility permission
Windows	Supported	Match elevation with WeChat if needed

Python >= 3.10
WeChat Desktop — any version (vision-based, no version dependency)
OpenRouter API key — for vision LLM message extraction and report generation

Configuration

`config/config.json`

{
  "wechat_app_name": "WeChat",
  "groups_to_monitor": ["*"],
  "sidebar_unread_only": true,
  "report_custom_prompt": "Summarize key decisions and action items.",
  "openrouter_api_key": "",
  "llm_model": "openai/gpt-4o",
  "output_dir": "output"
}

Field	Description
`wechat_app_name`	Window title for WeChat (usually "WeChat")
`groups_to_monitor`	`["*"]` = all groups, or list specific chat names
`sidebar_unread_only`	`true` = only process chats with unread badges
`report_custom_prompt`	Custom instructions for the LLM report
`openrouter_api_key`	API key (or use `OPENROUTER_API_KEY` env var)
`llm_model`	LLM model identifier for report generation
`output_dir`	Directory for output JSON files

Architecture

weclaw/
├── weclaw_cli/                 # CLI layer (click commands)
│   ├── main.py                # CLI entry point
│   ├── context.py             # Config loading
│   ├── commands/              # All CLI commands (capture, finalize, report, etc.)
│   └── output/                # JSON/text formatter
│
├── algo_a/                     # Vision-based message capture
│   ├── pipeline_a_win.py     # Main pipeline (both platforms)
│   ├── capture_chat.py        # Screenshot scroll-capture engine
│   ├── extract_messages.py    # Vision LLM message extraction
│   └── ...                    # Sidebar scan, click, stitch, dedup
│
├── algo_b/                     # LLM report generation
│   ├── pipeline_b.py         # Report pipeline
│   ├── build_report_prompt.py # Prompt construction
│   └── generate_report.py    # LLM call
│
├── platform_mac/               # macOS platform layer
│   ├── driver.py              # Quartz screenshots + CGEvent
│   ├── mac_ai_driver.py       # Vision AI driver
│   └── ...                    # Window detection, OCR, stitching
│
├── platform_win/               # Windows platform layer
│   ├── driver.py              # Vision AI driver
│   └── ...                    # Window detection, UI Automation
│
├── shared/                     # Cross-cutting utilities
│   ├── platform_api.py        # PlatformDriver protocol
│   ├── vision_backend.py      # VisionBackend protocol (pluggable LLM interface)
│   ├── stepwise_backend.py    # StepwiseBackend (writes images+prompts for agent)
│   ├── vision_ai.py           # OpenRouterBackend (built-in LLM, optional)
│   ├── message_schema.py      # Message dataclass
│   └── llm_client.py          # OpenRouter text wrapper (optional)
│
├── npm/                        # npm binary distribution
│   ├── weclaw/                # Main npm package
│   ├── platforms/             # Per-platform binary packages
│   └── scripts/build.py      # PyInstaller build script
│
├── config/                     # Configuration
├── scripts/                    # Debug & utility scripts
├── pyproject.toml              # Python package config
├── entry.py                    # PyInstaller entry point
└── run.sh                      # Shell entry point

Data Flow

weclaw-cua run / weclaw-cua capture
  │
  ├─ algo_a (vision capture)
  │   ├─ find WeChat window (OS API)
  │   ├─ scan sidebar for unread (vision AI)
  │   ├─ for each chat:
  │   │   ├─ click into chat
  │   │   ├─ scroll + capture screenshots
  │   │   ├─ stitch into long image
  │   │   ├─ vision LLM → structured JSON
  │   │   └─ post-process + dedup
  │   └─ write JSON files to output/
  │
  └─ algo_b (report generation)
      ├─ load message JSONs
      ├─ build report prompt
      ├─ call LLM
      └─ output report text

License

Apache License 2.0

Disclaimer

This project is a local UI automation tool for personal use only:

Read-only — captures what is visible on screen, does not modify WeChat data
No database access — uses pure vision, no decryption or memory scanning
No cloud transmission — all automation runs locally; only LLM API calls leave your machine (to your configured provider)
Use at your own risk — for personal learning and research purposes only

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.2

May 18, 2026

0.2.1

May 13, 2026

0.2.0

May 4, 2026

0.1.5

Apr 10, 2026

This version

0.1.4

Apr 10, 2026

0.1.0

Apr 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

weclaw_cua-0.1.4.tar.gz (146.9 kB view details)

Uploaded Apr 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

weclaw_cua-0.1.4-py3-none-any.whl (190.4 kB view details)

Uploaded Apr 10, 2026 Python 3

File details

Details for the file weclaw_cua-0.1.4.tar.gz.

File metadata

Download URL: weclaw_cua-0.1.4.tar.gz
Upload date: Apr 10, 2026
Size: 146.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for weclaw_cua-0.1.4.tar.gz
Algorithm	Hash digest
SHA256	`72321c88ee0304df900ebcf11d60e5bd047a027f9d9ecc07bdc75d580fba687f`
MD5	`be68fcc7c6edb646b90f2ec76a5f1a99`
BLAKE2b-256	`a4d9350acdf1e289a29a119efabc72556ad316c2fbc8f7747ab1e37404a167a7`

See more details on using hashes here.

File details

Details for the file weclaw_cua-0.1.4-py3-none-any.whl.

File metadata

Download URL: weclaw_cua-0.1.4-py3-none-any.whl
Upload date: Apr 10, 2026
Size: 190.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for weclaw_cua-0.1.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`28e2f283b3f0b42b06392c4d9a118615c0dd934c57d36bf88a79c8dc6a9e3504`
MD5	`3de62e1960b6320ad61ec530df480316`
BLAKE2b-256	`87c8e7a3aca747fc41ce2e2016ac485b5435374f838983d74d7724317440443d`

See more details on using hashes here.

weclaw-cua 0.1.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

WeClaw-CUA

Highlights

How It Works

Installation

PyPI

From Source (Recommended)

Installation (For AI Agents)

Quick Start

Step 1 — Initialize

macOS: Grant Accessibility Permission

Windows: Match Elevation

Step 2 — Configure

Step 3 — Use It

Using with AI Agents (Stepwise Mode)

How Stepwise Mode Works

Step-by-Step for Agents

Claude Code / Cursor Configuration

Direct Mode (Built-in LLM)

Command Reference

init — First-Time Setup

run — Full Pipeline

capture — Capture Messages

finalize — Process Agent Responses

report — Generate Report

build-report-prompt — Get Report Prompt

sessions — List Captured Chats

history — View Chat Messages

search — Search Messages

export — Export Chat

stats — Chat Statistics

unread — Scan for Unread Chats

new-messages — Incremental Messages

Message Types

System Requirements

Configuration

config/config.json

Architecture

Data Flow

License

Disclaimer

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`init` — First-Time Setup

`run` — Full Pipeline

`capture` — Capture Messages

`finalize` — Process Agent Responses

`report` — Generate Report

`build-report-prompt` — Get Report Prompt

`sessions` — List Captured Chats

`history` — View Chat Messages

`search` — Search Messages

`export` — Export Chat

`stats` — Chat Statistics

`unread` — Scan for Unread Chats

`new-messages` — Incremental Messages

`config/config.json`