Python automation toolkit for VRChat (Windows / Linux)

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

GesonAnko

These details have not been verified by PyPI

Project description

vrcpilot

English | 日本語

Python automation toolkit for the VRChat desktop client on Windows / Linux. It can launch, focus, capture, OCR, detect image templates, and synthesize input through both a typed Python API and the vrcpilot CLI.

Features

Process control — launch VRChat (vrcpilot.launch; direct-spawn by default, --via-steam for the legacy Steam route), detect running PIDs, and terminate the process.
Window control — focus / unfocus the VRChat window and check its foreground state (Win32 / X11 / XWayland).
Screen capture — Capture / CaptureLoop for streaming video frames and take_screenshot for one-off captures that round-trip through YAML.
Audio capture — Speaker / SpeakerLoop for VRChat-only audio (native PipeWire pipeline on Linux; proc-tap process loopback on Windows).
Unified recording — vrcpilot record writes MP4 (video and/or audio) or WAV (audio only) to a file, or streams a self-describing Matroska (MKV) byte stream to stdout for piping into ffmpeg etc.
OCR — pluggable OCREngine ABC with the default RapidOCREngine. ocr() returns word-level results in VRChat window-local coordinates that feed straight into mouse.move().
Image-template detection — TemplateDetectEngine using OpenCV TM_CCOEFF_NORMED. Detections use the same coordinate schema as OCR.
Synthetic input — keyboard / mouse input via pydirectinput on Windows and inputtino + /dev/uinput on Linux. Input is sent only while VRChat is focused.
Non-ASCII text injection — vrcpilot.clipboard sends arbitrary Unicode strings through clipboard + Ctrl+V.
OSC — OscSender and the vrcpilot osc CLI fire VRChat's /input/*, /chatbox/*, and /avatar/parameters/* messages over UDP.
Virtual mic output — Stream WAV files or live float32 chunks (e.g. an LLM agent's TTS) into VRChat through VB-Audio Virtual Cable on Windows, or through the VRCPilotMic PipeWire sink on Linux (one-time setup via vrcpilot linux-mic register). CLI subcommand vrcpilot mic accepts a WAV file or raw s16le over stdin.
CLI front-end — subcommands such as vrcpilot launch / screenshot / record / ocr / detect / mouse / keyboard / paste / mic / ..., with tab completion via argcomplete.

Installation

Python 3.12 or later is required.

On Linux, install inputtino-python into the same Python environment before installing vrcpilot. See the Linux requirements below for the native build packages and /dev/uinput permissions. uv tool install creates an isolated environment; on Linux, use the --with inputtino-python example below.

# Linux only: install inputtino before vrcpilot
pip install "inputtino-python @ git+https://github.com/games-on-whales/inputtino.git@stable#subdirectory=bindings/python"

# Library + CLI
pip install vrcpilot

# Install with OCR support
pip install "vrcpilot[ocr]"

# Install as an isolated CLI tool
uv tool install vrcpilot

# Install as an isolated CLI tool on Linux
uv tool install --with "inputtino-python @ git+https://github.com/games-on-whales/inputtino.git@stable#subdirectory=bindings/python" vrcpilot

# Install from source for development
git clone https://github.com/MLShukai/vrcpilot
cd vrcpilot
uv sync --all-extras

Pre-release builds (0.X.Yrc1, 0.X.Ya1, etc.) are excluded from pip install by default. To opt in to a pre-release, use pip install --pre vrcpilot or uv tool install --prerelease=allow vrcpilot (and the same --prerelease=allow flag for the Linux uv tool install --with inputtino-python variant above).

Platform Requirements

Windows

No additional system packages are required. pywin32 and pydirectinput are installed automatically as dependencies.

For vrcpilot mic only, install VB-Audio Virtual Cable. After installation, Windows sound settings expose a playback device named CABLE Input and a recording device named CABLE Output. Open VRChat's Audio settings and select "CABLE Output (VB-Audio Virtual Cable)" as the microphone input device — vrcpilot mic writes to CABLE Input and VRChat picks the audio up through CABLE Output. The dependency is not needed if you do not use vrcpilot mic.

Linux

An X11 or XWayland session is required. Wayland-native sessions are not supported. In that environment, focus() / unfocus() emit a RuntimeWarning and return False.

Check your session type with:

echo $XDG_SESSION_TYPE   # x11 or wayland
echo $DISPLAY            # OK if this has a value, including through XWayland

inputtino-python is built natively from git, so install the following system packages before pip install:

sudo apt-get install -y cmake build-essential pkg-config libevdev-dev
sudo usermod -aG input "$USER"   # write access to /dev/uinput; log out and back in

If the uinput kernel module is disabled, load it with sudo modprobe uinput.

Also note that the distribution name differs from the import name. On PyPI it is inputtino-python; in Python, import it as inputtino.

Audio (for `vrcpilot mic`)

For vrcpilot mic and the Mic Python API on Linux, you also need:

pipewire + pipewire-pulse (PulseAudio compatibility layer)
libpulse0 (soundcard links against it via CFFI)
Run vrcpilot linux-mic register once after installation to create the persistent VRCPilotMic PipeWire sink.

Then in VRChat's Audio settings, select Monitor of VRCPilot Virtual Mic as the microphone input.

umu-launcher (for direct-spawn launch)

When vrcpilot launch runs without --via-steam (the default on Linux), it direct-spawns VRChat under umu-launcher, so umu-run must be on PATH.

On Debian / Ubuntu, grab the latest .deb from the official release page https://github.com/Open-Wine-Components/umu-launcher/releases/latest (look for an asset such as python3-umu-launcher_*.deb) and install it:

# Debian / Ubuntu — replace <file>.deb with the asset name from the release page
sudo dpkg -i <file>.deb
sudo apt-get install -f

For other distributions or building from source, see the official README: https://github.com/Open-Wine-Components/umu-launcher.

Alternatively, pass vrcpilot launch --via-steam to let Steam manage Proton itself; in that mode umu-launcher is not required.

macOS

Not supported. import vrcpilot raises ImportError on sys.platform values other than "win32" and "linux".

Quick Start (CLI)

The CLI is the quickest entry point for driving VRChat. The basic pipeline is: screenshot emits a Screenshot as YAML, then ocr / detect consume it from stdin or --screenshot.

OCR / detect results expose window-local coordinates under pos.bbox, and vrcpilot mouse move X Y consumes the same window-local frame. Feed pos.bbox in directly — no manual translation is needed.

# Launch VRChat in desktop mode and wait until startup completes
vrcpilot launch --no-vr --screen-width 1280 --screen-height 720 --wait-timeout 60

# Screenshot -> OCR -> save visualization PNG in one line
vrcpilot screenshot | vrcpilot ocr --viz /tmp/viz.png > /tmp/ocr.yaml

# Pass the same pipeline to image-template detection
vrcpilot screenshot | vrcpilot detect -q assets/button.png > /tmp/det.yaml

# Move the mouse and click (VRChat window-local coordinates)
vrcpilot mouse move 600 360
vrcpilot mouse click left

# Press a key (--duration defaults to 0.1s, the lower bound VRChat reliably accepts)
vrcpilot keyboard press w --duration 1.0

# Input non-ASCII text (clipboard + Ctrl+V)
vrcpilot paste "こんにちは、VRChat！"

# Record 10 seconds of VRChat video + audio to MP4
vrcpilot record -o /tmp/vrc.mp4 --duration 10

# Stream a self-describing MKV from VRChat into ffmpeg
vrcpilot record --duration 5 | ffmpeg -i - -c copy /tmp/vrc.mkv

# Play a WAV file into VRChat's mic
# (Windows: requires VB-Cable; Linux: run `vrcpilot linux-mic register` first)
vrcpilot mic -i greeting.wav

# Terminate (idempotent)
vrcpilot terminate

See vrcpilot --help and vrcpilot <subcommand> --help for all options.

Quick Start (Python API)

from time import sleep

import vrcpilot

# launch() waits up to wait_timeout seconds (default 30s) until VRChat's PID appears.
# None means VRChat was not detected within that time.
pid = vrcpilot.launch(no_vr=True, screen_width=1280, screen_height=720)
if pid is None:
    raise RuntimeError("VRChat did not start before launch() timed out")
sleep(45)  # extra warm-up wait: shaders / avatar loading / network sync

try:
    # Capture one frame (None on a recoverable failure)
    shot = vrcpilot.take_screenshot()
    if shot is None:
        raise RuntimeError("could not capture the VRChat screen")

    # OCR all visible words (uses a cached RapidOCREngine when engine is omitted)
    result = vrcpilot.ocr(shot)
    for word in result.words:
        print(word.text, word.bbox)

    # Move the cursor to the center of the first word and left-click
    # word.bbox is window-local, which is exactly what mouse.move expects.
    if result.words:
        x, y, w, h = result.words[0].bbox
        vrcpilot.mouse.move(int(x + w / 2), int(y + h / 2))
        vrcpilot.mouse.click(vrcpilot.MouseButton.LEFT)

    # Press a key
    vrcpilot.keyboard.press(vrcpilot.Key.W, duration=1.0)
finally:
    vrcpilot.terminate()

Stream audio chunks (e.g. from an LLM agent's TTS) into VRChat's mic:

import numpy as np
import vrcpilot

def tts_chunks():  # yield float32 NDArray chunks; (N,) mono or (N, C) multi-channel
    yield np.zeros(48000, dtype=np.float32)  # 1s of silence as a placeholder

with vrcpilot.Mic(sample_rate=48000, channels=1) as mic:
    for chunk in tts_chunks():
        mic.play(chunk)

CLI Subcommands

Subcommand	Purpose
`launch`	Start VRChat (direct-spawn by default; `--via-steam` for the Steam route). Supports `--no-vr`, `--screen-{width,height}`, `--wait-timeout`, and more
`pid`	List running VRChat PIDs, one per line
`terminate`	Terminate VRChat (idempotent)
`focus`	Bring the VRChat window to the foreground
`unfocus`	Send the VRChat window to the bottom of the z-order
`screenshot`	Capture one frame and emit a `Screenshot` YAML to stdout (PNG path or inline base64)
`record`	Record VRChat video and/or audio. `-o file.mp4` / `file.wav` for files; otherwise streams self-describing MKV to stdout
`mouse`	`move` / `click` / `scroll` (VRChat window-local coordinates)
`keyboard`	`press` (`--duration` defaults to 0.1s)
`paste`	Input text through clipboard + Ctrl+V (non-ASCII safe)
`ocr`	Run OCR on a `Screenshot` YAML (stdin pipe or `--screenshot <path>`)
`detect`	Template-search a `Screenshot` YAML with a query image. `-q query.png` / `--threshold` / `--top-k`
`osc`	Send VRChat OSC messages: `send` / `axis` / `tap` / `hold` / `chatbox` / `typing` / `avatar`
`mic`	Stream WAV / raw s16le PCM into a virtual mic device (Windows + VB-Cable, Linux + PipeWire); defaults to reading stdin
`linux-mic`	Register / unregister / inspect the `VRCPilotMic` PipeWire virtual mic (Linux only)

Shell Completion

vrcpilot supports tab completion through argcomplete. The following items can be completed:

Subcommands (launch / pid / terminate / focus / unfocus / screenshot / record / mouse / keyboard / paste / ocr / detect / osc / mic / linux-mic)
Options (--steam-path, etc.)
Options that take file paths (.exe for --steam-path, .png for --query, etc.)

Requirements

Install for development with uv sync, or install with uv tool install vrcpilot, and make sure register-python-argcomplete is available on PATH.
If you do not want to add it to your global PATH, replace register-python-argcomplete ... in the commands below with uv run register-python-argcomplete ....

One-Line Setup (Development Repository)

Right after cloning, source / dot-source the bundled bootstrap script if you want to complete "create venv -> activate -> register completion" in one line.

bash: . ./clicomp.sh
pwsh: . .\CliComp.ps1

The script performs the following steps:

Activate an existing .venv, if present
Run just setup if vrcpilot is not on PATH, then activate again
Register vrcpilot completion in the current session with register-python-argcomplete

If you run it in a subshell, such as bash clicomp.sh or .\CliComp.ps1, neither the venv nor completion settings will remain in the parent shell. Be sure to source / dot-source it (the script rejects normal execution). To make it persistent, add the following line to your shell startup file.

# ~/.bashrc
. /path/to/vrcpilot/clicomp.sh

# $PROFILE
. C:\path\to\vrcpilot\CliComp.ps1

Bash / Git Bash

To enable completion for the current session only:

eval "$(register-python-argcomplete vrcpilot)"

To make it persistent, add the line above to ~/.bashrc (or ~/.bash_profile in Git Bash).

PowerShell

Both Windows PowerShell 5.1 and pwsh 7.x are supported, though pwsh 7.x is recommended for development.

To enable completion for the current session only:

register-python-argcomplete --shell powershell vrcpilot | Out-String | Invoke-Expression

To make it persistent, add the Invoke-Expression line above to your PowerShell profile.

code $PROFILE   # notepad $PROFILE is also fine
# Append the Invoke-Expression line above to the end of the file and save it
# Open a new session, or reload with `. $PROFILE`

Troubleshooting

If completion does not work, see the argcomplete documentation: https://kislyuk.github.io/argcomplete/.

Documentation

Tutorial / playbook: docs/usage.md — task-based walkthrough (launch -> observe -> click -> teardown)
CLI reference: docs/cli.md — all subcommands, flags, and exit codes. Same content as vrcpilot --help / vrcpilot <subcommand> --help
Python API reference: docs/python-api.md — every symbol exposed as vrcpilot.<name>
Changelog: CHANGELOG.md
Contributing guide: CONTRIBUTING.md

License

Published under the MIT license.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

GesonAnko

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.4.0

May 26, 2026

This version

0.3.0

May 25, 2026

0.2.1

May 23, 2026

0.2.0

May 22, 2026

0.2.0rc1 pre-release

May 22, 2026

0.1.0

May 15, 2026

0.1.0rc1 pre-release

May 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vrcpilot-0.3.0.tar.gz (609.0 kB view details)

Uploaded May 25, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vrcpilot-0.3.0-py3-none-any.whl (146.9 kB view details)

Uploaded May 25, 2026 Python 3

File details

Details for the file vrcpilot-0.3.0.tar.gz.

File metadata

Download URL: vrcpilot-0.3.0.tar.gz
Upload date: May 25, 2026
Size: 609.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vrcpilot-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`1332ab0bb156f8870cf62cea736fe1f9a8271072714943de984cfef56c563b16`
MD5	`264fa4db989c6a8a9884b91b755aad04`
BLAKE2b-256	`6f8263b7e38404883bcdd1e1378f6e1da3a38219f48bdd2d181c1d79a5538dbb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vrcpilot-0.3.0.tar.gz:

Publisher: publish.yml on MLShukai/vrcpilot

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vrcpilot-0.3.0.tar.gz
- Subject digest: 1332ab0bb156f8870cf62cea736fe1f9a8271072714943de984cfef56c563b16
- Sigstore transparency entry: 1628231804
- Sigstore integration time: May 25, 2026
Source repository:
- Permalink: MLShukai/vrcpilot@d6cd98d43dc4a28cba2ccde1bb4d77dd8abe70d3
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/MLShukai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@d6cd98d43dc4a28cba2ccde1bb4d77dd8abe70d3
- Trigger Event: push

File details

Details for the file vrcpilot-0.3.0-py3-none-any.whl.

File metadata

Download URL: vrcpilot-0.3.0-py3-none-any.whl
Upload date: May 25, 2026
Size: 146.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vrcpilot-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ac7ec74a84c8c93dbdd648689c0e467a271fa0a38257f28267a79628c0d7178c`
MD5	`94c0fe76afd4bf08f7a24683e2b7ac76`
BLAKE2b-256	`d2f0bcdb04da5e50eac01dabeb6a219c806cad941ad5030c0900c49432920588`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vrcpilot-0.3.0-py3-none-any.whl:

Publisher: publish.yml on MLShukai/vrcpilot

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vrcpilot-0.3.0-py3-none-any.whl
- Subject digest: ac7ec74a84c8c93dbdd648689c0e467a271fa0a38257f28267a79628c0d7178c
- Sigstore transparency entry: 1628231856
- Sigstore integration time: May 25, 2026
Source repository:
- Permalink: MLShukai/vrcpilot@d6cd98d43dc4a28cba2ccde1bb4d77dd8abe70d3
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/MLShukai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@d6cd98d43dc4a28cba2ccde1bb4d77dd8abe70d3
- Trigger Event: push

vrcpilot 0.3.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

vrcpilot

Features

Installation

Platform Requirements

Windows

Linux

Audio (for vrcpilot mic)

umu-launcher (for direct-spawn launch)

macOS

Quick Start (CLI)

Quick Start (Python API)

CLI Subcommands

Shell Completion

Requirements

One-Line Setup (Development Repository)

Bash / Git Bash

PowerShell

Troubleshooting

Documentation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Audio (for `vrcpilot mic`)