Multi-service, multi-platform optical character recognition

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

FallingLuma

These details have not been verified by PyPI

Project links

Sponsor

Project description

OwOCR

OwOCR is a text recognition tool that continuously scans for images and performs OCR (Optical Character Recognition) on them. Its main focus is Japanese, but it works for many other languages.

Demo

Visual novel

https://github.com/user-attachments/assets/f2196b23-f2e7-4521-820a-88c8bedb9d8e

Manga

https://github.com/user-attachments/assets/f061854d-d20f-43e8-8c96-af5a0ea26f43

Installation/basic usage (easy Windows/macOS packages)

Easy to install Windows and macOS packages can be downloaded here.

On Windows, just extract the zip anywhere and double click on "owocr".
On macOS, just double click on the dmg and drag the owocr app to the Application folder, like most macOS apps. You will be asked to grant two permissions to owocr the first time it starts (Accessibility and Screen Capture), just follow the prompts to do so.
A "Log Viewer" window will shop up, showing information messages. A tray icon in the macOS menu bar/Windows task bar will show up. You can close the log viewer if you want.
By default owocr monitors the clipboard for images and outputs recognized text back to the clipboard. You can change this from the configuration, accessible from the tray icon.
With a left click on the tray icon you can pause/unpause on Windows, from the right click menu (left click on macOS) you can change the engine, pause/unpause, change the screen capture area selection, take a screenshot of the selected screen/window, launch the configuration, and reopen the log viewer if you closed it. The icon will be dimmed to show when owocr is paused.
In these versions all the OCR engines and features are already available, you don't need to install anything else. The tray icon is always enabled and can't be turned off.

Installation (terminal+Python, all operating systems)

OwOCR has been tested on Python 3.11 to 3.14. It can be installed with pip install owocr after you install Python. You also need to have one or more OCR engines, check the list below for instructions. I recommend installing at least Google Lens on any operating system, and OneOCR if you are on Windows. Bing is pre-installed, Apple Vision and Live Text come pre-installed on macOS.

Usage (terminal)

owocr

This default behavior monitors the clipboard for images and outputs recognized text back to the clipboard.

owocr_config

This opens the interface where you can change all the options.

From the terminal window you can pause/unpause with p or terminate with t/q, switch between engines with s or the engine-specific keys (from the engine list below).
The tray icon can also be used as explained above.
All command-line options and their descriptions can be viewed with: owocr -h.

Main features

Multiple input sources: clipboard, folders, websockets, unix domain socket, screen capture, OBS
Multiple output destinations: clipboard, text files, and websockets
Integrates well with Windows, macOS and Linux, supporting operating system features like notifications and a tray icon
Capture from specific screen areas, windows, of areas within windows (window capture is only supported on Windows/macOS/Wayland). This also tries to capture entire sentences and filter all repetitions. If you use an online engine like Lens I recommend setting a secondary local engine (OneOCR on Windows, Apple Live Text on macOS and meikiocr on Linux). With this "two pass" system only the changed areas are sent to the online service, allowing for both speed and accuracy
Control from the tray icon or the terminal window
Control from anywhere through keyboard shortcuts: you can set hotkeys for pausing, switching engines, taking a screenshot of the selected screen/window and changing the screen capture area selection
Read from a unix domain socket /tmp/owocr.sock on macOS/Linux
Furigana filter, works by default with Japanese text (both vertical and horizontal)

Manual configuration

The configuration file is stored in ~/.config/owocr_config.ini on Linux/macOS, or C:\Users\yourusername\.config\owocr_config.ini on Windows.
A sample config file is available at: owocr_config.ini

Notes about Linux support

While I've done all I could to support Linux (specifically Wayland), not everything might work with all setups. Specifically:

There are two ways of reading images from and writing text to the clipboard on Wayland. One requires a compositor which supports the extension "ext-data-control" and this should work out of the box with owocr by default. ext_data_control compatibility chart (worth noting GNOME/Mutter doesn't support it, but e.g. KDE/KWin does).
The alternative is through wl-clipboard (preinstalled in most distributions), but this will try to steal your focus constantly (due to Wayland's security design), limiting usability.
To switch to wl-clipboard, enable wayland_use_wlclipboard in owocr_config -> Advanced.
Reading from screen capture works on Wayland. The way it's designed is that your monitor/monitor selection/window selection in the operating system popup counts as a "virtual screen" to owocr.
By default the automatic coordinate selector will be launched to select one/more areas, as explained above.
Using "whole screen" 1 in the configuration/owocr -r=screencapture -sa=screen_1 will use the whole selection.
Using manual window names is not supported and will be ignored.
Keyboard combos/keyboard inputs in the coordinate selector might not work on Wayland. From my own testing they work on KDE (if you enable keyboard access in "Legacy X11 App Support" under "Application Permissions") but not GNOME. A workaround involves running pynput with the uinput backend, but this requires exposing your input devices (they will be accessible without root):
sudo chmod u+s $(which dumpkeys)
sudo usermod -a -G $(stat -c %G /dev/input/event0) $(whoami)
Then launch owocr with: PYNPUT_BACKEND_KEYBOARD=uinput owocr -r screencapture or add PYNPUT_BACKEND_KEYBOARD=uinput to your environment variables.
The tray icon requires installing this extension on GNOME (works out of the box on KDE)
X11 partially works but uses more resources for scanning the clipboard and doesn't support window capturing at all (only screens/screen selections).

Supported engines

Local

Chrome Screen AI - Recommended - Possibly the best local engine to date. Setup will be done automatically, but you can also manually download the zip for your operating system from this link and extract it to the C:/Users/yourusername/.config/screen_ai folder (Windows) or the ~/.config/screen_ai folder (macOS/Linux). → Terminal: install with pip install "owocr[screenai]", key: j
OneOCR - Windows 10/11 only - Recommended - One of the best local engines to date. On Windows 11, setup will be done automatically if you have Snipping Tool from the Microsoft Store (it's installed by default). On Windows 10 you need to copy 3 system files from Windows 11 to use it, refer to the readme here. It can also be used by installing oneocr on a Windows virtual machine and running the server there (oneocr_serve) and specifying the IP address of the Windows VM/machine in the config file. → Terminal: install with pip install "owocr[oneocr]", key: z
Apple Live Text (VisionKit framework) - macOS only - Recommended - One of the best local engines to date. It should be the same as Vision except that in Sonoma Apple added vertical text reading. → Terminal key: d
Apple Vision framework - macOS only - Older version of Live Text. → Terminal key: a
meikiocr - Recommended - Comparable to OneOCR in accuracy and CPU latency. It's limited to 64 text lines and 48 characters per line. → Terminal: install with pip install "owocr[meikiocr]", if you have a Nvidia GPU you can do pip uninstall onnxruntime && pip install onnxruntime-gpu which makes it the fastest OCR available. Key: k
NDLOCR-Lite - An engine from the Japanese National Diet Library's Lab, mostly suited to documents and book scans. → Terminal: install with pip install "owocr[ndlocrlite]", Key: h
Manga OCR (with optional comic-text-detector as segmenter) → Terminal: install with pip install "owocr[mangaocr]", keys: m (regular, ideal for small text areas), n (segmented, ideal for manga panels/larger images with multiple text areas)
WinRT OCR: Windows 10/11 only - It can also be used by installing winocr on a Windows virtual machine and running the server there (winocr_serve) and specifying the IP address of the Windows VM/machine in the config file. → Terminal: install with pip install "owocr[winocr]", key: w
EasyOCR → Terminal: install with pip install "owocr[easyocr]", key: e
RapidOCR → Terminal: install with pip install "owocr[rapidocr]", key: r

Cloud

Google Lens - Recommended - Arguably the best OCR engine to date. → Terminal: install with pip install "owocr[lens]", key: l
Bing - Recommended - Close second best. → Terminal key: b
Google Vision - You need a service account .json file named google_vision.json in user directory/.config/ → Terminal: install with pip install "owocr[gvision]", key: g
Azure Document Intelligence - You need to specify an api key and an endpoint in the config file → Terminal: install with pip install "owocr[azure]", key: v
OCRSpace - You need to specify an api key in the config file. → Terminal key: o

Links

Acknowledgments

This uses code from/references these people/projects:

Viola for working on the Google Lens implementation (twice!) and helping with the pyobjc VisionKit code!
@rtr46 for contributing a big overhaul allowing for coordinate support and JSON output, and for the initial Chrome Screen AI implementation!
@bpwhelan for contributing code for other language support and for his ideas (like two pass processing) originally implemented in the Game Sentence Miner fork of owocr
@bropines for the Bing code (Github issue)
@ronaldoussoren for helping with the pyobjc VisionKit code
Manga OCR for inspiring and being the project owocr was originally derived from
Mokuro for the comic text detector integration code
ocrmac for the Apple Vision framework API
ccylin2000_lipboard_monitor for the Windows clipboard polling code
vicky for the demo videos in this readme!
nao for the awesome icon!
Steffo for all his help in automating packaging/distribution with Github Actions!
kamperemu for the OBS source implementation!

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

FallingLuma

These details have not been verified by PyPI

Project links

Sponsor

Release history Release notifications | RSS feed

1.26.8

Mar 28, 2026

1.26.7

Mar 22, 2026

1.26.6

Mar 22, 2026

1.26.5

Mar 17, 2026

1.26.4

Mar 17, 2026

1.26.3

Mar 15, 2026

1.26.2

Mar 8, 2026

This version

1.26.1

Mar 6, 2026

1.26.0

Feb 26, 2026

1.25.5

Feb 23, 2026

1.25.4

Feb 22, 2026

1.25.3

Feb 22, 2026

1.25.2

Feb 21, 2026

1.25.1

Feb 20, 2026

1.25.0

Feb 20, 2026

1.24.4

Feb 13, 2026

1.24.3

Feb 8, 2026

1.24.2

Feb 6, 2026

1.24.1

Feb 5, 2026

1.24.0

Feb 5, 2026

1.23.4

Jan 27, 2026

1.23.3

Jan 25, 2026

1.23.2

Jan 25, 2026

1.23.1

Jan 23, 2026

1.23

Jan 23, 2026

1.22.8

Jan 18, 2026

1.22.7

Jan 14, 2026

1.22.6

Jan 13, 2026

1.22.5

Jan 13, 2026

1.22.4

Jan 13, 2026

1.22.3

Jan 12, 2026

1.22.2

Jan 10, 2026

1.22.1

Jan 6, 2026

1.22

Jan 5, 2026

1.21.6

Dec 24, 2025

1.21.5

Dec 22, 2025

1.21.4

Dec 22, 2025

1.21.3

Dec 20, 2025

1.21.2

Dec 18, 2025

1.21.1

Dec 18, 2025

1.21

Dec 14, 2025

1.20.3

Dec 9, 2025

1.20.2

Dec 5, 2025

1.20.1

Nov 30, 2025

1.20

Nov 23, 2025

1.19.5

Oct 25, 2025

1.19.4

Oct 23, 2025

1.19.3

Oct 22, 2025

1.19.2

Oct 22, 2025

1.19.1

Oct 22, 2025

1.19

Oct 22, 2025

1.18.7

Oct 20, 2025

1.18.6

Oct 19, 2025

1.18.5

Oct 18, 2025

1.18.4

Oct 18, 2025

1.18.3

Oct 18, 2025

1.18.2

Oct 18, 2025

1.18.1

Oct 17, 2025

1.18

Oct 17, 2025

1.17.7

Oct 16, 2025

1.17.6

Oct 14, 2025

1.17.5

Oct 13, 2025

1.17.4

Oct 13, 2025

1.17.3

Oct 13, 2025

1.17.2

Oct 13, 2025

1.17

Oct 9, 2025

1.16.2

Oct 6, 2025

1.16.1

Oct 5, 2025

1.16

Oct 5, 2025

1.15.1

Sep 20, 2025

1.15

Sep 19, 2025

1.14.7

Jul 25, 2025

1.14.6

Jul 10, 2025

1.14.5

Jul 6, 2025

1.14.4

Jul 6, 2025

1.14.3

Jul 3, 2025

1.14.2

Jun 16, 2025

1.14.1

Jun 16, 2025

1.14

Jun 15, 2025

1.13.12

May 3, 2025

1.13.10

Apr 9, 2025

1.13.9

Apr 7, 2025

1.13.8

Apr 7, 2025

1.13.7

Apr 7, 2025

1.13.6

Apr 6, 2025

1.13.5

Apr 6, 2025

1.13.4

Apr 6, 2025

1.13.3

Apr 5, 2025

1.13.2

Apr 5, 2025

1.13.1

Apr 5, 2025

1.13

Apr 5, 2025

1.12.2

Apr 3, 2025

1.12.1

Apr 2, 2025

1.12

Apr 2, 2025

1.11

Mar 31, 2025

1.10

Mar 29, 2025

1.9.1

Feb 1, 2025

1.9.0

Dec 21, 2024

1.8.4

Dec 17, 2024

1.8.3

Dec 13, 2024

1.8.2

Dec 13, 2024

1.8.1

Dec 12, 2024

1.8.0

Nov 16, 2024

1.7.6

Oct 15, 2024

1.7.5

Jun 24, 2024

1.7.4

May 18, 2024

1.7.3

Mar 8, 2024

1.7.2

Mar 4, 2024

1.7.1

Mar 4, 2024

1.7

Mar 2, 2024

1.6

Feb 18, 2024

1.5.2

Feb 9, 2024

1.5.1

Feb 8, 2024

1.5

Feb 8, 2024

1.4

Jan 30, 2024

1.3

Jan 29, 2024

1.2

Jan 24, 2024

1.1

Jan 23, 2024

1.0

Jan 22, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

owocr-1.26.1.tar.gz (332.3 kB view details)

Uploaded Mar 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

owocr-1.26.1-py3-none-any.whl (380.0 kB view details)

Uploaded Mar 6, 2026 Python 3

File details

Details for the file owocr-1.26.1.tar.gz.

File metadata

Download URL: owocr-1.26.1.tar.gz
Upload date: Mar 6, 2026
Size: 332.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for owocr-1.26.1.tar.gz
Algorithm	Hash digest
SHA256	`5a5aef5247f4e7136ec6f5526e57481097dd420201a864123dc4985c1c17c7c8`
MD5	`d8a110d5d5d64cb3fdc09b4e4550527f`
BLAKE2b-256	`e85978d7c149d68291c37ec51f36f9d570e5913c66af6f4a423c59e0c51ae955`

See more details on using hashes here.

Provenance

The following attestation bundles were made for owocr-1.26.1.tar.gz:

Publisher: release.yml on AuroraWright/owocr

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: owocr-1.26.1.tar.gz
- Subject digest: 5a5aef5247f4e7136ec6f5526e57481097dd420201a864123dc4985c1c17c7c8
- Sigstore transparency entry: 1049694957
- Sigstore integration time: Mar 6, 2026
Source repository:
- Permalink: AuroraWright/owocr@6a7910b6da37e46eb2d497667ea7d1731fc8a470
- Branch / Tag: refs/heads/master
- Owner: https://github.com/AuroraWright
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@6a7910b6da37e46eb2d497667ea7d1731fc8a470
- Trigger Event: workflow_dispatch

File details

Details for the file owocr-1.26.1-py3-none-any.whl.

File metadata

Download URL: owocr-1.26.1-py3-none-any.whl
Upload date: Mar 6, 2026
Size: 380.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for owocr-1.26.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`543300966a5e0908439b18bbb153b8ca88d42aca51efd84d7942336108870e23`
MD5	`8a378976ecc1208425710c48bccd7ed7`
BLAKE2b-256	`c791f0178252cdca4f1d18ac66153d239dd39fae41ac57f73e88eb532579d50b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for owocr-1.26.1-py3-none-any.whl:

Publisher: release.yml on AuroraWright/owocr

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: owocr-1.26.1-py3-none-any.whl
- Subject digest: 543300966a5e0908439b18bbb153b8ca88d42aca51efd84d7942336108870e23
- Sigstore transparency entry: 1049694959
- Sigstore integration time: Mar 6, 2026
Source repository:
- Permalink: AuroraWright/owocr@6a7910b6da37e46eb2d497667ea7d1731fc8a470
- Branch / Tag: refs/heads/master
- Owner: https://github.com/AuroraWright
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@6a7910b6da37e46eb2d497667ea7d1731fc8a470
- Trigger Event: workflow_dispatch

owocr 1.26.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

OwOCR

Demo

Visual novel

Manga

Installation/basic usage (easy Windows/macOS packages)

Installation (terminal+Python, all operating systems)

Usage (terminal)

Main features

Manual configuration

Notes about Linux support

Supported engines

Local

Cloud

Links

Acknowledgments

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance