High-speed, high-accuracy, local OCR for Japanese video games.

These details have not been verified by PyPI

Project links

Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Image Recognition

Project description

meikiocr

high-speed, high-accuracy, local ocr for japanese video games.

meikiocr is a python-based ocr pipeline that combines state-of-the-art detection and recognition models to provide an unparalleled open-source solution for extracting japanese text from video games and similar rendered content.

original image	ocr result

ナルホド
こ、こんなにドキドキするの、
小学校の学級裁判のとき以来です。

live demo

the easiest way to see meikiocr in action is to try the live demo hosted on hugging face spaces. no installation required!

try the meikiocr live demo here

core features

high accuracy: purpose-built and trained on japanese video game text, meikiocr significantly outperforms general-purpose ocr tools like paddleocr or easyocr on this specific domain.
high speed: the architecture is pareto-optimal, delivering exceptional performance on both cpu and gpu.
fully local & private: unlike cloud-based services, meikiocr runs entirely on your machine, ensuring privacy and eliminating api costs or rate limits.
cross-platform: it works wherever onnx runtime runs, providing a much-needed local ocr solution for linux users.
open & free: both the code and the underlying models are freely available under permissive licenses.

performance & benchmarks

meikiocr is built from two highly efficient models that establish a new pareto front for japanese text recognition. this means they offer a better accuracy/latency tradeoff than any other known open-weight model.

detection (cpu)	detection (gpu)

recognition (cpu)	recognition (gpu)

installation

pip install meikiocr

for nvidia gpu users (recommended)

for a massive performance boost, you can install the gpu-enabled version of the onnx runtime. this will be detected automatically by the script.

pip install meikiocr
pip uninstall onnxruntime
pip install onnxruntime-gpu

usage

this is how meikiocr can be called. you can also run demo.py for additional visual output.

import cv2
import numpy as np
from urllib.request import urlopen
from meikiocr import MeikiOCR

IMAGE_URL = "https://huggingface.co/spaces/rtr46/meikiocr/resolve/main/example.jpg"

with urlopen(IMAGE_URL) as resp:
    image = cv2.imdecode(np.asarray(bytearray(resp.read()), dtype="uint8"), cv2.IMREAD_COLOR)

ocr = MeikiOCR() # Initialize the OCR pipeline
results = ocr.run_ocr(image) # Run the full OCR pipeline
print('\n'.join([line['text'] for line in results if line['text']]))

adjusting thresholds

you can adjust the confidence thresholds for both the text line detection and the character recognition models. lowering the thresholds results in more detected text lines and characters, while higher values prevent false positives.

MeikiOCR().run_ocr(self, image, det_threshold=0.8, rec_threshold=0.2) # less, but more confident text boxes and characters returned

running dedicated detection

if you only care about the position of the text and not the content you can run the detection by itself, which is faster than running the whole ocr pipeline:

MeikiOCR().run_detection(self, image, det_threshold=0.8, rec_threshold=0.2) # only returns text line coordinates (for horizontal and vertical text lines)

in the same way you can also run_recognition by itself on images of precropped (horizontal) text lines.

how it works

meikiocr is a two-stage pipeline:

text detection: the meiki.text.detect.v0 model first identifies the bounding boxes of all horizontal text lines in the image.
text recognition: each detected text line is then cropped and processed in a batch by the meiki.text.recognition.v0 model, which recognizes the individual characters within it.

limitations

while meikiocr is state-of-the-art for its niche, it's important to understand its design constraints:

domain specific: it is highly optimized for rendered text from video games and may not perform well on handwritten or complex real-world scene text.
horizontal text only: it does not currently support vertical text.
architectural limits: the detection model is capped at finding 64 text boxes, and the recognition model can process up to 48 characters per line. these limits are sufficient for over 99% of video game scenarios but may be a constraint for other use cases.

advanced usage & potential

the meiki_ocr.py script provides a straightforward implementation of a post-processing pipeline that selects the most confident prediction for each character. however, the raw output from the recognition model is richer and can be used for more advanced applications. for example, one could build a language-aware post-processing step using n-grams to correct ocr mistakes by considering alternative character predictions.

this opens the door for meikiocr to be integrated into a variety of projects.

license

this project is licensed under the apache 2.0 license. see the license file for details.

Project details

These details have not been verified by PyPI

Project links

Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Image Recognition

Release history Release notifications | RSS feed

0.3.4

Apr 11, 2026

0.3.3

Apr 1, 2026

0.3.2

Mar 21, 2026

0.3.1

Feb 25, 2026

0.3.0

Feb 25, 2026

0.2.0

Jan 6, 2026

0.1.4

Dec 16, 2025

0.1.3

Nov 23, 2025

0.1.2

Nov 7, 2025

This version

0.1.1

Nov 6, 2025

0.1.0

Nov 5, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

meikiocr-0.1.1.tar.gz (11.9 kB view details)

Uploaded Nov 6, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

meikiocr-0.1.1-py3-none-any.whl (12.1 kB view details)

Uploaded Nov 6, 2025 Python 3

File details

Details for the file meikiocr-0.1.1.tar.gz.

File metadata

Download URL: meikiocr-0.1.1.tar.gz
Upload date: Nov 6, 2025
Size: 11.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for meikiocr-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`b6431673a74166243d7649900d32c1aa7ab61812037f5ce66ec8ddf3c5bff817`
MD5	`f40164a4428fb043157b7eefab2d2762`
BLAKE2b-256	`acb9ee010e82a65fd5908e50e8eefebad5a50a417dcfa8116e2c1ddb36533089`

See more details on using hashes here.

File details

Details for the file meikiocr-0.1.1-py3-none-any.whl.

File metadata

Download URL: meikiocr-0.1.1-py3-none-any.whl
Upload date: Nov 6, 2025
Size: 12.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.5

File hashes

Hashes for meikiocr-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ff98c8ab7ff2abb8cce2a7c27fbfdf182dddddedbdbb36b37758b3799a32023e`
MD5	`e7b7217e4c54f1824896b1f81b7f8b45`
BLAKE2b-256	`c0a7358cc004611fe5be39a649aa14b4548b8338089a901d369f0079a9fd74e9`

See more details on using hashes here.

meikiocr 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

meikiocr

live demo

core features

performance & benchmarks

installation

for nvidia gpu users (recommended)

usage

adjusting thresholds

running dedicated detection

how it works

limitations

advanced usage & potential

license

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes