OCR for Japanese manga
Project description
Manga OCR
Optical character recognition for Japanese text, with the main focus being Japanese manga. It uses a custom end-to-end model built with Transformers' Vision Encoder Decoder framework.
Manga OCR can be used as a general purpose printed Japanese OCR, but its main goal was to provide a high quality text recognition, robust against various scenarios specific to manga:
- both vertical and horizontal text
- text with furigana
- text overlaid on images
- wide variety of fonts and font styles
- low quality images
Unlike many OCR models, Manga OCR supports recognizing multi-line text in a single forward pass, so that text bubbles found in manga can be processed at once, without splitting them into lines.
See also:
- Poricom, a GUI reader, which uses manga-ocr
- mokuro, a tool, which uses manga-ocr to generate an HTML overlay for manga
- Xelieu's guide, a comprehensive guide on setting up a reading and mining workflow with manga-ocr/mokuro (and many other useful tips)
- Development code, including code for training and synthetic data generation: link
- Description of synthetic data generation pipeline + examples of generated images: link
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
manga-ocr-0.1.11.tar.gz
(66.3 kB
view hashes)
Built Distribution
manga_ocr-0.1.11-py3-none-any.whl
(62.9 kB
view hashes)
Close
Hashes for manga_ocr-0.1.11-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3e9a0373de1c9e1b6e5a34206ffe7c4c1b5823f2ba456143754e3e722c4b87a9 |
|
MD5 | 074c6e1b1fd179b4829071455e1f4b58 |
|
BLAKE2b-256 | 087797416057713f3d894da2d052135806f32b929601fda94c72034b753ca07c |