No project description provided

These details have not been verified by PyPI

Project description

MyLittleOCR API Library

Overview

MyLittleOCR is a unified wrapper for several popular OCR libraries, providing a consistent API that allows developers to seamlessly integrate and switch between different OCR engines without altering their application logic.

Features

Unified API: A consistent interface for multiple OCR engines, simplifying OCR integration.
Multiple OCR Backends: Supports popular OCR libraries including Tesseract, EasyOCR, PaddleOCR, WeChat OCR, Surya, and RapidOCR.
Flexible Switching: Easily switch between OCR backends based on project requirements.
Customizable: Fine-tune accuracy and performance by adjusting parameters for each OCR engine.

Installation

Install MyLittleOCR using pip:

pip install ml_ocr

Supported OCR Libraries

Below is a list of supported OCR libraries along with their licenses, project URLs, and usage examples with optional parameters.

OCR Engine	License	Project URL
Tesseract	Apache 2.0	Tesseract
EasyOCR	Apache 2.0	EasyOCR
WeChat OCR	Unknown	WeChat OCR
Surya	GPL 3.0	Surya
RapidOCR	Apache 2.0	RapidOCR

Tesseract

License: Apache 2.0
Project URL: Tesseract

To install Tesseract, first ensure you have the Tesseract binary installed. You can download it from here. Then install the Python wrapper:

pip install pytesseract

Usage with Optional Parameters

The TesseractEngine class can be instantiated with the following optional parameters:

tesseract_command: Path to the Tesseract executable. If not provided, it attempts to find it automatically.
default_langs: A list of language codes or names. Default is ["eng", "chi_sim"].

Note: You can use language names like 'English', 'eng', or 'en'. The program automatically converts them using the iso639 library.

EasyOCR

License: Apache 2.0
Project URL: EasyOCR

Install EasyOCR using:

pip install easyocr

Usage with Optional Parameters

The EasyOCREngine class accepts the following optional parameters:

default_langs: A list of language codes or names. Default is ["ch_sim", "en"].
Additional parameters supported by EasyOCR's Reader class (see EasyOCR Documentation).

WeChat OCR

License: Unknown (utilizes components from WeChat, a closed-source project. Use with caution and do not use for commercial purposes. Only supported on Windows.)
Project URL: WeChat OCR

Install WeChat OCR using:

pip install wechat_ocr

Usage

The WechatOCREngine does not require additional optional parameters for instantiation.

Surya

License: GPL 3.0
Project URL: Surya

Install Surya using:

pip install surya-ocr

Usage with Optional Parameters

The SuryaEngine class can be instantiated with the following optional parameters:

default_langs: A list of language codes or names. Default is ["en", "zh", "_math"].
Additional parameters can be passed via **kwargs.

RapidOCR

License: Apache 2.0
Project URL: RapidOCR

Install RapidOCR using:

pip install rapidocr_onnxruntime

Usage with Optional Parameters

The RapidOCREngine class accepts the following optional parameters:

det_model: Detection model path or name. Default is "ch_PP-OCRv4_det_infer.onnx".
rec_model: Recognition model path or name. Default is "ch_PP-OCRv4_rec_infer.onnx".
Additional parameters supported by RapidOCR (see RapidOCR API Documentation).

Note: The models will be automatically downloaded if not present. You can specify custom model paths as needed.

Quick Start

Here's an example of how to use the MyLittleOCR API to extract text from an image:

from ocr_engines import get_engine_class

# Get an instance of the desired OCR engine (e.g., 'tesseract', 'easyocr', 'paddleocr', 'wechat_ocr', 'surya', 'rapidocr')
ocr_engine = get_engine_class('tesseract')

# Extract text from an image
ocr_result = ocr_engine.ocr('/path/to/image.jpg')

# Convert OCR result to a list and print it
print("OCR Result:", ocr_result.to_list())

Ways to Interact with the API

There are two main ways to interact with the API in MyLittleOCR:

1. Engine Management-Based API Interaction

Get All Engines: Use get_all_engines() to retrieve all registered OCR engines.

from ocr_engines import get_all_engines

engines = get_all_engines()
for engine_name, engine_class in engines.items():
    print(f"Engine Name: {engine_name}")
    engine_instance = engine_class()
    result = engine_instance.ocr('/path/to/image.jpg')
    print(result.to_list())

Get Engine Instance: Use get_engine_instance(engine_name, **kwargs) to get an instance of a specific OCR engine with optional parameters.
```
from ocr_engines import get_engine_instance

engine_instance = get_engine_instance('easyocr')
```

Get Engine Class: Use get_engine_class(engine_name) to get the class of a specific OCR engine.

from ocr_engines import get_engine_class

EasyOCREngine = get_engine_class('easyocr')

2. Direct Import from Specific Library

Directly import the engine class from the specific OCR engine module, then instantiate and use it.

from ocr_engines.easyocr_engine import EasyOCREngine

engine_instance = EasyOCREngine(
    default_langs=['English', 'Korean'],
    gpu=False
)

Working with OCR Results

The OCRResult class represents OCR results and provides methods to process and filter them.

Initialization

Create an OCRResult instance by providing a list of OCRItem instances.

from my_little_ocr.base_engine.base_ocr_engine import OCRResult, OCRItem

ocr_items = [
    OCRItem(
        text="example",
        box=[[0, 0], [1, 0], [1, 1], [0, 1]],
        confidence=0.9
    )
]
ocr_result = OCRResult(ocr_items=ocr_items)

Filtering Results

Use filter_by_confidence(confidence_threshold) to filter OCR results based on confidence scores.

filtered_result = ocr_result.filter_by_confidence(0.8)

Converting Results to List

Use to_list(text_only=False) to convert OCR results to a list. Set text_only=True to retrieve only the text content.

text_list = ocr_result.to_list(text_only=True)
full_list = ocr_result.to_list(text_only=False)

These methods provide flexible ways to manage and utilize OCR results.

Base OCREngine Class

Below is the abstract base class for all OCR engines:

from abc import ABC, abstractmethod
from typing import Union
from PIL import Image
import numpy as np

ImageLike = Union[str, bytes, np.ndarray, Image.Image]

class BaseOCREngine(ABC):
    """
    Abstract base class for OCR engines.
    """

    ocr_engine_name: str = "Base OCR Engine"

    @abstractmethod
    def ocr(self, img: ImageLike) -> OCRResult:
        """
        Performs OCR on the given image.

        Args:
            img (ImageLike): The image to perform OCR on.

        Returns:
            OCRResult: The OCR result.
        """
        pass

The input img supports multiple formats:

File path as a string
Bytes
NumPy array (np.ndarray)
PIL Image (Image.Image)

Contributing

Contributions are welcome! If you'd like to add support for more OCR libraries or suggest improvements, feel free to open an issue or submit a pull request.

License

This project is licensed under the MIT License.

Note: Individual OCR engines may have different licenses. Please refer to their respective project pages for more details.

Acknowledgments

We would like to thank the following libraries that make this project possible:

Pydantic
GitPython
iso639-lang

And all the OCR libraries mentioned above.

Credits

Thanks to all the contributors and maintainers of the OCR libraries that were used in this project.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.0

Oct 4, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

my_little_ocr-0.1.0.tar.gz (15.2 kB view details)

Uploaded Oct 4, 2024 Source

Built Distribution

my_little_ocr-0.1.0-py3-none-any.whl (20.5 kB view details)

Uploaded Oct 4, 2024 Python 3

File details

Details for the file my_little_ocr-0.1.0.tar.gz.

File metadata

Download URL: my_little_ocr-0.1.0.tar.gz
Upload date: Oct 4, 2024
Size: 15.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.11.9

File hashes

Hashes for my_little_ocr-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`5cedf8cb7f5fe0bf0e5e973a2d963efbd2e0fed8db9a3136e40fa1e7730eef7f`
MD5	`d983adab181a0a8293658ab45deaaefa`
BLAKE2b-256	`6a6bf18c384b39176aceddbf9ec1d94e195320920533235ed04be82c347ad106`

See more details on using hashes here.

File details

Details for the file my_little_ocr-0.1.0-py3-none-any.whl.

File metadata

Download URL: my_little_ocr-0.1.0-py3-none-any.whl
Upload date: Oct 4, 2024
Size: 20.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.11.9

File hashes

Hashes for my_little_ocr-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e876c15cd61f253687133353311a682cb84f0475ade8b53a9694fb69e204479f`
MD5	`3c1f1171ee10c9fccf0ad079b841e6d1`
BLAKE2b-256	`f62c01a2c414525554891d113152774a2c8f5d67ec3d0aa4e31f2676d343acaf`

See more details on using hashes here.

my-little-ocr 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

MyLittleOCR API Library

Overview

Features

Installation

Supported OCR Libraries

Tesseract

Usage with Optional Parameters

EasyOCR

Usage with Optional Parameters

WeChat OCR

Usage

Surya

Usage with Optional Parameters

RapidOCR

Usage with Optional Parameters

Quick Start

Ways to Interact with the API

1. Engine Management-Based API Interaction

2. Direct Import from Specific Library

Working with OCR Results

Initialization

Filtering Results

Converting Results to List

Base OCREngine Class

Contributing

License

Acknowledgments

Credits

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes