No project description provided

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: Apache Software License
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Project description

MyOCR - Advanced OCR Pipeline Builder

MyOCR is a highly extensible and customizable framework for building OCR systems. Engineers can easily train, integrate deep learning models into custom OCR pipelines for real-world applications.

🌟 Key Features:

⚡️ End-to-End OCR Development Framework – Designed for developers to build and integrate detection, recognition, and custom OCR models in a unified and flexible pipeline.

🛠️ Modular & Extensible – Mix and match components - swap models, predictors, or input output processors with minimal changes.

🔌 Developer-Friendly by Design - Clean Python APIs, prebuilt pipelines and processors, and straightforward customization for training and inference.

🚀 Production-Ready Performance – ONNX runtime support for fast CPU/GPU inference, support various ways of deployment.

📣 Updates

🔥2025.05.03 inner refactoring & update docs
2025.04.28 release MyOCR alpha version:
- Release image detection, class, recognition models
- All components can work together

🛠️ Installation

📦 Requirements

Python 3.11+
Optional: CUDA 12.6+ (Recommended for GPU acceleration, but CPU mode is also supported)

📥 Install Dependencies

# Clone the code from GitHub
git clone https://github.com/robbyzhaox/myocr.git
cd myocr

# create venv
uv venv

# Install dependencies
pip install -e .

# Development environment installation
pip install -e ".[dev]"

# Download pre-trained model weights
mkdir -p ~/.MyOCR/models/
Download weights from: https://drive.google.com/drive/folders/1RXppgx4XA_pBX9Ll4HFgWyhECh5JtHnY
# Alternative download link: https://pan.baidu.com/s/122p9zqepWfbEmZPKqkzGBA?pwd=yq6j

🚀 Quick Start

🖥️ Local Inference

Basic OCR Recognition

from myocr.pipelines import CommonOCRPipeline

# Initialize common OCR pipeline (using GPU)
pipeline = CommonOCRPipeline("cuda:0")  # Use "cpu" for CPU mode

# Perform OCR recognition on an image
result = pipeline("path/to/your/image.jpg")
print(result)

Structured OCR Output (Example: Invoice Information Extraction)

config chat_bot in myocr.pipelines.config.structured_output_pipeline.yaml

chat_bot:
  model: qwen2.5:14b
  base_url: http://127.0.0.1:11434/v1
  api_key: 'key'

from pydantic import BaseModel, Field
from myocr.pipelines import StructuredOutputOCRPipeline

# Define output data model, refer to:
from myocr.pipelines.response_format import InvoiceModel

# Initialize structured OCR pipeline
pipeline = StructuredOutputOCRPipeline("cuda:0", InvoiceModel)

# Process image and get structured data
result = pipeline("path/to/invoice.jpg")
print(result.to_dict())

🐳 Docker Deployment

The framework provides support for Docker deployment, which can be built and run using the following commands:

Run the Docker Container

docker run -d -p 8000:8000 robbyzhaox/myocr:cpu-0.1.0

Accessing API Endpoints (Docker)

IMAGE_PATH="your_image.jpg"

BASE64_IMAGE=$(base64 -w 0 "$IMAGE_PATH")  # Linux
#BASE64_IMAGE=$(base64 -i "$IMAGE_PATH" | tr -d '\n') # macOS

curl -X POST \
  -H "Content-Type: application/json" \
  -d "{\"image\": \"${BASE64_IMAGE}\"}" \
  http://localhost:8000/ocr

🔗 Using Rest API

The framework provides a simple Flask API service that can be called via HTTP interface:

# Start the service default port: 5000
python main.py

API endpoints:

GET /ping: Check if the service is running properly
POST /ocr: Basic OCR recognition
POST /ocr-json: Structured OCR output

We also have a UI for these endpoints, please refer to doc-insight-ui

Star History

🎖 Contribution Guidelines

We welcome any form of contribution, including but not limited to:

Submitting bug reports
Adding new features
Improving documentation
Optimizing performance

📄 License

This project is open-sourced under the Apache 2.0 License, see the LICENSE file for details.

Project details

These details have not been verified by PyPI

Project links

Development Status
- 3 - Alpha
Intended Audience
- Science/Research
License
- OSI Approved :: Apache Software License
Programming Language
- Python :: 3
Topic
- Scientific/Engineering :: Artificial Intelligence

Release history Release notifications | RSS feed

0.1.1

May 17, 2025

0.1.0

May 14, 2025

0.1.0b0 pre-release

May 12, 2025

0.1.0a4 pre-release

May 8, 2025

0.1.0a3 pre-release

May 7, 2025

0.1.0a2 pre-release

May 7, 2025

0.1.0a1 pre-release

May 6, 2025

This version

0.1.0a0 pre-release

May 4, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

myocr_kit-0.1.0a0.tar.gz (62.4 kB view details)

Uploaded May 4, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

myocr_kit-0.1.0a0-py3-none-any.whl (67.2 kB view details)

Uploaded May 4, 2025 Python 3

File details

Details for the file myocr_kit-0.1.0a0.tar.gz.

File metadata

Download URL: myocr_kit-0.1.0a0.tar.gz
Upload date: May 4, 2025
Size: 62.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.12

File hashes

Hashes for myocr_kit-0.1.0a0.tar.gz
Algorithm	Hash digest
SHA256	`db305e9184fb35bc2f73e190da7e9638fb9753ffe340b170cb1f86706d2a570d`
MD5	`869a9ef167a45a29bf0358227e393f92`
BLAKE2b-256	`cc5d6026945549cce37844627480023fcb093a2e88007023b2462d91e9088d9f`

See more details on using hashes here.

File details

Details for the file myocr_kit-0.1.0a0-py3-none-any.whl.

File metadata

Download URL: myocr_kit-0.1.0a0-py3-none-any.whl
Upload date: May 4, 2025
Size: 67.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.11.12

File hashes

Hashes for myocr_kit-0.1.0a0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2274628c7802561c575f53e31a8f6047e58c06f9bc3aff014a887aaee34b6511`
MD5	`5389701d353e4cc05d63b77a4dcb6a02`
BLAKE2b-256	`0a86cc809920957e1416950b04db4046f21602b8b0f63ced6e0beac03b8da1e5`

See more details on using hashes here.

myocr-kit 0.1.0a0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

MyOCR - Advanced OCR Pipeline Builder

🌟 Key Features:

📣 Updates

🛠️ Installation

📦 Requirements

📥 Install Dependencies

🚀 Quick Start

🖥️ Local Inference

Basic OCR Recognition

Structured OCR Output (Example: Invoice Information Extraction)

🐳 Docker Deployment

Run the Docker Container

Accessing API Endpoints (Docker)

🔗 Using Rest API

Star History

🎖 Contribution Guidelines

📄 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes