Skip to main content

Python Tamil OCR package

Project description

OCR Tamil - Easy, Accurate and Simple to use Tamil OCR

LICENSE HuggingSpace

OCR Tamil can help you extract text from signboard, nameplates, storefronts etc., from Natural Scenes with high accuracy. This version of OCR is much more robust to tilted text compared to the Tesseract, Paddle OCR and Easy OCR as they are primarily built to work on the documents texts and not on natural scenes. This model is work in progress, feel free to contribute!!!

Currently supports two languages (English + Tamil). Accuracy of the model can be improved by adjusting the Text detection model as per your requirements. Achieved the accuracy of around >95% (98% NED) in validation set

Comparison between Tesseract OCR and OCR Tamil

Input Image OCR TAMIL Tesseract
teaser வாழ்கவளமுடன் க்‌ க்கஸாரகளள௮ஊகஎளமுடன்‌
teaser ரெடிமேட்ஸ் NO OUTPUT
teaser கோபி NO OUTPUT
teaser தாம்பரம் NO OUTPUT
teaser நெடுஞ்சாலைத் NO OUTPUT
teaser அண்ணாசாலை NO OUTPUT

Obtained Tesseract results using the huggingface space with Tamil as language

How to Install and Use OCR Tamil

Tested using Python 3.10 on Windows & Linux (Ubuntu 22.04) Machines

Pip

  1. Using PIP install pip install ocr_tamil
  2. Use the below code for text recognition at word level by inserting the image_path

Text Recognition

from ocr_tamil.ocr import OCR
image_path = r"test_images\1.jpg" # insert your own path here (step 2 file location)
ocr = OCR()
texts = ocr.predict(image_path)
with open("output.txt","w",encoding="utf-8") as f:
    f.write(texts)
>>>> நெடுஞ்சாலைத்
teaser

Text Detect + Recognition

  1. Use the below code for text detection and recognition by inserting the image_path
from ocr_tamil.ocr import OCR
image_path = r"test_images\0.jpg" # insert your own path here
ocr = OCR(detect=True)
texts = ocr.predict(image_path)
with open("output.txt","w",encoding="utf-8") as f:
    f.write(texts)

>>>> கொடைக்கானல் Kodaikanal 
teaser

Github

  1. Clone the repository
  2. Pip install the required modules using pip install -r requirements.txt
  3. Run the below code by providing the image path

Text Recognition

from ocr_tamil.ocr import OCR

image_path = r"test_images\1.jpg" # insert your own path here
ocr = OCR()
texts = ocr.predict(image_path)
with open("output.txt","w",encoding="utf-8") as f:
    f.write(texts)

>>>> நெடுஞ்சாலைத்

Text Detect + Recognition

from ocr_tamil.ocr import OCR

image_path = r"test_images\0.jpg" # insert your own path here
ocr = OCR(detect=True)
texts = ocr.predict(image_path)
with open("output.txt","w",encoding="utf-8") as f:
    f.write(texts)

>>>> கொடைக்கானல் Kodaikanal 

Detailed Medium tutorial can be found here.

Huggingface spaces🤗 demo can be found here

Applications

  1. Navigating autonomous vehicles based on the signboards
  2. License plate recognition

Limitations

  1. Unable to read the text if they are present in rotated forms

teaser teaser

  1. Currently supports Only English and Tamil Language

  2. Document Text reading capability is limited. Auto identification of Paragraph, line are not supported along with Text detection model inability to detect and crop the Tamil text leads to accuracy decrease (WORKAROUND Can use your own text detection model along with OCR tamil text recognition model)

teaser

Cropped Text from Text detection Model

teaser

Character **இ** missing due to text detection model error

**?**யற்கை மூலிகைகளில் இருந்து ஈர்த்தெடுக்கக்கப்பட்ட விரிய உட்பொருட்களை உள்ளடக்கி எந்த இரசாயன சேர்க்கைகளும் **?**ல்லாமல் உருவாக்கப்பட்ட **?**ந்தியாவின் முதல் சித்த தயாரிப்பு

Thanks to the below contibuters for making awesome Text detection and text recognition models

Text detection - CRAFT TEXT DECTECTION

Text recognition - PARSEQ

@InProceedings{bautista2022parseq,
  title={Scene Text Recognition with Permuted Autoregressive Sequence Models},
  author={Bautista, Darwin and Atienza, Rowel},
  booktitle={European Conference on Computer Vision},
  pages={178--196},
  month={10},
  year={2022},
  publisher={Springer Nature Switzerland},
  address={Cham},
  doi={10.1007/978-3-031-19815-1_11},
  url={https://doi.org/10.1007/978-3-031-19815-1_11}
}
@inproceedings{baek2019character,
  title={Character Region Awareness for Text Detection},
  author={Baek, Youngmin and Lee, Bado and Han, Dongyoon and Yun, Sangdoo and Lee, Hwalsuk},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  pages={9365--9374},
  year={2019}
}

CITATION

@InProceedings{GnanaPrasath,
  title={Tamil OCR},
  author={Gnana Prasath D},
  month={01},
  year={2024},
  url={https://github.com/gnana70/tamil_ocr}
}

logo

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ocr_tamil-0.1.5.tar.gz (43.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ocr_tamil-0.1.5-py3-none-any.whl (56.0 kB view details)

Uploaded Python 3

File details

Details for the file ocr_tamil-0.1.5.tar.gz.

File metadata

  • Download URL: ocr_tamil-0.1.5.tar.gz
  • Upload date:
  • Size: 43.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.13

File hashes

Hashes for ocr_tamil-0.1.5.tar.gz
Algorithm Hash digest
SHA256 95e2cf2ad2c187835ac1e45803b8b41f2c160c3ab03927bf2dbf3a97d316cc45
MD5 b33d7b01f7dc546290a0af50276b094c
BLAKE2b-256 e32bf124117763f862d08dea27815b6d3f9f4c773d7f0405694e02eaf5b0c380

See more details on using hashes here.

File details

Details for the file ocr_tamil-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: ocr_tamil-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 56.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.13

File hashes

Hashes for ocr_tamil-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 c4e88294264e47911ca0ab534ae5fff8e82583570652641f717db895bbd5756b
MD5 79f4202d9dfc7cec7805ab81b83b25de
BLAKE2b-256 9c563d2b5b6a44bfbc356ffce3b417209f94974cb7e0c224c7ca098ba5df2e0d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page