Skip to main content

A Kannada OCR

Project description

AksharaJaana

AksharaJaana is a package which uses tesseract ocr in the backend to convert the read-only kannada text to editable format. A Special feature of this is it can separate columns in the page and thus making it easier to read and edit. Do consider using this package if necessary and feel free to mail me for any clarifications.

Happy coding and installing.

To see the python package visit https://pypi.org/project/AksharaJaana/

The Requirements

Conda environment is preferred for the smooth use

  • AksharaJaana (pip package), check out the latest version available
  • Tesseract
  • poppler

Details for Installation

Ubuntu

Open terminal and execute below commands.

  1. Installing tesseract-ocr in the system

    sudo apt-get update -y 
    sudo apt-get install -y tesseract-ocr 
    
  2. Installing poppler in the system

    sudo apt-get install -y poppler-utils 
    
  3. Installing python and pip (if pip is not installed)

    sudo apt install python==3.6.9
    
  4. Installing packages for AksharaJaana

    pip install AksharaJaana
    

Windows

  1. Installing tesseract-ocr in the system

    • Download tesseract
      • go to the website
      • click on tesseract-ocr-w64-setup-v5.0.0-alpha.20200328.exe (64 bit).
    • Install tesseract for Kannada Language and Script
      • open the downloaded file and click next and accept the agreement.
      • Next you will give an option to choose the languages.
      • Choose kannada in both script and language
    • Add tesseract to Path
      • Check if this folder C:\Program Files\Tesseract-OCR\ is present. If yes, follow below procedure
      • Add C:\Program Files\Tesseract-OCR\ to your system PATH by doing the following
        1. Click on the Windows start button, search for Edit the system environment variables, click on Environment Variables
        2. Under System variables, look for and double-click on PATH, click on New.
        3. then add C:\Program Files\Tesseract-OCR\, click OK.
      • if folder is not present, manually add the folder tesseract-ocr to the Program Files in the C drive which must be present at the download section (after extraction) and follow the same procedure
      • See complete docs.
  2. Installing poppler in the system

    • Download Poppler
      • go to this page
      • click on poppler-0.54_x86
    • Unzip the file and copy files to C:\Users\Program Files\poppler-0.68.0_x86
    • Add poppler to path
      • Add C:\Program Files\poppler-0.68.0_x86\bin to your system PATH by doing the following:
        1. Click on the Windows start button, search for Edit the system environment variables, click on Environment Variables
        2. under System variables, look for and double-click on PATH, click on New
        3. then add C:\Users\Program Files\poppler-0.68.0_x86\bin, click OK.
  3. Installing python and pip in the system (If pip is not installed)

  4. Installing packages for AksharaJaana

    • open command prompt

      pip install AksharaJaana
      
  5. Reboot the system before starting to use

Python Script

import AksharaJaana.main as ak 
text = ak.ocr_engine("Your file Path") 
print(text) 

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

AksharaJaana-1.0.0.1.tar.gz (6.2 kB view details)

Uploaded Source

Built Distribution

AksharaJaana-1.0.0.1-py3-none-any.whl (6.6 kB view details)

Uploaded Python 3

File details

Details for the file AksharaJaana-1.0.0.1.tar.gz.

File metadata

  • Download URL: AksharaJaana-1.0.0.1.tar.gz
  • Upload date:
  • Size: 6.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.3 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.41.0 importlib-metadata/1.6.0 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.9

File hashes

Hashes for AksharaJaana-1.0.0.1.tar.gz
Algorithm Hash digest
SHA256 3f99e946373d7cdc76ac48ecf1c7e0f2a83f735742a6ea4022ca62d7aa71ba72
MD5 a1e07e513091e6940046ac8674d714e4
BLAKE2b-256 3ce89750fcb4ae3f875564b5f25cd5aaecfe918b24167c85c72ac4a152c86e57

See more details on using hashes here.

File details

Details for the file AksharaJaana-1.0.0.1-py3-none-any.whl.

File metadata

  • Download URL: AksharaJaana-1.0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 6.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.3 readme-renderer/34.0 requests/2.27.1 requests-toolbelt/0.9.1 urllib3/1.26.9 tqdm/4.41.0 importlib-metadata/1.6.0 keyring/23.4.1 rfc3986/1.5.0 colorama/0.4.4 CPython/3.6.9

File hashes

Hashes for AksharaJaana-1.0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 1f408fee5f00290c59c4c3a8f0487ff88c24637389eac48f8c594cfe1546219f
MD5 3699080417c3d9e0f2df99feeb8f6314
BLAKE2b-256 e6094182b733cd7bdcfa6e0880ea4628b4a9b5939870b3a7f0559d3c49743180

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page