Skip to main content

Kannada OCR with column Separation

Project description

AksharaJaana

AksharaJaana is the package which uses tesseract ocr in backend to convert the kannada text to editable format.You can use following sample code in ubuntu.The Special feature of this is it can separate columns in page

The Requirements

OpenCV >= 3.2

pip install opencv-python

Numpy

pip install numpy

pdf2image

pip install pdf2image

pip install poppler

Sample Code

Installing the AksharaJaana

pip install AksharaJaana

Python Script

Its in test.py

import AksharaJaana.main as ak

text = ak.ocr_engine('/home/navaneeth/Desktop/NandD/OCR_kannada/CamScanner 06-28-2020 12.12.10.pdf')

from AksharaJaana.utils import utils

u = utils()

u.write_as_RTF(text, saving_path='/home/navaneeth/Desktop/1.rtf')

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

AksharaJaana-0.1.2.2.tar.gz (4.5 kB view details)

Uploaded Source

Built Distribution

AksharaJaana-0.1.2.2-py3-none-any.whl (5.3 kB view details)

Uploaded Python 3

File details

Details for the file AksharaJaana-0.1.2.2.tar.gz.

File metadata

  • Download URL: AksharaJaana-0.1.2.2.tar.gz
  • Upload date:
  • Size: 4.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2.post20191203 requests-toolbelt/0.9.1 tqdm/4.41.0 CPython/3.6.9

File hashes

Hashes for AksharaJaana-0.1.2.2.tar.gz
Algorithm Hash digest
SHA256 7a6ba15b6d884970e2acc6a5364ce154b4078700fc24ae153328ac58f5153fee
MD5 f171c937455ae249a1ca446857cda84d
BLAKE2b-256 0eb74f0998b6d582fae75480c5004a0e04d883417b21a22c2bcc5aace8c53e2e

See more details on using hashes here.

File details

Details for the file AksharaJaana-0.1.2.2-py3-none-any.whl.

File metadata

  • Download URL: AksharaJaana-0.1.2.2-py3-none-any.whl
  • Upload date:
  • Size: 5.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2.post20191203 requests-toolbelt/0.9.1 tqdm/4.41.0 CPython/3.6.9

File hashes

Hashes for AksharaJaana-0.1.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 b5ba6810f79dbb33b8ee04d168fc56e536ca83c56f5606fa2458e6a944179019
MD5 50d0dee8292a61d3fdc7589956b48370
BLAKE2b-256 9d9e74f0ed2664609b13aee7de235b5ac65ffd81c1b4dfd5f5bad217d23c3834

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page