Skip to main content

Extract information from the pan and aadhar card image

Project description

Pan Aadhar OCR

Extract Text from Pan and Aadhar Cards

Pan Aadhar OCR is a python package which takes an Image of a valid Pan/Aadhar Document and extracts the text from it and returns the information in JSON format.

  • Easy to use
  • Returns information in JSON
  • Works even faster with the GPU
  • If you don't have a GPU, you can still run it on CPU, but slower

Tech

Pan Aadhar OCR uses a number of open source projects to work properly:

  • EasyOCR - Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
  • Python - Python is a programming language that lets you work quickly and integrate systems more effectively.
  • OpenCV - OpenCV is open source and released under the BSD 3-Clause License. It is free for commercial use.

Installation

This library requires Python 3.6+ to run. As well as you also need to install tesseract on your system. If you have Linux based system just run:

sudo apt install tesseract-ocr

On windows system you will need to download Tessaract from here. and Add it to the Path.

Install the package.

pip install pan-aadhar-ocr

Then Import the package.

from pan_aadhar_ocr import Pan_Info_Extractor

Create an instance of the extractor.

extractor = Pan_Info_Extractor()

Pass the image to the extractor to get the results.

extractor.info_extractor('/content/pan test.jpeg')

This will return a result as following:

{
    "Pan_number": "EKAPS0276J", 
    "Name": "John Kevin Doe",
    "Father_Name": "Kevin Doe",
    "DOB": "31/10/1992"
} 

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pan_aadhar_ocr-1.0.4.tar.gz (4.5 kB view details)

Uploaded Source

File details

Details for the file pan_aadhar_ocr-1.0.4.tar.gz.

File metadata

  • Download URL: pan_aadhar_ocr-1.0.4.tar.gz
  • Upload date:
  • Size: 4.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.7.3 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.59.0 CPython/3.8.8

File hashes

Hashes for pan_aadhar_ocr-1.0.4.tar.gz
Algorithm Hash digest
SHA256 1603bff4f1b7791e496df1f216b3729da30f8ee58fd67f710375541156e9ad00
MD5 4f5a499b5da9e62e7a1d500adf9ddf90
BLAKE2b-256 c3625a621fb66edd2188cb18273b86f378e48148b791a4870ac21a3984436563

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page