Skip to main content

Detects and extracts the features in a aadhaar card

Project description

Use to extract details from a image of a aadhaar card

Trained using YOLOv8

Dataset

Check out the Github Repo for more info

Note : Requires external installation of tesseract , and setting the environment variable of the same

How to use :

from Aadhaar_extractor.Extractor import AadhaarExtractor

#create a object of the AadhaarExtractor class by passing the image of the aadhaar card as argument


obj = AadhaarExtractor("aadhaar.jpg")

#use obj.extract() for extracting the data 

extractedData = obj.extract()

The extract() method will return , a list of all the fields detected .

This module currently detects five fields:

  1. Aadhaar Number (aadhaar_no)
  2. Date of Birth (dob)
  3. Gender (gender)
  4. Name (name)
  5. Address (address)

Each field is member of the list(detected field), has four fields in the following order

  1. The coordinates of the detected field in x1,y1,x2,y2 format (type=list)

  2. The name of the field

  3. Confidence of detection

  4. Extracted text from the detected field

Note:All the data used to train , were publicly available in the internet

This project is far from complete, and can be made much better, feel free to contribute

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Aadhaar_extractor-0.0.5.tar.gz (5.6 MB view hashes)

Uploaded Source

Built Distribution

Aadhaar_extractor-0.0.5-py3-none-any.whl (5.6 MB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page