NumPlateVision

Extracting a number plate from an image and recognising the characters on them

Project description

About

This module contains a number plate reader, using a YOLO model for number plate extraction and a CNN trained from scratch on an OCR dataset for character recognition.

How to setup

Create a Python virtual enviroment using python3 -m venv venv
Install using pip install NumPlateVision

Example run

To detect and read a number plate from an image, here is an example script

Example notebook

ML data processing pipeline

The number plate of a car can be obtained by using an object detection model. State of the art currently is YOLO (You Only Look Once) which is a deep CNN. The model used for number plate extraction can be found here on The Hugging Face.

A number plate can be read by extracting each character from the number plate, passing it into a character recognition CNN model and then stringing together a word.

Each of these stages is explained in more detail below.

Stage 1 - Plate extraction (YOLO)

The model finds a bounding box of what it thinks is a number plate. With a set confidence level, we can obtain the bounding box predicted by the model, and extract the number plate from the original image.

Original

Extracted

Stage 2 - Character segmentation (OpenCV)

One similarity among all number plates is that the letters are black. Therefore, after applying image processing techniques such as edge sharpening using the Laplacian operator and Otsu thresholding, we can create a binary image with the characters as the foreground and everything else as the background.

Processed

Stage 3 - Character extraction (OpenCV)

To extract a character, we can use the contours method from OpenCV, which identifies foreground objects within an image. This method creates a bounding box around each character, allowing us to extract that portion of the image.

Contours

Stage 4 - Character recognition (CNN from scratch, see below)

Once we have the characters, we can feed each one into the model, obtain the predictions, and concatenate them into a string.

Prediction

CNN built from scratch for character recognition (See stage 4 above)

Training

The neural network is trained on the standard OCR dataset which contains 50k images of characters.

To increase the number of examples, the training data is augmented 5 times per image from a mix of rotation, translation and zooming. This attains a total train set of size 100k.

Constants

Constants used during training:

Loss: Categorical crossentropy
Epochs: 10
Optimiser: Adam

Model evaluation

After training, the test set attains an accuracy of 96.7%.

Looking at the loss and accuracy per epoch we see that there are no signs of overfitting:

eval

The confusion matrix shows excellent results overall, however classes 0, 4, and 24 had misclassifications with numbers 0, 3, and letter P; adding more training data or augmentation could help improve accuracy.

In addition, looking at a few test examples:

example_test

Project details

Release history Release notifications | RSS feed

This version

0.1.3

Aug 23, 2024

0.1.2

Aug 23, 2024

0.1.1

Aug 23, 2024

0.1.0

Aug 23, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

numplatevision-0.1.3.tar.gz (3.6 MB view details)

Uploaded Aug 23, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

NumPlateVision-0.1.3-py3-none-any.whl (3.6 MB view details)

Uploaded Aug 23, 2024 Python 3

File details

Details for the file numplatevision-0.1.3.tar.gz.

File metadata

Download URL: numplatevision-0.1.3.tar.gz
Upload date: Aug 23, 2024
Size: 3.6 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.11.4

File hashes

Hashes for numplatevision-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`8aafe9a240198e09f64a0d662167a5afc4543c4951bef89e0a317537ba6e7537`
MD5	`15c0121a9eb91e1cd31acee6645e915b`
BLAKE2b-256	`a64ad2f4080d65d26ee3f4d62d80ef9c6f68ed1a969958221daf5264ffbcf750`

See more details on using hashes here.

File details

Details for the file NumPlateVision-0.1.3-py3-none-any.whl.

File metadata

Download URL: NumPlateVision-0.1.3-py3-none-any.whl
Upload date: Aug 23, 2024
Size: 3.6 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.11.4

File hashes

Hashes for NumPlateVision-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3a8f50e8fef9457552a95205f066fb7297c22a8de335aec9affe5d398f4fa3d9`
MD5	`3e4c8aadf3c42a3ebfdd7c53803ef556`
BLAKE2b-256	`b5507899987cb09375aa48275375d5d80147e61892a9fe35a7f4577e7ee98e35`

See more details on using hashes here.

NumPlateVision 0.1.3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

About

How to setup

Example run

ML data processing pipeline

Stage 1 - Plate extraction (YOLO)

Stage 2 - Character segmentation (OpenCV)

Stage 3 - Character extraction (OpenCV)

Stage 4 - Character recognition (CNN from scratch, see below)

CNN built from scratch for character recognition (See stage 4 above)

Training

Constants

Model evaluation

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes