Robust and Straight-Forward solution for reading difficult and tricky QR codes within images in Python. Supported by a YOLOv8 QR Segmentation model.
Project description
QReader
QReader is a Robust and Straight-Forward solution for reading difficult and tricky QR codes within images in Python. Powered by a YOLOv8 model.
Behind the scenes, the library is composed by two main building blocks: A YOLOv8 QR Detector model trained to detect and segment QR codes (also offered as stand-alone), and the Pyzbar QR Decoder. Using the information extracted from this QR Detector, QReader transparently applies, on top of Pyzbar, different image preprocessing techniques that maximize the decoding rate on difficult images.
Installation
To install QReader, simply run:
pip install qreader
You may need to install some additional pyzbar dependencies:
On Windows:
Rarely, you can see an ugly ImportError related with lizbar-64.dll
. If it happens, install the vcredist_x64.exe from the Visual C++ Redistributable Packages for Visual Studio 2013
On Linux:
sudo apt-get install libzbar0
On Mac OS X:
brew install zbar
NOTE: If you're running QReader in a server with very limited resources, you may want to install the CPU version of PyTorch, before installing QReader. To do so, run: pip install torch --no-cache-dir
(Thanks to @cjwalther for his advice).
Usage
QReader is a very simple and straight-forward library. For most use cases, you'll only need to call detect_and_decode
:
from qreader import QReader
import cv2
# Create a QReader instance
qreader = QReader()
# Get the image that contains the QR code
image = cv2.cvtColor(cv2.imread("path/to/image.png"), cv2.COLOR_BGR2RGB)
# Use the detect_and_decode function to get the decoded QR data
decoded_text = qreader.detect_and_decode(image=image)
detect_and_decode
will return a tuple
containing the decoded string of every QR found in the image.
NOTE: Some entries can be
None
, it will happen when a QR have been detected but couldn't be decoded.
API Reference
QReader(model_size = 's', min_confidence = 0.5, reencode_to = 'shift-jis', weights_folder = None)
This is the main class of the library. Please, try to instantiate it just once to avoid loading the model every time you need to detect a QR code.
model_size
: str. The size of the model to use. It can be 'n' (nano), 's' (small), 'm' (medium) or 'l' (large). Larger models are more accurate but slower. Default: 's'.min_confidence
: float. The minimum confidence of the QR detection to be considered valid. Values closer to 0.0 can get more False Positives, while values closer to 1.0 can lose difficult QRs. Default (and recommended): 0.5.reencode_to
: str | None. The encoding to reencode theutf-8
decoded QR string. If None, it won't re-encode. If you find some characters being decoded incorrectly, try to set a Code Page that matches your specific charset. Recommendations that have been found useful:- 'shift-jis' for Germanic languages
- 'cp65001' for Asian languages (Thanks to @nguyen-viet-hung for the suggestion)
weights_folder
: str|None. Folder where the detection model will be downloaded. If None, it will be downloaded to the default qrdet package internal folder, making sure that it gets correctly removed when uninstalling. You could need to change it when working in environments like AWS Lambda where only /tmp folder is writable, as issued in #21. Default:None
(<qrdet_package>/.model).
QReader.detect_and_decode(image, return_detections = False)
This method will decode the QR codes in the given image and return the decoded strings (or None, if any of them was detected but not decoded).
-
image
: np.ndarray. The image to be read. It is expected to be RGB or BGR (uint8). Format (HxWx3). -
return_detections
: bool. IfTrue
, it will return the full detection results together with the decoded QRs. If False, it will return only the decoded content of the QR codes. -
is_bgr
: boolean. IfTrue
, the received image is expected to be BGR instead of RGB. -
Returns: tuple[str | None] | tuple[tuple[dict[str, np.ndarray | float | tuple[float | int, float | int]]], str | None]]: A tuple with all detected QR codes decodified. If
return_detections
isFalse
, the output will look like:('Decoded QR 1', 'Decoded QR 2', None, 'Decoded QR 4', ...)
. Ifreturn_detections
isTrue
it will look like:(('Decoded QR 1', {'bbox_xyxy': (x1_1, y1_1, x2_1, y2_1), 'confidence': conf_1}), ('Decoded QR 2', {'bbox_xyxy': (x1_2, y1_2, x2_2, y2_2), 'confidence': conf_2, ...}), ...)
. Look QReader.detect() for more information about detections format.
QReader.detect(image)
This method detects the QR codes in the image and returns a tuple of dictionaries with all the detection information.
-
image
: np.ndarray. The image to be read. It is expected to be RGB or BGR (uint8). Format (HxWx3). -
is_bgr
: boolean. IfTrue
, the received image is expected to be BGR instead of RGB. -
Returns: tuple[dict[str, np.ndarray|float|tuple[float|int, float|int]]]. A tuple of dictionaries containing all the information of every detection. Contains the following keys.
Key | Value Desc. | Value Type | Value Form |
---|---|---|---|
confidence |
Detection confidence | float |
conf. |
bbox_xyxy |
Bounding box | np.ndarray (4) | [x1, y1, x2, y2] |
cxcy |
Center of bounding box | tuple[float , float ] |
(x, y) |
wh |
Bounding box width and height | tuple[float , float ] |
(w, h) |
polygon_xy |
Precise polygon that segments the QR | np.ndarray (N, 2) | [[x1, y1], [x2, y2], ...] |
quad_xy |
Four corners polygon that segments the QR | np.ndarray (4, 2) | [[x1, y1], ..., [x4, y4]] |
padded_quad_xy |
quad_xy padded to fully cover polygon_xy |
np.ndarray (4, 2) | [[x1, y1], ..., [x4, y4]] |
image_shape |
Shape of the input image | tuple[int , int ] |
(h, w) |
NOTE:
- All
np.ndarray
values are of typenp.float32
- All keys (except
confidence
andimage_shape
) have a normalized ('n') version. For example,bbox_xyxy
represents the bbox of the QR in image coordinates [[0., im_w], [0., im_h]], whilebbox_xyxyn
contains the same bounding box in normalized coordinates [0., 1.].bbox_xyxy[n]
andpolygon_xy[n]
are clipped toimage_shape
. You can use them for indexing without further management
NOTE: Is this the only method you will need? Take a look at QRDet.
QReader.decode(image, detection_result)
This method decodes a single QR code on the given image, described by a detection result.
Internally, this method will run the pyzbar decoder, using the information of the detection_result
, to apply different image preprocessing techniques that heavily increase the detecoding rate.
-
image
: np.ndarray. NumPy Array with theimage
that contains the QR to decode. The image is expected to be inuint8
format [HxWxC], RGB. -
detection_result
: dict[str, np.ndarray|float|tuple[float|int, float|int]]. One of the detection dicts returned by the detect method. Note that QReader.detect() returns atuple
of thesedict
. This method expects just one of them. -
Returns: str | None. The decoded content of the QR code or
None
if it couldn't be read.
Usage Tests
The following code will try to decode these images containing QRs with QReader, pyzbar and OpenCV.
from qreader import QReader
from cv2 import QRCodeDetector, imread
from pyzbar.pyzbar import decode
# Initialize the three tested readers (QRReader, OpenCV and pyzbar)
qreader_reader, cv2_reader, pyzbar_reader = QReader(), QRCodeDetector(), decode
for img_path in ('test_mobile.jpeg', 'test_draw_64x64.jpeg'):
# Read the image
img = imread(img_path)
# Try to decode the QR code with the three readers
qreader_out = qreader_reader.detect_and_decode(image=img)
cv2_out = cv2_reader.detectAndDecode(img=img)[0]
pyzbar_out = pyzbar_reader(image=img)
# Read the content of the pyzbar output (double decoding will save you from a lot of wrongly decoded characters)
pyzbar_out = tuple(out.data.data.decode('utf-8').encode('shift-jis').decode('utf-8') for out in pyzbar_out)
# Print the results
print(f"Image: {img_path} -> QReader: {qreader_out}. OpenCV: {cv2_out}. pyzbar: {pyzbar_out}.")
The output of the previous code is:
Image: test_mobile.jpeg -> QReader: ('https://github.com/Eric-Canas/QReader'). OpenCV: . pyzbar: ().
Image: test_draw_64x64.jpeg -> QReader: ('https://github.com/Eric-Canas/QReader'). OpenCV: . pyzbar: ().
Note that QReader internally uses pyzbar as decoder. The improved detection-decoding rate that QReader achieves comes from the combination of different image pre-processing techniques and the YOLOv8 based QR detector that is able to detect QR codes in harder conditions than classical Computer Vision methods.
Benchmark
Rotation Test
Method | Max Rotation Degrees |
---|---|
Pyzbar | 17º |
OpenCV | 46º |
QReader | 79º |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file qreader-3.14.tar.gz
.
File metadata
- Download URL: qreader-3.14.tar.gz
- Upload date:
- Size: 19.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c9affd2c08735a7b4ff93d725afd30211f9172a67c25b937610d6efd934f03db |
|
MD5 | 556b6666e5bcc99ebec7b3d218dab825 |
|
BLAKE2b-256 | 5e4020427de21a94402023f6c1d71fa2ce77f718ddbaf5a3cb21b67c7516a060 |
File details
Details for the file qreader-3.14-py3-none-any.whl
.
File metadata
- Download URL: qreader-3.14-py3-none-any.whl
- Upload date:
- Size: 17.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9f16ab161af69f50ab85b0182f2d56e4437f6a3be89124ae605c3c2d12d547e6 |
|
MD5 | 3ead33aa1f71fde976c6af7feb3488e7 |
|
BLAKE2b-256 | fcde042ff6da70f823b9d192d4548e6311be104edaf844312ad1bc8f3df8af45 |