A tool for extracting a text line image from the contour with different methods
Project description
Line image extractor
This is a tool and a library to be used for extracting line images. Built by Teklia and freely available as open-source under the MIT licence.
It supports different extraction methods:
- boundingRect - bounding rectangle of the line polygon
- polygon - exact polygon
- min_area_rect - minimum area rectangle containing the polygon
- deskew_polygon - deskew the polygon
- deskew_min_area_rect - deskew the minimum area rectangle
- skew_polygon - skew the polygon (rotate by some angle)
- skew_min_area_rect - skew the minimum area rectangle (rotate by some angle)
Install the library using stable version from Pypi:
pip install teklia-line-image-extractor
Install the library in development mode:
pip install -e .
Test extraction:
line-image-extractor -i tests/data/page_img.jpg -o out.jpg -p tests/data/line_polygon.json -e deskew_min_area_rect --color
How to use it?:
from pathlib import Path
import numpy as np
from line_image_extractor.extractor import extract, read_img, save_img
from line_image_extractor.image_utils import polygon_to_bbox
from line_image_extractor.image_utils import Extraction
page_img = read_img(Path("tests/data/page_img.jpg"))
polygon = np.asarray([[241, 1169], [2287, 1251], [2252, 1190], [244, 1091], [241, 1169]])
bbox = polygon_to_bbox(polygon)
extracted_img = extract(
page_img, polygon, bbox, Extraction.polygon
)
save_img("line_output.jpg", extracted_img)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for teklia-line-image-extractor-0.2.8rc5.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 14d496af502239d376f0d98b02aae38795a9e1df408aa96d2f2f27ab1b954372 |
|
MD5 | f31be1b12cf3e466b616c08d1a7bcec9 |
|
BLAKE2b-256 | eb9cc245519d89a2b588b1b2aee3a4f637bade6d81d06b3599d3abe80be86056 |
Close
Hashes for teklia_line_image_extractor-0.2.8rc5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1b0db9ce8eeeaaa27fad3cfa48a4efbcc1e9a3b47ef4879cb3addede8018abfb |
|
MD5 | 56ffe6401692422c44ac813c4061c2f4 |
|
BLAKE2b-256 | d3cf2578b4b821f393793d5c78defeb9c0a229bf2c255cc642e4704bae578e2b |