Page segmentation and segmentation evaluation
Project description
ocrd_segment
This repository aims to provide a number of OCR-D-compliant processors for layout analysis and evaluation.
Installation
In your virtual environment, run:
pip install .
Usage
- exporting page images (including results from preprocessing like cropping/masking, deskewing, dewarping or binarization) along with region polygon coordinates and metadata, also MS-COCO:
- exporting region images (including results from preprocessing like cropping/masking, deskewing, dewarping or binarization) along with region polygon coordinates and metadata:
- exporting line images (including results from preprocessing like cropping/masking, deskewing, dewarping or binarization) along with line polygon coordinates and metadata:
- importing layout segmentations from other formats (mask images, MS-COCO JSON annotation):
- repairing layout segmentations (input file groups N >= 1, based on heuristics implemented using Shapely):
- ocrd-segment-repair :construction: (much to be done)
- comparing different layout segmentations (input file groups N = 2, compute the distance between two segmentations, e.g. automatic vs. manual):
- ocrd-segment-evaluate :construction: (very early stage)
- pattern-based segmentation (input file groups N=1, based on a PAGE template, e.g. from Aletheia, and some XSLT or Python to apply it to the input file group)
ocrd-segment-via-template
:construction: (unpublished)
- data-driven segmentation (input file groups N=1, based on a statistical model, e.g. Neural Network)
ocrd-segment-via-model
:construction: (unpublished)
For detailed description on input/output and parameters, see ocrd-tool.json
Testing
None yet.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ocrd_segment-0.1.13.tar.gz
(2.2 MB
view hashes)
Built Distribution
ocrd_segment-0.1.13-py3-none-any.whl
(183.2 kB
view hashes)
Close
Hashes for ocrd_segment-0.1.13-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a3fd0a867c4bd9acccbb682e577b9b814d494ce1b693774be20588c6bebdfc8b |
|
MD5 | 80a3d05104720ce4e449e00f7aaa4296 |
|
BLAKE2b-256 | a31c3f0c9e66c00a6783c157111b3ce027b70cdb80eb761a54ced3f982bb411c |