Page segmentation and segmentation evaluation
Project description
# ocrd_segment
This repository aims to provide a number of [OCR-D-compliant processors](https://ocr-d.github.io/cli) for layout analysis and evaluation.
## Installation
In your virtual environment, run: `bash pip install . `
## Usage
extracting page images (including results from preprocessing like cropping, deskewing or binarization) along with region polygon coordinates and metadata: - [ocrd-segment-extract-regions](ocrd_segment/extract_regions.py)
extracting line images (including results from preprocessing like cropping, deskewing, dewarping or binarization) along with line polygon coordinates and metadata: - [ocrd-segment-extract-lines](ocrd_segment/extract_lines.py)
comparing different layout segmentations (input file groups N = 2, compute the distance between two segmentations, e.g. automatic vs. manual): - [ocrd-segment-evaluate](ocrd_segment/evaluate.py) :construction: (very early stage)
repairing layout segmentations (input file groups N >= 1, based on heuristics implemented using Shapely): - [ocrd-segment-repair](ocrd_segment/repair.py) :construction: (much to be done)
pattern-based segmentation (input file groups N=1, based on a PAGE template, e.g. from Aletheia, and some XSLT or Python to apply it to the input file group) - ocrd-segment-via-template :construction: (unpublished)
data-driven segmentation (input file groups N=1, based on a statistical model, e.g. Neural Network) - ocrd-segment-via-model :construction: (unpublished)
For detailed description on input/output and parameters, see [ocrd-tool.json](ocrd_segment/ocrd-tool.json)
## Testing
None yet.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for ocrd_segment-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9b549066f46f26a147b726066712a423f9fcf64b8274dd8285447c564f361783 |
|
MD5 | e9bc6112469e53afd56563d862000228 |
|
BLAKE2b-256 | 90344825c12fa6e8238ce350fc766f6aaa0d591705c8f426160eb59ec7513541 |