Skip to main content

Perform various operations on ALTO xml files

Project description

ALTO Tools

:snake: tools for performing various operations on ALTO XML files


Installation

Clone the repository, enter it and run

pip install .

Usage

alto-tools <INPUT> [OPTION] 

INPUT should be the path to an ALTO file or directory containing ALTO files.

Output is sent to stdout.

OPTION Description
-t --text Extract UTF-8 encoded text content
-c --confidence Extract mean OCR word confidence score
-i --illustrations Extract bounding box coordinates of <Illustration> elements
-g --graphics Extract bounding box coordinates of <GraphicalElement> elements

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

alto-tools-0.1.0.tar.gz (9.5 kB view hashes)

Uploaded Source

Built Distribution

alto_tools-0.1.0-py3-none-any.whl (9.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page