Perform various operations on ALTO xml files
Project description
ALTO Tools
:snake: tools for performing various operations on ALTO XML files
Installation
Clone the repository, enter it and run
pip install .
Usage
alto-tools <INPUT> [OPTION]
INPUT
should be the path to an ALTO file or directory containing ALTO files.
Output is sent to stdout
.
OPTION | Description |
---|---|
-t --text |
Extract UTF-8 encoded text content |
-c --confidence |
Extract mean OCR word confidence score |
-i --illustrations |
Extract bounding box coordinates of <Illustration> elements |
-g --graphics |
Extract bounding box coordinates of <GraphicalElement> elements |
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
alto-tools-0.1.0.tar.gz
(9.5 kB
view hashes)
Built Distribution
Close
Hashes for alto_tools-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9015c74d0bd089da52ae6d8d69903c2b562061ef21803b1f39cec1587a468691 |
|
MD5 | 3a460e80393f33bc46b47cb3e3cd3f90 |
|
BLAKE2b-256 | 50a8397269efadf94ece214951c2678b06afd290fad091f5cf48c0cff5c92c13 |