Amazon Textract Helper tools for pretty printing
Project description
Textract-PrettyPrinter
Provides functions to format the output received from Textract in more easily consumable formats incl. CSV or Markdown. amazon-textract-prettyprinter
Install
> python -m pip install amazon-textract-prettyprinter
Make sure your environment is setup with AWS credentials through configuration files or environment variables or an attached role. (https://docs.aws.amazon.com/cli/latest/userguide/cli-chap-configure.html)
Samples
Get FORMS and TABLES as CSV
from textractcaller.t_call import call_textract, Textract_Features
from textractprettyprinter.t_pretty_print import Pretty_Print_Table_Format, Textract_Pretty_Print, get_string
textract_json = call_textract(input_document=input_document, features=[Textract_Features.FORMS, Textract_Features.TABLES])
print(get_string(textract_json=textract_json,
table_format=Pretty_Print_Table_Format.csv,
output_type=[Textract_Pretty_Print.TABLES, Textract_Pretty_Print.FORMS]))
Get string for TABLES using the get_string method
from textractcaller.t_call import call_textract, Textract_Features
from textractprettyprinter.t_pretty_print import Textract_Pretty_Print, get_string
textract_json = call_textract(input_document=input_document, features=[Textract_Features.TABLES])
get_string(textract_json=textract_json, output_type=Textract_Pretty_Print.TABLES)
Print out tables in LaTeX format
from textractcaller.t_call import call_textract, Textract_Features
from textractprettyprinter.t_pretty_print import Textract_Pretty_Print, get_string
textract_json = call_textract(input_document=input_document, features=[Textract_Features.FORMS, Textract_Features.TABLES])
get_tables_string(textract_json=textract_json, table_format=Pretty_Print_Table_Format.latex)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for amazon-textract-prettyprinter-0.1.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7650d68d35ccc994a7e98b136e84c31b86a378953ccf363941a4f2a35d9f1885 |
|
MD5 | d70cf1025d7de883d487cb768d19b4e5 |
|
BLAKE2b-256 | 6e2b518c5269a00e67ffab1e5b08e8df3dd4ef21bf37b1e17306c3c2afdee7d3 |
Close
Hashes for amazon_textract_prettyprinter-0.1.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e6bbef27713f97342e55aa027124202812eb20aa5c4fefa1475fa2191b9c4c56 |
|
MD5 | 3b44e89b2fd88fd840e32d96acf06eac |
|
BLAKE2b-256 | 1ad5310b20554a1bf5b26e82714729e8a5275c23fd6a36d4efc1d4d480980d9c |