Reusable code for Google Document-AI services
Project description
Instructions
Installing
- Install using
pip
command
Use the pip command to install this package:
pip install reusable-code-for-document-ai
Basic Usage
NOTE: Our system will support only image file and pdf file.
from reusable_code.process_document import CustomGoogleDocAIProcessor
processor = GoogleDocAIProcessor(
location="your_location",
processor_name="your_processor_name",
processor_options=your_processor_options
)
document = processor.process_document("path_to_your_document", extract_page_number)
path_to_your_document
= your pdf or image file path.
extract_page_number
= Which page you want to extract, page number.
processor options
NOTE: Additional configurations for Document OCR Processor (Optional).
# For more information: https://cloud.google.com/document-ai/docs/enterprise-document-ocr
process_options = documentai.ProcessOptions(
ocr_config=documentai.OcrConfig(
enable_native_pdf_parsing=True,
enable_image_quality_scores=True,
enable_symbol=True,
# OCR Add Ons https://cloud.google.com/document-ai/docs/ocr-add-ons
premium_features=documentai.OcrConfig.PremiumFeatures(
compute_style_info=True,
enable_math_ocr=False, # Enable to use Math OCR Model
enable_selection_mark_detection=True,
),
)
)
Follow this pattern for set processor name
your_processor_name = projects/<PROJECT_ID>/locations/us/processors/<processor-id>/processorVersions/<version-id>
Make sure to replace your_location
, your_processor_name
, your_processor_options
, and path_to_your_document
with appropriate values.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for reusable-code-for-document-ai-0.1.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 934c6c07e42c01bb765abc90631f2ff27db06f82fa58a212ec86f8434f95989d |
|
MD5 | 5bddb9ef0e9f923f5be965f0785f1e9f |
|
BLAKE2b-256 | 28fdd3ecb20c72babda26e052c0746d8e926d97b6bed80520ad59bdc4dcc362e |
Close
Hashes for reusable_code_for_document_ai-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | b879837ac147a13db5fed60ae2a12e2bac52aa58627990046d83e7902f4272df |
|
MD5 | 31b09f769418abbbaae7875d85f977a2 |
|
BLAKE2b-256 | 49dc02b2067e8c46758f4772315d14c044c7c6e20bdbd7e569f667071f36c648 |