Skip to main content

Reusable code for Google Document-AI services

Project description

Instructions

Installing

  1. Install using pip command

Use the pip command to install this package:

pip install reusable-code-for-document-ai

Basic Usage

NOTE: Our system will support only image file and pdf file.

from reusable_code.process_document import CustomGoogleDocAIProcessor

processor = GoogleDocAIProcessor(
    location="your_location",
    processor_name="your_processor_name",
    processor_options=your_processor_options
)

document = processor.process_document("path_to_your_document", extract_page_number)

path_to_your_document = your pdf or image file path.

extract_page_number = Which page you want to extract, page number.

processor options

NOTE: Additional configurations for Document OCR Processor (Optional).

# For more information: https://cloud.google.com/document-ai/docs/enterprise-document-ocr

process_options = documentai.ProcessOptions(
    ocr_config=documentai.OcrConfig(
        enable_native_pdf_parsing=True,
        enable_image_quality_scores=True,
        enable_symbol=True,
        # OCR Add Ons https://cloud.google.com/document-ai/docs/ocr-add-ons
        premium_features=documentai.OcrConfig.PremiumFeatures(
            compute_style_info=True,
            enable_math_ocr=False,  # Enable to use Math OCR Model
            enable_selection_mark_detection=True,
        ),
    )
)

Follow this pattern for set processor name

your_processor_name = projects/<PROJECT_ID>/locations/us/processors/<processor-id>/processorVersions/<version-id>

Make sure to replace your_location, your_processor_name, your_processor_options, and path_to_your_document with appropriate values.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

reusable-code-for-document-ai-0.1.1.tar.gz (3.6 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file reusable-code-for-document-ai-0.1.1.tar.gz.

File metadata

File hashes

Hashes for reusable-code-for-document-ai-0.1.1.tar.gz
Algorithm Hash digest
SHA256 934c6c07e42c01bb765abc90631f2ff27db06f82fa58a212ec86f8434f95989d
MD5 5bddb9ef0e9f923f5be965f0785f1e9f
BLAKE2b-256 28fdd3ecb20c72babda26e052c0746d8e926d97b6bed80520ad59bdc4dcc362e

See more details on using hashes here.

File details

Details for the file reusable_code_for_document_ai-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for reusable_code_for_document_ai-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b879837ac147a13db5fed60ae2a12e2bac52aa58627990046d83e7902f4272df
MD5 31b09f769418abbbaae7875d85f977a2
BLAKE2b-256 49dc02b2067e8c46758f4772315d14c044c7c6e20bdbd7e569f667071f36c648

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page