Skip to main content

This module will return whether PDF is Digital, Non-Digital or Mixed.

Project description

# After installation use syntax :

## Load library
#
from digital_nondigital_pdf_extraction import pdf_extractor

result, digital_pages, nondigital_pages = pdf_extractor.digital_nondigital_classifier("scansmpl.pdf")
# scansmpl.pdf replace with your file name.

print(result)
print(digital_pages)
print(nondigital_pages)
# It will return result either Digital, Non-Digital or Mixed based on document.
# digital_pages - digital page numbers present in pdf.
# nondigital_pages - nondigital page numbers present in pdf.

data = pdf_extractor.digital_nondigital_extractor("scansmpl.pdf", digital_pages, nondigital_pages)
print(data)
# Your result

Thanks and Enjoy !!!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page