Skip to main content

This module will return whether PDF is Digital, Non-Digital or Mixed.

Project description

# After installation use syntax :

## Load library
#
from digital_nondigital_pdf_extraction import pdf_extractor

result, digital_pages, nondigital_pages = pdf_extractor.digital_nondigital_classifier("scansmpl.pdf")
# scansmpl.pdf replace with your file name.

print(result)
print(digital_pages)
print(nondigital_pages)
# It will return result either Digital, Non-Digital or Mixed based on document.
# digital_pages - digital page numbers present in pdf.
# nondigital_pages - nondigital page numbers present in pdf.

data = pdf_extractor.digital_nondigital_extractor("scansmpl.pdf", digital_pages, nondigital_pages)
print(data)
# Your result

Thanks and Enjoy !!!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Built Distribution

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page