Skip to main content

This module will return whether PDF is Digital, Non-Digital or Mixed.

Project description

# After installation use syntax :

## Load library
#
from digital_nondigital_pdf_extraction import pdf_extractor

result, digital_pages, nondigital_pages = pdf_extractor.digital_nondigital_classifier("scansmpl.pdf")
# scansmpl.pdf replace with your file name.

print(result)
print(digital_pages)
print(nondigital_pages)
# It will return result either Digital, Non-Digital or Mixed based on document.
# digital_pages - digital page numbers present in pdf.
# nondigital_pages - nondigital page numbers present in pdf.

data = pdf_extractor.digital_nondigital_extractor("scansmpl.pdf", digital_pages, nondigital_pages)
print(data)
# Your result

Thanks and Enjoy !!!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

digital_nondigital_pdf_extraction-1.0.0.tar.gz (3.0 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file digital_nondigital_pdf_extraction-1.0.0.tar.gz.

File metadata

  • Download URL: digital_nondigital_pdf_extraction-1.0.0.tar.gz
  • Upload date:
  • Size: 3.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.8

File hashes

Hashes for digital_nondigital_pdf_extraction-1.0.0.tar.gz
Algorithm Hash digest
SHA256 555e83d2a39d9218319c7c009e3a0731ac186acd3679cb5ed4357818aa30010a
MD5 306c069fcd5f478830488cfef716df83
BLAKE2b-256 7d2a929f7abc9828a9fbfea0c620115fe262eb2174bba7c8cd3da86a75b432c9

See more details on using hashes here.

File details

Details for the file digital_nondigital_pdf_extraction-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: digital_nondigital_pdf_extraction-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 3.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.8

File hashes

Hashes for digital_nondigital_pdf_extraction-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ff1b2c1079b07b02f4bc806be32bb872bfc2c33ab52528db5d7ce3232cea9057
MD5 3f1f49113348dcba0b201153dad5bcc1
BLAKE2b-256 586fa0799b177b002ba0fcffa489fe13e31bffcf8af4b5dd897e0f5161dfbaec

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page