Skip to main content

This module will return whether PDF is Digital, Non-Digital or Mixed.

Project description

# After installation use syntax :

## Load library
#
from digital_nondigital_pdf_extraction import pdf_extractor

result, digital_pages, nondigital_pages = pdf_extractor.digital_nondigital_classifier("scansmpl.pdf")
# scansmpl.pdf replace with your file name.

print(result)
print(digital_pages)
print(nondigital_pages)
# It will return result either Digital, Non-Digital or Mixed based on document.
# digital_pages - digital page numbers present in pdf.
# nondigital_pages - nondigital page numbers present in pdf.

data = pdf_extractor.digital_nondigital_extractor("scansmpl.pdf", digital_pages, nondigital_pages)
print(data)
# Your result

Thanks and Enjoy !!!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

digital_nondigital_pdf_extraction-1.0.1.tar.gz (3.0 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file digital_nondigital_pdf_extraction-1.0.1.tar.gz.

File metadata

  • Download URL: digital_nondigital_pdf_extraction-1.0.1.tar.gz
  • Upload date:
  • Size: 3.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.8

File hashes

Hashes for digital_nondigital_pdf_extraction-1.0.1.tar.gz
Algorithm Hash digest
SHA256 f7e22cb388234f41238eccd2422b9fc47468ceb08c34e59583af0a56c4e0c140
MD5 c29493e7c30f55ebb7e36f717a430a78
BLAKE2b-256 9fcaccac3b7908fbefd3dc1d19d052853e4a8dea8eb0c5fbc14c8f98f7e32765

See more details on using hashes here.

File details

Details for the file digital_nondigital_pdf_extraction-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: digital_nondigital_pdf_extraction-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 3.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.8

File hashes

Hashes for digital_nondigital_pdf_extraction-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 7989cada0d000cc21be3b3a892a4a69b0a01c221e1ebb56051a4e445dcd07a92
MD5 c9489b01c1e5694be96da94526365148
BLAKE2b-256 b5ed90c319f7b4e8897db6cafc79d148a8102c47a0511f2af084cd696ddd755b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page