Skip to main content

This module will return whether PDF is Digital, Non-Digital or Mixed.

Project description

# After installation use syntax :

## Load library
#
from digital_nondigital_pdf_extraction import pdf_extractor

result, digital_pages, nondigital_pages = pdf_extractor.digital_nondigital_classifier("scansmpl.pdf")
# scansmpl.pdf replace with your file name.

print(result)
print(digital_pages)
print(nondigital_pages)
# It will return result either Digital, Non-Digital or Mixed based on document.
# digital_pages - digital page numbers present in pdf.
# nondigital_pages - nondigital page numbers present in pdf.

data = pdf_extractor.digital_nondigital_extractor("scansmpl.pdf", digital_pages, nondigital_pages)
print(data)
# Your result

Thanks and Enjoy !!!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

digital_nondigital_pdf_extraction-1.0.2.tar.gz (3.0 kB view details)

Uploaded Source

Built Distribution

File details

Details for the file digital_nondigital_pdf_extraction-1.0.2.tar.gz.

File metadata

  • Download URL: digital_nondigital_pdf_extraction-1.0.2.tar.gz
  • Upload date:
  • Size: 3.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.8

File hashes

Hashes for digital_nondigital_pdf_extraction-1.0.2.tar.gz
Algorithm Hash digest
SHA256 fb0b6257cf964912543c32396a69fbcc2f547b1291bff97af06cac20a331c10c
MD5 3ea1404d266e1e7426690d8b4dbd3b70
BLAKE2b-256 ebc7f573e9d926ff38b68d3b761a8c59750ff540208cbbef545da962ff7dd4fc

See more details on using hashes here.

File details

Details for the file digital_nondigital_pdf_extraction-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: digital_nondigital_pdf_extraction-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 3.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.0 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.8.8

File hashes

Hashes for digital_nondigital_pdf_extraction-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 7ac207538ba84ea3789fb050aa5dbd653c5c6a6b340ca6a445f44db34ac96896
MD5 a7255ffe17a1464fa1c4b511798a21c9
BLAKE2b-256 3b22c4538c7780f8bb5d3eea8313607d92c29943c1d885d44d2676300cdff7dc

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page