Skip to main content

PDF text and table search

Project description

PDFScraper

CLI program for searching text and tables inside of PDF documents and displaying results in HTML. It combines Pdfminer.six, Camelot and Tesseract OCR in a single program, which is simple to use.

How to install

Using pip

After installing the dependencies you can simply use pip to install PDFScraper:

$ pip install PDFScraper

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PDFScraper-1.0.2.tar.gz (8.5 kB view details)

Uploaded Source

Built Distribution

PDFScraper-1.0.2-py3-none-any.whl (11.1 kB view details)

Uploaded Python 3

File details

Details for the file PDFScraper-1.0.2.tar.gz.

File metadata

  • Download URL: PDFScraper-1.0.2.tar.gz
  • Upload date:
  • Size: 8.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.3

File hashes

Hashes for PDFScraper-1.0.2.tar.gz
Algorithm Hash digest
SHA256 583da5bf51e1f3c656f9b3c46ebb826aab2af6c2d52ae28d4f6729687db4ec44
MD5 adf1d423276a731f6b16514a0f080f41
BLAKE2b-256 1570eb099fe1ec975a647684f2ca6e2e2b757a4a9d18f196574a16d53bb6b885

See more details on using hashes here.

File details

Details for the file PDFScraper-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: PDFScraper-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 11.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.3

File hashes

Hashes for PDFScraper-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 5179002357fccebebc127f72b7dc3b4198ece34015507ccc303ed51da5e5d343
MD5 e81316377c82d85e7c575a918ed64b89
BLAKE2b-256 8c342507f2c5abcae97f155490319ad10b726da5db1d6fc7834cacddc30956b8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page