Skip to main content

PDF text and table search

Project description

PDFScraper

CLI program for searching text and tables inside of PDF documents and displaying results in HTML. It combines Pdfminer.six, Camelot and Tesseract OCR in a single program, which is simple to use.

How to install

Using pip

After installing the dependencies you can simply use pip to install PDFScraper:

$ pip install PDFScraper

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PDFScraper-1.0.4.tar.gz (8.9 kB view details)

Uploaded Source

Built Distribution

PDFScraper-1.0.4-py3-none-any.whl (13.1 kB view details)

Uploaded Python 3

File details

Details for the file PDFScraper-1.0.4.tar.gz.

File metadata

  • Download URL: PDFScraper-1.0.4.tar.gz
  • Upload date:
  • Size: 8.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.3

File hashes

Hashes for PDFScraper-1.0.4.tar.gz
Algorithm Hash digest
SHA256 129e803e871c94a379ea3618b99e0a627e36c0bc96dfc79846d976967915d3a8
MD5 1f7c8a660133344389d083cffc69c603
BLAKE2b-256 c89d0d54b2a866a94fdc25f76df290369b0ee0cfe5bdf55abb3e4663bee11dfb

See more details on using hashes here.

File details

Details for the file PDFScraper-1.0.4-py3-none-any.whl.

File metadata

  • Download URL: PDFScraper-1.0.4-py3-none-any.whl
  • Upload date:
  • Size: 13.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.1 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.8.3

File hashes

Hashes for PDFScraper-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 c4698a9265249c99a59f50b58b6657e2c0198dd83c6bbb07fc6afa6dc59303bd
MD5 6414413d0b8cbe2cd339a5b03c1c8efc
BLAKE2b-256 b690576de138a1b439f85a0418c9b89d51448c64bfac34110640d38f8b74e0ee

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page