Skip to main content

PDF parser component (Apache Tika) for PCU project

Project description

# pcu_pdf (Apache Tika parser for PCU project)

PDF parser component (Apache Tika) for PCU project. From the path of a PDF file, get its textual content.

[Check PCU project][pcu].

[pcu]: https://github.com/zevio/pcu_core


## Usage in another project

If you wish to import this module in another Python project, please install it :

pip install pcu-pdf

Then, add this import line at the beginning of your Python file :

from pcu_pdf import pcu_pdf

You can now use pcu_pdf’s functions, for example :

pcu_pdf.PDFParser(“path/to/pdf/file”)

## Test

To test your installation, go to pcu_pdf/ directory and execute the Makefile with the following command line :

make test

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pcu_pdf-1.2.1.tar.gz (2.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pcu_pdf-1.2.1-py3-none-any.whl (17.0 kB view details)

Uploaded Python 3

File details

Details for the file pcu_pdf-1.2.1.tar.gz.

File metadata

  • Download URL: pcu_pdf-1.2.1.tar.gz
  • Upload date:
  • Size: 2.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.22.0 CPython/3.6.5

File hashes

Hashes for pcu_pdf-1.2.1.tar.gz
Algorithm Hash digest
SHA256 355767c6442d6f7af0607e7c148a2189ac8b345202a98d60fa111d58462e65bf
MD5 c461913a4127831d82d03e54b7b64929
BLAKE2b-256 2d0261d309f710aa245b66ef7aaa3326be805f7d61afab6ce48b9ec11a6c8028

See more details on using hashes here.

File details

Details for the file pcu_pdf-1.2.1-py3-none-any.whl.

File metadata

  • Download URL: pcu_pdf-1.2.1-py3-none-any.whl
  • Upload date:
  • Size: 17.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.22.0 CPython/3.6.5

File hashes

Hashes for pcu_pdf-1.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 dbf2f4fd7628bf3147ba2cc9a123edaa19bc804ee41d059f74a0058e686ad129
MD5 6fac32b51e3761ff069917b16e8a7f09
BLAKE2b-256 f70405de105c32bbfb210535004473f98cb40a81babecd7097b9de4915226e3d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page