Skip to main content

PDF parser component (Apache Tika) for PCU project

Project description

# pcu_pdf (Apache Tika parser for PCU project)

PDF parser component (Apache Tika) for PCU project. From the path of a PDF file, get its textual content.

[Check PCU project][pcu].

[pcu]: https://github.com/zevio/pcu_core


## Usage in another project

If you wish to import this module in another Python project, please install it :

pip install pcu-pdf

Then, add this import line at the beginning of your Python file :

from pcu_pdf import pcu_pdf

You can now use pcu_pdf’s functions, for example :

pcu_pdf.PDFParser(“path/to/pdf/file”)

## Test

To test your installation, go to pcu_pdf/ directory and execute the Makefile with the following command line :

make test

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pcu_pdf-1.2.1.tar.gz (2.6 kB view hashes)

Uploaded Source

Built Distribution

pcu_pdf-1.2.1-py3-none-any.whl (17.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page