Skip to main content

PDF parser component (Apache Tika) for PCU project

Project description

# pcu_pdf (Apache Tika parser for PCU project)

PDF parser component (Apache Tika) for PCU project. From the path of a PDF file, get its textual content.

[Check PCU project][pcu].

[pcu]: https://github.com/zevio/pcu_core


## Usage in another project

If you wish to import this module in another Python project, please install it :

pip install pcu-pdf

Then, add this import line at the beginning of your Python file :

from pcu_pdf import pcu_pdf

You can now use pcu_pdf’s functions, for example :

pcu_pdf.PDFParser(“path/to/pdf/file”)

## Test

To test your installation, go to pcu_pdf/ directory and execute the Makefile with the following command line :

make test

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pcu_pdf-1.2.2.tar.gz (56.9 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pcu_pdf-1.2.2-py3-none-any.whl (56.9 MB view details)

Uploaded Python 3

File details

Details for the file pcu_pdf-1.2.2.tar.gz.

File metadata

  • Download URL: pcu_pdf-1.2.2.tar.gz
  • Upload date:
  • Size: 56.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.22.0 CPython/3.6.5

File hashes

Hashes for pcu_pdf-1.2.2.tar.gz
Algorithm Hash digest
SHA256 e0933f277944277445c4f1bff85e3e6b2613bdd301ce976a94705f7f24ad31bc
MD5 28e38523d0fa876af89199382a6fe948
BLAKE2b-256 a75caab5343bc42256fa7b29ef67eb1d90966b284250f07636cf3d73c944baea

See more details on using hashes here.

File details

Details for the file pcu_pdf-1.2.2-py3-none-any.whl.

File metadata

  • Download URL: pcu_pdf-1.2.2-py3-none-any.whl
  • Upload date:
  • Size: 56.9 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.18.4 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.22.0 CPython/3.6.5

File hashes

Hashes for pcu_pdf-1.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 3fda5a13094a995a5f53075e6b7f6a5b2436f089e9a992705fb39064e70a8fd9
MD5 784826aa0e2a603b3bb3b09c54f9d3e1
BLAKE2b-256 8857c7edc7448be0fbf63822f2d9f2d1c743ea9bf9ae4dbc6a53db28905cba59

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page