Skip to main content

Extract text from a given pdf

Project description

winzy-pdf-to-text

PyPI Changelog Tests License

Extract text from a given pdf

Installation

First configure your Winzy project to use Winzy.

Then install this plugin in the same environment as your Winzy application.

pip install winzy-pdf-to-text

Usage

winzy pdf2text example.pdf -p 1

This will extract all text from page 1 to the standard output.

One can also provide range

winzy pdf2text example.pdf -p 3-6

This will extract text from page 3 to 5 .

Development

To set up this plugin locally, first checkout the code. Then create a new virtual environment:

cd winzy-pdf-to-text
python -m venv venv
source venv/bin/activate

Now install the dependencies and test dependencies:

pip install -e '.[test]'

To run the tests:

python -m pytest

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

winzy_pdf_to_text-0.1.0.tar.gz (7.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

winzy_pdf_to_text-0.1.0-py3-none-any.whl (7.3 kB view details)

Uploaded Python 3

File details

Details for the file winzy_pdf_to_text-0.1.0.tar.gz.

File metadata

  • Download URL: winzy_pdf_to_text-0.1.0.tar.gz
  • Upload date:
  • Size: 7.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for winzy_pdf_to_text-0.1.0.tar.gz
Algorithm Hash digest
SHA256 fe5220c82079df598dc86be41be11eb57d0c2b3117cb556dca967085b0f7af60
MD5 ee2fe617c6110ee4a1784384a4a25e83
BLAKE2b-256 182ed5cae77250230f498ec9c997f5e8b598090b82cc41f3d4c63c55dc5ce7bd

See more details on using hashes here.

Provenance

The following attestation bundles were made for winzy_pdf_to_text-0.1.0.tar.gz:

Publisher: publish.yml on sukhbinder/winzy-pdf-to-text

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file winzy_pdf_to_text-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for winzy_pdf_to_text-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1fae0c40ba5d3467e2714c792e8f834adc1199b93615aefc456c65f63879ff23
MD5 57aa03188a098208a651b9cbf17ba470
BLAKE2b-256 849ec9dd12ceda81ea7a9f9286a831ceec278748dc2f11b94f0ad7f46e31fc56

See more details on using hashes here.

Provenance

The following attestation bundles were made for winzy_pdf_to_text-0.1.0-py3-none-any.whl:

Publisher: publish.yml on sukhbinder/winzy-pdf-to-text

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page