Skip to main content

Convert PDF to structured text using PaddleOCR

Project description

pyconverters_paddleocr

license tests codecov docs version PyPI - Python Version

Convert PDF to structured text using PaddleOCR

Installation

You can simply pip install pyconverters_paddleocr.

Developing

Pre-requisites

You will need to install uv (for package management and building):

pip install uv

Clone the repository:

git clone https://github.com/oterrier/pyconverters_paddleocr

Install dependencies

uv sync --extra test

Running the test suite

uv run pytest

Linting

uv run ruff check .
uv run ruff format --check .

Building the documentation

uv run --extra docs sphinx-build docs docs/_build

The built documentation is available at docs/_build/index.html.

SBOM & vulnerability check

Install the SBOM dependencies:

uv sync --extra sbom

Generate a CycloneDX SBOM from the current environment:

uv run cyclonedx-py environment -o sbom.cdx.json --output-format json

Audit dependencies for known vulnerabilities:

uv run pip-audit --format json --output audit-report.json

To fail on any known vulnerability (useful in CI):

uv run pip-audit --strict

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyconverters_paddleocr-0.6.15.tar.gz (12.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyconverters_paddleocr-0.6.15-py3-none-any.whl (7.2 kB view details)

Uploaded Python 3

File details

Details for the file pyconverters_paddleocr-0.6.15.tar.gz.

File metadata

  • Download URL: pyconverters_paddleocr-0.6.15.tar.gz
  • Upload date:
  • Size: 12.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.12 {"installer":{"name":"uv","version":"0.10.12","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"12","id":"bookworm","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pyconverters_paddleocr-0.6.15.tar.gz
Algorithm Hash digest
SHA256 d8a9996718370d4a509c0370ae2574c9023ba9da781ab2e14426d351e0a3b3e3
MD5 be4a27ed6ba0d5f16b5f8bb7dd3948d5
BLAKE2b-256 e92c69e69b323725041791e18d1d7e338ec8e311263b84ac045af2b6481622fe

See more details on using hashes here.

File details

Details for the file pyconverters_paddleocr-0.6.15-py3-none-any.whl.

File metadata

  • Download URL: pyconverters_paddleocr-0.6.15-py3-none-any.whl
  • Upload date:
  • Size: 7.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.12 {"installer":{"name":"uv","version":"0.10.12","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"12","id":"bookworm","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pyconverters_paddleocr-0.6.15-py3-none-any.whl
Algorithm Hash digest
SHA256 fefb869896237e80a67b7671fc8928d2b3a1b491f4c74da0dbb7ded952cac056
MD5 70dc08c77d90c4b26c33767ef65d5422
BLAKE2b-256 f00c8a02e38405cb2a57109d66be97b3a4f9f3584e9b0be756da7ee26ad2ce4c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page