Skip to main content

Convert PDF to structured text using MinerU

Project description

pyconverters_mineru

license tests codecov docs version PyPI - Python Version

Convert PDF to structured text using MinerU

Installation

You can simply pip install pyconverters_mineru.

Developing

Pre-requisites

You will need to install uv (for package management and building):

pip install uv

Clone the repository:

git clone https://github.com/oterrier/pyconverters_mineru

Install dependencies

uv sync --extra test

Running the test suite

uv run pytest

Linting

uv run ruff check .
uv run ruff format --check .

Building the documentation

uv run --extra docs sphinx-build docs docs/_build

The built documentation is available at docs/_build/index.html.

SBOM & vulnerability check

Install the SBOM dependencies:

uv sync --extra sbom

Generate a CycloneDX SBOM from the current environment:

uv run cyclonedx-py environment -o sbom.cdx.json --output-format json

Audit dependencies for known vulnerabilities:

uv run pip-audit --format json --output audit-report.json

To fail on any known vulnerability (useful in CI):

uv run pip-audit --strict

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyconverters_mineru-1.6.50.tar.gz (9.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyconverters_mineru-1.6.50-py3-none-any.whl (9.2 kB view details)

Uploaded Python 3

File details

Details for the file pyconverters_mineru-1.6.50.tar.gz.

File metadata

  • Download URL: pyconverters_mineru-1.6.50.tar.gz
  • Upload date:
  • Size: 9.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.10 {"installer":{"name":"uv","version":"0.11.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"12","id":"bookworm","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pyconverters_mineru-1.6.50.tar.gz
Algorithm Hash digest
SHA256 1eaa4ffe7a77e7fa2d622b6796ae214745c5e035f01743cd9315dbf97f120b11
MD5 066780e07025f00e9049e39f00df1a77
BLAKE2b-256 80b3576e208403c6cd4fd5a6cd0be6bd6798f75cf75c7c7ea5a22b9bde3120f0

See more details on using hashes here.

File details

Details for the file pyconverters_mineru-1.6.50-py3-none-any.whl.

File metadata

  • Download URL: pyconverters_mineru-1.6.50-py3-none-any.whl
  • Upload date:
  • Size: 9.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.10 {"installer":{"name":"uv","version":"0.11.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"12","id":"bookworm","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pyconverters_mineru-1.6.50-py3-none-any.whl
Algorithm Hash digest
SHA256 29c61d00e097ef542a7b1fd0fb0af90d0c5766b958a40c2c43877ff351f7d605
MD5 48de861990d3eddc51c2a4385cddb531
BLAKE2b-256 623cca67c9ffc482175a110d4fff407ae2c12043a9941b91f7a9afc3933aa371

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page