Skip to main content

Docling-serve remote converter for pgx-docling-parser

Project description

pgx-docling-parser-serve

Remote converter for pgx-docling-parser using docling-serve API.

This package provides a remote document converter that uses the docling-serve API for document processing.

Installation

Install this package directly from PyPI:

pip install pgx-docling-parser-serve

This will automatically install pgx-docling-parser-core as a dependency.

Configuration

Configure the converter using environment variables:

  • PAPERLESS_DOCLING_SERVE_URL: URL of the docling-serve instance (default: http://docling-serve:5000)
  • PAPERLESS_DOCLING_SERVE_TIMEOUT: Request timeout in seconds (default: 300.0)
  • PAPERLESS_DOCLING_POLL_MAX_ATTEMPTS: Maximum number of polling attempts (default: 60)
  • PAPERLESS_DOCLING_POLL_INTERVAL: Polling interval in seconds (default: 5.0)
  • PAPERLESS_DOCLING_PDF_CONVERSION_MODE: Conversion mode - easyocr, tesseract, or granite_docling (default: easyocr)

How It Works

The converter uses a simple 3-step workflow:

  1. Create Task: POST to /v1/convert/file/async to create an async conversion task
  2. Poll Status: Repeatedly GET /v1/status/poll/{task_id} until task completes (success/failure)
  3. Fetch Result: GET /v1/result/{task_id} to retrieve the converted document

The polling mechanism will retry up to PAPERLESS_DOCLING_POLL_MAX_ATTEMPTS times with PAPERLESS_DOCLING_POLL_INTERVAL seconds between attempts.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pgx_docling_parser_serve-0.1.15.tar.gz (5.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pgx_docling_parser_serve-0.1.15-py3-none-any.whl (5.3 kB view details)

Uploaded Python 3

File details

Details for the file pgx_docling_parser_serve-0.1.15.tar.gz.

File metadata

  • Download URL: pgx_docling_parser_serve-0.1.15.tar.gz
  • Upload date:
  • Size: 5.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.19 {"installer":{"name":"uv","version":"0.11.19","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pgx_docling_parser_serve-0.1.15.tar.gz
Algorithm Hash digest
SHA256 f4f588a8da0768487de7c5321f046b79f24dd51f1d07739027ba8d16dc6c4176
MD5 4e9af66c289acb22ff9cd545316cf04f
BLAKE2b-256 a9746744b1b167e2ed8988845f98e2b944fcc3400a3606eca7de72bafc1e88b1

See more details on using hashes here.

File details

Details for the file pgx_docling_parser_serve-0.1.15-py3-none-any.whl.

File metadata

  • Download URL: pgx_docling_parser_serve-0.1.15-py3-none-any.whl
  • Upload date:
  • Size: 5.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.19 {"installer":{"name":"uv","version":"0.11.19","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for pgx_docling_parser_serve-0.1.15-py3-none-any.whl
Algorithm Hash digest
SHA256 1a197099aa52840baead795a2e54c7ed67579cee18208b32043bab63819a82e6
MD5 3283d9cac78617c6d47421da4869ba53
BLAKE2b-256 2e4d16c2850d1582966a60588c9f2e5bc920bda4f2e1711b8a6ded2f65139e9d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page