Skip to main content

No project description provided

Project description

fast-pdf-extract

A Rust backed PDF text extraction library for Python.

Features

  • Detect and remove headers and footers
  • Clean bilingual PDFs
  • Mark headings in bold (basic markdown)
  • High accuracy
  • Peformance

Development

uv sync --only-dev

# run tests
python -m unittest

# publishing
maturin build --release
maturin publish

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fast_pdf_extract-0.5.1.tar.gz (6.7 MB view details)

Uploaded Source

Built Distribution

fast_pdf_extract-0.5.1-cp310-cp310-macosx_11_0_arm64.whl (3.6 MB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64

File details

Details for the file fast_pdf_extract-0.5.1.tar.gz.

File metadata

  • Download URL: fast_pdf_extract-0.5.1.tar.gz
  • Upload date:
  • Size: 6.7 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: maturin/1.8.2

File hashes

Hashes for fast_pdf_extract-0.5.1.tar.gz
Algorithm Hash digest
SHA256 6e73de2fca07452d24ef44bca7f2625ec835a22acdba7e9b3b946953cccebe56
MD5 edbdaf0611468c2f6b15a662a71b5cc9
BLAKE2b-256 2c040b6b7675a1457b7e570049838e283d97be08266b957a9cc8be07200c9a4a

See more details on using hashes here.

File details

Details for the file fast_pdf_extract-0.5.1-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for fast_pdf_extract-0.5.1-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 27c73938947db26cb16bd0a897abb938770403e2d5c6660c31b8be2efadf8cd0
MD5 d1936bf164d8720af68ecce04b270e73
BLAKE2b-256 5bc0ee946c7c8f5aaf7100e888b7ea48c1268fdba031285220d36565afb0047e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page