Skip to main content

Convert PDF files to Markdown using pymupdf4llm

Project description

pdfmd

Convert PDF files to Markdown using pymupdf4llm.

Installation

# Install from PyPI
pip install pdfmd

# Or as a uv tool
uv tool install pdfmd

For development:

git clone https://github.com/3n3a/pdfmd.git
cd pdfmd
uv sync

Usage

pdfmd input.pdf                          # writes input.md
pdfmd input.pdf -o output.md             # explicit output path
pdfmd input.pdf -o -                     # stdout
pdfmd input.pdf -p "1,3,5-8"            # specific pages (1-indexed)
pdfmd input.pdf --images --dpi 200       # extract images

Run pdfmd --help for all options.

Build & Publish

uv build      # creates dist/*.whl and dist/*.tar.gz
uv publish    # publish to PyPI

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdfmd-0.2.1.tar.gz (3.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pdfmd-0.2.1-py3-none-any.whl (3.0 kB view details)

Uploaded Python 3

File details

Details for the file pdfmd-0.2.1.tar.gz.

File metadata

  • Download URL: pdfmd-0.2.1.tar.gz
  • Upload date:
  • Size: 3.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pdfmd-0.2.1.tar.gz
Algorithm Hash digest
SHA256 40283b2ecbd5ca50e770ef03eaa3fa28dc91343fae4984286f566676b0cbc255
MD5 5b38cc6686226b537619f2f5051881f3
BLAKE2b-256 5da2ae8188cc89fac9a5c345381a13cebb478caa611309fc1de0a44806601e23

See more details on using hashes here.

Provenance

The following attestation bundles were made for pdfmd-0.2.1.tar.gz:

Publisher: publish.yml on 3n3a/pdfmd

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pdfmd-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: pdfmd-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 3.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pdfmd-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8a7ef735e4732f6fcebbea9b857a6708907e64ddea8e3b6312910f5f630fc42c
MD5 a0474bb8bda740f4f19c35503b9dcfbc
BLAKE2b-256 10a0fa6b2aa0c0489950e68bd49ba08c49533119cfae70bb610c82cd6e04255a

See more details on using hashes here.

Provenance

The following attestation bundles were made for pdfmd-0.2.1-py3-none-any.whl:

Publisher: publish.yml on 3n3a/pdfmd

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page