Skip to main content

A high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Project description

PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Community

Join us on Discord here: #pymupdf

Installation

PyMuPDF requires Python 3.9 or later, install using pip with:

pip install PyMuPDF

There are no mandatory external dependencies. However, some optional features become available only if additional packages are installed.

You can also try without installing by visiting PyMuPDF.io.

Usage

Basic usage is as follows:

import pymupdf # imports the pymupdf library
doc = pymupdf.open("example.pdf") # open a document
for page in doc: # iterate the document pages
  text = page.get_text() # get plain text encoded as UTF-8

Documentation

Full documentation can be found on pymupdf.readthedocs.io.

Optional Features

  • fontTools for creating font subsets.
  • pymupdf-fonts contains some nice fonts for your text output.
  • Tesseract-OCR for optical character recognition in images and document pages.

About

PyMuPDF adds Python bindings and abstractions to MuPDF, a lightweight PDF, XPS, and eBook viewer, renderer, and toolkit. Both PyMuPDF and MuPDF are maintained and developed by Artifex Software, Inc.

PyMuPDF was originally written by Jorj X. McKie.

License and Copyright

PyMuPDF is available under open-source AGPL and commercial license agreements. If you determine you cannot meet the requirements of the AGPL, please contact Artifex for more information regarding a commercial license.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pymupdf-1.25.4.tar.gz (69.8 MB view details)

Uploaded Source

Built Distributions

pymupdf-1.25.4-cp39-abi3-win_amd64.whl (16.6 MB view details)

Uploaded CPython 3.9+Windows x86-64

pymupdf-1.25.4-cp39-abi3-win32.whl (15.1 MB view details)

Uploaded CPython 3.9+Windows x86

pymupdf-1.25.4-cp39-abi3-musllinux_1_2_x86_64.whl (21.3 MB view details)

Uploaded CPython 3.9+musllinux: musl 1.2+ x86-64

pymupdf-1.25.4-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (20.0 MB view details)

Uploaded CPython 3.9+manylinux: glibc 2.17+ x86-64

pymupdf-1.25.4-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl (19.5 MB view details)

Uploaded CPython 3.9+manylinux: glibc 2.17+ ARM64

pymupdf-1.25.4-cp39-abi3-macosx_11_0_arm64.whl (18.6 MB view details)

Uploaded CPython 3.9+macOS 11.0+ ARM64

pymupdf-1.25.4-cp39-abi3-macosx_10_9_x86_64.whl (19.4 MB view details)

Uploaded CPython 3.9+macOS 10.9+ x86-64

File details

Details for the file pymupdf-1.25.4.tar.gz.

File metadata

  • Download URL: pymupdf-1.25.4.tar.gz
  • Upload date:
  • Size: 69.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.10

File hashes

Hashes for pymupdf-1.25.4.tar.gz
Algorithm Hash digest
SHA256 5f189466b68901055a9ddc77dc1c91cba081a60964f0caa6ff5b9b87001a0194
MD5 88c441cb5dd1644c571320b8aa0774f9
BLAKE2b-256 2556d7de0325125621a3d095eb43ce35f2e036cd4c0489ff5e8cae816f1cd8b9

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.4-cp39-abi3-win_amd64.whl.

File metadata

  • Download URL: pymupdf-1.25.4-cp39-abi3-win_amd64.whl
  • Upload date:
  • Size: 16.6 MB
  • Tags: CPython 3.9+, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.10

File hashes

Hashes for pymupdf-1.25.4-cp39-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 af8fa1f7920b3b27343b2511cb9fc44baf2bad8b948db0b0f9d5b2fa589cb305
MD5 cfb32d75e0d019349439a90782c48dbd
BLAKE2b-256 c39f2872a29766dfe65fb97268233503cc9f7ba6e776eedbdb88ed47216aef84

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.4-cp39-abi3-win32.whl.

File metadata

  • Download URL: pymupdf-1.25.4-cp39-abi3-win32.whl
  • Upload date:
  • Size: 15.1 MB
  • Tags: CPython 3.9+, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.11.10

File hashes

Hashes for pymupdf-1.25.4-cp39-abi3-win32.whl
Algorithm Hash digest
SHA256 d90e6087f9b153999fa9f2c4709488fb0355de0302ec502a76c4fc0c0a0c535e
MD5 f084c2885c81fbed7a220ee6b3bd15a1
BLAKE2b-256 0e0a3eb1fab9e788790be32d8ca43235f59652d44252168bfba457c5496948fe

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.4-cp39-abi3-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.4-cp39-abi3-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 a8c23242d209512437187dc728001fc7c65c2dd9d37b50071f34c5cd745194a3
MD5 9d033c1840cbb1590b748aa778a47c20
BLAKE2b-256 09f83ba60c9cbb01c9aacc2e1c35fc7640456306da99cdea913954f33a0c9c74

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.4-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.4-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 72817aa5f5023632346eee4aad0c32018d7879cf9a72e1820054dc9d8c48150f
MD5 a77ddb6cdddfac6ba339da3c9c050bdc
BLAKE2b-256 36ae9b3651d457698ff5e19c5a663c51703cc990892b35cf4a779c36a1a1bf64

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.4-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.4-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl
Algorithm Hash digest
SHA256 bd16e264438b01bf6f6b8722deb49df20c0a9a7be2b4a8e29dcb7b5183d8b42a
MD5 f17410ddda571478f260f13fa70f5ce9
BLAKE2b-256 412f7ca622fff06e879fd4adcde8b1bdbe9989ff2f36fc0607481581f4a3caa1

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.4-cp39-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.4-cp39-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 72ac2a460e15a4b75fb22d0bdeb9eb9b3bd6ab44d5288e56f85ff9e4241d0830
MD5 1adc3d8bed9b7c636489c7896b0b10bb
BLAKE2b-256 ae760757056bdcf273de4934681b84acde6e0b61a46b1755038e8d786ac6b368

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.4-cp39-abi3-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.4-cp39-abi3-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 b39ab5fc0cc793f2a93908cb4dc13182e61e9ace3476850ebf0e505f65355475
MD5 1e87c5279a58c2980bf44be433278f1b
BLAKE2b-256 59facd82d78401bed6452ea07e2a3ffd6c30eddb635b888ea5f2f699c2968e80

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page