Skip to main content

A high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Project description

PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Community

Join us on Discord here: #pymupdf

Installation

PyMuPDF requires Python 3.9 or later, install using pip with:

pip install PyMuPDF

There are no mandatory external dependencies. However, some optional features become available only if additional packages are installed.

You can also try without installing by visiting PyMuPDF.io.

Usage

Basic usage is as follows:

import pymupdf # imports the pymupdf library
doc = pymupdf.open("example.pdf") # open a document
for page in doc: # iterate the document pages
  text = page.get_text() # get plain text encoded as UTF-8

Documentation

Full documentation can be found on pymupdf.readthedocs.io.

Optional Features

  • fontTools for creating font subsets.
  • pymupdf-fonts contains some nice fonts for your text output.
  • Tesseract-OCR for optical character recognition in images and document pages.

About

PyMuPDF adds Python bindings and abstractions to MuPDF, a lightweight PDF, XPS, and eBook viewer, renderer, and toolkit. Both PyMuPDF and MuPDF are maintained and developed by Artifex Software, Inc.

PyMuPDF was originally written by Jorj X. McKie.

License and Copyright

PyMuPDF is available under open-source AGPL and commercial license agreements. If you determine you cannot meet the requirements of the AGPL, please contact Artifex for more information regarding a commercial license.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pymupdf-1.25.1.tar.gz (61.0 MB view details)

Uploaded Source

Built Distributions

pymupdf-1.25.1-cp39-abi3-win_amd64.whl (16.6 MB view details)

Uploaded CPython 3.9+ Windows x86-64

pymupdf-1.25.1-cp39-abi3-win32.whl (15.1 MB view details)

Uploaded CPython 3.9+ Windows x86

pymupdf-1.25.1-cp39-abi3-musllinux_1_2_x86_64.whl (21.1 MB view details)

Uploaded CPython 3.9+ musllinux: musl 1.2+ x86-64

pymupdf-1.25.1-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (20.0 MB view details)

Uploaded CPython 3.9+ manylinux: glibc 2.17+ x86-64

pymupdf-1.25.1-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl (19.5 MB view details)

Uploaded CPython 3.9+ manylinux: glibc 2.17+ ARM64

pymupdf-1.25.1-cp39-abi3-macosx_11_0_arm64.whl (18.6 MB view details)

Uploaded CPython 3.9+ macOS 11.0+ ARM64

pymupdf-1.25.1-cp39-abi3-macosx_10_9_x86_64.whl (19.4 MB view details)

Uploaded CPython 3.9+ macOS 10.9+ x86-64

File details

Details for the file pymupdf-1.25.1.tar.gz.

File metadata

  • Download URL: pymupdf-1.25.1.tar.gz
  • Upload date:
  • Size: 61.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.10

File hashes

Hashes for pymupdf-1.25.1.tar.gz
Algorithm Hash digest
SHA256 6725bec0f37c2380d926f792c262693c926af7cc1aa5aa2b8207e771867f015a
MD5 615135b1c130d0a9b988ad2d2122b135
BLAKE2b-256 c38876c076c152be6d29a792defc3b3bff73de7f690e55f978b66adf6dbb8a1a

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.1-cp39-abi3-win_amd64.whl.

File metadata

  • Download URL: pymupdf-1.25.1-cp39-abi3-win_amd64.whl
  • Upload date:
  • Size: 16.6 MB
  • Tags: CPython 3.9+, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.10

File hashes

Hashes for pymupdf-1.25.1-cp39-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 e2b0b73c0aab0f863e5132c93cfa4607e8129feb1afa3d544b2cf7f172c50b5a
MD5 3e87531aaa0dc69b057f2cadc7aee1f4
BLAKE2b-256 46728c5bbf817aacebe21a454f3ade8ee4b5b17afe698bb73d65c4ca23a89a87

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.1-cp39-abi3-win32.whl.

File metadata

  • Download URL: pymupdf-1.25.1-cp39-abi3-win32.whl
  • Upload date:
  • Size: 15.1 MB
  • Tags: CPython 3.9+, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.10

File hashes

Hashes for pymupdf-1.25.1-cp39-abi3-win32.whl
Algorithm Hash digest
SHA256 fc7dbc1aa9e298a4c81084e389c9623c26fcaa232c71efaa073af150069e2221
MD5 247c7d1ff8dd9faff77831807bf5d2f2
BLAKE2b-256 a1d1440b267842a1374f8d55c508302882f2ef7dd0f859514f060e1618ef97aa

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.1-cp39-abi3-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.1-cp39-abi3-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 a687bd387589e30abd810a78a23341f57f43fa16a4d8d8c0b870bb6d89607343
MD5 4e55d10d1c43441143adddeb1765dfa1
BLAKE2b-256 8ce31a7a8400f1688c3c782478635ca929f85facd266157e4b90d650766bc49d

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.1-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.1-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 b63f8e9e65b0bda48f9217efd4d2a8c6d7a739dd28baf460c1ae78439b9af489
MD5 8456ab017260e29be694b581549a0780
BLAKE2b-256 77157bf672afb99002ad813aeb4886cc601bb9a4629210d9a3906a8d5650a941

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.1-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.1-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl
Algorithm Hash digest
SHA256 a39afbd80381f43e30d6eb2ec4613f465f507ac2b76070abdd2da8724f32ef36
MD5 e23e0638f85a749ada11938ef395cdf5
BLAKE2b-256 32bfd7697604ea2b1fe299c7bdf4b57e3549693ce73f75c44e890cfd34837d23

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.1-cp39-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.1-cp39-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 15e6f4013ad0a029a2221920f9d2081f56dc43259dabfdf5cad7fbf1cee4b5a7
MD5 8eba43408a26fe0ecbaec826863e26e5
BLAKE2b-256 0eb62ad245dcbbb1abae9eeb8de5049b27c12c9ee8590c6c769499e386164bd6

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.1-cp39-abi3-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.1-cp39-abi3-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 793f9f6d51029e97851c711b3f6d9fe912313d95a306fbe8b1866f301d0e2bd3
MD5 455c06618762f1378950177a4f8a7671
BLAKE2b-256 927be7205ea48f547122c226a34f5452bc72915b6d06d7925970b8dd3493baf1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page