Skip to main content

A high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Project description

PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Community

Join us on Discord here: #pymupdf

Installation

PyMuPDF requires Python 3.9 or later, install using pip with:

pip install PyMuPDF

There are no mandatory external dependencies. However, some optional features become available only if additional packages are installed.

You can also try without installing by visiting PyMuPDF.io.

Usage

Basic usage is as follows:

import pymupdf # imports the pymupdf library
doc = pymupdf.open("example.pdf") # open a document
for page in doc: # iterate the document pages
  text = page.get_text() # get plain text encoded as UTF-8

Documentation

Full documentation can be found on pymupdf.readthedocs.io.

Optional Features

  • fontTools for creating font subsets.
  • pymupdf-fonts contains some nice fonts for your text output.
  • Tesseract-OCR for optical character recognition in images and document pages.

About

PyMuPDF adds Python bindings and abstractions to MuPDF, a lightweight PDF, XPS, and eBook viewer, renderer, and toolkit. Both PyMuPDF and MuPDF are maintained and developed by Artifex Software, Inc.

PyMuPDF was originally written by Jorj X. McKie.

License and Copyright

PyMuPDF is available under open-source AGPL and commercial license agreements. If you determine you cannot meet the requirements of the AGPL, please contact Artifex for more information regarding a commercial license.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pymupdf-1.25.3.tar.gz (67.3 MB view details)

Uploaded Source

Built Distributions

pymupdf-1.25.3-cp39-abi3-win_amd64.whl (16.5 MB view details)

Uploaded CPython 3.9+ Windows x86-64

pymupdf-1.25.3-cp39-abi3-win32.whl (15.1 MB view details)

Uploaded CPython 3.9+ Windows x86

pymupdf-1.25.3-cp39-abi3-musllinux_1_2_x86_64.whl (21.1 MB view details)

Uploaded CPython 3.9+ musllinux: musl 1.2+ x86-64

pymupdf-1.25.3-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (20.0 MB view details)

Uploaded CPython 3.9+ manylinux: glibc 2.17+ x86-64

pymupdf-1.25.3-cp39-abi3-macosx_11_0_arm64.whl (18.6 MB view details)

Uploaded CPython 3.9+ macOS 11.0+ ARM64

pymupdf-1.25.3-cp39-abi3-macosx_10_9_x86_64.whl (19.3 MB view details)

Uploaded CPython 3.9+ macOS 10.9+ x86-64

File details

Details for the file pymupdf-1.25.3.tar.gz.

File metadata

  • Download URL: pymupdf-1.25.3.tar.gz
  • Upload date:
  • Size: 67.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.10

File hashes

Hashes for pymupdf-1.25.3.tar.gz
Algorithm Hash digest
SHA256 b640187c64c5ac5d97505a92e836da299da79c2f689f3f94a67a37a493492193
MD5 c412d9b99906111f41f1aa0aa8f9a0f1
BLAKE2b-256 0647b61c1c44b87cbdaeecdec3f43ce524ed6b3c72172bc6184eb82c94fbc43d

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.3-cp39-abi3-win_amd64.whl.

File metadata

  • Download URL: pymupdf-1.25.3-cp39-abi3-win_amd64.whl
  • Upload date:
  • Size: 16.5 MB
  • Tags: CPython 3.9+, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.10

File hashes

Hashes for pymupdf-1.25.3-cp39-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 4fb357438c9129fbf939b5af85323434df64e36759c399c376b62ad6da95498c
MD5 783f9d5d40ad9f5a142a8833587b3492
BLAKE2b-256 09e0d72e88a1d5e23aa381fd463057dc3d0fb29090e1e7308a870c334716579c

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.3-cp39-abi3-win32.whl.

File metadata

  • Download URL: pymupdf-1.25.3-cp39-abi3-win32.whl
  • Upload date:
  • Size: 15.1 MB
  • Tags: CPython 3.9+, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.10

File hashes

Hashes for pymupdf-1.25.3-cp39-abi3-win32.whl
Algorithm Hash digest
SHA256 bca72e6089f985d800596e22973f79cc08af6cbff1d93e5bda9248326a03857c
MD5 9ae31f3246a561d4995304783d40b187
BLAKE2b-256 835371faaaf91c56f2883b13f3dd849bf2697f012eb35eb7b952d62734cff41f

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.3-cp39-abi3-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.3-cp39-abi3-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 a5de51efdbe4d486b6c1111c84e8a231cbfb426f3d6ff31ab530ad70e6f39756
MD5 4ae02b035d9048c0cfa3e73afe45705c
BLAKE2b-256 42ee3c449b0de061440ba1ac984aa845315e9e2dca0ff2003c5adfc6febff203

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.3-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.3-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 46d90c4f9e62d1856e8db4b9f04a202ff4a7f086a816af73abdc86adb7f5e25a
MD5 00b98102acbb2b8f14f8ccaa05cefd11
BLAKE2b-256 52debd1418e31f73d37b8381cd5deacfd681e6be702b8890e123e83724569ee1

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.3-cp39-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.3-cp39-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 6ef753005b72ebfd23470f72f7e30f61e21b0b5e748045ec5b8f89e6e3068d62
MD5 d061c1105f403ec51600a1dc9a669b2e
BLAKE2b-256 14624e12126db174c8cfbf692281cda971cc4046c5f5226032c2cfaa6f83e08d

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.3-cp39-abi3-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.3-cp39-abi3-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 96878e1b748f9c2011aecb2028c5f96b5a347a9a91169130ad0133053d97915e
MD5 a802220dfa1725dc4abbb8b0d5bf23e3
BLAKE2b-256 619b98ef4b98309e9db3baa9fe572f0e61b6130bb9852d13189970f35b703499

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page