Skip to main content

A high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Project description

PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Community

Join us on Discord here: #pymupdf

Installation

PyMuPDF requires Python 3.10 or later, install using pip with:

pip install PyMuPDF

There are no mandatory external dependencies. However, some optional features become available only if additional packages are installed.

You can also try without installing by visiting PyMuPDF.io.

Usage

Basic usage is as follows:

import pymupdf # imports the pymupdf library
doc = pymupdf.open("example.pdf") # open a document
for page in doc: # iterate the document pages
  text = page.get_text() # get plain text encoded as UTF-8

Documentation

Full documentation can be found on pymupdf.readthedocs.io.

Optional Features

  • fontTools for creating font subsets.
  • pymupdf-fonts contains some nice fonts for your text output.
  • Tesseract-OCR for optical character recognition in images and document pages.

About

PyMuPDF adds Python bindings and abstractions to MuPDF, a lightweight PDF, XPS, and eBook viewer, renderer, and toolkit. Both PyMuPDF and MuPDF are maintained and developed by Artifex Software, Inc.

PyMuPDF was originally written by Jorj X. McKie.

License and Copyright

PyMuPDF is available under open-source AGPL and commercial license agreements. If you determine you cannot meet the requirements of the AGPL, please contact Artifex for more information regarding a commercial license.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pymupdf-1.26.6.tar.gz (84.3 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

pymupdf-1.26.6-cp310-abi3-win_amd64.whl (18.4 MB view details)

Uploaded CPython 3.10+Windows x86-64

pymupdf-1.26.6-cp310-abi3-win32.whl (17.2 MB view details)

Uploaded CPython 3.10+Windows x86

pymupdf-1.26.6-cp310-abi3-manylinux_2_28_x86_64.whl (24.1 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.28+ x86-64

pymupdf-1.26.6-cp310-abi3-manylinux_2_28_aarch64.whl (23.5 MB view details)

Uploaded CPython 3.10+manylinux: glibc 2.28+ ARM64

pymupdf-1.26.6-cp310-abi3-macosx_11_0_arm64.whl (22.5 MB view details)

Uploaded CPython 3.10+macOS 11.0+ ARM64

pymupdf-1.26.6-cp310-abi3-macosx_10_9_x86_64.whl (23.2 MB view details)

Uploaded CPython 3.10+macOS 10.9+ x86-64

File details

Details for the file pymupdf-1.26.6.tar.gz.

File metadata

  • Download URL: pymupdf-1.26.6.tar.gz
  • Upload date:
  • Size: 84.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for pymupdf-1.26.6.tar.gz
Algorithm Hash digest
SHA256 a2b4531cd4ab36d6f1f794bb6d3c33b49bda22f36d58bb1f3e81cbc10183bd2b
MD5 27f4e3814c7e669267fd274e8a93d1bc
BLAKE2b-256 ecd7a6f0e03a117fa2ad79c4b898203bb212b17804f92558a6a339298faca7bb

See more details on using hashes here.

File details

Details for the file pymupdf-1.26.6-cp310-abi3-win_amd64.whl.

File metadata

  • Download URL: pymupdf-1.26.6-cp310-abi3-win_amd64.whl
  • Upload date:
  • Size: 18.4 MB
  • Tags: CPython 3.10+, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for pymupdf-1.26.6-cp310-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 ce02ca96ed0d1acfd00331a4d41a34c98584d034155b06fd4ec0f051718de7ba
MD5 6dd9eb10d1ed02255f746960d4890787
BLAKE2b-256 f9e8989f4eaa369c7166dc24f0eaa3023f13788c40ff1b96701f7047421554a8

See more details on using hashes here.

File details

Details for the file pymupdf-1.26.6-cp310-abi3-win32.whl.

File metadata

  • Download URL: pymupdf-1.26.6-cp310-abi3-win32.whl
  • Upload date:
  • Size: 17.2 MB
  • Tags: CPython 3.10+, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.11

File hashes

Hashes for pymupdf-1.26.6-cp310-abi3-win32.whl
Algorithm Hash digest
SHA256 8f3ef05befc90ca6bb0f12983200a7048d5bff3e1c1edef1bb3de60b32cb5274
MD5 46a7c19bc14a156a1237d35704fb1ebe
BLAKE2b-256 ab9d30f7fcb3776bfedde66c06297960debe4883b1667294a1ee9426c942e94d

See more details on using hashes here.

File details

Details for the file pymupdf-1.26.6-cp310-abi3-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.26.6-cp310-abi3-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 7777d0b7124c2ebc94849536b6a1fb85d158df3b9d873935e63036559391534c
MD5 88ae052a706d111bda91c99052dc1f8e
BLAKE2b-256 4fbf6e02e3d84b32c137c71a0a3dcdba8f2f6e9950619a3bc272245c7c06a051

See more details on using hashes here.

File details

Details for the file pymupdf-1.26.6-cp310-abi3-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for pymupdf-1.26.6-cp310-abi3-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 617ba69e02c44f0da1c0e039ea4a26cf630849fd570e169c71daeb8ac52a81d6
MD5 ead924218d410ebf787e9547c5a85cc8
BLAKE2b-256 e4c8377cf20e31f58d4c243bfcf2d3cb7466d5b97003b10b9f1161f11eb4a994

See more details on using hashes here.

File details

Details for the file pymupdf-1.26.6-cp310-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for pymupdf-1.26.6-cp310-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 6844cd2396553c0fa06de4869d5d5ecb1260e6fc3b9d85abe8fa35f14dd9d688
MD5 2b0f82e0f99b510f9031d0c4412edce6
BLAKE2b-256 eca011adb742d18142bd623556cd3b5d64649816decc5eafd30efc9498657e76

See more details on using hashes here.

File details

Details for the file pymupdf-1.26.6-cp310-abi3-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.26.6-cp310-abi3-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 e46f320a136ad55e5219e8f0f4061bdf3e4c12b126d2740d5a49f73fae7ea176
MD5 b34d945f428a5479bc27c48ee30113ea
BLAKE2b-256 9e5cdec354eee5fe4966c715f33818ed4193e0e6c986cf8484de35b6c167fb8e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page