Skip to main content

A high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Project description

PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Community

Join us on Discord here: #pymupdf

Installation

PyMuPDF requires Python 3.9 or later, install using pip with:

pip install PyMuPDF

There are no mandatory external dependencies. However, some optional features become available only if additional packages are installed.

You can also try without installing by visiting PyMuPDF.io.

Usage

Basic usage is as follows:

import pymupdf # imports the pymupdf library
doc = pymupdf.open("example.pdf") # open a document
for page in doc: # iterate the document pages
  text = page.get_text() # get plain text encoded as UTF-8

Documentation

Full documentation can be found on pymupdf.readthedocs.io.

Optional Features

  • fontTools for creating font subsets.
  • pymupdf-fonts contains some nice fonts for your text output.
  • Tesseract-OCR for optical character recognition in images and document pages.

About

PyMuPDF adds Python bindings and abstractions to MuPDF, a lightweight PDF, XPS, and eBook viewer, renderer, and toolkit. Both PyMuPDF and MuPDF are maintained and developed by Artifex Software, Inc.

PyMuPDF was originally written by Jorj X. McKie.

License and Copyright

PyMuPDF is available under open-source AGPL and commercial license agreements. If you determine you cannot meet the requirements of the AGPL, please contact Artifex for more information regarding a commercial license.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PyMuPDF-1.24.12.tar.gz (52.9 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

PyMuPDF-1.24.12-cp39-abi3-win_amd64.whl (16.0 MB view details)

Uploaded CPython 3.9+Windows x86-64

PyMuPDF-1.24.12-cp39-abi3-win32.whl (14.7 MB view details)

Uploaded CPython 3.9+Windows x86

PyMuPDF-1.24.12-cp39-abi3-musllinux_1_2_x86_64.whl (20.7 MB view details)

Uploaded CPython 3.9+musllinux: musl 1.2+ x86-64

PyMuPDF-1.24.12-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (19.6 MB view details)

Uploaded CPython 3.9+manylinux: glibc 2.17+ x86-64

PyMuPDF-1.24.12-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl (19.0 MB view details)

Uploaded CPython 3.9+manylinux: glibc 2.17+ ARM64

PyMuPDF-1.24.12-cp39-abi3-macosx_11_0_arm64.whl (18.2 MB view details)

Uploaded CPython 3.9+macOS 11.0+ ARM64

PyMuPDF-1.24.12-cp39-abi3-macosx_10_9_x86_64.whl (18.9 MB view details)

Uploaded CPython 3.9+macOS 10.9+ x86-64

File details

Details for the file PyMuPDF-1.24.12.tar.gz.

File metadata

  • Download URL: PyMuPDF-1.24.12.tar.gz
  • Upload date:
  • Size: 52.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for PyMuPDF-1.24.12.tar.gz
Algorithm Hash digest
SHA256 ba6d212d7a286b6fab9024c669aa314edfcbdd5b45fa6f5ea5d463a9e7576e52
MD5 bd8f553c25c8fc5e2f40e35de84c6754
BLAKE2b-256 cfcc073855527996078f4f42d6022f00fdb050127715aaa585b32eaf470ae698

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.12-cp39-abi3-win_amd64.whl.

File metadata

  • Download URL: PyMuPDF-1.24.12-cp39-abi3-win_amd64.whl
  • Upload date:
  • Size: 16.0 MB
  • Tags: CPython 3.9+, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for PyMuPDF-1.24.12-cp39-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 9017bb5c05e3480c1f5a15671be9208363e0a31307445480ac5a810672f7342e
MD5 1ac5bc2509d5e175327332cbb7fdf4c3
BLAKE2b-256 4f5bb59a5036832340fd7892d05843c23a2e30e0ae173aea82b1585cc6d5d78c

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.12-cp39-abi3-win32.whl.

File metadata

  • Download URL: PyMuPDF-1.24.12-cp39-abi3-win32.whl
  • Upload date:
  • Size: 14.7 MB
  • Tags: CPython 3.9+, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for PyMuPDF-1.24.12-cp39-abi3-win32.whl
Algorithm Hash digest
SHA256 2d099328fefdf0ee53a4146db9b05aab120b6e4d68cc4510c07a706fed19625a
MD5 9d6f0e46620cb41a6c5a2521143d924e
BLAKE2b-256 4074faa38b31d6ce43298e435b55a4b354f89aea620d042176e1579927f050c8

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.12-cp39-abi3-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.12-cp39-abi3-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 f021d5060f946662345d62564fb06c4289d8522d4a249543e26de5f9ff3c9d06
MD5 564f31cb5c0202e91426ef2dba0a97f1
BLAKE2b-256 cbc5e90f5cfd61fe8c9419bcd4be0b6a1b8fb00d7d21810514c1f85f1cc41011

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.12-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.12-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 dd8e8a24601c4483ed8a3699922fe48ad0179d28121816a131ba8c4534ad5682
MD5 ce769811eafc63de4423b8c522c28d0e
BLAKE2b-256 d4bb85711eac0dbb45b5dcd77e77386564dc3da845f72f759d491ac12b7f8177

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.12-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.12-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl
Algorithm Hash digest
SHA256 d4a387a0e98a63b1ccc326032ffe7d3982d3d92495b661ca89ea634048e11f4e
MD5 8174fb62d8f1168c0a69fc23d4acca87
BLAKE2b-256 5de5ba0adaf6d4b820c40c6b0067127f06be59619b17302b4f76c636db803bfd

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.12-cp39-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.12-cp39-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 deac1546a80afc0930ffefb9714ff8aec54210ed2b1d8ecdd44e4c2b9e0ee839
MD5 61cbf9e228a2913807f69fdd38394c87
BLAKE2b-256 f9f806046824f3bc45ebeebc0add62bc50e04cc6cf29b533d105941794bd59e9

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.12-cp39-abi3-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.12-cp39-abi3-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 2228a46edc53bf8f83cbe5a8cf0a83bb8cf406614e6976906d3655355e079ec5
MD5 970b1ea8f639271db087abaa2ec61b3e
BLAKE2b-256 08b5fc669d9b98f8e25ef77484c595a67901e97a0888af26133e769552bbd4c5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page