Skip to main content

A high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Project description

PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Community

Join us on Discord here: #pymupdf

Installation

PyMuPDF requires Python 3.9 or later, install using pip with:

pip install PyMuPDF

There are no mandatory external dependencies. However, some optional features become available only if additional packages are installed.

You can also try without installing by visiting PyMuPDF.io.

Usage

Basic usage is as follows:

import pymupdf # imports the pymupdf library
doc = pymupdf.open("example.pdf") # open a document
for page in doc: # iterate the document pages
  text = page.get_text() # get plain text encoded as UTF-8

Documentation

Full documentation can be found on pymupdf.readthedocs.io.

Optional Features

  • fontTools for creating font subsets.
  • pymupdf-fonts contains some nice fonts for your text output.
  • Tesseract-OCR for optical character recognition in images and document pages.

About

PyMuPDF adds Python bindings and abstractions to MuPDF, a lightweight PDF, XPS, and eBook viewer, renderer, and toolkit. Both PyMuPDF and MuPDF are maintained and developed by Artifex Software, Inc.

PyMuPDF was originally written by Jorj X. McKie.

License and Copyright

PyMuPDF is available under open-source AGPL and commercial license agreements. If you determine you cannot meet the requirements of the AGPL, please contact Artifex for more information regarding a commercial license.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PyMuPDF-1.24.14.tar.gz (56.2 MB view details)

Uploaded Source

Built Distributions

PyMuPDF-1.24.14-cp39-abi3-win_amd64.whl (16.3 MB view details)

Uploaded CPython 3.9+ Windows x86-64

PyMuPDF-1.24.14-cp39-abi3-win32.whl (15.0 MB view details)

Uploaded CPython 3.9+ Windows x86

PyMuPDF-1.24.14-cp39-abi3-musllinux_1_2_x86_64.whl (21.0 MB view details)

Uploaded CPython 3.9+ musllinux: musl 1.2+ x86-64

PyMuPDF-1.24.14-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (19.8 MB view details)

Uploaded CPython 3.9+ manylinux: glibc 2.17+ x86-64

PyMuPDF-1.24.14-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl (19.3 MB view details)

Uploaded CPython 3.9+ manylinux: glibc 2.17+ ARM64

PyMuPDF-1.24.14-cp39-abi3-macosx_11_0_arm64.whl (18.4 MB view details)

Uploaded CPython 3.9+ macOS 11.0+ ARM64

PyMuPDF-1.24.14-cp39-abi3-macosx_10_9_x86_64.whl (19.2 MB view details)

Uploaded CPython 3.9+ macOS 10.9+ x86-64

File details

Details for the file PyMuPDF-1.24.14.tar.gz.

File metadata

  • Download URL: PyMuPDF-1.24.14.tar.gz
  • Upload date:
  • Size: 56.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for PyMuPDF-1.24.14.tar.gz
Algorithm Hash digest
SHA256 0eed9f998525eaf39706dbf2d0cf3162150f0f526e4a36b1748ffa50bde581ae
MD5 14d0e006a814dc14293a63bcfd37ca53
BLAKE2b-256 e06b6bd735144a190d26dcc23f98b4aae0e09b259cc4c87bba266a39b7b91f56

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.14-cp39-abi3-win_amd64.whl.

File metadata

  • Download URL: PyMuPDF-1.24.14-cp39-abi3-win_amd64.whl
  • Upload date:
  • Size: 16.3 MB
  • Tags: CPython 3.9+, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for PyMuPDF-1.24.14-cp39-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 3d1f1ec2fe0249484afde7a0fc02589f19aaeb47c42939d23ae1d012aa1bc59b
MD5 9f3785fc51f830217c8b12e7baa08e5f
BLAKE2b-256 25b282d70d9f5aea5a33e770f37e6db43ed08b5dc71b3526c5d7051689d1031e

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.14-cp39-abi3-win32.whl.

File metadata

  • Download URL: PyMuPDF-1.24.14-cp39-abi3-win32.whl
  • Upload date:
  • Size: 15.0 MB
  • Tags: CPython 3.9+, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for PyMuPDF-1.24.14-cp39-abi3-win32.whl
Algorithm Hash digest
SHA256 60a7ee7db3e0d3a4dcbe6df2781ba4487acb7e515c64ea9c857504f44effcb25
MD5 555bc2024dc1386ed780d28b40a4374b
BLAKE2b-256 872b46af7461bd299c3f52bc5455332cc82608cea1667cd692652505fdf9308e

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.14-cp39-abi3-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.14-cp39-abi3-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 2d1b5c47df2f8055de5dedfbd3189c742188261a8c257f406378382adac94cff
MD5 ca4223fe41a6d1b92f3b895f4613ef2f
BLAKE2b-256 77ed40eb23cf5e91de0510dfedb7d9feedeab5ce9691544ad09599e124a0a333

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.14-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.14-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 0de4f5ed903c2be6d0abcccdc796368939b51ce03916eb53292916e3b6ea65d6
MD5 03a5e151d2558606373ee5ced9657e30
BLAKE2b-256 27bf203d06c68660d5535db65b6c54cacd35b950945c11c1c4546d674f270892

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.14-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.14-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl
Algorithm Hash digest
SHA256 37f24108e2e18150fb8d512dcccdfa1e3d9b9dd203ffaa7ffb959bb20aea40b4
MD5 d082cdd78de3b6904591dfa18fa3b5d3
BLAKE2b-256 5169518e6c088e20a5ded1fc658d4aec1e54c0f98f2d62d91362bd4231df9ecf

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.14-cp39-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.14-cp39-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 755906af4b4d693552ae5469ba682075853f4dc8a70639affd1bd6c049c5d900
MD5 686e176c895c06f26af46a1147a9726b
BLAKE2b-256 2c118d6f4c8fca86b93759e430c4b0b7b66f8067d58893d6fe0a193420d14453

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.14-cp39-abi3-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.14-cp39-abi3-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 b3ad7a4f4b607ff97f2e1b8111823dd3797dbb381ec851c3ae4695fea6f68478
MD5 fe93a1099180dadf8cfc61aa9973350a
BLAKE2b-256 62ce972b080c526af80577ffaa49676c05361ba152de94de3af339a2f3ac07c2

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page