Skip to main content

A high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Project description

PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Community

Join us on Discord here: #pymupdf

Installation

PyMuPDF requires Python 3.9 or later, install using pip with:

pip install PyMuPDF

There are no mandatory external dependencies. However, some optional features become available only if additional packages are installed.

You can also try without installing by visiting PyMuPDF.io.

Usage

Basic usage is as follows:

import pymupdf # imports the pymupdf library
doc = pymupdf.open("example.pdf") # open a document
for page in doc: # iterate the document pages
  text = page.get_text() # get plain text encoded as UTF-8

Documentation

Full documentation can be found on pymupdf.readthedocs.io.

Optional Features

  • fontTools for creating font subsets.
  • pymupdf-fonts contains some nice fonts for your text output.
  • Tesseract-OCR for optical character recognition in images and document pages.

About

PyMuPDF adds Python bindings and abstractions to MuPDF, a lightweight PDF, XPS, and eBook viewer, renderer, and toolkit. Both PyMuPDF and MuPDF are maintained and developed by Artifex Software, Inc.

PyMuPDF was originally written by Jorj X. McKie.

License and Copyright

PyMuPDF is available under open-source AGPL and commercial license agreements. If you determine you cannot meet the requirements of the AGPL, please contact Artifex for more information regarding a commercial license.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pymupdf-1.25.0.tar.gz (60.8 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

pymupdf-1.25.0-cp39-abi3-win_amd64.whl (16.6 MB view details)

Uploaded CPython 3.9+Windows x86-64

pymupdf-1.25.0-cp39-abi3-win32.whl (15.1 MB view details)

Uploaded CPython 3.9+Windows x86

pymupdf-1.25.0-cp39-abi3-musllinux_1_2_x86_64.whl (21.1 MB view details)

Uploaded CPython 3.9+musllinux: musl 1.2+ x86-64

pymupdf-1.25.0-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (20.0 MB view details)

Uploaded CPython 3.9+manylinux: glibc 2.17+ x86-64

pymupdf-1.25.0-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl (19.5 MB view details)

Uploaded CPython 3.9+manylinux: glibc 2.17+ ARM64

pymupdf-1.25.0-cp39-abi3-macosx_11_0_arm64.whl (18.6 MB view details)

Uploaded CPython 3.9+macOS 11.0+ ARM64

pymupdf-1.25.0-cp39-abi3-macosx_10_9_x86_64.whl (19.4 MB view details)

Uploaded CPython 3.9+macOS 10.9+ x86-64

File details

Details for the file pymupdf-1.25.0.tar.gz.

File metadata

  • Download URL: pymupdf-1.25.0.tar.gz
  • Upload date:
  • Size: 60.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.10

File hashes

Hashes for pymupdf-1.25.0.tar.gz
Algorithm Hash digest
SHA256 9e5a33816e4b85ed6a01545cada2b866fc280a3b6478bb8e19c364532adf6692
MD5 623f71e73af72bf54e50f7ac851f8613
BLAKE2b-256 d29eec6139116b551922789eb72e710371ddd770a2236fbd5302c2a58670ebbc

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.0-cp39-abi3-win_amd64.whl.

File metadata

  • Download URL: pymupdf-1.25.0-cp39-abi3-win_amd64.whl
  • Upload date:
  • Size: 16.6 MB
  • Tags: CPython 3.9+, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.10

File hashes

Hashes for pymupdf-1.25.0-cp39-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 148800b9b14f2f48b5fc8f9213aeb94e272f1af4b533deeabe0e561d3bd334bf
MD5 1237b7fb7d9fce2d221b3cac5a2b9a3b
BLAKE2b-256 95d0aa79cc0c65ca6e8faf17cb44d779c1511a0da6525d617deb105b1ead0d98

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.0-cp39-abi3-win32.whl.

File metadata

  • Download URL: pymupdf-1.25.0-cp39-abi3-win32.whl
  • Upload date:
  • Size: 15.1 MB
  • Tags: CPython 3.9+, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.11.10

File hashes

Hashes for pymupdf-1.25.0-cp39-abi3-win32.whl
Algorithm Hash digest
SHA256 cada9ebc14ae99253639e9d752f2e1cb8a62af813626c4391c62d39c41f8a43a
MD5 a5e0886f697162e3ada472c6c2caa66c
BLAKE2b-256 cafe31322d459dd7bb868c859d1a8f24f020f5c949496001f930a76f3190b830

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.0-cp39-abi3-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.0-cp39-abi3-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 e9d9e15af0fe389e15d605574f953df304d6037ce7cb4dca8b7cdec34d0694a3
MD5 3d7f025f16596097d14d759c925de0d8
BLAKE2b-256 4f99ecd6edc233367a5820015bb98d098be781f9d3ef973b66197c0990c9f1c8

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.0-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.0-cp39-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 62b8673a2348c1b63874d2b1e93ee29e66892e7ca8311df9c7f4a9aa1d89caed
MD5 8af0f625d040690e9f026be3efeed14b
BLAKE2b-256 6bea08041590cc1a8c66f5fefc3edd0b57f36b9225524586848d27ab470430e7

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.0-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.0-cp39-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl
Algorithm Hash digest
SHA256 2e0eea2fcd7cbf84c8a18ec6744d9f92b81552df2974e40498baf41ec0dc557b
MD5 6ec9b211864faf06fd41b1160037191b
BLAKE2b-256 68acb512a88537b03f099b81ec66118969c82196d81af7ee710f231a2e18ab0a

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.0-cp39-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.0-cp39-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 227ea293734b92fc1f49a01ffebe0f3d212bc0ec3be7e2db172088b8eaff5a4a
MD5 1ed6bd62a83cbd306525b858930230e7
BLAKE2b-256 8b7f9811212db9cc72757b40a0224fe3ed4a1b057c72546a224ab61d8981a56f

See more details on using hashes here.

File details

Details for the file pymupdf-1.25.0-cp39-abi3-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for pymupdf-1.25.0-cp39-abi3-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 d12a4a7d4456b2c09afb674428be2021b87a3c4afbae7f717d5d7e805a30a989
MD5 ea260c5c3ea4d880afb559b5a5a907f3
BLAKE2b-256 bdfec7810f3a960979963640dd7e8f5b485671fc423f11691df776ca3a093080

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page