Skip to main content

A high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Project description

PyMuPDF

PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.

Community

Join us on Discord here: #pymupdf

Installation

PyMuPDF requires Python 3.8 or later, install using pip with:

pip install PyMuPDF

There are no mandatory external dependencies. However, some optional features become available only if additional packages are installed.

You can also try without installing by visiting PyMuPDF.io.

Usage

Basic usage is as follows:

import pymupdf # imports the pymupdf library
doc = pymupdf.open("example.pdf") # open a document
for page in doc: # iterate the document pages
  text = page.get_text() # get plain text encoded as UTF-8

Documentation

Full documentation can be found on pymupdf.readthedocs.io.

Optional Features

  • fontTools for creating font subsets.
  • pymupdf-fonts contains some nice fonts for your text output.
  • Tesseract-OCR for optical character recognition in images and document pages.

About

PyMuPDF adds Python bindings and abstractions to MuPDF, a lightweight PDF, XPS, and eBook viewer, renderer, and toolkit. Both PyMuPDF and MuPDF are maintained and developed by Artifex Software, Inc.

PyMuPDF was originally written by Jorj X. McKie.

License and Copyright

PyMuPDF is available under open-source AGPL and commercial license agreements. If you determine you cannot meet the requirements of the AGPL, please contact Artifex for more information regarding a commercial license.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PyMuPDF-1.24.11.tar.gz (51.2 MB view details)

Uploaded Source

Built Distributions

PyMuPDF-1.24.11-cp38-abi3-win_amd64.whl (16.0 MB view details)

Uploaded CPython 3.8+ Windows x86-64

PyMuPDF-1.24.11-cp38-abi3-win32.whl (14.7 MB view details)

Uploaded CPython 3.8+ Windows x86

PyMuPDF-1.24.11-cp38-abi3-musllinux_1_2_x86_64.whl (20.7 MB view details)

Uploaded CPython 3.8+ musllinux: musl 1.2+ x86-64

PyMuPDF-1.24.11-cp38-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl (19.6 MB view details)

Uploaded CPython 3.8+ manylinux: glibc 2.17+ x86-64

PyMuPDF-1.24.11-cp38-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl (19.0 MB view details)

Uploaded CPython 3.8+ manylinux: glibc 2.17+ ARM64

PyMuPDF-1.24.11-cp38-abi3-macosx_11_0_arm64.whl (18.2 MB view details)

Uploaded CPython 3.8+ macOS 11.0+ ARM64

PyMuPDF-1.24.11-cp38-abi3-macosx_10_9_x86_64.whl (18.9 MB view details)

Uploaded CPython 3.8+ macOS 10.9+ x86-64

File details

Details for the file PyMuPDF-1.24.11.tar.gz.

File metadata

  • Download URL: PyMuPDF-1.24.11.tar.gz
  • Upload date:
  • Size: 51.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.13

File hashes

Hashes for PyMuPDF-1.24.11.tar.gz
Algorithm Hash digest
SHA256 6e45e57f14ac902029d4aacf07684958d0e58c769f47d9045b2048d0a3d20155
MD5 d53ecd74044fa614805f4af8561af4f0
BLAKE2b-256 d4a33edbb6be649e311107b320141cae0353d4cc9c6593eba7691f16c53c9c71

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.11-cp38-abi3-win_amd64.whl.

File metadata

  • Download URL: PyMuPDF-1.24.11-cp38-abi3-win_amd64.whl
  • Upload date:
  • Size: 16.0 MB
  • Tags: CPython 3.8+, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.13

File hashes

Hashes for PyMuPDF-1.24.11-cp38-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 745ce77532702d6ddeeecb47306d3669629aa5ff82708318cd652881f493b0ba
MD5 69972669eb84f06146e02686ea84a896
BLAKE2b-256 f60399895f003d7ff59c83d524aeccecff4e1ee1f39a7724f88acfda4f67b8bc

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.11-cp38-abi3-win32.whl.

File metadata

  • Download URL: PyMuPDF-1.24.11-cp38-abi3-win32.whl
  • Upload date:
  • Size: 14.7 MB
  • Tags: CPython 3.8+, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.10.13

File hashes

Hashes for PyMuPDF-1.24.11-cp38-abi3-win32.whl
Algorithm Hash digest
SHA256 6fda6c7ed7e6ad74d9cfac5c3837ef42efd58c506440e2513a0a200bc3c4dbc0
MD5 7350f5dbf39137ebcd3362221d758bdd
BLAKE2b-256 286ba0247598f06585d84ae9927d6ed191d89d38686ad6bf0dadc0ed699a77e7

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.11-cp38-abi3-musllinux_1_2_x86_64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.11-cp38-abi3-musllinux_1_2_x86_64.whl
Algorithm Hash digest
SHA256 9b7ac5b8ec3daec17f2e830962ed091610e576a5e531d2fe28c437fbd69b1969
MD5 c65053e855341eb5927b773a1286b393
BLAKE2b-256 6a929ff020892560f80433876ec904c0f2669d1d69403adf412565e54a946615

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.11-cp38-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.11-cp38-abi3-manylinux2014_x86_64.manylinux_2_17_x86_64.whl
Algorithm Hash digest
SHA256 2efb793644df99db0fe2468149048175cf25c5803997828efc9152aca838f5f2
MD5 6bca90bade44d2c5171bba9f07223775
BLAKE2b-256 ca21ad66778ad2485f87ef1d5a36f17ec8d4aee8ce247c8e46c673eff776a877

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.11-cp38-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.11-cp38-abi3-manylinux2014_aarch64.manylinux_2_17_aarch64.whl
Algorithm Hash digest
SHA256 32fd013e3c844f105c0a6a43ee82acc7cd0c900f6ff14f5eed9492840bbcbdd9
MD5 7b71f4ec21fb50c9464a6d1a3a17ecbc
BLAKE2b-256 553d84bd559129d2ff07267baae0bde0c6f4f49232408b547971f7a2e1534cb9

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.11-cp38-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.11-cp38-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 20c8eb65b855a33411246d6697a3f3166727fe2d8585753cf0db648730104be6
MD5 dfb2d7ba0fb30dc29193ac3418c75612
BLAKE2b-256 16f88396ca7218622cb3600c919b320a24f05b7c14bd81eea03f3f2182844a06

See more details on using hashes here.

File details

Details for the file PyMuPDF-1.24.11-cp38-abi3-macosx_10_9_x86_64.whl.

File metadata

File hashes

Hashes for PyMuPDF-1.24.11-cp38-abi3-macosx_10_9_x86_64.whl
Algorithm Hash digest
SHA256 24c35ba9e731027ff24566b90d4986e9aac75e1ce47589b25de51e3c687ddb73
MD5 436663b712e83e38635c53fb79fb59ff
BLAKE2b-256 f575b059d603530d99926de2b6a64314f3534e2149ee5496142de550c66907ac

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page