Skip to main content

Read and write PDFs with Python, powered by qpdf

Project description

pikepdf is a Python library allowing creation, manipulation and repair of PDF files. It is provides a wrapper around QPDF.

Python + QPDF = “py” + “qpdf” = “pyqpdf”, which looks like a dyslexia test. Say it out loud, and it sounds like “pikepdf”.

This package is in pre-alpha.

Python 3.5 and 3.6 are fully supported.

Features:

  • Editing, manipulation and transformation of existing PDFs

  • Based on the mature, proven QPDF C++ library

  • Can read and write PDFs with any type of PDF encryption (except public key)

  • Supports all PDF compression filters

  • Supports PDF object streams

  • Supports PDF 1.3 through 1.7

  • Can manipulate PDF/A and other types of PDF without losing their metadata marker

  • Can create “fast web view” (linearized) PDFs

  • Automatically recovers and repairs damaged PDFs

  • Implements more of the PDF specification than existing Python PDF tools

  • For convenience, renders PDF pages or embedded PDF images in Jupyter notebooks and IPython

This library is similar to PyPDF2 and pdfrw – it provides low level access to PDF features and allows editing and content transformation of existing PDFs, and requires some knowledge of the PDF specification.

Python 2.7 and earlier versions of Python 3 are not currently supported but support is probably not difficult to achieve. Pull requests are welcome.

Installation

On Unix (Linux, macOS)

Binary wheels are available for x86-64 Linux platforms and Intel macOS. 32-bit wheels will be added if anyone needs them.

  • pip install pikepdf

From source

A C++11 compliant compiler is required, which includes most recent versions of GCC (4.8 and up) and clang (3.3 and up). A C++14 compiler is recommended.

libqpdf 7.0.0 is required at compile time and runtime. Many platforms have not updated to this version, so you may need to install this program without a package manager.

  • clone this repository

  • install libjpeg, zlib and qpdf on your platform, including headers

  • pip install ./pikepdf

On Windows (Requires Visual Studio 2015)

Windows is not currently part of continuous integration, so this might not work.

  • For Python 3.5:

    • clone this repository

    • pip install ./pikepdf

pikepdf requires a C++11 compliant compiler (i.e. Visual Studio 2015 on Windows). Running a regular pip install command will detect the version of the compiler used to build Python and attempt to build the extension with it. We must force the use of Visual Studio 2015.

::
  • clone this repository

  • “%VS140COMNTOOLS%....VCvcvarsall.bat” x64

  • set DISTUTILS_USE_SDK=1

  • set MSSdk=1

  • pip install ./python_example

Note that this requires the user building python_example to have registry edition rights on the machine, to be able to run the vcvarsall.bat script.

Windows runtime requirements

On Windows, the Visual C++ 2015 redistributable packages are a runtime requirement for this project. It can be found here.

Building the documentation

Documentation for the example project is generated using Sphinx. Sphinx has the ability to automatically inspect the signatures and documentation strings in the extension module to generate beautiful documentation in a variety formats. The following command generates HTML-based reference documentation; for other formats please refer to the Sphinx manual:

  • cd pikepdf/docs

  • make html

About Python 2.7

The author’s priority is building a great PDF library for Python for future applications, which means there isn’t time to target Python 2.7. Currently the C++ source compiles and links correctly, so all that is necessary is backporting Python 3 source files.

It was recently confirmed that the C++ code base compiles and links with Python 2.7. One would need to backport the Python source files and fix any test suite regressions. Pull requests are welcome.

License

pikepdf is provided under the Mozilla Public License 2.0 license (MPL) that can be found in the LICENSE file. By using, distributing, or contributing to this project, you agree to the terms and conditions of this license.

Informally, MPL 2.0 is a not a “viral” license. It may be combined with other work, including commercial software. However, you must disclose your modifications to pikepdf in source code form. In other works, fork this repository on Github or elsewhere and commit your contributions there, and you’ve satisfied the license.

The tests/resources/copyright file describes licensing terms for the test suite and the provenance of test resources.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pikepdf-0.1.3.tar.gz (965.5 kB view details)

Uploaded Source

Built Distributions

pikepdf-0.1.3-cp36-cp36m-manylinux1_x86_64.whl (6.0 MB view details)

Uploaded CPython 3.6m

pikepdf-0.1.3-cp36-cp36m-macosx_10_6_intel.whl (889.4 kB view details)

Uploaded CPython 3.6m macOS 10.6+ intel

pikepdf-0.1.3-cp35-cp35m-manylinux1_x86_64.whl (6.0 MB view details)

Uploaded CPython 3.5m

pikepdf-0.1.3-cp35-cp35m-macosx_10_6_intel.whl (889.4 kB view details)

Uploaded CPython 3.5m macOS 10.6+ intel

File details

Details for the file pikepdf-0.1.3.tar.gz.

File metadata

  • Download URL: pikepdf-0.1.3.tar.gz
  • Upload date:
  • Size: 965.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pikepdf-0.1.3.tar.gz
Algorithm Hash digest
SHA256 42fdedd23a27318a8d4396b4c2a9765de04f9c178d2f3b227762bb19e81f7fc8
MD5 3d7a2d239e1ec80b50f2b49d45ede458
BLAKE2b-256 1c183866e1dbee27e644b8020e006e67ace3fa34cb66c7d58265044f3df5cbcb

See more details on using hashes here.

File details

Details for the file pikepdf-0.1.3-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for pikepdf-0.1.3-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 9ed09ee6a1d81232792b0c417edda5d5121946cf231da3780a52947aa3490a95
MD5 95bb3faf385d43c7c14f64a0e7684064
BLAKE2b-256 b05dc5d1780f06a119fcd739559fef156d6667dd5c8f2be16436b5ce7b78f956

See more details on using hashes here.

File details

Details for the file pikepdf-0.1.3-cp36-cp36m-macosx_10_6_intel.whl.

File metadata

File hashes

Hashes for pikepdf-0.1.3-cp36-cp36m-macosx_10_6_intel.whl
Algorithm Hash digest
SHA256 8364bb57aab3a5629b861759e209b180c18a5f9f27a72461b86c62486fb33367
MD5 e77954efa3c0dc6065acedb9ac4c6dd3
BLAKE2b-256 5c957a42d71a0df7fcfb1520ce0b6e919d6f881709bbb99315672cff4829551b

See more details on using hashes here.

File details

Details for the file pikepdf-0.1.3-cp35-cp35m-manylinux1_x86_64.whl.

File metadata

File hashes

Hashes for pikepdf-0.1.3-cp35-cp35m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 892fd8bcc3e80000ef8e7eb7997ed35d9d05208663462671fd1a4217a15ac57d
MD5 36c26ac050653e5f1a2284e4bd3fb99c
BLAKE2b-256 c01f452d9fb7683b515ee5701cf9b11a5179496977a80d646e664dc6d088dbcf

See more details on using hashes here.

File details

Details for the file pikepdf-0.1.3-cp35-cp35m-macosx_10_6_intel.whl.

File metadata

File hashes

Hashes for pikepdf-0.1.3-cp35-cp35m-macosx_10_6_intel.whl
Algorithm Hash digest
SHA256 29aef590412f136bc4b04df64100bd825e576e8e13e599409a43b0b89ace21fa
MD5 3757005235a04b8025af045fb4e09f3d
BLAKE2b-256 1a6fb77a617938926d2fcb94d88ef2da90a4a120ba106b7296fa6f324c7ea189

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page