Skip to main content

A set of utility classes and functions to process documents with Python

Project description

document-clipper Documentation Status Updates

A set of utility classes and functions to process documents with Python


The document_clipper package uses libraries that relies on several command-line tools included in the poppler-utils package such as: - pdftohtml - pdfimages - pftocairo

Before attempting to use document_clipper, please install the poppler-utils package.

For instance, in Ubuntu, you may do so by running the following command:

$ sudo apt-get install poppler-utils

Then, you may install document_clipper as usual via Python package managers, such as PIP:

$ pip install document_clipper


  • Fetch the number of pages associated to a PDF file.

  • Extract the coordinates and dimensions of a given text located in a PDF file.

  • Combine multiple PDFs into a single PDF.

  • Combine multiple PDF and image files into a single PDF.

  • Generate a new PDF file containing a subset of a provided source PDF file’s pages. Rotations can be applied to each page individually.

  • Optionally fix the document(s) involved in the slicing/merging processes beforehand.


0.1.0 (2017-06-27)

  • First release on PyPI.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

document_clipper-1.2.1.tar.gz (2.2 MB view hashes)

Uploaded Source

Built Distribution

document_clipper-1.2.1-py2.py3-none-any.whl (11.6 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page