Skip to main content

A set of utility classes and functions to process documents with Python

Project description

document-clipper

https://img.shields.io/pypi/v/document_clipper.svg https://img.shields.io/travis/reclamador/document_clipper.svg Documentation Status Updates https://coveralls.io/repos/github/reclamador/document_clipper/badge.svg?branch=master

A set of utility classes and functions to process documents with Python

Installation

The document_clipper package uses libraries that relies on several command-line tools included in the poppler-utils package such as: - pdftohtml - pdfimages - pftocairo

Before attempting to use document_clipper, please install the poppler-utils package.

For instance, in Ubuntu, you may do so by running the following command:

$ sudo apt-get install poppler-utils

Then, you may install document_clipper as usual via Python package managers, such as PIP:

$ pip install document_clipper

Features

  • Fetch the number of pages associated to a PDF file.
  • Extract the coordinates and dimensions of a given text located in a PDF file.
  • Combine multiple PDFs into a single PDF.
  • Combine multiple PDF and image files into a single PDF.
  • Generate a new PDF file containing a subset of a provided source PDF file’s pages. Rotations can be applied to each page individually.
  • Optionally fix the document(s) involved in the slicing/merging processes beforehand.

History

0.1.0 (2017-06-27)

  • First release on PyPI.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for document_clipper, version 1.2.0
Filename, size File type Python version Upload date Hashes
Filename, size document_clipper-1.2.0-py2.py3-none-any.whl (10.9 kB) File type Wheel Python version 3.5 Upload date Hashes View hashes
Filename, size document_clipper-1.2.0.tar.gz (2.2 MB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page