Skip to main content

A library for converting HTML files into PDF. The tool uses Chrome to render the HTML and print it into a pdf file.

Project description

Pdfy

DOI

Pdfy is a Python library for converting HTML (and anything Chrome can render) into PDF. It uses Chrome printing functionality, so the PDFs will be rendered exactly as printed in the browser.

Installation

To install the library, you need to run.

pip install pdfy

Additionally, you will need to install Chrome Driver.

Usage

Using the library is as easy as:

from pdfy import Pdfy
p = Pdfy()
p.html_to_pdf("html_file.htm", pdf_path="pdf_file.pdf")

More control over the PDF layout

If you need to have more control over the layout, you can pass additional parameters to html_to_pdf

options = {"paperWidth": 8.3, "paperHeight":11.7}
p.html_to_pdf("html_file.htm", pdf_path="pdf_file.pdf" options=options)

The full list of parameters is available on Chrome's Developer site.

Not saving the PDF

In the absence of the pdf_path argument, the html_to_pdf function will return the PDF as a base64 encoded string.

pdf = p.html_to_pdf("html_file.htm")

Multiple instances

The library will run Chrome in the background in the remote debug mode. This means that if your project requires multiple initialized Pdfy objects, you might need to change the port used for debugging. This can be done by passing the port number to Pdfy() as follows:

p = Pdfy(debug_port=9222) #9222 is the default port

Credits

This library is released under the Apache 2.0 License.

(C) Copyright 2018-2022 Mika Hämäläinen

Need for NLP solutions for your business?

Rootroo logo

My company, Rootroo offers consulting related to multilingual NLP tasks. We have a strong academic background in the state-of-the-art AI solutions for every NLP need. Just contact us, we won't bite.

Cite

@software{mika_hamalainen_2020_4108770,
  author       = {Mika Hämäläinen and
                  Hiromu Hota and
                  Mike and
                  Mirza Delic},
  title        = {mikahama/pdfy 1.0.50},
  month        = oct,
  year         = 2020,
  publisher    = {Zenodo},
  version      = {1.0.50},
  doi          = {10.5281/zenodo.4108770},
  url          = {https://doi.org/10.5281/zenodo.4108770}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

pdfy-1.2.0-py3-none-any.whl (7.4 kB view details)

Uploaded Python 3

File details

Details for the file pdfy-1.2.0-py3-none-any.whl.

File metadata

  • Download URL: pdfy-1.2.0-py3-none-any.whl
  • Upload date:
  • Size: 7.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.8.10

File hashes

Hashes for pdfy-1.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4f2925b9da61313531dc66cb28a533f640eaadf2802434be487a9ff1a0a9336c
MD5 620b7bc518ba33ef97e74b4540e8dc1a
BLAKE2b-256 aa15ee2cdb2ce9de8e3c90e05e26a76091c638817b79d18dd8b58be0507daeaa

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page