Package acting as a wrapper around the headless mode of existing web browsers to generate images from URLs and from HTML+CSS strings or files.
Project description
HTML2Image
HTML2Image is a lightweight Python package that acts as a wrapper around the headless mode of existing web browsers to generate images from URLs and from HTML+CSS strings or files.
This package has been tested on Windows, Ubuntu (desktop and server) and MacOS. It is currently in a work in progress stage. If you encounter any problem or difficulties while using it, feel free to open an issue on the GitHub page of this project. Feedback is also welcome!
Principle
Most web browsers have a Headless Mode, which is a way to run them without displaying any graphical interface. Headless mode is mainly used for automated testings but also comes in handy if you want to take screenshots of web pages that are exact replicas of what you would see on your screen if you were using the browser yourself.
However, for the sake of taking screenshots, headless mode is not very convenient to use. HTML2Image aims to hide the inconveniences of the browsers' headless modes while adding useful features such as allowing to create an image from as little as a string.
For more information about headless modes :
- (Chrome) https://developers.google.com/web/updates/2017/04/headless-chrome
- (Firefox) https://developer.mozilla.org/en-US/docs/Mozilla/Firefox/Headless_mode
Installation
HTML2Image is published on PyPI and can be installed through pip:
pip install --upgrade html2image
In addition to this package, at least one of the following browsers must be installed on your machine :
- Google Chrome (Windows, MacOS)
- Chromium Brower (Linux)
Usage
First, import the package and instantiate it
from html2image import Html2Image
hti = Html2Image()
Multiple arguments can be passed to the constructor (click to expand):
browser
: Browser that will be used, set by default to'chrome'
(the only browser supported by HTML2Image at the moment)chrome_path
andfirefox_path
: The path or the command that can be used to find the executable of a specific browser.output_path
: Path to the folder to which taken screenshots will be outputed. Default is the current working directory of your python program.size
: 2-Tuple reprensenting the size of the screenshots that will be taken. Default value is(1920, 1080)
.temp_path
: Path that will be used by html2image to put together different resources loaded with theload_str
andload_file
methods. Default value is%TEMP%/html2image
on Windows, and/tmp/html2image
on Linux and MacOS.
Example:
hti = Html2Image(size=(500, 200))
You can also change these values later:
hti.size = (500, 200)
Then take a screenshot
The screenshot
method is the basis of this package, most of the time, you won't need to use anything else. It can take screenshots of a lot of things :
- URLs via the
url
parameter; - HTML and CSS files via the
html_file
andcss_file
parameters; - HTML and CSS strings via the
html_str
andcss_str
parameters; - and "other" types of files via the
other_file
parameter (try it with .svg files!).
And you can also (optional):
- Change the size of the screenshots using the
size
parameter; - Save the screenshots as a specific name using the
save_as
parameter.
N.B. : The screenshot
method returns a list containing the path(s) of the screenshot(s) it took.
A few examples
- URL to image
hti.screenshot(url='https://www.python.org', save_as='python_org.png')
- HTML & CSS strings to image
html = """<h1> An interesting title </h1> This page will be red"""
css = "body {background: red;}"
hti.screenshot(html_str=html, css_str=css, save_as='red_page.png')
- HTML & CSS files to image
hti.screenshot(
html_file='blue_page.html', css_file='blue_background.css',
save_as='blue_page.png'
)
- Other files to image
hti.screenshot(other_file='star.svg')
- Change the screenshots' size
hti.screenshot(other_file='star.svg', size=(500, 500))
Click to show all the images generated with all the code above
- Change the directory to which the screenshots are saved
hti = Html2Image(output_path='my_screenshot_folder')
OR
hti.output_path = 'my_screenshot_folder'
N.B. : the output path will be changed for all future screenshots.
Use lists in place of any parameters while using the screenshot
method
- Screenshot multiple objects using only one filename, or one filename per file:
# create three files from one filename
hti.screenshot(html_str=['A', 'B', 'C'], save_as='ABC.png')
# outputs ABC_0.png, ABC_1.png, ABC_2.png
# create three files from from different filenames
hti.screenshot(html_str=['A', 'B', 'C'], save_as=['A.png', 'B.png', 'C.png'])
# outputs A.png, B.png, C.png
- Take multiple screenshots with the same size
# take four screenshots with a resolution of 100*50
hti.screenshot(
html_str=['A', 'B', 'C', 'D']
size=(100, 50)
)
- Take multiple screenshots with different sizes
# take four screenshots with different resolutions from three given sizes
hti.screenshot(
html_str=['A', 'B', 'C', 'D'],
size=[(100, 50), (100, 100), (50, 50)]
)
# respectively 100*50, 100*100, 50*50, 50*50
# if not enough sizes are given, the last size in the list will be repeated
- Apply CSS string(s) to multiple HTML string(s)
# screenshot two html strings and apply css strings on both
hti.screenshot(
html_str=['A', 'B'],
css_str='body {background: red;}'
)
# screenshot two html strings and apply multiple css strings on both
hti.screenshot(
html_str=['A', 'B'],
css_str=['body {background: red;}', 'body {font-size: 50px;}']
)
# screenshot one html string and apply multiple css strings on it
hti.screenshot(
html_str='A',
css_str=['body {background: red;}', 'body {font-size: 50px;}']
)
- Retrieve the path of the generated file(s)
Thescreenshot
method returns a list containing the path(s) of the screenshot(s):
paths = hti.screenshot(
html_str=['A', 'B', 'C'],
save_as="letters.png",
)
print(paths)
# >>> ['D:\\myFiles\\letters_0.png', 'D:\\myFiles\\letters_1.png', 'D:\\myFiles\\letters_2.png']
Using the CLI
HTML2image comes with a Command Line Interface which you can use to generate screenshots from files and urls on the go.
The CLI is a work in progress and may be subject to changes.
You can call it by typing hti
or html2image
into a terminal.
argument | description | example |
---|---|---|
-h, --help | Shows the help message | hti -h |
-U, --urls | Screenshots a list of URLs | hti -U https://www.python.org |
-H, --html | Screenshots a list of HTML files | hti -H file.html |
-C, --css | Attaches a CSS files to the HTML ones | hti -H file.html -C style.css |
-O, --other | Screenshots a list of files of type "other" | hti -O star.svg |
-S, --save-as | A list of the screenshot filename(s) | hti -O star.svg -S star.png |
-s, --size | A list of the screenshot size(s) | hti -O star.svg -s 50,50 |
-o, --output_path | Change the output path of the screenshots (default is current working directory) | hti star.svg -o screenshot_dir |
-q, --quiet | Disable all CLI's outputs | hti --quiet |
-v, --verbose | More details, can help debugging | hti --verbose |
--chrome_path | Specify a different chrome path | |
--temp_path | Specify a different temp path (where the files are loaded) |
Testing
Only basic testing is available at the moment. To run tests, run PyTest at the root of the project:
python -m pytest
TODO List
- A nice CLI (Currently in a WIP state)
- A better way to name the CLI's outputed files ?
- Support of other browsers, such as Firefox
- More extensive doc + comments
- PDF generation?
- Testing on push/PR with GitHub Actions
- Use threads or multiprocessing to speed up screenshot taking
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file html2image-1.1.1.tar.gz
.
File metadata
- Download URL: html2image-1.1.1.tar.gz
- Upload date:
- Size: 12.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.4 CPython/3.8.2 Windows/10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8dd3b06b5a2a955d3315ca4ad825fa0118ffa6f50d5d2a5ea73b5b83cc862b75 |
|
MD5 | d7e9a641935110cde46f87bd5f02e3af |
|
BLAKE2b-256 | be460ef1faec04f87ee3156d0601ef5e3997d9a45a3b85a88f021ee3254e37c8 |
File details
Details for the file html2image-1.1.1-py3-none-any.whl
.
File metadata
- Download URL: html2image-1.1.1-py3-none-any.whl
- Upload date:
- Size: 12.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.4 CPython/3.8.2 Windows/10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5d6ccb715233803f3712fa8c0bafd53ba057c9fc869bb1f8438872150f692e6c |
|
MD5 | df3b74fef2233961579af772d6bcc422 |
|
BLAKE2b-256 | 725528e516c59ebc9bff813d7b399afeb35b1c65420adfd9ca21a5e12712b176 |