Skip to main content

Headless chrome/chromium automation library (unofficial port of puppeteer)

Project description

PyPI PyPI version Documentation Build Status codecov

Unofficial Python port of puppeteer JavaScript (headless) chrome/chromium browser automation library.

WORK IN PROGRESS

Currently not all features are tested.

Installation

Pyppeteer requires python 3.6+.

Install by pip from PyPI:

pytyon3 -m pip install pyppeteer

Or install latest version from github:

python3 -m pip install -U git+https://github.com/miyakogi/pyppeteer.git@dev

Usage

Below code open web page and take a screenshot.

import asyncio
from pyppeteer.launcher import launch

async def main():
    browser = launch()
    page = await browser.newPage()
    await page.goto('http://example.com')
    await page.screenshot({'path': 'example.png'})
    browser.close()

asyncio.get_event_loop().run_until_complete(main())

Pyppeteer has almost same API as puppeteer. More APIs are listed in the document.

Puppeteer’s document is also useful for pyppeteer users.

Differences between puppeteer and pyppeteer

Pyppeteer is to be as similar as puppeteer, but some differences between python and JavaScript make it difficult.

These are differences between puppeteer and pyppeteer.

Keyword argument for options

Puppeteer uses object (dictionary in python) for passing options to functions/methods. Pyppeteer accepts both dictionary and keyword argument for options.

Dictionary style option (similar to puppeteer):

browser = launch({'headless': True})

Keyword argument style option (more pythonic, isn’t it?):

browser = launch(headless=True)

Element selector method name ($ -> querySelector)

In python, $ is not usable for method name. So pyppeteer uses Page.querySelector() instead of Page.$(), and ElementHandle.querySelector() instead of ElementHandle.$(). Pyppeteer has shorthand of this method, Page.J() and ElementHandle.J().

Argument of Page.evaluate() / ElementHandle.evaluate()

Puppeteer’s version of evaluate() takes JavaScript raw function, but pyppeteer takes string of JavaScript function.

Example to get element’s inner text:

element = await page.querySelector('h1')
title = await element.evaluate('(element) => element.textContent')

Credits

This package was created with Cookiecutter* and the audreyr/cookiecutter-pypackage* project template.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyppeteer-0.0.7.tar.gz (52.5 kB view details)

Uploaded Source

File details

Details for the file pyppeteer-0.0.7.tar.gz.

File metadata

  • Download URL: pyppeteer-0.0.7.tar.gz
  • Upload date:
  • Size: 52.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pyppeteer-0.0.7.tar.gz
Algorithm Hash digest
SHA256 9085c38fdff84664820bac7826e42502ea376fda26837b6fd901304b2239b7bd
MD5 73dfe2b0b9626447e0743bf12f1faae9
BLAKE2b-256 48696a7ed70fa0a13bc4d789df5e4b7482d9437fc6272f8287525e3d22cedd65

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page