A general collection of robot-evading tweaks for pyppeteer-ng
Project description
pyppeteerstealth
A bunch of hacks from different websites
Note!
- Does not yet pass https://arh.antoinevastel.com/bots/areyouheadless please help!
- Probably wont help where sites are using https://developer.mozilla.org/en-US/docs/Web/API/User-Agent_Client_Hints_API (
navigator.userAgentData.brands
etc)
If you know what is missing, please make a PR!!!
If you compare loading https://arh.antoinevastel.com/bots/ in your application, versus in your browser you might be able to see what is required to get the fingerprint closer to a "normal" browser (further away from a "headless" browser)
This is intended to be used with https://github.com/dgtlmoon/pyppeteer-ng and is also part of the https://changedetection.io project.
browser = await pyppeteer_instance.connect(browserWSEndpoint="ws://127.0.0.1:3000",
ignoreHTTPSErrors=True
)
self.page = (pages := await browser.pages) and len(pages) or await browser.newPage()
# Should be set with setUserAgent
user_agent = next((value for key, value in request_headers.items() if key.lower().strip() == 'user-agent'), DEFAULT_USER_AGENT)
await self.page.setUserAgent(user_agent)
try:
from pyppeteerstealth import inject_evasions_into_page
except ImportError:
logger.debug("pyppeteerstealth module not available, skipping")
pass
else:
await inject_evasions_into_page(self.page)
response = await self.page.goto("https://example.com", waitUntil="load")
Last report from https://bot.sannysoft.com/
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pyppeteerstealth-0.0.4.tar.gz
(9.9 kB
view hashes)
Built Distribution
Close
Hashes for pyppeteerstealth-0.0.4-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d046533db308f68c78021d148e213f0e2d05bd602381ce5857e0909281df2b7b |
|
MD5 | c29b56330d59f8e33640ccd9571d2f65 |
|
BLAKE2b-256 | b6c05007d96b836a394af82a1f98ba4c8984d87fe29d76a3769424d98314aafd |