WeRead HTML Scanner
Project description
WeReadScan (html-scan variant)
About
This branch is a html-scan variant of WeReadScan, integrating script developed by Sec-ant.
Thanks for Sec-ant, this variant of WeReadScan can be more efficient.
More detail about Sec-ant's project, you can visit https://github.com/Sec-ant/weread-scraper
Get started
pip install WeReadScan-HTML
This package needs selenium, so you should have some basis of selenium.
Demo
Talk is cheap, just show you the code.
"""
demo.py
The demo of WeReadScan.py
Copyright 2023 by Algebra-FUN
ALL RIGHTS RESERVED.
"""
from selenium.webdriver import Edge
from selenium.webdriver.chrome.service import Service
from selenium.webdriver.edge.options import Options
from WeReadScan import WeRead
options = Options()
options.add_argument("--disable-blink-features=AutomationControlled")
options.add_argument('disable-infobars')
options.add_argument('log-level=3')
options.add_argument("headless")
# service = Service("/Users/creator/Downloads/msedgedriver")
# launch Webdriver
print('Webdriver launching...')
driver = Edge(options=options)
# driver = Edge(service=service, options=options)
print('Webdriver launched.')
with WeRead(driver,debug=True) as weread:
weread.login() #? login for grab the whole book
weread.scan2html('https://weread.qq.com/web/reader/2c632ef071a486a92c60226kc81322c012c81e728d9d180')
weread.scan2html('https://weread.qq.com/web/reader/a9c32f40717db77aa9c9171kc81322c012c81e728d9d180')
Usage
Just code as demo show.
Stargazers over time
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
WeReadScan-HTML-0.1.1.tar.gz
(4.8 kB
view hashes)
Built Distribution
Close
Hashes for WeReadScan_HTML-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 071b2cf29985ed9d8fae09298d82a2c922844055040b2e0f169a913d82c82542 |
|
MD5 | 822afadffa0ce40ae7299ac66a3b08bf |
|
BLAKE2b-256 | 46007c6eaa22a21e18abacc2686bb295969f39c25655ae38a329d452a557b73d |