WeRead HTML Scanner
Project description
WeReadScan (html-scan variant)
About
This branch is a html-scan variant of WeReadScan, integrating script developed by Sec-ant.
Thanks for Sec-ant, this variant of WeReadScan can be more efficient.
More detail about Sec-ant's project, you can visit https://github.com/Sec-ant/weread-scraper
Get started
pip install WeReadScan-HTML
This package needs selenium, so you should have some basis of selenium.
Demo
Talk is cheap, just show you the code.
"""
demo.py
The demo of WeReadScan.py
Copyright 2023 by Algebra-FUN
ALL RIGHTS RESERVED.
"""
from selenium.webdriver import Edge
from selenium.webdriver.chrome.service import Service
from selenium.webdriver.edge.options import Options
from WeReadScan import WeRead
options = Options()
options.add_argument("--disable-blink-features=AutomationControlled")
options.add_argument('disable-infobars')
options.add_argument('log-level=3')
options.add_argument("headless")
# service = Service("/Users/creator/Downloads/msedgedriver")
# launch Webdriver
print('Webdriver launching...')
driver = Edge(options=options)
# driver = Edge(service=service, options=options)
print('Webdriver launched.')
with WeRead(driver,debug=True) as weread:
weread.login() #? login for grab the whole book
weread.scan2html('https://weread.qq.com/web/reader/2c632ef071a486a92c60226kc81322c012c81e728d9d180')
weread.scan2html('https://weread.qq.com/web/reader/a9c32f40717db77aa9c9171kc81322c012c81e728d9d180')
Usage
Just code as demo show.
Stargazers over time
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
WeReadScan-HTML-0.1.1.tar.gz
(4.8 kB
view details)
Built Distribution
File details
Details for the file WeReadScan-HTML-0.1.1.tar.gz
.
File metadata
- Download URL: WeReadScan-HTML-0.1.1.tar.gz
- Upload date:
- Size: 4.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 98f3ca240b5d855a316f80b763c21aca7667d7d356c77883784aef6caf1a6d82 |
|
MD5 | f76fd4cc5004428b4b304311c3f96639 |
|
BLAKE2b-256 | 61576d5d67be352b2af593e1eddbcd549fb24855d70330267644a1ad504cad74 |
File details
Details for the file WeReadScan_HTML-0.1.1-py3-none-any.whl
.
File metadata
- Download URL: WeReadScan_HTML-0.1.1-py3-none-any.whl
- Upload date:
- Size: 5.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 071b2cf29985ed9d8fae09298d82a2c922844055040b2e0f169a913d82c82542 |
|
MD5 | 822afadffa0ce40ae7299ac66a3b08bf |
|
BLAKE2b-256 | 46007c6eaa22a21e18abacc2686bb295969f39c25655ae38a329d452a557b73d |