Skip to main content

WeRead HTML Scanner

Project description

WeReadScan (html-scan variant)

GitHub last commit GitHub code size in bytes GitHub top language

About

This branch is a html-scan variant of WeReadScan, integrating script developed by Sec-ant.

Thanks for Sec-ant, this variant of WeReadScan can be more efficient.

More detail about Sec-ant's project, you can visit https://github.com/Sec-ant/weread-scraper

Get started

pip install WeReadScan-HTML

This package needs selenium, so you should have some basis of selenium.

Demo

Talk is cheap, just show you the code.

"""
demo.py
The demo of WeReadScan.py
Copyright 2023 by Algebra-FUN
ALL RIGHTS RESERVED.
"""


from selenium.webdriver import Edge
from selenium.webdriver.chrome.service import Service
from selenium.webdriver.edge.options import Options

from WeReadScan import WeRead


options = Options()
options.add_argument("--disable-blink-features=AutomationControlled")
options.add_argument('disable-infobars')
options.add_argument('log-level=3')
options.add_argument("headless")

# service = Service("/Users/creator/Downloads/msedgedriver")

# launch Webdriver
print('Webdriver launching...')
driver = Edge(options=options)
# driver = Edge(service=service, options=options)
print('Webdriver launched.')

with WeRead(driver,debug=True) as weread:
    weread.login() #? login for grab the whole book
    weread.scan2html('https://weread.qq.com/web/reader/2c632ef071a486a92c60226kc81322c012c81e728d9d180')
    weread.scan2html('https://weread.qq.com/web/reader/a9c32f40717db77aa9c9171kc81322c012c81e728d9d180')

Usage

Just code as demo show.

Stargazers over time

Stargazers over time

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

WeReadScan-HTML-0.1.1.tar.gz (4.8 kB view details)

Uploaded Source

Built Distribution

WeReadScan_HTML-0.1.1-py3-none-any.whl (5.4 kB view details)

Uploaded Python 3

File details

Details for the file WeReadScan-HTML-0.1.1.tar.gz.

File metadata

  • Download URL: WeReadScan-HTML-0.1.1.tar.gz
  • Upload date:
  • Size: 4.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.6

File hashes

Hashes for WeReadScan-HTML-0.1.1.tar.gz
Algorithm Hash digest
SHA256 98f3ca240b5d855a316f80b763c21aca7667d7d356c77883784aef6caf1a6d82
MD5 f76fd4cc5004428b4b304311c3f96639
BLAKE2b-256 61576d5d67be352b2af593e1eddbcd549fb24855d70330267644a1ad504cad74

See more details on using hashes here.

File details

Details for the file WeReadScan_HTML-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for WeReadScan_HTML-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 071b2cf29985ed9d8fae09298d82a2c922844055040b2e0f169a913d82c82542
MD5 822afadffa0ce40ae7299ac66a3b08bf
BLAKE2b-256 46007c6eaa22a21e18abacc2686bb295969f39c25655ae38a329d452a557b73d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page