Web Utils for browsing and scraping
Project description
WebU
Web Utils for browsing and scraping.
Install
pip install webu --upgrade
默认安装现在只包含最基础的运行依赖:requests 和 tclogger。
浏览器自动化、FastAPI 服务、面板、代理池、验证码识别、MongoDB、Hugging Face 等重依赖都拆到了可选 extras 里,按需安装即可。
Common Installs
基础能力,适合 LLMClient、GeminiClient 这类纯 HTTP 客户端:
pip install -U webu
HTML / 搜索结果解析,适合 webu.google_api.parser、webu.gemini.parser:
pip install -U "webu[parsing]"
DrissionPage 浏览器能力,适合 webu.browsers.chrome 和 webu.searches.*:
pip install -U "webu[browser]"
嵌入向量客户端,适合 webu.embed:
pip install -U "webu[embed]"
CAPTCHA 自动解题,适合 webu.captcha:
pip install -U "webu[captcha]"
playwright install chromium
Service Installs
Google Search API 服务 ggsc:
pip install -U "webu[google-api]"
playwright install chromium
如果还需要内置 Dash 面板:
pip install -U "webu[google-api,google-api-panel]"
playwright install chromium
如果需要自动处理 reCAPTCHA 图片题,再额外加上 captcha:
pip install -U "webu[google-api,captcha]"
playwright install chromium
Google Docker / HF Spaces 工具 ggdk:
pip install -U "webu[google-docker]"
playwright install chromium
如果还需要内置 Dash 面板:
pip install -U "webu[google-docker,google-docker-panel]"
playwright install chromium
Google Hub 调度服务 gghb:
pip install -U "webu[google-hub]"
如果还需要内置 Dash 面板:
pip install -U "webu[google-hub,google-hub-panel]"
Gemini 浏览器服务端能力:
pip install -U "webu[gemini]"
playwright install chromium
Proxy API 服务 pxsc:
pip install -U "webu[proxy-api]"
WARP API 服务 cfwp:
pip install -U "webu[warp-api]"
Cloudflare Tunnel 工具 cftn:
pip install -U "webu[cf-tunnel]"
IPv6 相关能力:
pip install -U "webu[ipv6]"
安装全部功能:
pip install -U "webu[all]"
playwright install chromium
Extra Summary
| Extra | 适用模块 / 命令 | 说明 |
|---|---|---|
parsing |
webu.google_api.parser, webu.gemini.parser |
仅安装 HTML 解析相关依赖 |
browser / searches |
webu.browsers.chrome, webu.searches.* |
DrissionPage + 虚拟显示 |
embed |
webu.embed |
仅安装 numpy |
captcha |
webu.captcha |
Playwright + OpenCV + numpy + httpx |
fastapi |
webu.fastapis.* |
FastAPI / Uvicorn / Pydantic |
dashboard |
Google API / Hub 面板 | Dash + A2WSGI |
proxy |
webu.proxy_api.*, webu.google_api.proxy_manager |
aiohttp / SOCKS / MongoDB |
cf-tunnel |
webu.cf_tunnel.*, cftn |
Cloudflare Tunnel CLI 相关依赖 |
gemini |
webu.gemini.* |
Gemini 浏览器服务端所需依赖 |
google-api |
webu.google_api.*, ggsc |
Google 搜索服务本体,不含 CAPTCHA 图像解题和 Dash 面板 |
google-api-panel |
Google API panel | Google API 的 Dash 面板依赖 |
google-docker |
webu.google_docker.*, ggdk |
Google Docker / HF Spaces 工具本体,不含 Dash 面板 |
google-docker-panel |
Google Docker panel | Google Docker 的 Dash 面板依赖 |
google-hub |
webu.google_hub.*, gghb |
Hub 调度服务本体,不含 Dash 面板 |
google-hub-panel |
Google Hub panel | Google Hub 的 Dash 面板依赖 |
proxy-api |
webu.proxy_api.*, pxsc |
代理采集、校验、服务 |
warp-api |
webu.warp_api.*, cfwp |
WARP 管理服务 |
ipv6 |
webu.ipv6.* |
IPv6 路由、会话、服务 |
all |
全部模块 | 安装所有可选依赖 |
dev |
测试 / 开发 | pytest + pytest-asyncio |
Combining Extras
可以一次安装多个功能组:
pip install -U "webu[google-api,google-api-panel,captcha,google-hub]"
Notes
playwright只是 Python 包;首次使用浏览器相关功能后,仍需执行playwright install chromium。google-api/google-hub/google-docker现在即使未安装 Dash 也能启动服务,只是不会挂载 panel。import webu和若干子包入口现在采用惰性导入,不会再因为未安装某个可选依赖就把整个包导入失败。- 如果只需要某个轻量子模块,尽量直接安装对应 extra,不要默认使用
webu[all]。
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file webu-1.3.4.tar.gz.
File metadata
- Download URL: webu-1.3.4.tar.gz
- Upload date:
- Size: 307.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.25
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7bc0e06934ddea092555988abd32af62d5325990d7ac94ab7228d724b31fda5b
|
|
| MD5 |
799933b19bb8fbb0aa8c8d07a0dae35f
|
|
| BLAKE2b-256 |
e32f49a4439e8b167bbfa23c9c0077b17767feb046e764711d30c5fb36bf9da9
|
File details
Details for the file webu-1.3.4-py3-none-any.whl.
File metadata
- Download URL: webu-1.3.4-py3-none-any.whl
- Upload date:
- Size: 343.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.25
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4f016388ca9b64f238a00d822ce6f838c1c641b89d13ca860fd9315db186aa20
|
|
| MD5 |
bbcd5e42ada979c1edaa7bf6dc5ab6a9
|
|
| BLAKE2b-256 |
2c911decbbfcf93bd546fab8982ac6781500f16f878188a83ccf17cae235c665
|