base on asyncio, spider module like scrapy.
Project description
NHM-SPIDER
使用asyncio异步方式实现的爬虫,写法与scrapy相同。
快速开始
安装
# 使用 uv 安装项目依赖
uv sync
# 或者直接安装包
pip install nhm-spider
安装playwright,如果需要使用的话:
playwright install
- 爬虫结束后的统计数据的展示,items,errors,200的次数等。
- 爬虫开始时开启的模块的展示,middleware,pipeline等。
发布
# 更新版本号在 pyproject.toml 中
# 更新lock文件
uv lock
# 构建分发包
uv run --module build
# 上传到 PyPI
uv run --module twine upload dist/*
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
nhm_spider-4.0.3.tar.gz
(19.0 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file nhm_spider-4.0.3.tar.gz.
File metadata
- Download URL: nhm_spider-4.0.3.tar.gz
- Upload date:
- Size: 19.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7a22e5a7d2a00bfd8f952c7590123ed935b176fd5af2e5116556dcc005684f2e
|
|
| MD5 |
9b567ffac0507e8bacbc990ee5156c5e
|
|
| BLAKE2b-256 |
04e02bc7218fc151ad2b49252107317c9bff8e1742001ca758556b6c678eae78
|
File details
Details for the file nhm_spider-4.0.3-py3-none-any.whl.
File metadata
- Download URL: nhm_spider-4.0.3-py3-none-any.whl
- Upload date:
- Size: 28.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3a47c1718cc7c13c5707331e8ec91c9af38384d5322ed8b288b2f843d1308982
|
|
| MD5 |
3a8a810e36de43f82011b1e1dffaca75
|
|
| BLAKE2b-256 |
c1e5f6fdb22704818f4c937bae2ab21967f7b1fbce15c21de5b181678dee6e89
|