Skip to main content

base on asyncio, spider module like scrapy.

Project description

NHM-SPIDER

使用asyncio异步方式实现的爬虫,写法与scrapy相同。

快速开始

安装

# 使用 uv 安装项目依赖
uv sync

# 或者直接安装包
pip install nhm-spider

安装playwright,如果需要使用的话:

playwright install
  • 爬虫结束后的统计数据的展示,items,errors,200的次数等。
  • 爬虫开始时开启的模块的展示,middleware,pipeline等。

发布

# 更新版本号在 pyproject.toml 中

# 更新lock文件
uv lock

# 构建分发包
uv run --module build

# 上传到 PyPI
uv run --module twine upload dist/*

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nhm_spider-4.0.3.tar.gz (19.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nhm_spider-4.0.3-py3-none-any.whl (28.1 kB view details)

Uploaded Python 3

File details

Details for the file nhm_spider-4.0.3.tar.gz.

File metadata

  • Download URL: nhm_spider-4.0.3.tar.gz
  • Upload date:
  • Size: 19.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for nhm_spider-4.0.3.tar.gz
Algorithm Hash digest
SHA256 7a22e5a7d2a00bfd8f952c7590123ed935b176fd5af2e5116556dcc005684f2e
MD5 9b567ffac0507e8bacbc990ee5156c5e
BLAKE2b-256 04e02bc7218fc151ad2b49252107317c9bff8e1742001ca758556b6c678eae78

See more details on using hashes here.

File details

Details for the file nhm_spider-4.0.3-py3-none-any.whl.

File metadata

  • Download URL: nhm_spider-4.0.3-py3-none-any.whl
  • Upload date:
  • Size: 28.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for nhm_spider-4.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 3a47c1718cc7c13c5707331e8ec91c9af38384d5322ed8b288b2f843d1308982
MD5 3a8a810e36de43f82011b1e1dffaca75
BLAKE2b-256 c1e5f6fdb22704818f4c937bae2ab21967f7b1fbce15c21de5b181678dee6e89

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page