Skip to main content

Scrapy Tools

Project description

概述

一句话介绍:用于扩展 Scrapy 功能来解放双手。

安装

可以直接输入以下命令:

.. code:: bash

pip install gzspidertools


可选安装1,安装数据库相关的所有依赖:

.. code:: bash

pip install gzspidertools[database]


可选安装2,通过以下命令安装所有依赖:

.. code:: bash

pip install gzspidertools[all]

注:详细的安装介绍请查看 安装指南_。

用法

开发人员只需根据命令生成示例模板,再配置相关设置即可。

查看库版本

gzcmd version

创建项目

gzcmd startproject <project_name>

进入项目根目录

cd <project_name>

替换(覆盖)为真实的配置 .conf 文件:

这里是为了演示方便,正常情况是直接在 VIT 中的 .conf 文件填上你需要的配置即可

cp /root/mytemp/.conf DemoSpider/VIT/.conf

生成爬虫脚本

gzcmd genspider <spider_name> <example.com>

运行脚本

scrapy crawl <spider_name>

注:也可以使用 gzcmd crawl <spider_name>

=======

uv

uv 安装更新

pip install --upgrade uv

uv sync

build

uv build

uv publish

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gzspidertools-0.0.30.tar.gz (89.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gzspidertools-0.0.30-py3-none-any.whl (130.2 kB view details)

Uploaded Python 3

File details

Details for the file gzspidertools-0.0.30.tar.gz.

File metadata

  • Download URL: gzspidertools-0.0.30.tar.gz
  • Upload date:
  • Size: 89.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.2

File hashes

Hashes for gzspidertools-0.0.30.tar.gz
Algorithm Hash digest
SHA256 c8b5a293df96831e76040f49aae63fccd2b6a6caad58613047253075656feb9f
MD5 80a6a854d1b167cfa455ee9cd01602f6
BLAKE2b-256 c3f7d88d42dbef4d8b116bc2f01980922de97466c307d77e6ea4fdde5cc05dc3

See more details on using hashes here.

File details

Details for the file gzspidertools-0.0.30-py3-none-any.whl.

File metadata

File hashes

Hashes for gzspidertools-0.0.30-py3-none-any.whl
Algorithm Hash digest
SHA256 066bda36b193e8d3a50c449bc8e1011b28d0ae48d3924fcdd27307c04af606c5
MD5 269b339f5201c12bedb396ab995cb2f5
BLAKE2b-256 c3d967026d09f7a30a9ed5d48c9ae0febc422efe3ee8f624c97e6a38fae0afb8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page