Skip to main content

pansihub command-line tools.

Project description

pancli

pancli (also known as Pansi Cli), is a tool for running scrapy spider or interacting to ScrapyDD server.

pancli give some useful supplymentary functions to scrapy command line, some of the functions are:

  1. Execute spider from a egg packaged project.
  2. Install package required libraries at run-time if necessary.
  3. Package a scrapy project into egg.

Installation

pancli is available on pypi, you can install it by simply run:

pip install pancli

Usages

Run a spider

Run a spider is very easy by executing pancli crawl {spider_name}, the command is compatiable with scrapy crawl

One special and the most wonderful parameter is the -f parameter, which specifies a FIGURE file in which all settings/parameters/plugins of a spider can be writen as a simple JSON/YAML file. If you run spider very often, FIGURE file can save tons of time.

FIGURE fields:

  • spider: the target spider name which is the same in the scrapy list command.
  • settings: (dict) settings can be used to populate all settings at runtime, not only the literal/string values, but list/dicts

And other parameters, the more detail documentation is coming soon.

package a spider

A packaged spider is extremely portable, with one spider package and a FIGURE file, you can easily crawl the whole internet.

When your current dir is in the scope of scrapy project(with any ancient folder which contains a scrapy.cfg file), you can easily run the following commmand to build a spider package.

pancli package

If you haven't create setup.py for the project, this command will help you create one.

And this command is inspired by scrapyd-client

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

pancli-0.1.9-py3-none-any.whl (20.3 kB view details)

Uploaded Python 3

File details

Details for the file pancli-0.1.9-py3-none-any.whl.

File metadata

  • Download URL: pancli-0.1.9-py3-none-any.whl
  • Upload date:
  • Size: 20.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.31.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.8.5

File hashes

Hashes for pancli-0.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 0724d0ae456caebbaffdb7c2fc6e34f0e95d51d4fd92ef83984ff1660ffc8ff6
MD5 407d4f08e4454ff1dad61afcb8ac4351
BLAKE2b-256 1e3159745ebf539bcd4349af4d40a01c4523dc8c164a13b4dfb6c164517d9860

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page