Skip to main content

A Scrapy extension that report your log from your scraped data.

Project description

Scrapy Log Report Extension

A Scrapy extension that report your log from your scraped data.

Usage

This Scrapy extension provides a way to report your log from your scraped data. It will generate a report every LOGSTATS_INTERVAL seconds, and send it to your log server.

# log report demo
{
  "items_add": 0,
  "pages_add": 0,
  "items_rate": 0,
  "pages_rate": 0,
  "items_count": 0,
  "pages_count": 0,
  "spider_name": "douban",
  "log_count/INFO": 8,
  "log_count/DEBUG": 1,
  "log_count/WARNING": 2,
  "item_scraped_add_count": 0,
  "response_received_add_count": 0
}

Installation

First, pip install this package:

$ pip install masterai-scrapy-extensions

Usage

Enable the extension in your project's settings.py file, by adding the following lines:

EXTENSIONS = {
    "masterai_scrapy_extensions.logreport.ReportStats": 100,
}
#
LOGSTATS_INTERVAL = 60
# set the URL to your log server
# method POST is used to send the report data
LOGREPORT_URL = "http://127.0.0.1:5000/api/v1/task/worker/status"

That's all! Now run your job and have a look at the field stats.

Settings

The settings below can be defined as any other Scrapy settings, as described on Scrapy docs.

  • LOGREPORT_URL: set the interval in seconds to generate the report.
  • LOGSTATS_INTERVAL: set the URL to your log server,method POST is used to send the report data.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

masterai_scrapy_extensions-0.1.0.tar.gz (3.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

masterai_scrapy_extensions-0.1.0-py3-none-any.whl (3.5 kB view details)

Uploaded Python 3

File details

Details for the file masterai_scrapy_extensions-0.1.0.tar.gz.

File metadata

File hashes

Hashes for masterai_scrapy_extensions-0.1.0.tar.gz
Algorithm Hash digest
SHA256 9e24ceda13beb42ca7a0c0dc8a04b242f89639d884cf2a563e832a949c754a45
MD5 1fcbee16cd33cc289d3f07a03bc07b7d
BLAKE2b-256 34bc8eb3f0ca4ed2f7400d5f3f8af968635b1fcd1bb0cc381eee833e6955f093

See more details on using hashes here.

File details

Details for the file masterai_scrapy_extensions-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for masterai_scrapy_extensions-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a614e13f7dacf651b727e9aa46c5360c5fcc211010be022ec443721a7ff0dfaa
MD5 508bf0362af03f94a9b4394321a90945
BLAKE2b-256 7275481367b405d42aa57790ae2d264572f4e75638d726a01e3c4e96d71878f7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page