Skip to main content

A Scrapy extension that report your log from your scraped data.

Project description

Scrapy Log Report Extension

A Scrapy extension that report your log from your scraped data.

Usage

This Scrapy extension provides a way to report your log from your scraped data. It will generate a report every LOGSTATS_INTERVAL seconds, and send it to your log server.

# log report demo
{
  "items_add": 0,
  "pages_add": 0,
  "items_rate": 0,
  "pages_rate": 0,
  "items_count": 0,
  "pages_count": 0,
  "spider_name": "douban",
  "log_count/INFO": 8,
  "log_count/DEBUG": 1,
  "log_count/WARNING": 2,
  "item_scraped_add_count": 0,
  "response_received_add_count": 0
}

Installation

First, pip install this package:

$ pip install masterai-scrapy-extensions

Usage

Enable the extension in your project's settings.py file, by adding the following lines:

EXTENSIONS = {
    "masterai_scrapy_extensions.logreport.ReportStats": 100,
}
#
LOGSTATS_INTERVAL = 60
# set the URL to your log server
# method POST is used to send the report data
LOGREPORT_URL = "http://127.0.0.1:5000/api/v1/task/worker/status"

# log color
from masterai_scrapy_extensions import logcolor

logcolor.log_color_init()

That's all! Now run your job and have a look at the field stats.

Settings

The settings below can be defined as any other Scrapy settings, as described on Scrapy docs.

  • LOGREPORT_URL: set the interval in seconds to generate the report.
  • LOGSTATS_INTERVAL: set the URL to your log server,method POST is used to send the report data.
  • COLORLOG_FORMAT: log color format.
  • COLORLOG_COLORS: log colors.
  • COLORLOG_DATEFORMAT: log color date format.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

masterai_scrapy_extensions-2024.8.1.tar.gz (4.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file masterai_scrapy_extensions-2024.8.1.tar.gz.

File metadata

File hashes

Hashes for masterai_scrapy_extensions-2024.8.1.tar.gz
Algorithm Hash digest
SHA256 0f8c27862a1b257ddf03be577754aefc87e07f7c8050cb57d424581f7d5cbcb9
MD5 0e2510ba0cb3bb12c821f886f6b92830
BLAKE2b-256 901cd39aeac8d21b6309c8f40d8ee6d99923bbb2fc02c5da2501caf7d463f43f

See more details on using hashes here.

File details

Details for the file masterai_scrapy_extensions-2024.8.1-py3-none-any.whl.

File metadata

File hashes

Hashes for masterai_scrapy_extensions-2024.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 6d19ba584dfbcb050bade1f10dfce283c829e18b4c9590eab4c1fe2c464d7761
MD5 9a4d363c8ef45a57e45dddc7247f9711
BLAKE2b-256 473b591e0cd345545e4ab3a4b4416d533e5ab7bf5fe5867a5e4b809d999faead

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page