Skip to main content

A Scrapy extension that report your log from your scraped data.

Project description

Scrapy Log Report Extension

A Scrapy extension that report your log from your scraped data.

Usage

This Scrapy extension provides a way to report your log from your scraped data. It will generate a report every LOGSTATS_INTERVAL seconds, and send it to your log server.

# log report demo
{
  "items_add": 0,
  "pages_add": 0,
  "items_rate": 0,
  "pages_rate": 0,
  "items_count": 0,
  "pages_count": 0,
  "spider_name": "douban",
  "log_count/INFO": 8,
  "log_count/DEBUG": 1,
  "log_count/WARNING": 2,
  "item_scraped_add_count": 0,
  "response_received_add_count": 0
}

Installation

First, pip install this package:

$ pip install masterai-scrapy-extensions

Usage

Enable the extension in your project's settings.py file, by adding the following lines:

EXTENSIONS = {
    "masterai_scrapy_extensions.logreport.ReportStats": 100,
}
#
LOGSTATS_INTERVAL = 60
# set the URL to your log server
# method POST is used to send the report data
LOGREPORT_URL = "http://127.0.0.1:5000/api/v1/task/worker/status"

# log color
from masterai_scrapy_extensions import logcolor

logcolor.log_color_init()

That's all! Now run your job and have a look at the field stats.

Settings

The settings below can be defined as any other Scrapy settings, as described on Scrapy docs.

  • LOGREPORT_URL: set the interval in seconds to generate the report.
  • LOGSTATS_INTERVAL: set the URL to your log server,method POST is used to send the report data.
  • COLORLOG_FORMAT: log color format.
  • COLORLOG_COLORS: log colors.
  • COLORLOG_DATEFORMAT: log color date format.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

masterai_scrapy_extensions-2024.7.29.tar.gz (4.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file masterai_scrapy_extensions-2024.7.29.tar.gz.

File metadata

File hashes

Hashes for masterai_scrapy_extensions-2024.7.29.tar.gz
Algorithm Hash digest
SHA256 b4fe988edec3a545ccbeb49f73c21cc98364bfba79eabd1479a08133f6016ff8
MD5 cc2d5e1d42a7321941c89dbb0bd8ddca
BLAKE2b-256 f7beaf1690cdb0c2b80328d7422c34e387a52fb816e2077c5eea8a6df5264232

See more details on using hashes here.

File details

Details for the file masterai_scrapy_extensions-2024.7.29-py3-none-any.whl.

File metadata

File hashes

Hashes for masterai_scrapy_extensions-2024.7.29-py3-none-any.whl
Algorithm Hash digest
SHA256 7f226802aeec02432587790e37fa6980c3ef4aa1ec6b19b9f70b96a6ef4bdced
MD5 ad34a290cf7859e305c22c8cc23cbdab
BLAKE2b-256 2c9763924846ac8201e47adb7a9d8f8db2e53d431a1beef2743289e139a7ff60

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page