Skip to main content

A simple package for crawling bol.com

Project description

bolcom_crawler

This is a really simple crawler that makes use of Scrapy to crawl bol.com.

Usage

The Crawler instance has two functions you can use, crawl_products and crawl_category. See an example below.

from bol_crawler.crawler import Crawler
crawler = Crawler()

# to crawl products
products = crawler.crawl_products(
    [
        'https://www.bol.com/nl/p/lg-34gl750-b-ultragear-gaming-monitor/9200000115819731',
    ]
)

# to crawl a category
products = crawler.crawl_category(
    [
        'https://www.bol.com/nl/l/gaming-toetsenborden/N/18214/', 0  # the 0 value is how often you want to go to the next page. 0 is just crawling the first page
    ]
)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bol-crawler-0.0.1.tar.gz (1.3 kB view details)

Uploaded Source

Built Distribution

bol_crawler-0.0.1-py3-none-any.whl (14.0 kB view details)

Uploaded Python 3

File details

Details for the file bol-crawler-0.0.1.tar.gz.

File metadata

  • Download URL: bol-crawler-0.0.1.tar.gz
  • Upload date:
  • Size: 1.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.40.0 CPython/3.7.3

File hashes

Hashes for bol-crawler-0.0.1.tar.gz
Algorithm Hash digest
SHA256 5b3d555be4ca381a9c3e5961e0f5e15bc6dc08e01a210b352bb90da5a78547c4
MD5 2576fd80d4af29a269d43ff0f8855430
BLAKE2b-256 ccc651827b56d8a3c32a00f89cf11bc491bc80bc6b1297ac84bca731ef75aebc

See more details on using hashes here.

File details

Details for the file bol_crawler-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: bol_crawler-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 14.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.40.0 CPython/3.7.3

File hashes

Hashes for bol_crawler-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 147b9a04b8cb06f8a920c7ef0d3fd9e0e1caae9cc8cd2e186c2ad9691adbad27
MD5 18dec37c0ff05a04191f047b96177e31
BLAKE2b-256 8b044d2a8dc29303392a37d87b467b543e3217efbcff243def962dc17fdb5338

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page