A simple package for crawling bol.com
Project description
bolcom_crawler
This is a really simple crawler that makes use of Scrapy to crawl bol.com.
Usage
The Crawler instance has two functions you can use, crawl_products and crawl_category. See an example below.
from bol_crawler.crawler import Crawler
crawler = Crawler()
# to crawl products
products = crawler.crawl_products(
[
'https://www.bol.com/nl/p/lg-34gl750-b-ultragear-gaming-monitor/9200000115819731',
]
)
# to crawl a category
products = crawler.crawl_category(
[
'https://www.bol.com/nl/l/gaming-toetsenborden/N/18214/', 0 # the 0 value is how often you want to go to the next page. 0 is just crawling the first page
]
)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
bol-crawler-0.0.1.tar.gz
(1.3 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file bol-crawler-0.0.1.tar.gz.
File metadata
- Download URL: bol-crawler-0.0.1.tar.gz
- Upload date:
- Size: 1.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.40.0 CPython/3.7.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5b3d555be4ca381a9c3e5961e0f5e15bc6dc08e01a210b352bb90da5a78547c4
|
|
| MD5 |
2576fd80d4af29a269d43ff0f8855430
|
|
| BLAKE2b-256 |
ccc651827b56d8a3c32a00f89cf11bc491bc80bc6b1297ac84bca731ef75aebc
|
File details
Details for the file bol_crawler-0.0.1-py3-none-any.whl.
File metadata
- Download URL: bol_crawler-0.0.1-py3-none-any.whl
- Upload date:
- Size: 14.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.40.0 CPython/3.7.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
147b9a04b8cb06f8a920c7ef0d3fd9e0e1caae9cc8cd2e186c2ad9691adbad27
|
|
| MD5 |
18dec37c0ff05a04191f047b96177e31
|
|
| BLAKE2b-256 |
8b044d2a8dc29303392a37d87b467b543e3217efbcff243def962dc17fdb5338
|