Crawler for Shopee from all country
Project description
Shopee-crawler
This is a light crawling tool.
High performance.
Easy to use.
Requirement
python>=3.7
Install
- Using
setup.py
git clone https://github.com/lthoangg/shopee-crawler
python/python3 setup.py install
- Using
pip
pip install shopee-crawler
How to use
Examples:
from shopee_crawler import Crawler
crawler = Crawler()
crawler.set_origin(origin="shopee.vn") # Input your root Shopee website of your country that you want to crawl
data = crawler.crawl_by_shop_url(shop_url='shop_url')
data = crawler.crawl_by_cat_url(cat_url='cat_url')
data = crawler.crawl_by_search(keyword='keyword')
data = crawler.crawl_cat_list()
# print(data)
Usage
-
About 12.000 rows (products) in 2-3 seconds (32 workers)
-
About 12.000 rows (products) in 5-6 seconds (16 workers)
Test yourself
import time
from shopee_crawler import Crawler
crawler = Crawler()
crawler.set_origin(origin="shopee.vn")
keyword = 'điện thoại samsung'
# Start time
start = time.time()
# Crawl
data = crawler.crawl_by_search(keyword=keyword)
# End time
end = time.time()
# Time crawling
print("Time : ",end - start, "seconds")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
shopee_crawler-0.2.3.tar.gz
(6.5 kB
view details)
File details
Details for the file shopee_crawler-0.2.3.tar.gz
.
File metadata
- Download URL: shopee_crawler-0.2.3.tar.gz
- Upload date:
- Size: 6.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.11.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e3de4a9dacec0ba45429d4626f113b51af0022b584ebc615f4d85650790629c5 |
|
MD5 | 4de8b16db94da9e6eead553cf2dc987a |
|
BLAKE2b-256 | 8f0c07f755fb04f1b4a1a6d530d547b0f2221925c7e9326b5c8803d9f40e91ac |