Crawler for shopee
Project description
Shopee-crawler
This is a crawl tool.
Requirement
python>=3.7
Install
- Using
setup.py
git clone https://github.com/lthoangg/shopee-crawler
python/python3 setup.py install
- Using
pip
pip install shopee-crawler
How to use
- Crawl by shop url
from shopee_crawler import crawl_by_shop_url
data = crawl_by_shop_url(shop_url='shop_url')
# print(data)
- Crawl by category url
from shopee_crawler import crawl_by_cat_url
data = crawl_by_cat_url(cat_url='cat_url')
# print(data)
- Crawl by keyword (search)
from shopee_crawler import crawl_by_search
data = crawl_by_search(keyword='keyword')
# print(data)
Usage
-
About 12000 rows (products) in 2-3 seconds (32 workers)
-
About 12000 rows (products) in 5-6 seconds (16 workers)
Test yourself
import time
from shopee_crawler.crawler import crawl_by_search
keyword='điện thoại samsung'
# Start time
start = time.time()
# Crawl
crawl_by_search(keyword=keyword)
# End time
end = time.time()
# Time crawling
print("Time : ",end - start, "seconds")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
shopee_crawler-0.1.4.tar.gz
(5.2 kB
view details)
Built Distribution
File details
Details for the file shopee_crawler-0.1.4.tar.gz
.
File metadata
- Download URL: shopee_crawler-0.1.4.tar.gz
- Upload date:
- Size: 5.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.1 CPython/3.9.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 93d37cbf1756c7d67ffdf6009682660365a7776d5f8de1232b2fc5e043eeea4d |
|
MD5 | dfabd36f73fd4cb202f7a393b9068db5 |
|
BLAKE2b-256 | 14c32923425eb7ec48b5967f574f6b3ef6bce325f61f363257b0e786c18cfdcb |
File details
Details for the file shopee_crawler-0.1.4-py3-none-any.whl
.
File metadata
- Download URL: shopee_crawler-0.1.4-py3-none-any.whl
- Upload date:
- Size: 8.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.1 CPython/3.9.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4cd252000a9452406c8af95a3280cf692b695504c6c83537f397cf396a5e6530 |
|
MD5 | 9049e815f93a8ad0fdd02925390ec923 |
|
BLAKE2b-256 | d954139f3a54807dbef82abb4de21faa5d9bee3c777dd1efc459be64d6625298 |