Skip to main content

Crawler for Shopee from all country

Project description

Shopee-crawler

This is a light crawling tool.

High performance.

Easy to use.


Requirement

python>=3.7

Install

  • Using setup.py
git clone https://github.com/lthoangg/shopee-crawler
python/python3 setup.py install
  • Using pip
pip install shopee-crawler

How to use

Examples:

from shopee_crawler import Crawler

crawler = Crawler()
crawler.set_origin(origin="shopee.vn") # Input your root Shopee website of your country that you want to crawl

data = crawler.crawl_by_shop_url(shop_url='shop_url')

data = crawler.crawl_by_cat_url(cat_url='cat_url')

data = crawler.crawl_by_search(keyword='keyword')

data = crawler.crawl_cat_list()
# print(data)

Usage

  • About 12.000 rows (products) in 2-3 seconds (32 workers)

  • About 12.000 rows (products) in 5-6 seconds (16 workers)

Test yourself

import time
from shopee_crawler import Crawler

crawler = Crawler()
crawler.set_origin(origin="shopee.vn")

keyword = 'điện thoại samsung'

# Start time
start = time.time()

# Crawl
data = crawler.crawl_by_search(keyword=keyword)

# End time
end = time.time()

# Time crawling
print("Time : ",end - start, "seconds")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shopee_crawler-0.2.3.tar.gz (6.5 kB view details)

Uploaded Source

File details

Details for the file shopee_crawler-0.2.3.tar.gz.

File metadata

  • Download URL: shopee_crawler-0.2.3.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.11.1

File hashes

Hashes for shopee_crawler-0.2.3.tar.gz
Algorithm Hash digest
SHA256 e3de4a9dacec0ba45429d4626f113b51af0022b584ebc615f4d85650790629c5
MD5 4de8b16db94da9e6eead553cf2dc987a
BLAKE2b-256 8f0c07f755fb04f1b4a1a6d530d547b0f2221925c7e9326b5c8803d9f40e91ac

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page