Skip to main content

Crawler for shopee

Project description

Shopee-crawler

This is a crawling tool.

High performance.

Easy to use.


Requirement

python>=3.7

Install

  • Using setup.py
git clone https://github.com/lthoangg/shopee-crawler
python/python3 setup.py install
  • Using pip
pip install shopee-crawler

How to use

  • Crawl by shop url
from shopee_crawler import crawl_by_shop_url

data = crawl_by_shop_url(shop_url='shop_url')
# print(data)
  • Crawl by category url
from shopee_crawler import crawl_by_cat_url

data = crawl_by_cat_url(cat_url='cat_url')
# print(data)
  • Crawl by keyword (search)
from shopee_crawler import crawl_by_search

data = crawl_by_search(keyword='keyword')
# print(data)
  • Crawl all categories
from shopee_crawler import crawl_cat_list

data = crawl_cat_list()
# print(data)

Usage

  • About 12.000 rows (products) in 2-3 seconds (32 workers)

  • About 12.000 rows (products) in 5-6 seconds (16 workers)

Test yourself

import time
from shopee_crawler.crawler import crawl_by_search

keyword='điện thoại samsung'

# Start time
start = time.time()

# Crawl
crawl_by_search(keyword=keyword)

# End time
end = time.time()

# Time crawling
print("Time : ",end - start, "seconds")

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shopee_crawler-0.1.6.tar.gz (5.4 kB view hashes)

Uploaded Source

Built Distribution

shopee_crawler-0.1.6-py3-none-any.whl (8.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page