Skip to main content

crawl the name and price of each product from e-commerce websites in Taiwan.

Project description

papaprice 爬爬價

爬MOMO、PCHOME、ETMALL、SHOPEE、YAHOO 商品名稱與價格

from papaprice import Etmall, Momo, Pchome, Shopee, Yahoo

# https://www.etmall.com.tw/i/2886010
etmall = Etmall()
print('etmall', etmall.query('2886010'))
# etmall ('Apple iPhone 12 Pro 128G 智慧型 5G 手機', 32388)

# https://www.momoshop.com.tw/goods/GoodsDetail.jsp?i_code=8169274
momo = Momo()
print('momo', momo.query('8169274'))
# momo ('【Apple 蘋果】iPhone 12 Pro 128G 6.1吋(超值殼貼組)', 33000)

# https://24h.pchome.com.tw/prod/DYAJIM-A900B724R
pchome = Pchome()
print('pchome', pchome.query('DYAJIM-A900B724R'))
# pchome ('Apple iPhone 12 Pro (128G)-太平洋藍(MGMN3TA/A)', 32900)

# https://shopee.tw/--i.54598032.5857088290
shopee = Shopee()
print('shopee', shopee.query('54598032.5857088290'))
# shopee ('APPLE iPhone 12 Pro 128G 支援5G上網/A14 CPU 送門市現場保貼服務兌換券', 32940)

# https://tw.buy.yahoo.com/gdsale/-9205108.html
yahoo = Yahoo()
print('yahoo', yahoo.query('9205108'))
# yahoo ('Apple iPhone 12 PRO 128G 6.1吋智慧型手機', 32600)

支援 proxies 使用,避免大量查詢被封鎖。

proxies = {
    'https':'https://your.proxy.link',
    'http':'http://your.proxy.link',
}

# https://www.etmall.com.tw/i/2886010
etmall = Etmall(proxies)
print('etmall', etmall.query('2886010'))
# etmall ('Apple iPhone 12 Pro 128G 智慧型 5G 手機', 32388)

proxies 設定,以服務供應商 Bright Data 為例 ->連結

clientname = '你在Bright Data的用戶名' #注意:每種不同的Proxy服務,有不同的用戶名。
password = '對應的密碼'
proxy = f'https://{username}:{pwssword}@zproxy.lum-superproxy.io:22225'
proxies = {'https':proxy}

# https://www.etmall.com.tw/i/2886010
etmall = Etmall(proxies)
print('etmall', etmall.query('2886010'))
# etmall ('Apple iPhone 12 Pro 128G 智慧型 5G 手機', 32388)

未來版本

目前僅可透過已知的商品編號進行爬取,未來預計加入關鍵字搜尋功能,透過購物網站的搜尋功能進行爬取。

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

papaprice-0.0.6.tar.gz (6.5 kB view details)

Uploaded Source

Built Distribution

papaprice-0.0.6-py3-none-any.whl (6.9 kB view details)

Uploaded Python 3

File details

Details for the file papaprice-0.0.6.tar.gz.

File metadata

  • Download URL: papaprice-0.0.6.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.0

File hashes

Hashes for papaprice-0.0.6.tar.gz
Algorithm Hash digest
SHA256 e0c88b6ee04c20399caaa1da8f9c58736218445f7c905fe47d1ee447df4f7f98
MD5 043c0b0905628ad37baf9eceee8a4f39
BLAKE2b-256 e4bf9fe9ebe5d547796eda05be7aa74aa275e4dcdfe8fedaa70f864ee264ac60

See more details on using hashes here.

File details

Details for the file papaprice-0.0.6-py3-none-any.whl.

File metadata

  • Download URL: papaprice-0.0.6-py3-none-any.whl
  • Upload date:
  • Size: 6.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.0

File hashes

Hashes for papaprice-0.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 6293b7af4d64ec032fbc30851aef2c85f248a63e5b2e8d2ea4d7bb6ba7bc5e49
MD5 6726af6dbcad407fcdeecb55a5740267
BLAKE2b-256 491b86a66f2fc2d5b029b33faf3be50ec372fa618f1bab15713d50a767b9f409

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page