crawl the name and price of each product from e-commerce websites in Taiwan.
Project description
papaprice 爬爬價
爬MOMO、PCHOME、ETMALL、SHOPEE、YAHOO 商品名稱與價格
from papaprice import Etmall, Momo, Pchome, Shopee, Yahoo
# https://www.etmall.com.tw/i/2886010
etmall = Etmall()
print('etmall', etmall.query('2886010'))
# etmall ('Apple iPhone 12 Pro 128G 智慧型 5G 手機', 32388)
# https://www.momoshop.com.tw/goods/GoodsDetail.jsp?i_code=8169274
momo = Momo()
print('momo', momo.query('8169274'))
# momo ('【Apple 蘋果】iPhone 12 Pro 128G 6.1吋(超值殼貼組)', 33000)
# https://24h.pchome.com.tw/prod/DYAJIM-A900B724R
pchome = Pchome()
print('pchome', pchome.query('DYAJIM-A900B724R'))
# pchome ('Apple iPhone 12 Pro (128G)-太平洋藍(MGMN3TA/A)', 32900)
# https://shopee.tw/--i.54598032.5857088290
shopee = Shopee()
print('shopee', shopee.query('54598032.5857088290'))
# shopee ('APPLE iPhone 12 Pro 128G 支援5G上網/A14 CPU 送門市現場保貼服務兌換券', 32940)
# https://tw.buy.yahoo.com/gdsale/-9205108.html
yahoo = Yahoo()
print('yahoo', yahoo.query('9205108'))
# yahoo ('Apple iPhone 12 PRO 128G 6.1吋智慧型手機', 32600)
支援 proxies 使用,避免大量查詢被封鎖。
proxies = {
'https':'https://your.proxy.link',
'http':'http://your.proxy.link',
}
# https://www.etmall.com.tw/i/2886010
etmall = Etmall(proxies)
print('etmall', etmall.query('2886010'))
# etmall ('Apple iPhone 12 Pro 128G 智慧型 5G 手機', 32388)
proxies 設定,以服務供應商 Bright Data 為例 ->連結
clientname = '你在Bright Data的用戶名' #注意:每種不同的Proxy服務,有不同的用戶名。
password = '對應的密碼'
proxy = f'https://{username}:{pwssword}@zproxy.lum-superproxy.io:22225'
proxies = {'https':proxy}
# https://www.etmall.com.tw/i/2886010
etmall = Etmall(proxies)
print('etmall', etmall.query('2886010'))
# etmall ('Apple iPhone 12 Pro 128G 智慧型 5G 手機', 32388)
未來版本
目前僅可透過已知的商品編號進行爬取,未來預計加入關鍵字搜尋功能,透過購物網站的搜尋功能進行爬取。
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
papaprice-0.0.6.tar.gz
(6.5 kB
view details)
Built Distribution
File details
Details for the file papaprice-0.0.6.tar.gz
.
File metadata
- Download URL: papaprice-0.0.6.tar.gz
- Upload date:
- Size: 6.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e0c88b6ee04c20399caaa1da8f9c58736218445f7c905fe47d1ee447df4f7f98 |
|
MD5 | 043c0b0905628ad37baf9eceee8a4f39 |
|
BLAKE2b-256 | e4bf9fe9ebe5d547796eda05be7aa74aa275e4dcdfe8fedaa70f864ee264ac60 |
File details
Details for the file papaprice-0.0.6-py3-none-any.whl
.
File metadata
- Download URL: papaprice-0.0.6-py3-none-any.whl
- Upload date:
- Size: 6.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6293b7af4d64ec032fbc30851aef2c85f248a63e5b2e8d2ea4d7bb6ba7bc5e49 |
|
MD5 | 6726af6dbcad407fcdeecb55a5740267 |
|
BLAKE2b-256 | 491b86a66f2fc2d5b029b33faf3be50ec372fa618f1bab15713d50a767b9f409 |