Skip to main content

Ecommerce scrapers library

Project description

Ecommerce Retailer Scraper

The purpose of this project is to create a library of spiders/scraper for major online ecommerce retailer shop such as Amazon, Sephora, Walmart, and many other brands. The app will return product information and review as well in JSON format.

The extracted data you can use for market research, product design, consumer buying impulse...The sky is the limit.

Instruction:

  1. How to install:
pip install ecom-scraper
  1. How to use:
from retail_scraper.spiders.sephora import Sephora

# Sephora spider take either product url or productid Sephora(url=url, productid=product_id)
url = 'https://www.sephora.com/product/huda-beauty-liquid-matte-ultra-comfort-transfer-proof-lipstick-P479843'
product_id = 'P479843'
sephora = Sephora(url=url)
# Or
sephora = Sephora(productid=product_id)
sephora.scrap_product_info()  # Instantiate the Scrap product function
info = sephora.product_info  # product info and its variants will be stored in product_info
sephora.scrap_product_reviews()  # Instantiate The Scrap product reviews function
reviews = sephora.product_reviews  # All product reviews will be stored in product_reviews

Supported Scrapers

Add a New Spider or Feature

If you want to add a spider/scraper to the app or even a new feature please use the link bellow or open it as an issue in this github repo. Most upvoted feature will be added to the app.

Add new spider or feature

Upcoming Scraper/Spiders

Contribution

You are most welcome to contribute to this project and create pull requests.

Credit

  • @diemonster for all his comments, feedbacks and instruction.
  • Everyone in the #python community in the libera IRC

Disclimar

This library is built for educational puposes ONLY, use at your own risk.

Project details


Release history Release notifications | RSS feed

This version

1.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

retail_scraper-1.1.tar.gz (5.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

retail_scraper-1.1-py3-none-any.whl (6.6 kB view details)

Uploaded Python 3

File details

Details for the file retail_scraper-1.1.tar.gz.

File metadata

  • Download URL: retail_scraper-1.1.tar.gz
  • Upload date:
  • Size: 5.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.8.9

File hashes

Hashes for retail_scraper-1.1.tar.gz
Algorithm Hash digest
SHA256 b11485ecf3a937e6f41b12043dfabeb97fcd37e2deee16aa2156c2435743713c
MD5 9f1ceddd8e048bbce4f70acf2ded191e
BLAKE2b-256 583cd9d3c0efcac8da5d2564074ac63401729d639c21e5fe43bc4c6951a46447

See more details on using hashes here.

File details

Details for the file retail_scraper-1.1-py3-none-any.whl.

File metadata

  • Download URL: retail_scraper-1.1-py3-none-any.whl
  • Upload date:
  • Size: 6.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.8.9

File hashes

Hashes for retail_scraper-1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c3042bef764e0ddc54c9dfb206a652753c0799c207d7504abfeed0fa3077ddce
MD5 13f90d301013789abec18288bf4365df
BLAKE2b-256 427854980235a9fb208fcdad16a2b88b20db54b5ec0ef7bdd359482f5db783fa

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page