Skip to main content

Ecommerce scrapers library

Project description

Ecommerce Retailer Scraper

The purpose of this project is to create a library of spiders/scraper for major online ecommerce retailer shop such as Amazon, Sephora, Walmart, and many other brands. The app will return product information and review as well in JSON format.

The extracted data you can use for market research, product design, consumer buying impulse...The sky is the limit.

Instruction:

  1. How to install:
pip install ecom-scraper
  1. How to use:
from retail_scraper.spiders.sephora import Sephora

# Sephora spider take either product url or productid Sephora(url=url, productid=product_id)
url = 'https://www.sephora.com/product/huda-beauty-liquid-matte-ultra-comfort-transfer-proof-lipstick-P479843'
product_id = 'P479843'
sephora = Sephora(url=url)
# Or
sephora = Sephora(productid=product_id)
sephora.scrap_product_info()  # Instantiate the Scrap product function
info = sephora.product_info  # product info and its variants will be stored in product_info
sephora.scrap_product_reviews()  # Instantiate The Scrap product reviews function
reviews = sephora.product_reviews  # All product reviews will be stored in product_reviews

Supported Scrapers

Add a New Spider or Feature

If you want to add a spider/scraper to the app or even a new feature please use the link bellow or open it as an issue in this github repo. Most upvoted feature will be added to the app.

Add new spider or feature

Upcoming Scraper/Spiders

Contribution

You are most welcome to contribute to this project and create pull requests.

Credit

  • @diemonster for all his comments, feedbacks and instruction.
  • Everyone in the #python community in the libera IRC

Disclimar

This library is built for educational puposes ONLY, use at your own risk.

Project details


Release history Release notifications | RSS feed

This version

1.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

retail_scraper-1.1.tar.gz (5.5 kB view hashes)

Uploaded Source

Built Distribution

retail_scraper-1.1-py3-none-any.whl (6.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page