Skip to main content

A Python library for web scraping

Project description

Web Scraping - Python Library

A Python library that lets you easily scrape data from popular websites using basic product information

Build Status codecov

Overview

For many, collecting product data can be helpful for monitoring price changes or helping decide which e-commerce site to purchase from. However, creating a web scraper from scratch can be cumbersome and time consuming. My goal is to make it easier for people to collect product data, and this Python library aims to simplify the web scraping process. With basic inputs like product information and store url, you can have easy access to rich product information.

Installation

To install, run the following:

pip install web-scraper-python-library

Usage

Product Search

The following code will retrieve and print the product data for an iphone 12 from Amazon as a JSON object.

product: a product name, like you would put into the product search page of a company's website

company: 'eBay', 'Walmart', or 'Amazon'

Code

from web_scraper_python_library import main as m

product_data = m.main(product = 'iphone 12', company = 'Amazon')
print(product_data)

Output

[
  {
    "name": "Apple iPhone 12, 64GB, Green - Fully Unlocked (Renewed)",
    "price": 390.0,
    "extraction_date": "2023-03-23 17:38:32"
  },
  ...
  {
    "name": "Apple iPhone 12 Mini, 64GB, Black - Unlocked (Renewed)",
    "price": 312.0,
    "extraction_date": "2023-03-23 17:38:32"
  }
]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

web-scraper-python-library-0.1.1.tar.gz (10.3 kB view details)

Uploaded Source

File details

Details for the file web-scraper-python-library-0.1.1.tar.gz.

File metadata

File hashes

Hashes for web-scraper-python-library-0.1.1.tar.gz
Algorithm Hash digest
SHA256 49c662da5fd5242ffa18cd0866462f7cc329250f594badacf9da4d5b6c153fde
MD5 10218d1a37b79fe945ac759daa781f64
BLAKE2b-256 6a42f2ff75828e8295fb6389ace95419d6999d790d61f24359d6bc751a342a70

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page