Skip to main content

A web scraping API that allows users to easily extract data from any website by simply providing the URL. The API utilizes advanced parsing and data storage techniques to ensure accurate and efficient data extraction. The package is easy to install and use, making it perfect for data scientists, researchers, and developers looking to quickly and easily access web data.

Project description

WebScraperAPI

A web scraping API that allows users to easily extract data from any website by simply providing the URL. The API utilizes advanced parsing and data storage techniques to ensure accurate and efficient data extraction. The package is easy to install and use, making it perfect for data scientists, researchers, and developers looking to quickly and easily access web data.

Installation

To install the package, use pip:

pip install WebScraperAPI

Usage

You can use the package by importing it in your code and using the scrape function.

from WebScraperAPI.scraper import Scraper
from WebScraperAPI.parser import Parser
from WebScraperAPI.data_storage import DataStorage

url = 'https://example.com'

# Scrape the data from the website
scraper_obj = Scraper(url)
data = scraper_obj.get_html()

if data:
    # Parse the data
    parser_obj = Parser(data)
    parsed_data = parser_obj.parse_data()

    # Save the data to a CSV file
    file_name = 'example_output.csv'
    data_storage_obj = DataStorage(parsed_data, file_name)
    data_storage_obj.save_data()
else:
    print("An error occurred while scraping the website")

Dependencies

beautifulsoup4
requests

Contributing

If you want to contribute to this project, please fork the repository and make a pull request.

License

This project is licensed under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

WebScraperAPI-1.0.3.tar.gz (3.5 kB view details)

Uploaded Source

File details

Details for the file WebScraperAPI-1.0.3.tar.gz.

File metadata

  • Download URL: WebScraperAPI-1.0.3.tar.gz
  • Upload date:
  • Size: 3.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.1

File hashes

Hashes for WebScraperAPI-1.0.3.tar.gz
Algorithm Hash digest
SHA256 5e18e6fc504235fef46c25111cd1565a7c649e42e49ba7da913f12376d546ad1
MD5 98a3a4ddf1564194a8eb37154e25332a
BLAKE2b-256 9ca4fb4bafce6d7b4d9196ae3e510c06d81bda3239c0b04a6a414739eaed1d32

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page