Skip to main content

A Python application that provides web scraping capabilities, including fetching Google and Yelp reviews.

Project description

pydatascraper

pydatascraper is a Python application that provides web scraping capabilities, including fetching Google and Yelp reviews. The application has a user-friendly graphical user interface (GUI) for easy interaction.

Features

  • Web Scraping: Extract information from web pages based on user-provided URLs.
  • Google Reviews: Fetch reviews for a given business or location using Google Maps API.
  • Yelp Reviews: Retrieve reviews for a business using the Yelp API.
  • OpenStreetMap Data: Extract latitude, longitude, and additional information from OpenStreetMap.

Requirements

  • Python 3.x
  • Required Python packages (install using pip install -r requirements.txt):
    • requests
    • beautifulsoup4
    • pandas
    • openpyxl
    • nltk (for text processing)
    • tkinter (GUI toolkit)

Usage

  1. Clone the repository:

    git clone https://github.com/arjunlimat/pydatascraper.git
    
  2. Install the package directly:


pip install pydatascraper

  1. import the webscraper model:
from pydatascraper.pyscraper import main
  1. Run the application:
main()

The GUI will appear, allowing you to choose different services and perform web scraping tasks.

Services

Web Scraping

Enter a URL and click "Search" to explore available data types.

Choose the desired data type, enter a file name, and click "Download" to save the data.

Google Reviews

Select "Google reviews" from the services dropdown.

Enter the business or location name and address. Provide a file name and click "Download" to fetch and save Google reviews.

Yelp Reviews

Select "Yelp reviews" from the services dropdown. Enter the business name and address. Provide a file name and click "Download" to fetch and save Yelp reviews.

OpenStreetMap

Select "Open Street Map" from the services dropdown. Enter the map URL, provide a file name, and click "Download" to extract map data.

Contributing

Contributions are welcome! If you encounter issues or have ideas for improvement, please open an issue or submit a pull request.

License:

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydatascraper-1.0.4.tar.gz (13.8 kB view details)

Uploaded Source

Built Distribution

pydatascraper-1.0.4-py3-none-any.whl (13.1 kB view details)

Uploaded Python 3

File details

Details for the file pydatascraper-1.0.4.tar.gz.

File metadata

  • Download URL: pydatascraper-1.0.4.tar.gz
  • Upload date:
  • Size: 13.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.8.4

File hashes

Hashes for pydatascraper-1.0.4.tar.gz
Algorithm Hash digest
SHA256 7663a55e8ce8a7c813e578df6ad149400bd7efa970c8ec8941bdd47d7c33b4de
MD5 4f5606652284b43873e76ab00d4300a7
BLAKE2b-256 2795662e1b4fa0a166b6be5a8f9d99e821d7e79c6948439bda3872a77b170767

See more details on using hashes here.

File details

Details for the file pydatascraper-1.0.4-py3-none-any.whl.

File metadata

File hashes

Hashes for pydatascraper-1.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 84d58546ec4ff21348defd169e21ebd6a56e51a12dea824ff5926d0999bfb358
MD5 c138566a03d39e72d36516d639fd7b30
BLAKE2b-256 75408527657b8fbf587ccf018df29fe20737c38524cc207bb1374b419f5452a3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page