A Python application that provides web scraping capabilities, including fetching Google and Yelp reviews.
Project description
pydatascraper
pydatascraper is a Python application that provides web scraping capabilities, including fetching Google and Yelp reviews. The application has a user-friendly graphical user interface (GUI) for easy interaction.
Features
- Web Scraping: Extract information from web pages based on user-provided URLs.
- Google Reviews: Fetch reviews for a given business or location using Google Maps API.
- Yelp Reviews: Retrieve reviews for a business using the Yelp API.
- OpenStreetMap Data: Extract latitude, longitude, and additional information from OpenStreetMap.
Requirements
- Python 3.x
- Required Python packages (install using
pip install -r requirements.txt
):requests
beautifulsoup4
pandas
openpyxl
nltk
(for text processing)tkinter
(GUI toolkit)
Usage
-
Clone the repository:
git clone https://github.com/arjunlimat/pydatascraper.git
-
Install the package directly:
pip install pydatascraper
- import the webscraper model:
from pydatascraper.pyscraper import main
- Run the application:
main()
The GUI will appear, allowing you to choose different services and perform web scraping tasks.
Services
Web Scraping
Enter a URL and click "Search" to explore available data types.
Choose the desired data type, enter a file name, and click "Download" to save the data.
Google Reviews
Select "Google reviews" from the services dropdown.
Enter the business or location name and address. Provide a file name and click "Download" to fetch and save Google reviews.
Yelp Reviews
Select "Yelp reviews" from the services dropdown. Enter the business name and address. Provide a file name and click "Download" to fetch and save Yelp reviews.
OpenStreetMap
Select "Open Street Map" from the services dropdown. Enter the map URL, provide a file name, and click "Download" to extract map data.
Contributing
Contributions are welcome! If you encounter issues or have ideas for improvement, please open an issue or submit a pull request.
License:
This project is licensed under the MIT License - see the LICENSE file for details.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pydatascraper-1.0.4.tar.gz
.
File metadata
- Download URL: pydatascraper-1.0.4.tar.gz
- Upload date:
- Size: 13.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.8.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7663a55e8ce8a7c813e578df6ad149400bd7efa970c8ec8941bdd47d7c33b4de |
|
MD5 | 4f5606652284b43873e76ab00d4300a7 |
|
BLAKE2b-256 | 2795662e1b4fa0a166b6be5a8f9d99e821d7e79c6948439bda3872a77b170767 |
File details
Details for the file pydatascraper-1.0.4-py3-none-any.whl
.
File metadata
- Download URL: pydatascraper-1.0.4-py3-none-any.whl
- Upload date:
- Size: 13.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.8.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 84d58546ec4ff21348defd169e21ebd6a56e51a12dea824ff5926d0999bfb358 |
|
MD5 | c138566a03d39e72d36516d639fd7b30 |
|
BLAKE2b-256 | 75408527657b8fbf587ccf018df29fe20737c38524cc207bb1374b419f5452a3 |