A Python application that provides web scraping capabilities, including fetching Google and Yelp reviews.
Project description
pydatascraper
pydatascraper is a Python application that provides web scraping capabilities, including fetching Google and Yelp reviews. The application has a user-friendly graphical user interface (GUI) for easy interaction.
Features
- Web Scraping: Extract information from web pages based on user-provided URLs.
- Google Reviews: Fetch reviews for a given business or location using Google Maps API.
- Yelp Reviews: Retrieve reviews for a business using the Yelp API.
- OpenStreetMap Data: Extract latitude, longitude, and additional information from OpenStreetMap.
Requirements
- Python 3.x
- Required Python packages (install using
pip install -r requirements.txt):requestsbeautifulsoup4pandasopenpyxlnltk(for text processing)tkinter(GUI toolkit)
Usage
-
Clone the repository:
git clone https://github.com/arjunlimat/pydatascraper.git
-
Install the package directly:
pip install pydatascraper
- import the webscraper model:
from pydatascraper.pyscraper import main
- Run the application:
main()
The GUI will appear, allowing you to choose different services and perform web scraping tasks.
Services
Web Scraping
Enter a URL and click "Search" to explore available data types.
Choose the desired data type, enter a file name, and click "Download" to save the data.
Google Reviews
Select "Google reviews" from the services dropdown.
Enter the business or location name and address. Provide a file name and click "Download" to fetch and save Google reviews.
Yelp Reviews
Select "Yelp reviews" from the services dropdown. Enter the business name and address. Provide a file name and click "Download" to fetch and save Yelp reviews.
OpenStreetMap
Select "Open Street Map" from the services dropdown. Enter the map URL, provide a file name, and click "Download" to extract map data.
Contributing
Contributions are welcome! If you encounter issues or have ideas for improvement, please open an issue or submit a pull request.
License:
This project is licensed under the MIT License - see the LICENSE file for details.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pydatascraper-1.0.4.tar.gz.
File metadata
- Download URL: pydatascraper-1.0.4.tar.gz
- Upload date:
- Size: 13.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.8.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7663a55e8ce8a7c813e578df6ad149400bd7efa970c8ec8941bdd47d7c33b4de
|
|
| MD5 |
4f5606652284b43873e76ab00d4300a7
|
|
| BLAKE2b-256 |
2795662e1b4fa0a166b6be5a8f9d99e821d7e79c6948439bda3872a77b170767
|
File details
Details for the file pydatascraper-1.0.4-py3-none-any.whl.
File metadata
- Download URL: pydatascraper-1.0.4-py3-none-any.whl
- Upload date:
- Size: 13.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.8.4
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
84d58546ec4ff21348defd169e21ebd6a56e51a12dea824ff5926d0999bfb358
|
|
| MD5 |
c138566a03d39e72d36516d639fd7b30
|
|
| BLAKE2b-256 |
75408527657b8fbf587ccf018df29fe20737c38524cc207bb1374b419f5452a3
|