This is a Python web scraper built to extract car information from Turbo.az, one of the most popular car purchase websites in Azerbaijan.
Project description
Turbo.az Scraper
This is a Python web scraper built to extract car information from Turbo.az, one of the most popular car purchase websites in Azerbaijan.
Features
- Extracts car details including make, model, year, price, and location.
- Data is saved in CSV format.
- Selenium and Beautiful Soup libraries are used for web scraping.
- Code may need to be updated due to possible changes in website HTML tags.
Installation
- Clone this repository.
- Install the required dependencies by running
pip install -r requirements.txt
in your terminal.
Installation
You can install turbo2csv via pip:
pip install turbo2csv
Usage
from turbo2csv.scraper import TurboScraper
scraper = TurboScraper(geckodriver_path='path/to/geckodriver', headless=True)
scraper.scrape(output_file='turbo.csv', start=1)
Testing
turbo2csv includes comprehensive test coverage to ensure reliability and accuracy. To run the tests, you can use pytest:
pip install pytest
pytest
Disclaimer
This application is intended for educational purposes only and should not be used for commercial purposes. The author is not responsible for any legal issues that may arise from the misuse of this tool.
License
This project is licensed under the MIT License - see the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file turbo2csv-1.2.0.tar.gz
.
File metadata
- Download URL: turbo2csv-1.2.0.tar.gz
- Upload date:
- Size: 4.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b9b3cb7305ea1d0c893963a16b32047b1cc30f8ec64938ab014b8451e0a8dfb2 |
|
MD5 | fbfaaa9cc6fceee95a7eaad6c1828d5f |
|
BLAKE2b-256 | 20c5b94737bfc0ec7d15e00989869f33f78fc7a8f5ec19b69c68fce815e45f9a |
File details
Details for the file turbo2csv-1.2.0-py3-none-any.whl
.
File metadata
- Download URL: turbo2csv-1.2.0-py3-none-any.whl
- Upload date:
- Size: 4.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 149c6f43d37757218e56d1ec55d4eb53c9da7bc2b29bf62f50698a4b97f6813e |
|
MD5 | ffdaef339e3a77a9e62466ef69a9156d |
|
BLAKE2b-256 | 367ed11cbd2626e82dc54b51f251d39c956793fec5f644c48e98ea30863f3cd6 |