Aruodas.lt website scraper
Project description
The Aruodas Web-scraper
Description
This web_scraper is designed to collect the following information for apartments listed on the Aruodas website.
- City
- Sub-district
- Description
- Link
- Building number
- Flat number
- Area
- Price per month
- Build year
- Building type
- Heating system
- Energy class
- Nearest kindergarten
- Nearest educational institution
- Nearest stop
- Nearest public transport stop
The scraper methods: Loops through webpages and scrapes data off the aruodas.lt website. The method has 4 parameters:
- no_room: this is used if only one type of apartment. FOr example, to search for only 2-room apartments
- room_min: this parameter is used to specify the minimum number of rooms to be included in the search results.
- room_max: this parameter is used to specify the maximum number of rooms to be included in the search results.
- num_houses: this parameter is used to indicate the number of apartments to be scraped.
Usage
To use the scaper, pip install the package.
pip install vilnius-aruodas-scraper
from aruodas_scraper import AruodasScraper
data = AruodasScraper()
# to scrape data for 100 apartments with 1 - 4 rooms
df = data.scrape(num_houses=100, room_min=1, room_max=4)
# to scrape data for 20 apartments with a minimum of 3 rooms
df = data.scrape(num_houses=20, room_min=3)
# to scrape data for 20 apartments with a maximum of 3 rooms
df = data.scrape(num_houses=20, room_max=3)
License
The MIT License - Copyright (c) 2021 - Blessing Ehizojie-Philips
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file vilnius-aruodas-scraper-0.0.5.tar.gz.
File metadata
- Download URL: vilnius-aruodas-scraper-0.0.5.tar.gz
- Upload date:
- Size: 4.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.6.0 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a2e25a03eb2cebda03e6f52d99200525a9ea51c5f8f22e2ecb46f0d19917f0f3
|
|
| MD5 |
0b1639ef3789cc6edd25ebb71e77c0da
|
|
| BLAKE2b-256 |
cd630a630154309978a6d644fca553d9c05e8e43525d02789f0bcf78b55beb94
|