Skip to main content

Aruodas.lt website scraper

Project description

The Aruodas Web-scraper

Description

This web_scraper is designed to collect the following information for apartments listed on the Aruodas website.

  • City
  • Sub-district
  • Description
  • Link
  • Building number
  • Flat number
  • Area
  • Price per month
  • Build year
  • Building type
  • Heating system
  • Energy class
  • Nearest kindergarten
  • Nearest educational institution
  • Nearest stop
  • Nearest public transport stop

The scraper has 3 methods:

  • scrape - Loops through webpages and scrapes data off the aruodas.lt website.
  • to_csv - Used to save the dataframe to csv
  • scrape_to_csv - Used to scrape and save the data to csv

Usage

To use the scaper, pip install the package.

pip install vilnius-aruodas-scraper

from aruodas_scraper import AruodasScraper()

one_four_rooms = AruodasScraper()

# to scrape and data
df = one_four_rooms.scrape(num_houses=100, room_min=1, room_max=3)

# to save scraped data to csv
one_four_rooms.to_csv(df)

# to scrape and save data to csv
one_four_rooms.scrape_to_csv(num_houses=100, room_min=1, room_max=3)

License

The MIT License - Copyright (c) 2021 - Blessing Ehizojie-Philips

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vilnius-aruodas-scraper-0.0.1.tar.gz (4.3 kB view details)

Uploaded Source

File details

Details for the file vilnius-aruodas-scraper-0.0.1.tar.gz.

File metadata

  • Download URL: vilnius-aruodas-scraper-0.0.1.tar.gz
  • Upload date:
  • Size: 4.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.5.0 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.7

File hashes

Hashes for vilnius-aruodas-scraper-0.0.1.tar.gz
Algorithm Hash digest
SHA256 c897b223cc8f3ac76cea99cc23136877f7d642912016c0fe9faf8502e1f22b43
MD5 0693246aa719e9f43b4ce26a50c93cc8
BLAKE2b-256 691f77d318dfe8f6062b6b89e20e83937ef0950654e6b9a22e5699918a7f2536

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page