Skip to main content

Aruodas.lt website scraper

Project description

The Aruodas Web-scraper

Description

This web_scraper is designed to collect the following information for apartments listed on the Aruodas website.

  • City
  • Sub-district
  • Description
  • Link
  • Building number
  • Flat number
  • Area
  • Price per month
  • Build year
  • Building type
  • Heating system
  • Energy class
  • Nearest kindergarten
  • Nearest educational institution
  • Nearest stop
  • Nearest public transport stop

The scraper has 3 methods:

  • scrape - Loops through webpages and scrapes data off the aruodas.lt website.
  • to_csv - Used to save the dataframe to csv
  • scrape_to_csv - Used to scrape and save the data to csv

Usage

To use the scaper, pip install the package.

pip install vilnius-aruodas-scraper

from aruodas_scraper import AruodasScraper()

one_four_rooms = AruodasScraper()

# to scrape and data
df = one_four_rooms.scrape(num_houses=100, room_min=1, room_max=3)

# to save scraped data to csv
one_four_rooms.to_csv(df)

# to scrape and save data to csv
one_four_rooms.scrape_to_csv(num_houses=100, room_min=1, room_max=3)

License

The MIT License - Copyright (c) 2021 - Blessing Ehizojie-Philips

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vilnius-aruodas-scraper-0.0.3.tar.gz (4.4 kB view details)

Uploaded Source

File details

Details for the file vilnius-aruodas-scraper-0.0.3.tar.gz.

File metadata

  • Download URL: vilnius-aruodas-scraper-0.0.3.tar.gz
  • Upload date:
  • Size: 4.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.5.0 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.7

File hashes

Hashes for vilnius-aruodas-scraper-0.0.3.tar.gz
Algorithm Hash digest
SHA256 88975141a46625e56608c46138b2014fab81f8430ff75471740999e93c61be52
MD5 ae7dac7be0aea09ba4ca4096db245fef
BLAKE2b-256 07dafd69aa8af5eea414aff5c66fca8ad290702807adf98e1d9f4a8206cd47cd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page