Skip to main content

Aruodas.lt website scraper

Project description

The Aruodas Web-scraper

Description

This web_scraper is designed to collect the following information for apartments listed on the Aruodas website.

  • City
  • Sub-district
  • Description
  • Link
  • Building number
  • Flat number
  • Area
  • Price per month
  • Build year
  • Building type
  • Heating system
  • Energy class
  • Nearest kindergarten
  • Nearest educational institution
  • Nearest stop
  • Nearest public transport stop

The scraper has 3 methods:

  • scrape - Loops through webpages and scrapes data off the aruodas.lt website.
  • to_csv - Used to save the dataframe to csv
  • scrape_to_csv - Used to scrape and save the data to csv

Usage

To use the scaper, pip install the package.

pip install vilnius-aruodas-scraper

from aruodas_scraper import AruodasScraper()

one_four_rooms = AruodasScraper()

# to scrape and data
df = one_four_rooms.scrape(num_houses=100, room_min=1, room_max=3)

# to save scraped data to csv
one_four_rooms.to_csv(df)

# to scrape and save data to csv
one_four_rooms.scrape_to_csv(num_houses=100, room_min=1, room_max=3)

License

The MIT License - Copyright (c) 2021 - Blessing Ehizojie-Philips

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vilnius-aruodas-scraper-0.0.2.tar.gz (4.4 kB view details)

Uploaded Source

File details

Details for the file vilnius-aruodas-scraper-0.0.2.tar.gz.

File metadata

  • Download URL: vilnius-aruodas-scraper-0.0.2.tar.gz
  • Upload date:
  • Size: 4.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.5.0 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.9.7

File hashes

Hashes for vilnius-aruodas-scraper-0.0.2.tar.gz
Algorithm Hash digest
SHA256 41415356282225bda3c3a8b39aa4391cd2f70b33da850bda9a09e0e8c23c238f
MD5 22c3a84ba1202ab67cd9d8b0e16f1ed9
BLAKE2b-256 5da18220c6778cfeea2d5002235bda67825736ed9ec1484a234beaa707036576

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page