A webscraper for extracting information of schools from http://schulverzeichnis.eu
Project description
SVWebScraper
SVWebScraper is a tool to parse information of schools listed on http://schulverzeichnis.eu - a database of all austrian schools. As this site does not provide means for exporting the data to csv files this webscraper was implemented to do so.
How to use it?
$ svscrape --base-url https://schulverzeichnis.eu/typ/ --type neue-mittelschule --query ?bundesland=wien --csv schools.csv
Start scrapping schools from base url ...
20 schools scrapped from "http://www.schulverzeichnis.eu/typ/neue-mittelschule"
40 schools scrapped from "http://www.schulverzeichnis.eu/typ/neue-mittelschule"
...
$ cat ./schools.csv
NAME;ADRESSE;PLZ;ORT;TEL_NR
Freie Waldorfschule Wien-West ;Seuttergasse 29;1130;Wien;01/1234
Mittelschule des Schulvereins der Dominikanerinnen Wien;Schlossberggasse 17;1130;Wien;01/1234
Neue Mittelschule Wien;Neubaugasse 42;1070;Wien;01/1234
...
Documentation
$ svscrape --help
Usage: svscrape [OPTIONS]
Options:
--csv FILENAME
--baseurl TEXT
--type [neue-mittelschule|ahs-mit-nms|hauptschule|sonderschule]
--query TEXT
--help Show this message and exit.
Installing the Latest Version
python -m pip install svscrape
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
svscrape-2022.4.tar.gz
(9.6 kB
view details)
Built Distribution
svscrape-2022.4-py3-none-any.whl
(11.7 kB
view details)
File details
Details for the file svscrape-2022.4.tar.gz
.
File metadata
- Download URL: svscrape-2022.4.tar.gz
- Upload date:
- Size: 9.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 81e9807e85109b9d17781058d2f9ead31976f2576ccf2e13648aa9e99d7b0b76 |
|
MD5 | 23a7dda91f6ada313c05498b72e0ca64 |
|
BLAKE2b-256 | 76c31a1f8460f616a3d5662c98fbfa91ea2e86670b32e9419a0eec46a255582f |
File details
Details for the file svscrape-2022.4-py3-none-any.whl
.
File metadata
- Download URL: svscrape-2022.4-py3-none-any.whl
- Upload date:
- Size: 11.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e92728bd0d65d23b9bbfa1e076a9c5b8d3720be02205edd191b6ae0afc8af1b2 |
|
MD5 | eba437806c970bcde18d384d817810c1 |
|
BLAKE2b-256 | e43067fcd352cc64c11e0009021441592c6446f2d69e9beb09b5d21f19553555 |