A webscraper for extracting information of schools from http://schulverzeichnis.eu
Project description
SVWebScraper
SVWebScraper is a tool to parse information of schools listed on http://schulverzeichnis.eu - a database of all austrian schools. As this site does not provide means for exporting the data to csv files this webscraper was implemented to do so.
How to use it?
$ svscrape --base-url https://schulverzeichnis.eu/typ/ --type neue-mittelschule --query ?bundesland=wien --csv schools.csv
Start scrapping schools from base url ...
20 schools scrapped from "http://www.schulverzeichnis.eu/typ/neue-mittelschule"
40 schools scrapped from "http://www.schulverzeichnis.eu/typ/neue-mittelschule"
...
$ cat ./schools.csv
NAME;ADRESSE;PLZ;ORT;TEL_NR
Freie Waldorfschule Wien-West ;Seuttergasse 29;1130;Wien;01/1234
Mittelschule des Schulvereins der Dominikanerinnen Wien;Schlossberggasse 17;1130;Wien;01/1234
Neue Mittelschule Wien;Neubaugasse 42;1070;Wien;01/1234
...
Documentation
$ svscrape --help
Usage: svscrape [OPTIONS]
Options:
--csv FILENAME
--baseurl TEXT
--type [neue-mittelschule|ahs-mit-nms|hauptschule|sonderschule]
--query TEXT
--help Show this message and exit.
Installing the Latest Version
python -m pip install svscrape
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file svscrape-2022.4.tar.gz.
File metadata
- Download URL: svscrape-2022.4.tar.gz
- Upload date:
- Size: 9.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
81e9807e85109b9d17781058d2f9ead31976f2576ccf2e13648aa9e99d7b0b76
|
|
| MD5 |
23a7dda91f6ada313c05498b72e0ca64
|
|
| BLAKE2b-256 |
76c31a1f8460f616a3d5662c98fbfa91ea2e86670b32e9419a0eec46a255582f
|
File details
Details for the file svscrape-2022.4-py3-none-any.whl.
File metadata
- Download URL: svscrape-2022.4-py3-none-any.whl
- Upload date:
- Size: 11.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e92728bd0d65d23b9bbfa1e076a9c5b8d3720be02205edd191b6ae0afc8af1b2
|
|
| MD5 |
eba437806c970bcde18d384d817810c1
|
|
| BLAKE2b-256 |
e43067fcd352cc64c11e0009021441592c6446f2d69e9beb09b5d21f19553555
|