A scraper of statistical data from the Siris database of Skolverket, built on top of Statscraper.
Project description
This is a scraper for statistical data from the Skolverket’s (http://siris.skolverket.se/siris)[SIRIS database] built on top of the Statscraper package <https://github.com/jplusplus/statscraper>.
The scraper is limited to the data avialble through http://siris.skolverket.se/siris/ris.export_stat.form
Install
pip install siris_scraper
Example usage
from siris.scraper import SirisScraper
# Init scraper
scraper = SirisScraper()
# List all schooltypes
verksamhetsformer = scraper.items
# [<Verksamhetsform: 15 (Fritidshem)>, <Verksamhetsform: 10 (Förskola)>, <Verksamhetsform: 14 (Förskoleklass)>,... ]
# Select a schooltype
verksamhetsform = verksamhetsformer.get_by_label(u"Öppen förskola")
# List all available datasets
datasets = verksamhetsform.items
# [<SirisDataset: 40 (Kostnader per kommun)>...]
# Select a dataset
dataset = datasets.get_by_label("Kostnader per kommun")
# Make a query
res = dataset.fetch() # Get latest available data
#res = dataset.fetch({"period": "2015"}) # Get data for a given period
#res = dataset.fetch({"period": "*"}) # Get data all periods
# List all avilable periods
print(dataset.periods)
# Use the result
# ...in Python Pandas for example
dataframe = res.pandas
Develop
Set up:
pip install -r requirements.txt
Run tests:
make tests
Deploy
To pypi:
python3 deploy_to_pypi.py
Todo
The scraper does not handle “uttag” at the moment. Fetches latest by default.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
siris_scraper-0.2.2.tar.gz
(5.9 kB
view hashes)
Built Distribution
Close
Hashes for siris_scraper-0.2.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6ecd006b57cd18acb79e599141fe11a999b5f7dceea46391e2199be87606863e |
|
MD5 | 8e02010d4f6dbdfc549ddbd16ba98e48 |
|
BLAKE2b-256 | ce0a7e23a3d899d6321f40ac1e02b5a81c4b87d2f5d4db6d836f3a34ea7bf9ae |