A scraper of statistical data from the Siris database of Skolverket, built on top of Statscraper.
Project description
This is a non-official scraper for statistical data from the Skolverket’s (https://www.skolverket.se/skolutveckling/statistik/)[SIRIS database] built on top of the Statscraper package <https://github.com/jplusplus/statscraper>. It lets you interact with the stat export interface in a pythonic way.
Install
pip install siris_scraper
Example usage
from siris.scraper import SirisScraper
# Init scraper
scraper = SirisScraper()
# List all schooltypes
skolformer = scraper.items
# Select a dataset
dataset = skolformer.get_by_label(u"Grundskolan")\
.items.get_by_label("Kommunnivå")\
.items.get_by_label("Personal")\
.items.get_by_label("Personalstatistik")
# Make a query
res = dataset.fetch() # Get latest available data
#res = dataset.fetch({"period": "2015"}) # Get data for a given period
#res = dataset.fetch({"period": "*"}) # Get data all periods
# List all avilable periods
print(dataset.periods)
# Use the result
# ...in Python Pandas for example
dataframe = res.pandas
Develop
Set up:
pip install -r requirements.txt
Run tests:
make tests
Deploy
To pypi:
python3 deploy_to_pypi.py
Todo
Handle filtering in export (“Begränsa träfflista”)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
siris_scraper-0.4.0.tar.gz
(7.3 kB
view hashes)
Built Distribution
Close
Hashes for siris_scraper-0.4.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c0fa906109de04c542d905480acf2a639c1da9fb663d0c8aeb185284f6cd2869 |
|
MD5 | 388649b85c14f40f178e1db8c5db3dd7 |
|
BLAKE2b-256 | 611a567bd705dacbf08783dbf7af4d95b1deb5d90c28a62088178bc4672c6a42 |