Skip to main content

Access the Riksdagen corpus

Project description

Swedish parliamentary proceedings - Riksdagens protokoll 1921-2021 v0.3.0

Westac Project, 2020-2021

The full data set consists of multiple parts:

  • Riksdagens protokoll between from 1921 until today in the Parla-clarin format
  • Comprehensive list of MPs and cabinet members during this period
  • Traceable logs of all curation and segmentation as a git history
  • Documentation of the corpus and the curation process
  • A Google Colab notebook that demonstrates how the dataset can be used with Python

Basic use

A full dataset is available under this download link. It has the following structure

  • Annual protocol files in the corpus/ folder
  • List of MPs corpus/members_of_parliament.csv
  • List of ministers corpus/ministers.csv
  • List of speakers of the house corpus/talman.csv

The workflow to use the data is demonstrated in this Google Colab notebook.

Participate in the curation process

The corpora are large and automatically curated and segmented. If you find any errors, it is possible to submit corrections to them. This is documented in the project wiki.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyriksdagen-0.2.0.tar.gz (18.2 kB view hashes)

Uploaded Source

Built Distribution

pyriksdagen-0.2.0-py3-none-any.whl (20.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page