Skip to main content

Welfare State Analytics

Project description

The Welfare State Analytics Text Analysis Repository

This repository contains various Jupyter Notebooks for exploring the curated corpus of Riksdagens Protocol.

About the Project

Welfare State Analytics. Text Mining and Modeling Swedish Politics, Media & Culture, 1945-1989 (WeStAc) is a digital humanities research project with five co-operatings partners: Umeå University, Uppsala University, Aalto University (Finland) and the National Library of Sweden.

The project will digitise literature, curate already digitised collections, and perform research via probabilistic methods and text mining models. WeStAc will both digitise and curate three massive textual datasets—in all, Big Data of almost four billion tokens—from the domains of Swedish politics, news media and literary culture during the second half of the 20th century.

Installation

JupyterHub installation

The westac_hub repository contains a ready-to-use Docker setup (Dockerfile and docker-compose.yml) for a Jupyter Hub using nginx as reverse-proxy. The default setup uses DockerSpawner that spawns containers as specified in westac_lab, and Github for autorization (OAuth2). See the Makefile on how to build the project.

Single Docker container

You can also run the westac_lab container as a single Docker container if you have Docker installed on your computer.

HOWTO Prepare a new version om Riksdagens Protokoll Corpus

Prerequisites

Create a default DTM (document-term-matrix)

The command make default-riksprot-dtm will create a DTM based on the settings found in 'opts/dtm_riksprot.yml`.

Run make

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

humlab_westac-0.5.40.tar.gz (83.1 kB view hashes)

Uploaded Source

Built Distribution

humlab_westac-0.5.40-py3-none-any.whl (158.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page