Welfare State Analytics
Project description
The Welfare State Analytics Text Analysis Repository
This repository contains various Jupyter Notebooks for exploring the curated corpus of Riksdagens Protocol.
About the Project
Welfare State Analytics. Text Mining and Modeling Swedish Politics, Media & Culture, 1945-1989 (WeStAc) is a digital humanities research project with five co-operatings partners: Umeå University, Uppsala University, Aalto University (Finland) and the National Library of Sweden.
The project will digitise literature, curate already digitised collections, and perform research via probabilistic methods and text mining models. WeStAc will both digitise and curate three massive textual datasets—in all, Big Data of almost four billion tokens—from the domains of Swedish politics, news media and literary culture during the second half of the 20th century.
Installation
JupyterHub installation
The westac_hub
repository contains a ready-to-use Docker setup (Dockerfile
and docker-compose.yml
) for a Jupyter Hub using nginx
as reverse-proxy. The default setup uses DockerSpawner
that spawns containers as specified in westac_lab
, and Github for autorization (OAuth2). See the Makefile on how to build the project.
Single Docker container
You can also run the westac_lab
container as a single Docker container if you have Docker installed on your computer.
HOWTO Prepare a new version om Riksdagens Protokoll Corpus
Prerequisites
- The Riksdagen Protokoll corpus XML files have been tagged using welfare-state-analytics/pyriksprot_tagger
- The welfare-state-analytics/pyriksprot has been used to process corpus metadata and speech corpus. (
make full
) - Important! The configuration .env files has been updated with target corpus version.
Create a default DTM (document-term-matrix)
The command make default-riksprot-dtm
will create a DTM based on the settings found in 'opts/dtm_riksprot.yml`.
Run make
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file humlab_westac-0.5.40.tar.gz
.
File metadata
- Download URL: humlab_westac-0.5.40.tar.gz
- Upload date:
- Size: 83.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.11.3 Linux/5.4.0-172-generic
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b2cb970a5297b22543e189f1b1e653c34308bf2c0f4b342aa17015d0fc7479d1 |
|
MD5 | bc95fdb59564d05ffd8d130ce622fb54 |
|
BLAKE2b-256 | 5256d34ad5a2fe26f4f4a249cb381534154a088aadd6e6fd8179ba00ec3f02a4 |
File details
Details for the file humlab_westac-0.5.40-py3-none-any.whl
.
File metadata
- Download URL: humlab_westac-0.5.40-py3-none-any.whl
- Upload date:
- Size: 158.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.2 CPython/3.11.3 Linux/5.4.0-172-generic
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8be36963b4479cb47aca21b04b71c5154949a39201a0bee886b604380766f7bd |
|
MD5 | edbd497028a391cd11e046ff615506c3 |
|
BLAKE2b-256 | df1ac83e95ba205131d5735a4702dd16ba3172f8cf0118157feb962cfba18a40 |