Python qualitative analysis toolkit with utilities and simplified wrappers for common algorithms
Project description
DACT Qualitative Analysis Toolkit (qualkit)
This project is a collection of utilities for conducting qualitative analysis.
It currently consists of the following modules:
- clean: a utility for cleaning up text prior to use with other tools
- sentiment: a wrapper around SciKit's SentimentIntensityAnalyzer
- anchored_topic_model: creates topic models using the Corex algorithm (Gallagher et. al., 2017) with user-supplied anchors to 'steer' the model using domain knowledge
- stopwords: a standard set of stopwords
- topics: a wrapper around SciKit's LatentDirichletAllocation
- keywords: a wrapper around NLTK's RAKE (Rapid Keyword Extraction) algorithm for finding keywords in text.
For more details on each module, see the 'docs' folder.
Installing the toolkit and its requirements
Install using:
pip install qualkit
Or add 'qualkit' to your requirements.txt file, or add as a dependency in project properties in PyCharm.
User Control
A user has control over the following aspects when using this toolkit which will influence outputs.
- Anchoring strategies
- Anchor Strength
- Number of topics
- Labelling True/False for each topic instead of dichotomising
- How data is preprocessed before topic modelling, redaction, tfidr vectoriser etec
References
Gallagher, R. J., Reing, K., Kale, D., and Ver Steeg, G. "Anchored Correlation Explanation: Topic Modeling with Minimal Domain Knowledge." Transactions of the Association for Computational Linguistics (TACL), 2017.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file qualkit-0.1.5.tar.gz
.
File metadata
- Download URL: qualkit-0.1.5.tar.gz
- Upload date:
- Size: 14.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 531aee7c2a663a8970b557b984503ab78a488c8ffd54090196d106af4197c54c |
|
MD5 | 3b9535e687515a0d7e526ab17aa901de |
|
BLAKE2b-256 | 7d1525445e7365c405c542e6d00be42844336f391d6ae60e1e1795bab3a96c5b |
File details
Details for the file qualkit-0.1.5-py2.py3-none-any.whl
.
File metadata
- Download URL: qualkit-0.1.5-py2.py3-none-any.whl
- Upload date:
- Size: 13.3 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2f39a7f95c5e4f784bf92294a0a2fa34e208cbc7cfd24289fd2e1d7a87fa8128 |
|
MD5 | af991134632da947f132b8676653bf5e |
|
BLAKE2b-256 | 44dd3cda26053b8afe1a3f3a3f144663d10e440c6f025827f5c14f3ff7019de3 |