Skip to main content

Python qualitative analysis toolkit with utilities and simplified wrappers for common algorithms

Project description

DACT Qualitative Analysis Toolkit (qualkit)

This project is a collection of utilities for conducting qualitative analysis.

It currently consists of the following modules:

  • clean: a utility for cleaning up text prior to use with other tools
  • sentiment: a wrapper around SciKit's SentimentIntensityAnalyzer
  • anchored_topic_model: creates topic models using the Corex algorithm (Gallagher et. al., 2017) with user-supplied anchors to 'steer' the model using domain knowledge
  • stopwords: a standard set of stopwords
  • topics: a wrapper around SciKit's LatentDirichletAllocation
  • keywords: a wrapper around NLTK's RAKE (Rapid Keyword Extraction) algorithm for finding keywords in text.

For more details on each module, see the 'docs' folder.

Installing the toolkit and its requirements

Install using:

pip install qualkit

Or add 'qualkit' to your requirements.txt file, or add as a dependency in project properties in PyCharm.

User Control

A user has control over the following aspects when using this toolkit which will influence outputs.

  • Anchoring strategies
  • Anchor Strength
  • Number of topics
  • Labelling True/False for each topic instead of dichotomising
  • How data is preprocessed before topic modelling, redaction, tfidr vectoriser etec

References

Gallagher, R. J., Reing, K., Kale, D., and Ver Steeg, G. "Anchored Correlation Explanation: Topic Modeling with Minimal Domain Knowledge." Transactions of the Association for Computational Linguistics (TACL), 2017.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

qualkit-0.1.5.tar.gz (14.2 kB view details)

Uploaded Source

Built Distribution

qualkit-0.1.5-py2.py3-none-any.whl (13.3 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file qualkit-0.1.5.tar.gz.

File metadata

  • Download URL: qualkit-0.1.5.tar.gz
  • Upload date:
  • Size: 14.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for qualkit-0.1.5.tar.gz
Algorithm Hash digest
SHA256 531aee7c2a663a8970b557b984503ab78a488c8ffd54090196d106af4197c54c
MD5 3b9535e687515a0d7e526ab17aa901de
BLAKE2b-256 7d1525445e7365c405c542e6d00be42844336f391d6ae60e1e1795bab3a96c5b

See more details on using hashes here.

File details

Details for the file qualkit-0.1.5-py2.py3-none-any.whl.

File metadata

  • Download URL: qualkit-0.1.5-py2.py3-none-any.whl
  • Upload date:
  • Size: 13.3 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.1 CPython/3.9.13

File hashes

Hashes for qualkit-0.1.5-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 2f39a7f95c5e4f784bf92294a0a2fa34e208cbc7cfd24289fd2e1d7a87fa8128
MD5 af991134632da947f132b8676653bf5e
BLAKE2b-256 44dd3cda26053b8afe1a3f3a3f144663d10e440c6f025827f5c14f3ff7019de3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page