Skip to main content

Python qualitative analysis toolkit with utilities and simplified wrappers for common algorithms

Project description

DACT Qualitative Analyis Toolkit (qualkit)

This project is a collection of utilities for conducting qualitative analysis.

It currently consists of the following modules:

  • clean: a utility for cleaning up text prior to use with other tools
  • sentiment: a wrapper around SciKit's SentimentIntensityAnalyzer
  • anchored_topic_model: creates topic models using the Corex algorithm (Gallagher et. al., 2017) with user-supplied anchors to 'steer' the model using domain knowledge
  • stopwords: a standard set of stopwords
  • topics: a wrapper around SciKit's LatentDirichletAllocation
  • keywords: a wrapper around NLTK's RAKE (Rapid Keyword Extraction) algorithm for finding keywords in text.

For more details on each module, see the 'docs' folder.

Installing the toolkit and its requirements

Install using:

pip install qualkit

Or add 'dactqal' to your requirements.txt file, or add as a dependency in project properties in PyCharm.

References

Gallagher, R. J., Reing, K., Kale, D., and Ver Steeg, G. "Anchored Correlation Explanation: Topic Modeling with Minimal Domain Knowledge." Transactions of the Association for Computational Linguistics (TACL), 2017.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

qualkit-0.0.2.tar.gz (2.9 kB view details)

Uploaded Source

File details

Details for the file qualkit-0.0.2.tar.gz.

File metadata

  • Download URL: qualkit-0.0.2.tar.gz
  • Upload date:
  • Size: 2.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.6.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.61.2 CPython/3.6.5

File hashes

Hashes for qualkit-0.0.2.tar.gz
Algorithm Hash digest
SHA256 0f211e20b4583f816481ba66b64b9173da13b2af2e85a33516442c4753231172
MD5 6f3fc025175b0c22bce6fffd54bc5a0f
BLAKE2b-256 c85ff845b3356ac1170ddc5270bbd256546f4429afd6430d1f8f46eecd2d71d7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page