Skip to main content

Topic modeling with text networks

Project description

Topic Network - topic modeling with text networks

This package builds a text network out of a list of strings, picks the most important words on the basis of betweenness centrality measure, and performs community detection to group the words into topic networks and return the groups. The methods used are language-agnostic, meaning that you can perform the topic modeling on any text in any language. The package uses NetworkX to build the network and perform the necessary calculations.

Usage

To install the package, use::

pip install topicnetwork

To find the topics, simply use::

import topicnetwork
topics = topicnetwork.find_topics(list_of_strings)

For best results, use a text without punctuation and stopwords, and words converted to lowercase. You can perform the cleaning on your English texts with:

text = topicnetwork.clean(list_of_strings)

Package written and maintained by Michal Pikusa (pikusa.michal@gmail.com)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

topicnetwork-0.1.3.tar.gz (2.4 kB view hashes)

Uploaded Source

Built Distribution

topicnetwork-0.1.3-py3-none-any.whl (2.6 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page