Skip to main content

easily bult LDA Topic Models with just a list of docs (e.g. a list of twitter posts in CSV/TXT

Project description

PyPI version

easyLDA is a library that easily build LDA Topic Models with just a list of docs (e.g. a list of twitter posts in CSV/TXT)

github: https://github.com/shichaoji/easyLDA

  • If you have a collection of documents, and what to explore the relationship & topics of the docs, easyLDA is a very handy library to use. Simply run the commend and you’ll get a trained LDA model with results visualized

The library pipeline text preprocessing, such as tf-idf, n-grams from Gensim library

Credit to:

https://radimrehurek.com/gensim/

http://pyldavis.readthedocs.io/en/latest/readme.html

installation

$ pip install easyLDA

usage example

simple need a text file (.csv) with each row represents a document (a post, comment, short article etc.), with only one column which is the text

text file (csv) sample view

Demo 1

easy to use, just in a shell window, type: easyLDA, then specify the location of the text document

1. then choose how many topics you want the model to fit

2. choose the topic contains only single word (1) or can be phases (2/3) as well

the program will be starting to train

  • in shell $ easyLDA

Demo 2

model result

models folder created by program contains the trained model

xx.html file is the interactive visulization of the model result

Demo 3

visualization live example

http://shichaoji.com/2016/02/04/easylda-live-example/

static pic result

Demo 4

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

easyLDA-0.2.8.6.tar.gz (6.5 kB view hashes)

Uploaded Source

Built Distributions

easyLDA-0.2.8.6-py2.7.egg (13.6 kB view hashes)

Uploaded Source

easyLDA-0.2.8.6-py2-none-any.whl (8.9 kB view hashes)

Uploaded Python 2

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page