Skip to main content

easily bult LDA Topic Models with just a list of docs (e.g. a list of twitter posts in CSV/TXT

Project description

PyPI version

easyLDA is a library that easily build LDA Topic Models with just a list of docs (e.g. a list of twitter posts in CSV/TXT)

github: https://github.com/shichaoji/easyLDA

  • If you have a collection of documents, and what to explore the relationship & topics of the docs, easyLDA is a very handy library to use. Simply run the commend and you’ll get a trained LDA model with results visualized

The library pipeline text preprocessing, such as tf-idf, n-grams from Gensim library

Credit to:

https://radimrehurek.com/gensim/

http://pyldavis.readthedocs.io/en/latest/readme.html

installation

$ pip install easyLDA

usage example

simple need a text file (.csv) with each row represents a document (a post, comment, short article etc.), with only one column which is the text

text file (csv) sample view

Demo 1

easy to use, just in a shell window, type: easyLDA, then specify the location of the text document

1. then choose how many topics you want the model to fit

2. choose the topic contains only single word (1) or can be phases (2/3) as well

the program will be starting to train

  • in shell $ easyLDA

Demo 2

model result

models folder created by program contains the trained model

xx.html file is the interactive visulization of the model result

Demo 3

visualization live example

http://shichaoji.com/2016/02/04/easylda-live-example/

static pic result

Demo 4

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

easyLDA-0.2.8.6.tar.gz (6.5 kB view details)

Uploaded Source

Built Distributions

easyLDA-0.2.8.6-py2.7.egg (13.6 kB view details)

Uploaded Source

easyLDA-0.2.8.6-py2-none-any.whl (8.9 kB view details)

Uploaded Python 2

File details

Details for the file easyLDA-0.2.8.6.tar.gz.

File metadata

  • Download URL: easyLDA-0.2.8.6.tar.gz
  • Upload date:
  • Size: 6.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for easyLDA-0.2.8.6.tar.gz
Algorithm Hash digest
SHA256 373ac02d9a6254875b2011f3b5b8996f0cc36be61d07dd15857fc585f1cdf6f9
MD5 cf5840cb4c48f164e5cd6bb2b311e791
BLAKE2b-256 022506d3ccf63f834d3b5a6f11e4a35271dbf332d842b43386cd82597593f9c3

See more details on using hashes here.

File details

Details for the file easyLDA-0.2.8.6-py2.7.egg.

File metadata

File hashes

Hashes for easyLDA-0.2.8.6-py2.7.egg
Algorithm Hash digest
SHA256 2cbd948a1cff2a0c0012eba24cd0b870b8ff97a6d0547f664c26f5ebb99e9208
MD5 80f90b27bbc7f62b5ba2a9422156c6a2
BLAKE2b-256 0f22be1c1e3014934bb5d38641d3648c801ec06f67242cb15d3a73999cfa0844

See more details on using hashes here.

File details

Details for the file easyLDA-0.2.8.6-py2-none-any.whl.

File metadata

File hashes

Hashes for easyLDA-0.2.8.6-py2-none-any.whl
Algorithm Hash digest
SHA256 2c1573d6b2a38be030bbdd5d2d9346e471235d88ac3e8e872ba291500c6004be
MD5 ce6b3550bef4e91a1252ba064a004efc
BLAKE2b-256 2a7ad57b10bbe9445ba13205caa9feda906f9eaae9f9465888ba13baee3039b8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page