easily bult LDA Topic Models with just a list of docs (e.g. a list of twitter posts in CSV/TXT
Project description
easyLDA
-------
|PyPI version|
easyLDA is a library that easily build LDA Topic Models with just a list of docs (e.g. a list of twitter posts in CSV/TXT)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
github: https://github.com/shichaoji/easyLDA
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- If you have a collection of documents, and what to explore the
relationship & topics of the docs, easyLDA is a very handy library to
use. Simply run the commend and you'll get a trained LDA model with
results visualized
The library pipeline text preprocessing, such as tf-idf, n-grams from Gensim library
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Credit to:
https://radimrehurek.com/gensim/
http://pyldavis.readthedocs.io/en/latest/readme.html
.. |PyPI version| image:: https://badge.fury.io/py/easyLDA.svg
:target: https://badge.fury.io/py/easyLDA
installation
~~~~~~~~~~~~
``$ pip install easyLDA``
usage example
~~~~~~~~~~~~~
simple need a text file (.csv/.txt) with each row represents a document (a post, comment, short article etc.), with only one column which is the text
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
text file (csv) sample view
^^^^^^^^^^^^^^^^^^^^^^^^^^^
.. image:: https://user-images.githubusercontent.com/20619704/35779561-dba715a0-099c-11e8-8519-09d6164e63ae.jpg
:width: 60%
:alt: Demo 1
:align: left
easy to use, just in a shell window, type: easyLDA, then specify the location of the text document
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1. then choose how many topics you want the model to fit
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2. choose the topic contains only single word (1) or can be phases (2/3) as well
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
the program will be starting to train
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- in shell $ easyLDA
.. image:: https://user-images.githubusercontent.com/20619704/35779521-49237200-099c-11e8-8cb2-ed916040a526.jpg
:width: 70%
:alt: Demo 2
:align: left
model result
~~~~~~~~~~~~
models folder created by program contains the trained model
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xx.html file is the interactive visulization of the model result
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
.. image:: https://user-images.githubusercontent.com/20619704/35779593-cfe800c0-099d-11e8-8db5-d3431f155496.jpg
:width: 60%
:alt: Demo 3
:align: left
visualization live example
~~~~~~~~~~~~~~~~~~~~~~~~~~
http://shichaoji.com/2016/02/04/easylda-live-example/
-------
|PyPI version|
easyLDA is a library that easily build LDA Topic Models with just a list of docs (e.g. a list of twitter posts in CSV/TXT)
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
github: https://github.com/shichaoji/easyLDA
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- If you have a collection of documents, and what to explore the
relationship & topics of the docs, easyLDA is a very handy library to
use. Simply run the commend and you'll get a trained LDA model with
results visualized
The library pipeline text preprocessing, such as tf-idf, n-grams from Gensim library
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Credit to:
https://radimrehurek.com/gensim/
http://pyldavis.readthedocs.io/en/latest/readme.html
.. |PyPI version| image:: https://badge.fury.io/py/easyLDA.svg
:target: https://badge.fury.io/py/easyLDA
installation
~~~~~~~~~~~~
``$ pip install easyLDA``
usage example
~~~~~~~~~~~~~
simple need a text file (.csv/.txt) with each row represents a document (a post, comment, short article etc.), with only one column which is the text
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
text file (csv) sample view
^^^^^^^^^^^^^^^^^^^^^^^^^^^
.. image:: https://user-images.githubusercontent.com/20619704/35779561-dba715a0-099c-11e8-8519-09d6164e63ae.jpg
:width: 60%
:alt: Demo 1
:align: left
easy to use, just in a shell window, type: easyLDA, then specify the location of the text document
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
1. then choose how many topics you want the model to fit
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
2. choose the topic contains only single word (1) or can be phases (2/3) as well
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
the program will be starting to train
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- in shell $ easyLDA
.. image:: https://user-images.githubusercontent.com/20619704/35779521-49237200-099c-11e8-8cb2-ed916040a526.jpg
:width: 70%
:alt: Demo 2
:align: left
model result
~~~~~~~~~~~~
models folder created by program contains the trained model
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
xx.html file is the interactive visulization of the model result
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
.. image:: https://user-images.githubusercontent.com/20619704/35779593-cfe800c0-099d-11e8-8db5-d3431f155496.jpg
:width: 60%
:alt: Demo 3
:align: left
visualization live example
~~~~~~~~~~~~~~~~~~~~~~~~~~
http://shichaoji.com/2016/02/04/easylda-live-example/
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
easyLDA-0.2.7.7.tar.gz
(6.4 kB
view hashes)
Built Distributions
easyLDA-0.2.7.7-py2.7.egg
(13.4 kB
view hashes)
Close
Hashes for easyLDA-0.2.7.7-py2-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 302ef17e6cae77cea82bca878217fbc051325208090db37badca8b4563b0eea7 |
|
MD5 | 67a63c39d104ac2146866730dba22027 |
|
BLAKE2b-256 | 258b120fa88b5ca1764980c07aa24a8e897bf94aac90b4c23ea0fed515caa3fd |