Skip to main content

Interactive topic model visualization. Port of the R package.

Project description

Python library for interactive topic model visualization. This is a port of the fabulous R package by Carson Sievert and Kenny Shirley.

LDAvis icon

pyLDAvis is designed to help users interpret the topics in a topic model that has been fit to a corpus of text data. The package extracts information from a fitted LDA topic model to inform an interactive web-based visualization.

The visualization is intended to be used within an IPython notebook but can also be saved to a stand-alone HTML file for easy sharing.

version status build status docs

Installation

  • Stable version using pip:

pip install pyldavis
  • Development version on GitHub

Clone the repository and run python setup.py

Usage

The best way to learn how to use pyLDAvis is to see it in action. Check out this notebook for an overview. Refer to the documentation for details.

For a concise explanation of the visualization see this vignette from the LDAvis R package.

Video demos

Ben Mabey walked through the visualization in this short talk using a Hacker News corpus:

Carson Sievert created a video demoing the R package. The visualization is the same and so it applies equally to pyLDAvis:

More documentation

To read about the methodology behind pyLDAvis, see the original paper, which was presented at the 2014 ACL Workshop on Interactive Language Learning, Visualization, and Interfaces in Baltimore on June 27, 2014.

History

1.3.2 (2015-11-09)

  • Gensim prepare 25% speed increase, thanks @mattilyra!

  • Pandas deprecation warnings are now gone.

  • Pandas v0.17 is now being used.

1.3.1 (2015-11-02)

  • Updates gensim and other logic to be python 3 compatible.

1.3.0 (2015-08-20)

  • Fixes gensim logic and makes it more robust.

  • Faster graphlab processing.

  • kargs for gensim and graphlab are passed down to underlying prepare function.

  • Requires recent version of pandas to avoid problems with our use of the newer DataFrame.to_dict API.

1.2.0 (2015-06-13)

  • Updates gensim logic to be clearer and work with Python 3.x.

1.1.0 (2015-06-02)

  • Fixes bug with GraphLab function that was producing bogus visualizations.

1.0.0 (2015-05-29)

  • First release on PyPI. Faithful port of R version with IPython support and helper functions for GraphLab & gensim.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyLDAvis-1.3.2.tar.gz (1.9 MB view details)

Uploaded Source

File details

Details for the file pyLDAvis-1.3.2.tar.gz.

File metadata

  • Download URL: pyLDAvis-1.3.2.tar.gz
  • Upload date:
  • Size: 1.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pyLDAvis-1.3.2.tar.gz
Algorithm Hash digest
SHA256 83d46c4ea141ad5f6ca1b176704433efa746b3a7d9c8aaf3e5748bd0e3568170
MD5 e9d7140866569d9d543b740a068a5c89
BLAKE2b-256 839eca0fb6dd7aa6c2e6b52894eaf3bbdc307cc99effaa11d109f5a504c332ef

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page