TopicBlob is a package to perform quick and easy topic modeling on text.
Project description
TopicBlob: Simplified Topic Modeling
TopicBlob
is a Python 3 library for processing textual data. It provides a simple API for diving into common natural language processing (NLP) taks around topic modeling such as finding similar documents and provide a list of topics givne input text.
Here is a live demo of TopicBlob working on wikipedia pages
https://share.streamlit.io/banjtheman/topicblob/main/topicblob_st.py
TopicBlob leverages NLTK
, pandas
, and gensim
, for the heavy lifting
Features
- Topic Extraction
- Similarity Search
- BM25 search ( word ranking search)
- Topic Search
Get it now
#TODO push to pip
$ git clone https://github.com/banjtheman/TopicBlob/
$ pip install --editable .
Requirements
- Python >= 3.5
Docker Setup
- Ensure you have (docker)[https://www.docker.com/] installed locally.
- Build local Docker Image
docker build -t topicblob:local .
- Run App (Simply runs the
example.py
module)docker run topicblob:local
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
topicblob-0.0.2.tar.gz
(6.8 kB
view hashes)
Built Distribution
topicblob-0.0.2-py3-none-any.whl
(10.9 kB
view hashes)
Close
Hashes for topicblob-0.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 780f83e54e1dc18b0262f1551ad295674257ea8b8b881ecfbd37f129f0635780 |
|
MD5 | f656cf6f9761d4d26f845266194cbbc9 |
|
BLAKE2b-256 | 059de7aba10f979da83296e27e6adc7548deb773c9bf546b37eff90f46adc352 |