A new transformer-based topic modeling library.
Project description
LeetTopic builds upon Top2Vec, BerTopic and other transformer-based topic modeling Python libraries. Unlike BerTopic and Top2Vec, LeetTopic allows users to control the degree to which outliers are resolved into neighboring topics.
It also lets you turn any DataFrame into a Bokeh application for exploring your documents and topics.
Installation
pip install leet-topic
Usage
import pandas as pd
from leet_topic import leet_topic
df = pd.read_json("data/vol7.json")
new_df, topic_data = leet_topic.LeetTopic(df,
document_field="descriptions",
html_filename="demo.html",
extra_fields=["names", "hdbscan_labels"],
max_distance=.5)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
leet_topic-0.0.4.tar.gz
(5.7 kB
view hashes)
Built Distribution
Close
Hashes for leet_topic-0.0.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 40538c5045cbe29c2b52a9ba49c11e2fc78f4b2885446a3742f0c288b4ac12c3 |
|
MD5 | 3d83ac8eddd109e0d3b91485c9812b7c |
|
BLAKE2b-256 | a561aed4ce4a3cdfdb8583c00a401b64ed2f9c1288ce74f59d53f673d46942af |