A utility for labeling clusters of text data.
Project description
CLabel
CLabel is a terminal-based cluster labeling tool that allows you to explore text data interactively and label clusters based on reviewing that data.
Install & Quickstart
pip install clabel
Type clabel
to run. Everything should happen in the terminal from there.
Currently clabel
can only import CSV files. It expects two columns to be in your csv: a column of text (string
) and a column of cluster labels (int
). You'll identify these the first time you import a dataset.
The workflow is:
- Pick a cluster to view examples. You'll view this through a pager so you can page through examples.
- Come up with a name for that cluster (
Declare Name
) - Repeat 1 & 2 until all your clusters have names.
You can persist any cluster labels to a json
file when you exit, so you don't have to complete labeling in one session. Then, you can load those labels in the next time you start clabel
by selecting that json
file and continue labeling.
Screenshots
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file clabel-0.1.2.tar.gz
.
File metadata
- Download URL: clabel-0.1.2.tar.gz
- Upload date:
- Size: 6.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.8 CPython/3.8.11 Darwin/19.6.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a0ab347fc9c1f2297064b6cddae846f760a55f650e83ceb4affa53c8f66bc045 |
|
MD5 | 771f05271496b18c17fbf05b1566d9e6 |
|
BLAKE2b-256 | fa52dc2aa0649924ba86be544a9073d1073c476e39df70e1ce816f9273799b39 |
File details
Details for the file clabel-0.1.2-py3-none-any.whl
.
File metadata
- Download URL: clabel-0.1.2-py3-none-any.whl
- Upload date:
- Size: 7.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.8 CPython/3.8.11 Darwin/19.6.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0046fb86ca86349d96a3420563f61ce2659044de8afcb4d03473ca92dccde65b |
|
MD5 | c0fd621240c37dba8911f71c2a51ff7d |
|
BLAKE2b-256 | 09df0832d68d356faf2ff93ce905dff18e2d4c4be55393f27c7ae1c57bc01268 |