Corpus-Show makes it easier and faster to visualize corpus through sentence embedding of corpus.
Project description
Corpus-Show
Corpus-Show helps to understand the corpus data distribution through various values generated from Sentence Transformer.
Installation
$ pip install corpusshow
Tutorial
- Main-tutorials: https://github.com/DSDanielPark/corpus-show/blob/main/tutorials/corpusshow_tutorial.ipynb
- Sub-tutorial-folder:
Main Feature
Use Case
[1] Korean-news-topic-classification-using-KO-BERT: all plots were created through Corpus-Show and Quick-Show.
References
[1] Scikit-Learn https://scikit-learn.org
[2] Matplotlib https://matplotlib.org/
[3] Huggingface Sentence Transformer https://huggingface.co/sentence-transformers
Contacts
Project Owner(P.O): Daniel Park, South Korea
e-mail parkminwoo1991@gmail.com
Maintainers: Daniel Park, South Korea
e-mail parkminwoo1991@gmail.com
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for corpusshow-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d912323c063ad8f7620d1741df3de1278ab04e3e5c77ce43fd2c8858c6a21de4 |
|
MD5 | 4e082bf220f4c053c2c5636944e01e1f |
|
BLAKE2b-256 | eb5478e9b658eb11781b814f951d20685d771d23a7fba4543a4758b0b9070b7a |