No project description provided
Project description
canica
canica is an interactive tool to visualize embeddings. Its main current goal is to explore text datasets, representing the input embeddings in a 2D tSNE plot.
How to install
Just
pip install canica
And start using the CanicaTSNE
and CanicaUMAP
class in your notebooks.
How to use
canica is designed to work mainly as a data exploration tool embedded as a widget inside of a jupyter notebook. These are the instructions to explore a dataset in a notebook (the tutorial notebook provides more information).
In a notebook, load a pandas DataFrame and make sure that at least one column contains the embeddings you want to plot. In a cell, run:
from canica.widget import CanicaTSNE
CanicaTSNE(df, embedding_col="embedding_col", text_col="text_col", hue_col="some_score")
Where df
is the pandas DataFrame, "embedding_col"
is a column in df
containing embeddings and hue_var
is another column that will be represented using colours (currently it has to be a numerical column with values between 0 and 1).
You can also use CanicaUMAP
instead of CanicaTSNE
to use UMAP.
This will show the canica embedding explorer and will enable interactive exploration of your dataset. Have a look at the tutorial notebook to see it working.
How to contribute
We welcome contributions of all kinds. For more information on how to do it, we refer you to the CONTRIBUTING.md file.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file canica-0.0.19.tar.gz
.
File metadata
- Download URL: canica-0.0.19.tar.gz
- Upload date:
- Size: 2.0 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: python-httpx/0.25.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0e7a1a669b86d444252a20753c5ed46491344c6b9676a555e63324c9a1536152 |
|
MD5 | d16fafdccc9cd549691132493fce018d |
|
BLAKE2b-256 | 7fb21ab75ca75e03c451d9e309b15d66d7989717faefd4c89a4ec4c4624982b6 |
File details
Details for the file canica-0.0.19-py3-none-any.whl
.
File metadata
- Download URL: canica-0.0.19-py3-none-any.whl
- Upload date:
- Size: 308.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: python-httpx/0.25.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ff9a0455cf5f5af5a400889391255733e4feb00dfb7a0dd72bce75f8a0794759 |
|
MD5 | b52429ef41cbb0728f9124e9c7f0563a |
|
BLAKE2b-256 | 8409bfd3b98287106ab13a366bc218b309b9032308fefb422bfd541d9138d75c |