Skip to main content

No project description provided

Project description

canica

canica is an interactive tool to visualize embeddings. Its main current goal is to explore text datasets, representing the input embeddings in a 2D tSNE plot.

canica gif

How to install

Just

pip install canica

And start using the CanicaTSNE and CanicaUMAP class in your notebooks.

How to use

canica is designed to work mainly as a data exploration tool embedded as a widget inside of a jupyter notebook. These are the instructions to explore a dataset in a notebook (the tutorial notebook provides more information).

In a notebook, load a pandas DataFrame and make sure that at least one column contains the embeddings you want to plot. In a cell, run:

from canica.widget import CanicaTSNE
CanicaTSNE(df, embedding_col="embedding_col", text_col="text_col", hue_col="some_score")

Where df is the pandas DataFrame, "embedding_col" is a column in df containing embeddings and hue_var is another column that will be represented using colours (currently it has to be a numerical column with values between 0 and 1). You can also use CanicaUMAP instead of CanicaTSNE to use UMAP.

This will show the canica embedding explorer and will enable interactive exploration of your dataset. Have a look at the tutorial notebook to see it working.

How to contribute

We welcome contributions of all kinds. For more information on how to do it, we refer you to the CONTRIBUTING.md file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

canica-0.0.19.tar.gz (2.0 MB view details)

Uploaded Source

Built Distribution

canica-0.0.19-py3-none-any.whl (308.8 kB view details)

Uploaded Python 3

File details

Details for the file canica-0.0.19.tar.gz.

File metadata

  • Download URL: canica-0.0.19.tar.gz
  • Upload date:
  • Size: 2.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.25.0

File hashes

Hashes for canica-0.0.19.tar.gz
Algorithm Hash digest
SHA256 0e7a1a669b86d444252a20753c5ed46491344c6b9676a555e63324c9a1536152
MD5 d16fafdccc9cd549691132493fce018d
BLAKE2b-256 7fb21ab75ca75e03c451d9e309b15d66d7989717faefd4c89a4ec4c4624982b6

See more details on using hashes here.

File details

Details for the file canica-0.0.19-py3-none-any.whl.

File metadata

  • Download URL: canica-0.0.19-py3-none-any.whl
  • Upload date:
  • Size: 308.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-httpx/0.25.0

File hashes

Hashes for canica-0.0.19-py3-none-any.whl
Algorithm Hash digest
SHA256 ff9a0455cf5f5af5a400889391255733e4feb00dfb7a0dd72bce75f8a0794759
MD5 b52429ef41cbb0728f9124e9c7f0563a
BLAKE2b-256 8409bfd3b98287106ab13a366bc218b309b9032308fefb422bfd541d9138d75c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page