Skip to main content

A browser-based embedding visualization and analysis tool

Project description

Embeddoor

A browser-based tool for embedding visualization and analysis.

Features

  • Dual-panel interface: 2D/3D plots on the left, custom visualizations (tables, images, word clouds) on the right
  • Interactive data exploration: Load CSV files, visualize tabular data, and plot 2-3 numerical columns
  • Advanced plot controls: Configure hue, size, and shape based on data columns
  • Lasso selection: Select data points interactively and store selections in the dataframe
  • Correlation analysis: Visualize pairwise correlations with Pearson, Spearman, or Kendall methods
  • Heatmap visualizations: View data as heatmaps from embeddings or numeric columns
  • Modular embedding framework: Create embeddings using HuggingFace, OpenAI, Gemini, and custom models
  • Dimensionality reduction: Apply PCA, t-SNE, and UMAP to high-dimensional embeddings
  • Data persistence: Save and load data in Parquet format

Installation

Development Installation

git clone https://github.com/haesleinhuepf/embeddoor.git
cd embeddoor
pip install -e .[dev,embeddings]

Quick Start

Launch the application:

embeddoor

This will start the server and open your default browser to http://localhost:5000.

Workflow

  1. Load Data: Use File → Open to load a CSV file
  2. Visualize: View tabular data in the right panel, plot numerical columns in the left panel
  3. Customize Plot: Select hue, size, and shape attributes for data points
  4. Select Points: Use the lasso tool to select data points (stored as a new column)
  5. Create Embeddings: Embedding → Create Embedding to generate embeddings from text/image columns
  6. Reduce Dimensions: Dimensionality Reduction → Apply PCA/t-SNE/UMAP to embeddings
  7. Save: File → Save to export data as Parquet

License

MIT License

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

embeddoor-0.1.0.tar.gz (59.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

embeddoor-0.1.0-py3-none-any.whl (63.1 kB view details)

Uploaded Python 3

File details

Details for the file embeddoor-0.1.0.tar.gz.

File metadata

  • Download URL: embeddoor-0.1.0.tar.gz
  • Upload date:
  • Size: 59.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for embeddoor-0.1.0.tar.gz
Algorithm Hash digest
SHA256 06e14cf8bd10bc4a3973af1e6872e705c31b38ab0c9de7ac78a753a25e610d40
MD5 f3dc54ee53b64e0ffabc9c5ce443c813
BLAKE2b-256 9a8cb1507d59846b408dd63f7576c32a614774b723eabdd7bec420d3631d8233

See more details on using hashes here.

File details

Details for the file embeddoor-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: embeddoor-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 63.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.13

File hashes

Hashes for embeddoor-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d508f13a275a5b7cf0c536cfcf4793d28584cd62ba369d2ccf4fa8a8098d7944
MD5 ab97de22bd7f534f5beb7856f4368db2
BLAKE2b-256 6d97551523d6ae0189f18b37343bcf759988c9df16f970f09f24de7cec5e8a37

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page