Skip to main content

Interactive Dimensionality Reduction, Clustering, and Visualization

Project description

InterDim

Docs and Tests Python Versions License Docs

Interactive Dimensionality Reduction, Clustering, and Visualization

InterDim is a Python package for interactive exploration of latent data dimensions. It wraps existing tools for dimensionality reduction, clustering, and data visualization in a streamlined interface, allowing for quick and intuitive analysis of high-dimensional data.

Features

  • Easy-to-use pipeline for dimensionality reduction, clustering, and visualization
  • Interactive 3D scatter plots for exploring reduced data
  • Support for various dimensionality reduction techniques (PCA, t-SNE, UMAP, etc.)
  • Multiple clustering algorithms (K-means, DBSCAN, etc.)
  • Customizable point visualizations for detailed data exploration

Installation

You can install from PyPI via pip (recommended):

pip install interdim

Or from source:

git clone https://github.com/MShinkle/interdim.git
cd interdim
pip install .

Quick Start

Here's a basic example using the Iris dataset:

from sklearn.datasets import load_iris
from interdim import InterDimAnalysis

iris = load_iris()
analysis = InterDimAnalysis(iris.data, true_labels=iris.target)
analysis.reduce(method='tsne', n_components=3)
analysis.cluster(method='kmeans', n_clusters=3)
analysis.show(n_components=3, point_visualization='bar')

3D Scatter Plot with Interactive Bar Charts

This will reduce the Iris dataset to 3 dimensions using t-SNE, clusters the data using K-means, and displays an interactive 3D scatter plot with bar charts for each data point as you hover over them.

Demo Notebooks

For more in-depth examples and use cases, check out our demo notebooks:

  1. Iris Species Analysis: Basic usage with the classic Iris dataset.
  2. DNN Latent Space Exploration: Visualizing deep neural network activations.
  3. LLM Token Analysis: Exploring language model token embeddings and layer activations.

Documentation

For detailed API documentation and advanced usage, visit our GitHub Pages.

Contributing

We welcome discussion and contributions!

License

InterDim is released under the BSD 3-Clause License. See the LICENSE file for details.

Contact

For questions and feedback, please open an issue on GitHub.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

interdim-0.4.0.tar.gz (13.4 kB view details)

Uploaded Source

Built Distribution

interdim-0.4.0-py3-none-any.whl (12.4 kB view details)

Uploaded Python 3

File details

Details for the file interdim-0.4.0.tar.gz.

File metadata

  • Download URL: interdim-0.4.0.tar.gz
  • Upload date:
  • Size: 13.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for interdim-0.4.0.tar.gz
Algorithm Hash digest
SHA256 a13f210c475d929c9633a3338fcc9dc99327cb68c0a24fea19213f3d124861f4
MD5 7cccd1fedc1379fe7e211c920da4a48f
BLAKE2b-256 9c9087dffa3836fefd2ab3805dad5a92d90c72dc1088eb9f341aac2c05c51b00

See more details on using hashes here.

File details

Details for the file interdim-0.4.0-py3-none-any.whl.

File metadata

  • Download URL: interdim-0.4.0-py3-none-any.whl
  • Upload date:
  • Size: 12.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for interdim-0.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 1440f20e917a29a0e9d08412e8ca3cd18f49ee3d1f43d266e7880d3270f34e37
MD5 1405bc2510c15a4dd561a087a1065ec1
BLAKE2b-256 680121c770fb26efbc704c965d60f7506f3e417351f711ce754e09f06491d1ce

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page