Skip to main content

Interactive Dimensionality Reduction, Clustering, and Visualization

Project description

InterDim

Docs and Tests Python Versions License Docs

Interactive Dimensionality Reduction, Clustering, and Visualization

InterDim is a Python package for interactive exploration of latent data dimensions. It wraps existing tools for dimensionality reduction, clustering, and data visualization in a streamlined interface, allowing for quick and intuitive analysis of high-dimensional data.

Features

  • Easy-to-use pipeline for dimensionality reduction, clustering, and visualization
  • Interactive 3D scatter plots for exploring reduced data
  • Support for various dimensionality reduction techniques (PCA, t-SNE, UMAP, etc.)
  • Multiple clustering algorithms (K-means, DBSCAN, etc.)
  • Customizable point visualizations for detailed data exploration

Installation

You can install from PyPI via pip (recommended):

pip install interdim

Or from source:

git clone https://github.com/MShinkle/interdim.git
cd interdim
pip install .

Quick Start

Here's a basic example using the Iris dataset:

from sklearn.datasets import load_iris
from interdim import InterDimAnalysis

iris = load_iris()
analysis = InterDimAnalysis(iris.data, true_labels=iris.target)
analysis.reduce(method='tsne', n_components=3)
analysis.cluster(method='kmeans', n_clusters=3)
analysis.show(n_components=3, point_visualization='bar')

3D Scatter Plot with Interactive Bar Charts

This will reduce the Iris dataset to 3 dimensions using t-SNE, clusters the data using K-means, and displays an interactive 3D scatter plot with bar charts for each data point as you hover over them.

Demo Notebooks

For more in-depth examples and use cases, check out our demo notebooks:

  1. Iris Species Analysis: Basic usage with the classic Iris dataset.
  2. DNN Latent Space Exploration: Visualizing deep neural network activations.
  3. LLM Token Analysis: Exploring language model token embeddings and layer activations.

Documentation

For detailed API documentation and advanced usage, visit our GitHub Pages.

Contributing

We welcome discussion and contributions!

License

InterDim is released under the BSD 3-Clause License. See the LICENSE file for details.

Contact

For questions and feedback, please open an issue on GitHub.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

interdim-1.0.0.tar.gz (13.4 kB view details)

Uploaded Source

Built Distribution

interdim-1.0.0-py3-none-any.whl (12.4 kB view details)

Uploaded Python 3

File details

Details for the file interdim-1.0.0.tar.gz.

File metadata

  • Download URL: interdim-1.0.0.tar.gz
  • Upload date:
  • Size: 13.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for interdim-1.0.0.tar.gz
Algorithm Hash digest
SHA256 548d33dd96aeef43c01cef1bdb3fd9144529d6f49a938bc244ed339b368b0e43
MD5 0608c3079774a010fb59af6dff428dab
BLAKE2b-256 968fdc57951d5f31d63abca2fe6b8886f99c8f6f02841645e895782bf3635532

See more details on using hashes here.

File details

Details for the file interdim-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: interdim-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 12.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.3

File hashes

Hashes for interdim-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 324f0d32ce3d2a48ab2afed7102290f516817363528b51fc4fc443d6a51aa8a5
MD5 66fb09ffe19338ddc78ab8f46e70172a
BLAKE2b-256 10f24e2dcad4a7e577c55433e8d23c332548833f3f5e56d200c063b7ca9438c7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page