Contrastive neighbor embeddings (CNE) for dimensionality reduction and clustering
Project description
CNE is a probabilistic self-supervised deep learning model for compressing high-dimensional data to a low-dimensional embedding. CNE is a general-purpose algorithm that works with multiple types of data including images, time series, and tabular data. It uses the InfoNCE objective, a variational bound on mutual information, to improve local structure preservation in the compressed latent space and simultaneously learns a cluster distribution (a prior over the latent embedding) during optimization. Overlapping clusters are automatically combined by optimizing a variational upper bound on entropy, so the number of clusters does not have to be specified manually — provided the number of initial clusters is large enough. CNE produces embeddings with similar quality to existing dimensionality reduction methods; can detect outliers; scales to large, out-of-core datasets; and can easily add new data to an existing embedding/clustering.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file cne-learn-0.0.dev0.tar.gz
.
File metadata
- Download URL: cne-learn-0.0.dev0.tar.gz
- Upload date:
- Size: 2.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2 requests-toolbelt/0.9.1 tqdm/4.33.0 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 57abcab9a0d9f24ed247bf36b08cc409f8827474e0401b9661b7e6e9d80b7584 |
|
MD5 | dcbc1a877aadaecf8d6f970152beaefd |
|
BLAKE2b-256 | 192b8810823478a435cc182c6557039bddbd9de2b16be3cec7292e23c5aeed01 |