Skip to main content

No project description provided

Project description

Learning topology-preserving data representations (ICLR 2023)

This repository contains code base for the paper which introduces a novel method for global structure preserving dimensionality rediction based on Topological Data Analysis (TDA), specifically Representation Topology Divergence (RTD).

Description

The proposed method aims to provide topological similarity between the data manifold and its latent representation via enforcing the similarity in topological features (clusters, loops, 2D voids, etc.) and their localization. The core of the method is the minimization of the Representation Topology Divergence (RTD) between original high-dimensional data and low-dimensional representation in latent space. RTD minimization provides closeness in topological features with strong theoretical guarantees. We develop a scheme for RTD differentiation and apply it as a loss term for the autoencoder. The proposed method “RTD-AE” better preserves the global structure and topology of the data manifold than state-of-the-art competitors as measured by linear correlation, triplet distance ranking accuracy, and Wasserstein distance between persistence barcodes.

Dependencies

The code base requires Python 3.8.X and installation of packages listed in requirements.txt.

Usage

For reproducing experiments and training RTD-AE, please refer to AE training.ipynb notebook. For training benchmark models such as UMAP and t-SNE, please refer to TSNE and UMAP.ipynb notebook. For visualization purposes, please refer to Visualization and metrics.ipynb.

Cite us

@inproceedings{trofimov2023learning,
title={Learning topology-preserving data representations},
author={Ilya Trofimov and Daniil Cherniavskii and Eduard Tulchinskii and Nikita Balabin and Serguei Barannikov and Evgeny Burnaev},
booktitle={International Conference on Learning Representations},
year={2023},
url={https://openreview.net/forum?id=lIu-ixf-Tzf}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rtd_ae-0.1.2.tar.gz (14.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rtd_ae-0.1.2-py3-none-any.whl (16.5 kB view details)

Uploaded Python 3

File details

Details for the file rtd_ae-0.1.2.tar.gz.

File metadata

  • Download URL: rtd_ae-0.1.2.tar.gz
  • Upload date:
  • Size: 14.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.10.6 Windows/10

File hashes

Hashes for rtd_ae-0.1.2.tar.gz
Algorithm Hash digest
SHA256 d5c6cf03ec4dce14a5e74d943a5d5d99191509f210d060ea6a39d1490f1c780c
MD5 82d29cef4772c766a9f87451ce0a388b
BLAKE2b-256 5851864945098f7a365c35af658e54652d7ff7164ba29b85eaa470f21ff37d1c

See more details on using hashes here.

File details

Details for the file rtd_ae-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: rtd_ae-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 16.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.10.6 Windows/10

File hashes

Hashes for rtd_ae-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 ceb29e331ebce66cc6ad2fca477607ce879b65778d9a5d4e809cfff0fdd65bbb
MD5 a90334151589def4574d9bfa1793e0ca
BLAKE2b-256 c72111288324d9da851f0782db17f0ac4882e371b2099d7554ecfba2ab34fc99

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page