Skip to main content

A Python package for graph processing and feature extraction

Project description

TopER: Topological Embeddings in Graph Representation Learning

This repository contains the implementation of TopER, a novel graph embedding method introduced in the paper titled "TopER: Topological Embeddings in Graph Representation Learning." Graph embeddings are crucial for graph representation learning, enabling the exploration of graphs using machine learning methods. However, existing deep learning techniques often rely on black-box, high-dimensional graph embeddings. There is a growing need for interpretable, low-dimensional embeddings to facilitate efficient data visualization for graph datasets, thus providing practical tools for effective dataset analysis.

Overview

TopER simplifies a fundamental concept of topological data analysis, specifically filtration, to compute the evolution rate of graph structures induced by a user-defined function on nodes or edges. This approach results in a low-dimensional, interpretable graph embedding that offers advantages in data visualization and graph classification tasks.

Repository Structure

  • Datasets:
    • Contains the benchmark datasets considered in the paper.
  • functions_to_calculate_[a,b]:
    • Contains the main algorithms to calculate vectors [a,b] for each graph in an entered benchmark dataset.
    • Main.py:
      • Python script to calculate the vectors X and Y for each filtration function and save them in .csv files.
    • features_main.py:
      • Python script to calculate [a,b] vectors and save them in a .csv file using the .csv files calculated previously.
  • classification_codes:
    • Contains codes for performing graph classification using the final .csv file generated by features_main.py.

Abstract of the Paper

Graph embeddings play a pivotal role in graph representation learning, allowing machine learning methods to explore and understand graph data. In our paper, "TopER: Topological Embeddings in Graph Representation Learning," we introduce TopER as a novel graph embedding method. TopER leverages the concept of filtration from topological data analysis to compute the evolution rate of graph structures induced by user-defined functions on nodes or edges.

Our experiments demonstrate that TopER outperforms or matches the performance of state-of-the-art deep learning models in graph classification tasks. Additionally, we provide theoretical stability guarantees for TopER. Notably, with just two parameters, TopER serves as a low-dimensional topological graph embedding, providing one of the first effective methods for visualizing graph datasets.

Usage

  1. Clone the repository:
    git clone https://github.com/your_username/TopER.git
    cd TopER
    

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

toper-1.0.3.tar.gz (9.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

toper-1.0.3-py3-none-any.whl (9.6 kB view details)

Uploaded Python 3

File details

Details for the file toper-1.0.3.tar.gz.

File metadata

  • Download URL: toper-1.0.3.tar.gz
  • Upload date:
  • Size: 9.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.12.2

File hashes

Hashes for toper-1.0.3.tar.gz
Algorithm Hash digest
SHA256 0eb3304d19ecfdbb12cdbf050a4036cdb2812c2d4e5a4e70535425b696466000
MD5 1f209e660058a64180d8a44427c55ad6
BLAKE2b-256 e06651402b6aef091c2a6a3be37eb72ea79a164dbec29f611e90ab39185179d2

See more details on using hashes here.

File details

Details for the file toper-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: toper-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 9.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.12.2

File hashes

Hashes for toper-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 3881cf0eb4e990e528b84fd838193ee921f330bde55d84398924cbc8d78e70ff
MD5 23dce1541a8aeaea0b8c7bc2261e9484
BLAKE2b-256 a6cebb0400f2b5dc17c74a3fb4ba61985f2333d59d42e46bfcdc458681603f29

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page