Skip to main content

Add a short description here!

Project description

dortmund2array

Tool to convert datasets from Benchmark Data Sets for Graph Kernels (K. Kersting et al., 2016) into a format suitable for deep learning research in graph classification.


Installation

Simply run pip install dortmund2array to install the command-line interface. The only dependencies are numpy networkx pandas.

Output

Given any benchmark dataset, this tool will create a file DATASET.pickle that contains a pickled list. At index i the list has a dictionary with the adjacency matrix, the graph signal (also known as graph feature matrix) and the corresponding label for the ith graph.

{
    "adjacency":    ...  # as numpy array. Shape: (nodes, nodes)
    "graph_signal": ...  # as numpy array. Shape: (nodes, features)
    "label":        ...  # usually a scalar.
}

The graph signal is an array of shape (nodes, features) where the features are either attributes given by the dataset or if no attributes are available, we simply take the node labels as attributes.

How to use

Simply get the dortmund2array command line tool via pip install dortmund2array.

usage: dortmund2array [-h] [--version] [--output OUTPUT_FOLDER]
                      [--input INPUT_FOLDER]

Tool to convert datasets from 'Benchmark Data Sets for Graph Kernels' (K.
Kersting et al., 2016)

optional arguments:
  -h, --help            show this help message and exit
  --version             show program's version number and exit
  --output OUTPUT_FOLDER, -o OUTPUT_FOLDER
                        Output folder.
  --input INPUT_FOLDER, -i INPUT_FOLDER
                        Input folder containing the dataset of the same name.
  -e                    Output edge list instead of adjacency for each
                        graph.

Thus, download and unzip a dataset. Make sure the folder-name agrees with the dataset-name on the files inside of it. If you for instance download MUTAG and the corresponding folder is .\MUTAG\ and you want the array data saved in .\MUTAG_array\ then you need to simply run:

dortmund2array -i ./MUTAG/ -o ./MUTAG_array/

Requirements

Make sure you meet all the dependencies inside the requirements.txt.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dortmund2array-0.2.tar.gz (15.5 kB view details)

Uploaded Source

File details

Details for the file dortmund2array-0.2.tar.gz.

File metadata

  • Download URL: dortmund2array-0.2.tar.gz
  • Upload date:
  • Size: 15.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.2.post20191203 requests-toolbelt/0.9.1 tqdm/4.40.2 CPython/3.7.3

File hashes

Hashes for dortmund2array-0.2.tar.gz
Algorithm Hash digest
SHA256 67c0065e15dd367055a4b9acdd36db20d9734f96ea224b25fa012af052a6533c
MD5 5644a575f219984f5110a4e66f580933
BLAKE2b-256 48fad7a75eda7336bd50a600e51424e78efd248899f7f3771cc34b48f09643cf

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page