Recommendation algorithms for large graphs on networkx

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

pygrank

Recommendation algorithms for large graphs.

Installation

pip install pygrank

Usage

Run a PageRank algorithm with seed oversampling

import networkx as nx
from pygrank.algorithms.pagerank import PageRank as Ranker
from pygrank.algorithms.oversampling import SeedOversampling as Oversampler

G = nx.Graph()
seeds = list()
... # insert graph nodes and select some of them as seeds (e.g. see tests.py)

algorithm = Oversampler(Ranker(alpha=0.85, tol=1.E-6, max_iters=100)) # these are the default values
ranks = algorithm.rank(G, {v: 1 for v in seeds})

Run a PageRank algorithm and make it converge to a robust node order

import networkx as nx
from pygrank.algorithms.pagerank import PageRank as Ranker
from pygrank.algorithms.utils import RankOrderConvergenceManager

G = nx.Graph()
seeds = list()
... # insert graph nodes and select some of them as seeds (e.g. see tests.py)
alpha = 0.85

algorithm = Ranker(alpha=alpha, convergence=RankOrderConvergenceManager(alpha))
ranks = algorithm.rank(G, {v: 1 for v in seeds})

Hash the outcome of graph normalization to speed up multiple calls to the same graph

import networkx as nx
from pygrank.algorithms.pagerank import PageRank as Ranker
from pygrank.algorithms.utils import preprocessor

G = nx.Graph()
seeds1 = list()
seeds2 = list()
... # insert graph nodes and select some of them as seeds (e.g. see tests.py)

algorithm = Ranker(alpha=0.8, to_scipy=preprocessor(normalization="col", assume_immutability=True))
ranks = algorithm.rank(G, {v: 1 for v in seeds1})
ranks = algorithm.rank(G, {v: 1 for v in seeds2}) # does not re-compute the normalization

How to evaluate with an unsupervised metric

from pygrank.algorithms.postprocess import Normalize
from pygrank.metrics.unsupervised import Conductance

G, ranks = ... # calculate as per the first example
normalized_ranks = Normalize().rank(ranks)

metric = Conductance(G)
print(metric.evaluate(normalized_ranks))

How to evaluate with a supervised metric

from pygrank.metrics.supervised import AUC
import pygrank.metrics.utils

G, seeds, algorithm = ... # as per the first example
seeds, ground_truth = pygrank.metrics.utils.split_groups(seeds, fraction_of_training=0.5)

pygrank.metrics.utils.remove_group_edges_from_graph(G, ground_truth)
ranks = algorithm.rank(G, {v: 1 for v in seeds})

metric = AUC({v: 1 for v in ground_truth})
print(metric.evaluate(ranks))

How to evaluate multiple ranks

import networkx as nx
from pygrank.algorithms.pagerank import PageRank as Ranker
from pygrank.algorithms.postprocess import Normalize as Normalizer
from pygrank.algorithms.oversampling import BoostedSeedOversampling as Oversampler
from pygrank.metrics.unsupervised import Conductance
from pygrank.metrics.supervised import AUC
from pygrank.metrics.multigroup import MultiUnsupervised, MultiSupervised, LinkAUC
import pygrank.metrics.utils

# Construct data
G = nx.Graph()
groups = {}
groups["group1"] = list()
... 

# Split to training and test data
training_groups, test_groups = pygrank.metrics.utils.split_groups(groups)
pygrank.metrics.utils.remove_group_edges_from_graph(G, test_groups)

# Calculate ranks and put them in a map
algorithm = Normalizer(Oversampler(Ranker(alpha=0.99)))
ranks = {group_id: algorithm.rank(G, {v: 1 for v in group}) 
        for group_id, group in training_groups.items()}


# Evaluation with Conductance
conductance = MultiUnsupervised(Conductance, G)
print(conductance.evaluate(ranks))

# Evaluation with LinkAUC
link_AUC = LinkAUC(G, pygrank.metrics.utils.to_nodes(test_groups))
print(link_AUC.evaluate(ranks))

# Evaluation with AUC
auc = MultiSupervised(AUC, pygrank.metrics.utils.to_seeds(test_groups))
print(auc.evaluate(ranks))

References

@article{krasanakis2019boosted,
  title={Boosted seed oversampling for local community ranking},
  author={Krasanakis, Emmanouil and Schinas, Emmanouil and Papadopoulos, Symeon and Kompatsiaris, Yiannis and Symeonidis, Andreas},
  journal={Information Processing \& Management},
  pages={102053},
  year={2019},
  publisher={Elsevier}
}

@inproceedings{krasanakis2019linkauc,
  title={LinkAUC: Unsupervised Evaluation of Multiple Network Node Ranks Using Link Prediction},
  author={Krasanakis, Emmanouil and Papadopoulos, Symeon and Kompatsiaris, Yiannis},
  booktitle={International Conference on Complex Networks and Their Applications},
  pages={3--14},
  year={2019},
  organization={Springer}
}

@unpublished{krasanakis2020stopping,
  title={Stopping Personalized PageRank without an Error Tolerance Parameter},
  author={Krasanakis, Emmanouil and Papadopoulos, Symeon and Kompatsiaris, Ioannis},
  year={2020},
  note = {unpublished}
}

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.2.14

Jun 11, 2024

0.2.13

Jun 6, 2024

0.2.12

Feb 27, 2023

0.2.11

Jan 2, 2023

0.2.10

Oct 17, 2022

0.2.9

Aug 20, 2022

0.2.8.4

Aug 13, 2022

0.2.8.3

Jul 15, 2022

0.2.8.2

Jul 15, 2022

0.2.8.1

Jul 14, 2022

0.2.8

Jul 13, 2022

0.2.7

Feb 6, 2022

0.2.6

Jan 26, 2022

0.2.5

Dec 7, 2021

0.2.4

Sep 18, 2021

0.2.3

Aug 23, 2021

0.2.2

Aug 8, 2021

0.2.1

Aug 8, 2021

0.1.17

Mar 24, 2021

0.1.16

Aug 21, 2020

This version

0.1.15

Apr 21, 2020

0.1.14

Apr 21, 2020

0.1.13

Jan 24, 2020

0.1.12

Dec 11, 2019

0.1.11

Dec 11, 2019

0.1.10

Dec 11, 2019

0.1.9

Nov 28, 2019

0.1.8

Oct 17, 2019

0.1.7

Oct 2, 2019

0.1.5

Oct 1, 2019

0.1.4

Sep 25, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pygrank-0.1.15-py3-none-any.whl (18.9 kB view details)

Uploaded Apr 21, 2020 Python 3

File details

Details for the file pygrank-0.1.15-py3-none-any.whl.

File metadata

Download URL: pygrank-0.1.15-py3-none-any.whl
Upload date: Apr 21, 2020
Size: 18.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.36.1 CPython/3.6.2

File hashes

Hashes for pygrank-0.1.15-py3-none-any.whl
Algorithm	Hash digest
SHA256	`40caf9d9ec2d4688da4d55d6e67b4b56b53dffb007483142273f267bf0e51838`
MD5	`45c2544851300d5d78a710c13d48aa99`
BLAKE2b-256	`0ec1a91557046eff9edff51cb6c66cdd7fb2e39c4b690f159282a24608784032`

See more details on using hashes here.

pygrank 0.1.15

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

pygrank

Installation

Usage

Run a PageRank algorithm with seed oversampling

Run a PageRank algorithm and make it converge to a robust node order

Hash the outcome of graph normalization to speed up multiple calls to the same graph

How to evaluate with an unsupervised metric

How to evaluate with a supervised metric

How to evaluate multiple ranks

References

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes