Skip to main content

Custom graph/network/multi-weighted network class based on storing list of neighbors for each nodes (as opposed to edge list) for scalable sampling and searching algorithms

Project description

logo

NNetwork

PyPI Version Supported Python Versions

NNetwork is a Custom graph/network/multi-weighted network class optimized for scalable subgraph sampling and searching algorithms. NNetwork stores a dictionary that maps each node to a list of its neighbors to allow for O(1) access for finding neighbors.

The efficiency of neighbor access is import for sampling algorithm such as random walks and Markov chain Monte Carlo motif sampling on graphs, which rely on accessing neighborhood information at every iteration of sampling. In comparison, many packages rely on calculations involving powers of adjacency matrices to calculate random walks of length k.

The default class of NNetwork encodes a network with weighted edges, which can also have list-valued edge weights as its 'color'.

Update for 0.1.0:

Built-in functions contain sampling algorithms for mesoscale network patches using various MCMC motif sampling algorithms [1]. At stationary distribution, it computes a uniformly chosen k-walk in the graph, which can optionally enforced to be non-backtraking, and the induced adjacency pattern is returned as a k x k matrix. Algorithimically, a given k-walk is randomly updated using a suitable MCMC algorithm. The so-computed k x k mesoscale patches are basis of subgraph analysis and network dictionary learning in [2].

By Josh Vendrow and Hanbaek Lyu


Installation

To install NNetwork, run this command in your terminal:

$ pip install -U NNetwork

This is the preferred method to install NNetwork, as it will always install the most recent stable release.

If you don't have pip installed, these installation instructions can guide you through the process.

Usage

Undirected Graphs

Create an undirected (weighted) graph from an edgelist:

>>> from NNetwork import NNetwork
>>> edgelist = [[1,2],[2,3],[3,4]]
>>> G = NNetwork()
>>> G.add_edges(edgelist)
>>> G.has_edge(2,3)
True
>>> G.get_edge_weight(2,3)
1

Get the neighbors of a node:

>>> G.neighbors(3)
[2,4]

Find the intersection of edges with another network:

>>> edgelist2 = [[2,3],[3,4],[5,7]]
>>> G2 = NNetwork()
>>> G2.add_edges(edgelist2)
>>> G.intersection(G2)
[[2,3],[3,4]]

Weighted Graphs

Create a weighted graph from an edgelist:

>>> from NNetwork import NNetwork
>>> edgelist = [[1,2,0.5],[2,3,0.8]]]
>>> G = NNetwork()
>>> G.add_wtd_edges(edgelist)
>>> G.get_edge_weight([2,3])
0.8

Convert weighted graph to an unweighed graph by thresholding

>>> G_simple = G.threshold2simple(0.7)
>>> G_simple.edges()
[[2,3]]

Mesoscale patch computation

>>> edgelist = [[1,2],[2,3],[1,3],[1,4],[1,5]]
>>> G = nn.NNetwork()
>>> G.add_edges(edgelist)
>>> print(G.vertices)
['1', '2', '3', '4', '5']
>>> print(G.edges)
{"['1', '2']": 1, "['2', '1']": 1, "['2', '3']": 1, "['3', '2']": 1, "['1', '3']": 1, "['3', '1']": 1, "['1', '4']": 1, "['4', '1']": 1, "['1', '5']": 1, "['5', '1']": 1}
>>> X, embs = G.get_patches(k=3, sample_size=4, skip_folded_hom=False)
>>> print(X) # each column is a vectorizaiton of k x k induced adjacency matrix
array([[0., 0., 0., 0.],
       [1., 1., 1., 1.],
       [0., 1., 0., 0.],
       [1., 1., 1., 1.],
       [0., 0., 0., 0.],
       [1., 1., 1., 1.],
       [0., 1., 0., 0.],
       [1., 1., 1., 1.],
       [0., 0., 0., 0.]])
>>> print(embs) # four consecutive 3-walks in G
[array(['2', '3', '2'], dtype='<U32'),
 array(['1', '2', '3'], dtype='<U32'),
 array(['4', '1', '4'], dtype='<U32'),
 array(['4', '1', '4'], dtype='<U32')]

Citing

If you use our work in an academic setting, please cite our papers:

[1] Hanbaek Lyu, Facundo Memoli, and David Sivakoff, “Sampling random graph homomorphisms and applications to network data analysis.” https://arxiv.org/abs/1910.09483 (2019)

[2] Hanbaek Lyu, Yacoub Kureh, Joshua Vendrow, and Mason A. Porter, “Learning low-rank mesoscale structures of networks” https://arxiv.org/abs/2102.06984 (2021)

Development

See CONTRIBUTING.md for information related to developing the code.

Suggested Git Branch Strategy

  1. master is for the most up-to-date development, very rarely should you directly commit to this branch. Your day-to-day work should exist on branches separate from master. It is recommended to commit to development branches and make pull requests to master.4. It is recommended to use "Squash and Merge" commits when committing PR's. It makes each set of changes to master atomic and as a side effect naturally encourages small well defined PR's.

Additional Optional Setup Steps:

  • Create an initial release to test.PyPI and PyPI.

    • Follow This PyPA tutorial, starting from the "Generating distribution archives" section.
  • Create a blank github repository (without a README or .gitignore) and push the code to it.

  • Delete these setup instructions from README.md when you are finished with them.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

NNetwork-0.2.0.tar.gz (15.2 kB view details)

Uploaded Source

Built Distribution

NNetwork-0.2.0-py2.py3-none-any.whl (12.5 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file NNetwork-0.2.0.tar.gz.

File metadata

  • Download URL: NNetwork-0.2.0.tar.gz
  • Upload date:
  • Size: 15.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.13

File hashes

Hashes for NNetwork-0.2.0.tar.gz
Algorithm Hash digest
SHA256 5e9c52fbc2a7f2675e3bb8d5210faad480670979897c250a4d8994fe35d331e7
MD5 ddc69b09c9231bbd4e84ebda109c8513
BLAKE2b-256 e9ed38f383b7ecad7b622aebd07512947679b38e263e8773bd60a6519fdc9d25

See more details on using hashes here.

File details

Details for the file NNetwork-0.2.0-py2.py3-none-any.whl.

File metadata

  • Download URL: NNetwork-0.2.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 12.5 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.13

File hashes

Hashes for NNetwork-0.2.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 f651f8d71c512919b065837ff814710bf2ca56960d793687541f45233ed791e6
MD5 9061c03e0ca5b3c11d367ff32aace60d
BLAKE2b-256 3dac55e67cd54b1aebe5ce63a1d949bd146bed2cf704410ea8ade927d1cf35fc

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page