Custom graph/network/multi-weighted network class based on storing list of neighbors for each nodes (as opposed to edge list) for scalable sampling and searching algorithms
Project description
NNetwork
NNetwork
is a Custom graph/network/multi-weighted network class optimized for scalable subgraph sampling and searching algorithms. NNetwork stores a dictionary that maps each node to a list of its neighbors to allow for O(1) access for finding neighbors.
The efficiency of neighbor access is import for sampling algorithm such as random walks and Markov chain Monte Carlo motif sampling on graphs, which rely on accessing neighborhood information at every iteration of sampling. In comparison, many packages rely on calculations involving powers of adjacency matrices to calculate random walks of length k.
The default class of NNetwork
encodes a network with weighted edges, which can also have list-valued edge weights as its 'color'.
Update for 0.1.0:
Built-in functions contain sampling algorithms for mesoscale network patches using various MCMC motif sampling algorithms [1]. At stationary distribution, it computes a uniformly chosen k-walk in the graph, which can optionally enforced to be non-backtraking, and the induced adjacency pattern is returned as a k x k matrix. Algorithimically, a given k-walk is randomly updated using a suitable MCMC algorithm. The so-computed k x k mesoscale patches
are basis of subgraph analysis and network dictionary learning in [2].
By Josh Vendrow and Hanbaek Lyu
Installation
To install NNetwork, run this command in your terminal:
$ pip install -U NNetwork
This is the preferred method to install NNetwork, as it will always install the most recent stable release.
If you don't have pip installed, these installation instructions can guide you through the process.
Usage
Undirected Graphs
Create an undirected (weighted) graph from an edgelist:
>>> from NNetwork import NNetwork
>>> edgelist = [[1,2],[2,3],[3,4]]
>>> G = NNetwork()
>>> G.add_edges(edgelist)
>>> G.has_edge(2,3)
True
>>> G.get_edge_weight(2,3)
1
Get the neighbors of a node:
>>> G.neighbors(3)
[2,4]
Find the intersection of edges with another network:
>>> edgelist2 = [[2,3],[3,4],[5,7]]
>>> G2 = NNetwork()
>>> G2.add_edges(edgelist2)
>>> G.intersection(G2)
[[2,3],[3,4]]
Weighted Graphs
Create a weighted graph from an edgelist:
>>> from NNetwork import NNetwork
>>> edgelist = [[1,2,0.5],[2,3,0.8]]]
>>> G = NNetwork()
>>> G.add_wtd_edges(edgelist)
>>> G.get_edge_weight([2,3])
0.8
Convert weighted graph to an unweighed graph by thresholding
>>> G_simple = G.threshold2simple(0.7)
>>> G_simple.edges()
[[2,3]]
Mesoscale patch computation
>>> edgelist = [[1,2],[2,3],[1,3],[1,4],[1,5]]
>>> G = nn.NNetwork()
>>> G.add_edges(edgelist)
>>> print(G.vertices)
['1', '2', '3', '4', '5']
>>> print(G.edges)
{"['1', '2']": 1, "['2', '1']": 1, "['2', '3']": 1, "['3', '2']": 1, "['1', '3']": 1, "['3', '1']": 1, "['1', '4']": 1, "['4', '1']": 1, "['1', '5']": 1, "['5', '1']": 1}
>>> X, embs = G.get_patches(k=3, sample_size=4, skip_folded_hom=False)
>>> print(X) # each column is a vectorizaiton of k x k induced adjacency matrix
array([[0., 0., 0., 0.],
[1., 1., 1., 1.],
[0., 1., 0., 0.],
[1., 1., 1., 1.],
[0., 0., 0., 0.],
[1., 1., 1., 1.],
[0., 1., 0., 0.],
[1., 1., 1., 1.],
[0., 0., 0., 0.]])
>>> print(embs) # four consecutive 3-walks in G
[array(['2', '3', '2'], dtype='<U32'),
array(['1', '2', '3'], dtype='<U32'),
array(['4', '1', '4'], dtype='<U32'),
array(['4', '1', '4'], dtype='<U32')]
Citing
If you use our work in an academic setting, please cite our papers:
[1] Hanbaek Lyu, Facundo Memoli, and David Sivakoff, “Sampling random graph homomorphisms and applications to network data analysis.” https://arxiv.org/abs/1910.09483 (2019)
[2] Hanbaek Lyu, Yacoub Kureh, Joshua Vendrow, and Mason A. Porter, “Learning low-rank mesoscale structures of networks” https://arxiv.org/abs/2102.06984 (2021)
Development
See CONTRIBUTING.md for information related to developing the code.
Suggested Git Branch Strategy
master
is for the most up-to-date development, very rarely should you directly commit to this branch. Your day-to-day work should exist on branches separate frommaster
. It is recommended to commit to development branches and make pull requests to master.4. It is recommended to use "Squash and Merge" commits when committing PR's. It makes each set of changes tomaster
atomic and as a side effect naturally encourages small well defined PR's.
Additional Optional Setup Steps:
-
Create an initial release to test.PyPI and PyPI.
- Follow This PyPA tutorial, starting from the "Generating distribution archives" section.
-
Create a blank github repository (without a README or .gitignore) and push the code to it.
-
Delete these setup instructions from
README.md
when you are finished with them.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file NNetwork-0.2.0.tar.gz
.
File metadata
- Download URL: NNetwork-0.2.0.tar.gz
- Upload date:
- Size: 15.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5e9c52fbc2a7f2675e3bb8d5210faad480670979897c250a4d8994fe35d331e7 |
|
MD5 | ddc69b09c9231bbd4e84ebda109c8513 |
|
BLAKE2b-256 | e9ed38f383b7ecad7b622aebd07512947679b38e263e8773bd60a6519fdc9d25 |
File details
Details for the file NNetwork-0.2.0-py2.py3-none-any.whl
.
File metadata
- Download URL: NNetwork-0.2.0-py2.py3-none-any.whl
- Upload date:
- Size: 12.5 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f651f8d71c512919b065837ff814710bf2ca56960d793687541f45233ed791e6 |
|
MD5 | 9061c03e0ca5b3c11d367ff32aace60d |
|
BLAKE2b-256 | 3dac55e67cd54b1aebe5ce63a1d949bd146bed2cf704410ea8ade927d1cf35fc |