Skip to main content

NuMeTris is a package for Maximum-Entropy models for triadic pattern detection. It contains known models such as DBCM and RBCM for binary motif analysis, and contain mixture models such as DBCM+CReMa and RBCM+CRWCM for weighted triadic motif analysis. The models are solved and routine are present for numeric ensemble generation and the computation of the triadic z-scores for triadic sub-graph occurrence, average flux and intensity.

Project description

PyPI License:GPLv3 Python Version arXiv

NuMeTriS: NUll ModEls for TRIadic Structures

NuMeTriS is a package developed on python3 for Pattern Detection of the 13 triadic connected sub-graphs using maximum-entropy models using direected and reciprocated constraints on network data.

NuMeTriS provides solvers for the binary models DBCM and RBCM, and the conditional weighted models CReMa and CRWCM. All of these models are explained in [1](forthcoming on ArXiv). The use of DBCM and CReMa enable the user to explicitly constrain network properties based on direction, such as out-degree, in-degree (binary) and out-strength and in-strength (weighted). In contrast, the use of RBCM and CRWCM enable the user to constrain network properties based on both direction and reciprocity, such as the reciprocated and non-reciprocated degrees and the reciprocated and non-reciprocated strengths.

Moreover, after solving the models it is possible to generate the related ensembles and compute triadic occurrences, the arithmetic mean of the weights on triadic structures (average fluxes) and their geometric mean (intensities). While triadic occurrences and fluxes are explained in [1](forthcoming on ArXiv), the triadic intensity is the geometric mean of the weights in triadic structures, as explained in 2 but here defined for all 13 sub-graphs.

To explore Maximum-Entropy modeling on networks, checkout Maximum Entropy Hub.

When using the module for your scientific research please consider citing:

    @misc{divece2023commodityspecific,
      title={Commodity-specific triads in the Dutch inter-industry production network}, 
      author={Marzio Di Vece and Frank P. Pijpers and Diego Garlaschelli},
      year={2023},
      eprint={2305.12179},
      archivePrefix={arXiv},
      primaryClass={physics.soc-ph}
}

Contents

Currently Available Models

NuMeTriS contains models for triadic motif detection on network data for continuous-valued semi-definite positive weights. The available models are:

  • DBCM Directed Binary Configuration Model
  • RBCM Reciprocated Binary Configuration Model
  • DBCM+CReMa Mixture model of Directed Binary Configuration Model and Conditional Reconstruction Method A
  • RBCM+CRWCM Mixture model of Reciprocated Binary Configuration Model and Conditionally Reciprocal Configuration Model

Please refer to the paper for further details.

Installation

DyGyS can be installed via pip. You can do it from your terminal

    $ pip install NuMeTriS

If you already installed the package and want to upgrade it, you can type from your terminal:

    $ pip install NuMeTriS --upgrade

Dependencies

NuMeTris uses the following dependencies:

  • scipy for optimization and root solving of some of the models;
  • numba for fast computation of network statistics and criterion functions.
  • matplotlib in order to plot z-score profiles and significance profiles.

They can be easily installed via pip typing

$ pip install scipy
$ pip install numba
$ pip install matplotlib

How-to Guidelines

The module containes the class Graph, initiated with a 2D weighted adjacency matrix, explaining the interactions present in the network data.

Class Instance and Empirical Network Statistics

To inizialize a Graph instance you can type:

G = Graph(adjacency=Wij)

where Wij is the weighted adjacency matrix in 2-D numpy array format.

After initializing you can already explore core network statistics such as (out-)degree, in-degree, reciprocated degrees, non-reciprocated out-degrees and non-reciprocated in-degrees, available using the respective codewords:

G.dseq_out, G.dseq_in, G.dseq_right, G.dseq_left, G.dseq_rec

or weighted core network statistics such as (out-)strength, in-strength, reciprocated strengths, non-reciprocated strengths, available using the respective codewords:

G.stseq_out, G.stseq_in, G.stseq_right, G.stseq_left, G.stseq_rec_out, G.stseq_rec_in

Also triadic network statistics are computed, namely Occurrences, Intensities and Fluxes, available using the respective codewords:

G.Nm_emp, G.Im_emp, G.Fm_emp

Solving the models

You can explore the currently available models using

G.implemented_models

use their names as described in this list not to incur in error messages.

When ready you can choose one of the aforementioned models and solve for their parameters using

G.solver(model= <chosen model>)

Once you solved the model various other attributes become visible and measures dependent solely on criterion functions are computed. These include Loglikelihood, Jacobian, Infinite Jacobian Norm, relative error and relative infinite norm, available using the codewords:

G.ll, G.jacobian, G.norm, G.relative_error, G.norm_rel

For further details on the .solve functions please see the documentation.

Numerically Computing z-scores and significance profiles

Computing z-scores and significance profiles is very easy. Generating the network ensemble is very easy. It's enough to type:

G.numerical_triadic_zscores(n_ensemble=<wanted number of graphs>,percentiles=<wanted percentiles for z-score CIs.>)

This routine generate "n_ensemble" graphs and computes z-scores for triadic occurrences (if a binary model is chosen) or triadic occurrences, fluxes and intensities (if a mixture model is chosen). Moreover it estimates a confidence interval for the z-score to understand the range of the values assumed in the ensemble. Percentiles are defined as tuple types and a 95% confidence interval can be estimated by the tuple (z_{2.5},z_{97.5}), for percentiles = (2.5,97.5).

This method returns expected triadic occurrences, triadic fluxes and triadic intensities

G.avg_Nm, G.avg_Fm, G.avg_Im,

corresponding z-scores

G.zscores_Nm, G.zscores_Fm, G.zscores_Im,

inferior and superior percentiles

G.zscores_down_Nm, G.zscores_down_Fm, G.zscores_down_Im, G.zscores_up_Nm, G.zscores_up_Fm, G.zscores_up_Im

and significance scores

G.normalized_z_Nm, G.normalized_z_Fm, G.normalized_z_Im.

Plot z-scores and significance profiles

Routine to plot z-scores and significance profiles. It's enough to type:

G.plot_zscores(type='z-scores')
G.plot_zscores(type='significance')
G.plot_zscores(type='both')

It plots triadic occurrences (if a binary model is used) or occurrences, fluxes and intensities (if a mixture model is used). It is possible to plot z-scores using the argument type='z-scores', significance profiles using argument type='significance' or both, using type='both'.

Documentation

You can find the complete documentation of the DyGyS library in documentation

Credits

Author:

Marzio Di Vece (a.k.a. MarsMDK)

Acknowledgments: The module was developed under the supervision of Diego Garlaschelli and Frank P. Pijpers. It was developed at IMT School for Advanced Studies Lucca and Statistics Netherlands and supported by the Italian ‘Programma di Attività Integrata’ (PAI) project ‘Prosociality, Cognition and Peer Effects’ (Pro.Co.P.E.), funded by IMT School for Advanced Studies.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

NuMeTriS-0.0.5.tar.gz (34.0 kB view hashes)

Uploaded Source

Built Distribution

NuMeTriS-0.0.5-py3-none-any.whl (30.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page