Library for the problem of Influence Maximization in Social Networks

These details have not been verified by PyPI

Project links

Project description

NetMax - Influence Maximization in Social Networks

NetMax is a Python library that provides the implementation of several algorithms for the problem of Influence Maximization in Social Networks, originally formulated in "Maximizing the Spread of Influence through a Social Network" (Kempe, Kleinberg and Tardos, 2003). NetMax is built upon NetworkX, a popular python library for working with graphs. It also addresses the problem of Competitive Influence Maximization, as an extensive-form strategic game setting in which multiple entities try to maximize their own influence across the network while minimizing the others'. It works with both signed and unsigned networks, implementing progressive, semi-progressive and non-progressive diffusion models.

Table of Contents:

NetMax - Influence Maximization in Social Networks

Installation

To install the library you can run the following command:

pip install netmax

Requirements

NetMax was developed with Python 3.12 and requires the installation of the following libraries:

networkx (version 3.3)
numpy
scipy
tqdm
heapdict

If you have already installed the library using pip, you don't need to install the requirements, otherwise you can simply run the following command:

pip install -r requirements.txt

Overview

This framework wants to be a useful tool for all those people who study the problem of Influence Maximization. The users instantiate the InfluenceMaximization class, setting some basic parameters:

input_graph: a directed graph representing the network (of type networkx.DiGraph)
agents: a dictionary where the key is the agent name (str) and the value is his budget (int)
alg: the algorithm to use for influence maximization (see Algorithms)
diff_model: the diffusion model to use (see Diffusion models)
inf_prob: the probability distribution used to generate (if needed) the probabilities of influence between nodes. The framework implements different influence probabilities, default is None (see Influence probabilities)
endorsement_policy: the policy that nodes use to choose which agent to endorse when they have been contacted by more than one agent. The framework implements different endorsement policies, default is 'random' (see Endorsement policies)
insert_opinion: True if the nodes do not contain any information about their opinion on the agents, False otherwise (or if the opinion is not used)
inv_edges: a bool indicating whether to invert the edges of the graph
first_random_seed: a bool indicating whether to insert a first node (chosen randomly) in the seed set of every agent
r: number of simulations to execute (default is 100)
verbose: if True sets the logging level to INFO, otherwise displays only the minimal information

Important: alg, diff_model, inf_prob and endorsement_policy can be either str or class parameters:

If they are str parameters, they represent the name attribute of the corresponding class already present in the framework. This was done in order to prevent the user from directly importing and instantiating all the specific classes, which could have not been user-friendly. To view all the keywords for these parameters, see the corresponding section
Otherwise, they must extend the corresponding superclass depending on the parameters (Algorithm for alg, DiffusionModel for diff_model, InfluenceProbability for inf_prob, EndorsementPolicy for endorsement_policy). This way, the user can define his own custom classes

After creating the InfluenceMaximization object, the user may call its run() method, which returns:

seed: a dictionary where the key is the agent name and the value is the seed set found
spread: a dictionary where the key is the agent name and the value is the expected spread
execution_time: the total execution time (in seconds)

All these values are also available in the result attribute (which is a dictionary) of the InfluenceMaximization object.

Algorithms

NetMax provides the implementation of many state-of-the-art algorithms.

Simulation-based

Monte-Carlo Greedy: implemented by the class MCGreedy (keyword: mcgreedy)
CELF: implemented by the class CELF (keyword: celf)
CELF++: implemented by the class CELF_PP (keyword: celfpp)

Proxy-based

Highest Out-Degree Heuristic: implemented by the class HighestOutDegree (keyword: outdeg)
Degree Discount: implemented by the class DegDis (keyword: degdis)
Group PageRank: implemented by the class Group_PR (keyword: group_pr)

Sketch-based

StaticGreedy: implemented by the class StaticGreedy (keyword: static_greedy)
RIS: implemented by the class RIS (keyword: ris)
TIM: implemented by the class TIM (keyword: tim)
TIM+: implemented by the class TIMp (keyword: tim_p)

Diffusion models

The supported diffusion models are:

Independent Cascade: implemented by the class IndependentCascade (keyword: ic)
Linear Threshold: implemented by the class LinearThreshold (keyword: lt)
Triggering Model: implemented by the class Triggering (keyword: tr)
Decreasing Cascade: implemented by the class DecreasingCascade (keyword: dc)
Semi-Progressive Friend-Foe Dynamic Linear Threshold: implemented by the class SemiProgressiveFriendFoeDynamicLinearThreshold (keyword: sp_f2dlt)
Non-Progressive Friend-Foe Dynamic Linear Threshold: implemented by the class NonProgressiveFriendFoeDynamicLinearThreshold (keyword: np_f2dlt)

Influence probabilities

The influence probabilities are used to label the edges between the network nodes if they are not already labeled. The user can choose between:

A constant value, set by default at 0.1 (keyword: constant)
A uniform distribution between 0.01 and 0.1 (keyword: uniform)
A distribution based on similarity between nodes computed with SimRank algorithm (keyword: similarity)
A ratio model which distributes the probability uniformly based on the in-degree of the target node (keyword: ratio)
A hybrid approach based on the average degree of the graph (keyword: hybrid)
An opinion-based approach (keyword: opinion) which assigns to each node a vector of opinions (namely, values between 0 and 1) and computes the influence probability comparing the opinions of the two nodes with cosine similarity and taking into account also their SimRank similarity, with the formula:

$p(u,v)=b+k*\left(\frac{1}{outdeg(u)}*similarity(u,v)+cossim(opinion(u),opinion(v))\right)$

Endorsement policies

In the competitive setting it is possible that, in the same time step, multiple agents contact the same node. Therefore, it is necessary an endorsement policy that dictates which agent the node chooses to endorse. Several endorsement policies are implemented:

A random policy, which chooses randomly between the agents that contacted the node in that specific time step (keyword: random)
A voting-based policy, which chooses the most occurring agent between the already activated neighbors of the node (keyword: voting)
A community-based approach, which applies the voting strategy to the community the node belongs to instead of its neighbors (keyword: community)
A similarity-based policy, which essentially is a weighted voting strategy based on the SimRank similarity between the node and its neighbors (keyword: sim_endorsement)

Useful papers

Here is a non-exhaustive list of useful papers which have been studied thoroughly to develop this framework:

Bharathi et al. - Competitive Influence Maximization in Social Networks
Borgs et al. - Maximizing Social Influence in Nearly Optimal Time
Borodin et al. - Threshold Models for Competitive Influence in Social Networks
Budak et al. - Limiting the Spread of Misinformation in Social Networks
Carnes et al. - Maximizing influence in a competitive social network
Chen et al. - Efficient Influence Maximization in Social Networks
Chen et al. - StaticGreedy Solving the Scalability-Accuracy Dilemma in Influence Maximization
Goyal et al. - CELF++ Optimizing the Greedy Algorithm for Influence Maximization in Social Networks
Goyal et al. - Learning Influence Probabilities In Social Networks
Goyal et al. - SimPath An Efficient Algorithm for Influence Maximization under the Linear Threshold Model
Gursoy et al. - Influence Maximization in Social Networks Under Deterministic Linear Threshold Model
Huang et al. - Competitive and complementary influence maximization in social network A follower's perspective
Kempe et al. - Influential Nodes in a Diffusion Model for Social Networks
Kempe et al. - Maximizing the Spread of Influence through a Social Network
Kong et al. - Online Influence Maximization under Decreasing Cascade Model
Leskovec et al. - Cost-Effective Outbreak Detection in Networks
Li et al. - GetReal Towards Realistic Selection of Influence Maximization Strategies in Competitive Networks
Lin et al. - A Learning-based Framework to Handle Multi-round Multi-party Influence Maximization on Social Networks
Liu - Influence maximization in social networks An ising-model-based approach
Liu et al. - Influence Maximization over Large-Scale Social Networks A Bounded Linear Approach
Lu et al. - From Competition to Complementarity Comparative Influence Diffusion and Maximization
Calio, Tagarelli - Complex influence propagation based on trust-aware dynamic linear threshold models
Tang et al. - Influence Maximization in Near-Linear Time A Martingale Approach
Tang et al. - Influence Maximization Near Optimal Time Complexity Meets Practical Efficiency
Wang et al. - Community-based Greedy Algorithm for Mining Top-K Influential Nodes in Mobile Social Networks
Zhou et al. - UBLF An Upper Bound Based Approach to Discover Influential Nodes in Social Networks
Zhu et al. - Minimum cost seed set for competitive social influence

Example

The following is a short example of how to create a simple influence maximization game with two agents. The data is read from a txt file with a function defined in the utils.py file, but alternatively the user can directly input a networkx.DiGraph object with the methods provided by networkx.

import utils
import influence_maximization as im

g = utils.read_adjacency_matrix("../data/network.txt")
# Dictionary <agent_name: agent_budget>
agents = {
    'Agent_0': 10
    'Agent_1': 10
}
im_instance = im.InfluenceMaximization(input_graph=g, agents=agents, alg='tim_p', 
                                        diff_model='ic', inf_prob='random', r=1000,
                                        insert_opinion=False, endorsement_policy='random', verbose=True)
seed, spread, execution_time = im_instance.run()
print(f"Seed sets found: {seed}")
print(f"Spreads: {spread}")
print(f"Total execution time: {execution_time}")

Credits

The creators of NetMax are Lorenzo Bloise and Carmelo Gugliotta.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.0

Dec 7, 2024

0.1.0

Nov 27, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

netmax-1.0.0.tar.gz (46.9 kB view details)

Uploaded Dec 7, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

netmax-1.0.0-py3-none-any.whl (66.7 kB view details)

Uploaded Dec 7, 2024 Python 3

File details

Details for the file netmax-1.0.0.tar.gz.

File metadata

Download URL: netmax-1.0.0.tar.gz
Upload date: Dec 7, 2024
Size: 46.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for netmax-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`9f13a6681e59e1a739026d9002424c5aba1245e7538a0113076289d55be3ec05`
MD5	`5dbab31f418adece4271bbbcc19c9de7`
BLAKE2b-256	`60684bb7781d53686b1043b99f1c5e52c946b612557e8d5c385e6ac0afc42c5f`

See more details on using hashes here.

File details

Details for the file netmax-1.0.0-py3-none-any.whl.

File metadata

Download URL: netmax-1.0.0-py3-none-any.whl
Upload date: Dec 7, 2024
Size: 66.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for netmax-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e95f54e85d7ca4740eb8d7dc748e2acf9c33bbf7cb698392b3e09008a1c2abb9`
MD5	`a236161328884f35f65976e7da8f8905`
BLAKE2b-256	`cd1430f264fce45c39ed27dddeec0df249d8522f94b524c5e552e2ad96c42b06`

See more details on using hashes here.

netmax 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

NetMax - Influence Maximization in Social Networks

Installation

Requirements

Overview

Algorithms

Simulation-based

Proxy-based

Sketch-based

Diffusion models

Influence probabilities

Endorsement policies

Useful papers

Example

Credits

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes