Word-of-Mouth cascades Generator

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

WoMG: Word of Mouth Generator

WoMG is a Python library for Word-of-Mouth Cascades Generation.

We propose a model for the synthetic generation of information cascades in social media. In our model the information “memes” propagating in the social network are characterized by a probability distribution in a topic space, accompanied by a textual description, i.e., a bag of keywords coherent with the topic distribution. Similarly, every person is described by a vector of interests defined over the same topic space. Information cascades are governed by the topic of the meme, its level of virality, the interests of each person, community pressure, and social influence.

This repository provides a reference implementation of WoMG as described in:

Generating realistic interest-driven information cascades.
Federico Cinus, Francesco Bonchi, André Panisson, Corrado Monti.

WoMG generates synthetic datasets of documents cascades on network. It starts with any (un)directed, (un)weighted graph and a collection of documents and it outputs the propagation DAGs of the docs through the network.

Installation

Install using pip:

$ pip install womg-core

You can also download or clone the GitHub repository:

$ git clone https://github.com/FedericoCinus/WoMG.git

Quickstart

The WoMG package provides a Python module and a command-line method. To run WoMG-core on a demo mode, execute the following command from Terminal:

$ womgc

It loads 50 documents and their topic distributions located in /womgdata and it spreads them over the default network (Les Miserables http://konect.uni-koblenz.de/networks/moreno_lesmis).

Options

You can check out the other options available to use with WoMG using:

$ womg --help

Input

[Network] The supported input format is an edgelist (txt extension):

	node1_id_int node2_id_int <weight_float, optional>

You can specify the edgelist path using the graph argument:

$ womg --graph /this/is/an/example/path/Graph_Folder/edgelist.txt

If no path is given the default network is Les Miserables network.

Output (default)

[Propagations] The output format is:
```
 time; item; node
```
[Items descriptions] :
```
 item; [topic-dim vector]
```

[Topic descriptions] :

 (topic_index, linear combination of words)

You can specify the output folder path:

$ womg --output /this/is/an/example/path/Output_Folder

WoMG extended (TBD)

WoMG is an open source reasearch project. More details of the software are reported below:

Input

[Network] The supported input format is an edgelist (txt extension):
```
 node1_id_int node2_id_int <weight_float, optional>
```

The graph is assumed to be undirected and unweighted by default. These options can be changed by setting the appropriate flags. You can specify the edgelist path using the graph argument):

python womg --graph /this/is/an/example/path/Graph_Folder/edgelist.txt

If no path is given the default network is Les Miserables network.

[Documents] The supported input format for documents collection (corpus) is txt. You have to specify the folder path containing them using the docs_folder argument:

$ womg --docs_folder /this/is/an/example/path/Corpus_Folder

If no documents folder path is given, WoMG will be set to generative mode.

Output

There are outputs for each class (or model)

[Diffusion] file could be in two formats:

list (default):

time doc activating_node

dict :

{ time: { doc: [activating nodes] } }

[Network] files: [info] dict:

{'type': 'Graph', 'numb_nodes': '77', 'numb_edges': '254', 'aver_degree': '6.5974', 'directed': 'False'}

[graph] dict:

  {(u, v): [1.3, 0.2, 0.8, ... , 0.91], ...}

Key: link-tuple. Value: weight vector

[interests and influence vectors] dict:

{(node, 'int'): [interest vector], (node, 'inlf'): [influence vector]}

[Topic] files:

[topic distributions] dict:

  {doc: [topic distribution]}

[viralities] dict:

  {doc: virality}

One can modify the outputs formats extension with the format argument:

python womg --format pickle python womg --format txt

and specify the output folder path:

python womg --output /this/is/an/example/path/Output_Folder

Options

topics number of topics to be considered in the topic distributions of documents and nodes interests; it has to be less than number of dimensions of the nodes' space provided by node2vec

Graph

homophily H degree of homophily. Node2vec is used as baseline for generating interests vectors of the nodes starting from the given graph. Parameters p and q can achieve different decoded degree of homophily and structural equivalence (see paper). The best mix of them can be achieved only by a deep analysis of the network and a grid searh on the parameters. In order to pursuit generality in the input graph we use three degree of mixing: structural equivalence predominant, deepWalk (p=1, q=1), homophily predominant (which are not the best for representing the graph!). 1-H is the degree of social influence between nodes; which is the percentage of the avg interests vecs norms to be assigned to the influence vectors.

Documents

docs number of documents TO BE GENERATED by lda, giving this parameter lda will be directly set to generative mode
virality virality of the doc; if virality is high, exponent of the power law is high and threshold for activation is low.

Diffusion

steps steps of the diffusion simulation
actives percentage of active nodes with respect to the total number of nodes in the intial configuration (before diffusion) for each doc.

Node2Vec

dimensions Number of dimensions for node2vec. Default 128
walk-length length of walk per source. Default 80
num-walks number of walks per source. Default 10
window-size context size for optimization. Default 10
iter number of epochs in SGD
workers number of parallel workers. Default 8
p manually set BFS parameter; else: it is set by H
q manually set DFS parameter; else: it is set by H

Input and Output

graph Input path of the graph edgelist
weighted boolean specifying (un)weighted. Default unweighted
unweighted
directed graph is (un)directed. Default undirected
undirected
docs-folder Input path of the documents folder
output Outputs path
format Outputs format
seed Seed (int) for random distribution extraction

Citing

@inproceedings{,
author = {},
 title = {},
 booktitle = {Proceedings},
 year = {2019}
}

Miscellaneous

Please feel free ..

Note: This is only a reference implementation analysis and more details are provided by the thesis.

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

1.0.4

Mar 10, 2021

1.0.3

Mar 5, 2021

1.0.2

Mar 3, 2021

1.0.1

May 7, 2020

1.0.0

May 7, 2020

0.9.60

May 7, 2020

0.9.59

May 7, 2020

0.9.58

May 6, 2020

0.9.55

Apr 13, 2020

0.9.54

Apr 12, 2020

0.9.53

Apr 12, 2020

0.9.52

Jan 12, 2020

0.9.51

Jan 12, 2020

0.9.50

Jan 12, 2020

0.9.49

Jan 11, 2020

0.9.48

Jan 11, 2020

0.9.47

Jan 11, 2020

0.9.46

Jan 10, 2020

0.9.45

Jan 10, 2020

0.9.44

Jan 9, 2020

0.9.43

Jan 9, 2020

0.9.42

Jan 9, 2020

0.9.41

Jan 9, 2020

0.9.40

Jan 9, 2020

0.9.39

Jan 9, 2020

0.9.38

Jan 9, 2020

0.9.37

Jan 9, 2020

0.9.36

Jan 9, 2020

0.9.35

Jan 9, 2020

0.9.34

Jan 9, 2020

0.9.33

Jan 9, 2020

0.9.32

Jan 9, 2020

0.9.31

Jan 9, 2020

0.9.30

Jan 9, 2020

0.9.29

Jan 9, 2020

0.9.28

Jan 8, 2020

0.9.27

Jan 8, 2020

0.9.26

Jan 8, 2020

0.9.25

Jan 8, 2020

0.9.24

Jan 8, 2020

0.9.23

Jan 8, 2020

0.9.22

Jan 8, 2020

0.9.21

Jan 8, 2020

0.9.20

Jan 4, 2020

0.9.19

Dec 20, 2019

0.9.18

Dec 18, 2019

0.9.17

Dec 18, 2019

0.9.16

Nov 28, 2019

0.9.15

Nov 28, 2019

0.9.14

Nov 28, 2019

0.9.13

Nov 27, 2019

0.9.12

Nov 27, 2019

0.9.11

Nov 27, 2019

0.9.10

Nov 25, 2019

0.9.9

Nov 25, 2019

0.9.8

Nov 25, 2019

0.9.7

Nov 25, 2019

0.9.6

Nov 25, 2019

0.9.5

Nov 25, 2019

0.9.4

Nov 22, 2019

0.9.3

Nov 22, 2019

0.9.2

Nov 22, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

womg-core-1.0.4.tar.gz (18.2 MB view details)

Uploaded Mar 10, 2021 Source

File details

Details for the file womg-core-1.0.4.tar.gz.

File metadata

Download URL: womg-core-1.0.4.tar.gz
Upload date: Mar 10, 2021
Size: 18.2 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.6.0 requests-toolbelt/0.9.1 tqdm/4.38.0 CPython/3.7.4

File hashes

Hashes for womg-core-1.0.4.tar.gz
Algorithm	Hash digest
SHA256	`52c09896d0ba2a876a8bf9f10822797557b0691d25d936f3b50ce2db11e68823`
MD5	`dd99167468b9fa86fa2e890350a77299`
BLAKE2b-256	`00552eea1003b7b5b274bcf9781a8c54dcf920109d1077d529ad7117d60f0397`

See more details on using hashes here.

womg-core 1.0.4

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

WoMG: Word of Mouth Generator

Installation

Quickstart

Options

Input

Output (default)

WoMG extended (TBD)

Input

Output

Options

Graph

Documents

Diffusion

Node2Vec

Input and Output

Citing

Miscellaneous

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes