23
Scalable Influence Maximization for Prevalent Viral Marketing in Large- Scale Social Network Wei Chen*, Chi Wang $ , and Yajun Wang* *Microsoft Research Asia, $ UIUC

Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

  • Upload
    others

  • View
    6

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Scalable Influence Maximization for Prevalent Viral Marketing in Large-

Scale Social Network

Wei Chen*, Chi Wang$, and Yajun Wang**Microsoft Research Asia, $UIUC

Page 2: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Problem Definition-I

• Influence maximization, defined by Kempe, Kleinberg, and Tardos (2003), is the problem of finding a small set of seed nodes in a social network that maximizes the spread of influence under certain influence cascade models.

• Proven to be NP-hard

Presenter
Presentation Notes
A small company develops a cool online application and wants to market it through an online social network. It has a limited budget such that it can only select a small number of initial users in the network to use it (by giving them gifts or payments). The company wishes that these initial users would love the application and start influencing their friends on the social network to use it, and their friends would influence their friends’ friends and so on, and thus through the word-of-mouth effect a large population in the social network would adopt the application. The problem is whom to select as the initial users so that they eventually influence the largest number of people in the network.
Page 3: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Problem Definition-II

• Given– Graph G(v,e)– Threshold k– Information Cascade model

• Find– k nodes in G (v, e) such that under the IC model,

the expected number of nodes activated by the k seeds is the largest.

Page 4: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Algorithms

• Classic: Independent IC model and greedy algorithm

• Proposed: MIA model and greedy algorithm• Proposed: MIA model and more efficient

greedy algorithm• Proposed: Prefix excluding MIA model and

greedy algorithm (PMIA)

Page 5: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Independent IC model and Greedy algorithm

• Independent cascade (IC) model– Each edge (u; v) in the graph is associated with a

propagation probability pp(u; v)– pp(u, v) = p(v is activated by u at step t+1| u is

activated at step t)

Page 6: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Independent IC model and Greedy algorithm

• The algorithm iteratively selects new seed u that maximizes the incremental change of f into the seed set S until k seeds are selected.

Page 7: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Proposed: MIA model and greedy algorithm

Page 8: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Proposed: MIA model and greedy algorithm

Presenter
Presentation Notes
Intuitively, MIIA(v; µ) and MIOA(v; µ) give the local influence regions of v, and different values of theta controls the size of these local influence regions.
Page 9: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Proposed: MIA model and greedy algorithm

• Let the activation probability of any node u in MIIA(v; \theta), denoted as ap(u; S; MIIA(v; \theta)), be the probability that u is activated when the seed set is S and influence is propagated in MIIA(v; \theta).

Page 10: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Proposed: MIA model and greedy algorithm

Page 11: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Proposed: MIA model and more efficient greedy algorithm

• A batch update scheme– Take advantage of influence linearity to add

multiple nodes to set S in each iteration

Presenter
Presentation Notes
The only important step in the greedy algorithm is to select the next seed that gives the largest incremental influence spread.
Page 12: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Proposed: Prefix excluding MIA model and greedy algorithm (PMIA)

Page 13: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Proposed: Prefix excluding MIA model and greedy algorithm (PMIA)

Page 14: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

General summary

• First compute maximum influence paths (MIP) between every pair of nodes in the network via a Dijkstrashortest-path algorithm, and ignore MIPs with probability smaller than an influence threshold \theta, effectively restricting influence to a local region.

• Then union the MIPs starting or ending at each node into the tree structures, which represent the local influence regions of each node.

• Only consider influence propagated through these local tree, and we refer to this model as the maximum influence arborescence (MIA) model

Page 15: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Experiments

Page 16: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Evaluation

Presenter
Presentation Notes
let v be a neighbor of vertex u. If u has been selected as a seed, then when considering selecting v as a new seed based on its degree, we should not count the edge vu towards its degree. Thus we should discount v’s degree by one due to the presence of u in the seed set, and we do the same discount on v’s degree for every neighbor of v that is already in the seed set.
Page 17: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Evaluation

Page 18: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Evaluation

Page 19: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Evaluation

Page 20: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Evaluation

Page 21: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Evaluation

Page 22: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Evaluation

Page 23: Scalable Influence Maximization for Prevalent Viral Marketing in …swl.slis.indiana.edu/clubFiles/apr28_he.pdf · Prevalent Viral Marketing in Large-Scale Social Network Wei Chen*,

Contribution

• Heuristic-based– The heuristic is the main contribution– Construct local tree structure for calculating

influence spread• Scalable to millions of nodes and edges• Much faster than existing greedy algorithms• A simple tunable parameter for users to

control the balance between the running time and the influence spread of the algorithm