5
M375T/M396C: Topics in Complex Networks Spring 2013 Lecture 01 — 01/15/13 Lecturer: Ravi Srinivasan Scribe: N/A 1.1 Introduction Welcome to M375T/M396C. In this course we will consider mathematical concepts that underlie the study of complex networks and, in particular, social networks. A list of topics to be covered, the class syllabus, and any course material (lecture notes, problem sets, links to articles, etc.) is available on the course website: http://www.ma.utexas.edu/users/rav/ComplexNetworks/ 1.2 Social networks and small-world property In order to study mathematical models that have some basis in reality, we need to have a sense of the properties that real-world networks share. One particularly important one is the small-world property. Most of us have had the experience that one of our friends knows—or is friends with someone who knows—a person we believed to be socially distant, like a celebrity or a famous politician. This property has garnered fame under the notion of “six degrees of separation,” which hypothesizes that we are each connected to any other person on the planet by only six links, on average. The small-world effect is necessary for many important features of a social network to take hold: fast and widespread propagation of information, tipping point phenomena where small changes have large effects, and network robustness and resiliency. 1.2.1 Milgram’s experiment The first evidence of a small-world property was discovered in a series of groundbreaking experiments by the sociologist Stanley Milgram in the 1960’s. At the time there was no notion of what a social graph looked like—remember, Facebook didn’t exist yet! To infer the underlying network structure, Milgram picked a “target” in Boston and sent folders with a description of the target to an arbitrary sample of people across the country. These participants were asked to add their name to the folder and send it to a friend or acquaintance who would be most likely to know the target or someone who does (i.e., greedy routing ). 01-1

M375T/M396C: Topics in Complex Networks Spring 2013 ... · M375T/M396C Lecture 01 | 01/15/13 Spring 2013 Networks: Lecture 1 Introduction Visual ExamplesÑ1 Figure:The network structure

  • Upload
    others

  • View
    0

  • Download
    0

Embed Size (px)

Citation preview

Page 1: M375T/M396C: Topics in Complex Networks Spring 2013 ... · M375T/M396C Lecture 01 | 01/15/13 Spring 2013 Networks: Lecture 1 Introduction Visual ExamplesÑ1 Figure:The network structure

M375T/M396C: Topics in Complex Networks Spring 2013

Lecture 01 — 01/15/13Lecturer: Ravi Srinivasan Scribe: N/A

1.1 Introduction

Welcome to M375T/M396C. In this course we will consider mathematical concepts thatunderlie the study of complex networks and, in particular, social networks. A list of topicsto be covered, the class syllabus, and any course material (lecture notes, problem sets, linksto articles, etc.) is available on the course website:

http://www.ma.utexas.edu/users/rav/ComplexNetworks/

1.2 Social networks and small-world property

In order to study mathematical models that have some basis in reality, we need to have asense of the properties that real-world networks share. One particularly important one isthe small-world property.

Most of us have had the experience that one of our friends knows—or is friends withsomeone who knows—a person we believed to be socially distant, like a celebrity or a famouspolitician. This property has garnered fame under the notion of “six degrees of separation,”which hypothesizes that we are each connected to any other person on the planet by onlysix links, on average. The small-world effect is necessary for many important features of asocial network to take hold: fast and widespread propagation of information, tipping pointphenomena where small changes have large effects, and network robustness and resiliency.

1.2.1 Milgram’s experiment

The first evidence of a small-world property was discovered in a series of groundbreakingexperiments by the sociologist Stanley Milgram in the 1960’s. At the time there was nonotion of what a social graph looked like—remember, Facebook didn’t exist yet! To inferthe underlying network structure, Milgram picked a “target” in Boston and sent folderswith a description of the target to an arbitrary sample of people across the country. Theseparticipants were asked to add their name to the folder and send it to a friend or acquaintancewho would be most likely to know the target or someone who does (i.e., greedy routing).

01-1

Page 2: M375T/M396C: Topics in Complex Networks Spring 2013 ... · M375T/M396C Lecture 01 | 01/15/13 Spring 2013 Networks: Lecture 1 Introduction Visual ExamplesÑ1 Figure:The network structure

M375T/M396C Lecture 01 — 01/15/13 Spring 2013

 !"#$%&#'()*+$,)-#.$(+$/01$2))$)%2(,(+2(34$

o 5#0+)6$789$(2$0++*,#+$03:*0(62063#$+#2$0&#$-(+;)(62$  /"#&#0+$2"#1$0&#$&#.02#-$0+$%0&2$)<$0$!"#$%&'()%*+'  #=%06+()6$2"&)*>"$2"#$>&0%"$,01$?#$.(,(2#-@'

A$

B,0..$/)&.-9$!"#$+C#%2(3+$

20.2. STRUCTURE AND RANDOMNESS 613

you

your friends

friends of your friends

(a) Pure exponential growth produces a small world

you

your friends

friends of your friends

(b) Triadic closure reduces the growth rate

Figure 20.1: Social networks expand to reach many people in only a few steps.

people brings us to more than 100 · 100 · 100 = 1, 000, 000 people who in principle could be

three steps away. In other words, the numbers are growing by powers of 100 with each step,

bringing us to 100 million after four steps, and 10 billion after five steps.

There’s nothing mathematically wrong with this reasoning, but it’s not clear how much

it tells us about real social networks. The difficulty already manifests itself with the second

step, where we conclude that there may be more than 10, 000 people within two steps of you.

As we’ve seen, social networks abound in triangles — sets of three people who mutually

know each other — and in particular, many of your 100 friends will know each other. As a

result, when we think about the nodes you can reach by following edges from your friends,

many of these edges go from one friend to another, not to the rest of world, as illustrated

schematically in Figure 20.1(b). The number 10, 000 came from assuming that each of your

100 friends was linked to 100 new people; and without this, the number of friends you could

reach in two steps could be much smaller.

So the effect of triadic closure in social networks works to limit the number of people

you can reach by following short paths, as shown by the contrast between Figures 20.1(a)

20.2. STRUCTURE AND RANDOMNESS 613

you

your friends

friends of your friends

(a) Pure exponential growth produces a small world

you

your friends

friends of your friends

(b) Triadic closure reduces the growth rate

Figure 20.1: Social networks expand to reach many people in only a few steps.

people brings us to more than 100 · 100 · 100 = 1, 000, 000 people who in principle could be

three steps away. In other words, the numbers are growing by powers of 100 with each step,

bringing us to 100 million after four steps, and 10 billion after five steps.

There’s nothing mathematically wrong with this reasoning, but it’s not clear how much

it tells us about real social networks. The difficulty already manifests itself with the second

step, where we conclude that there may be more than 10, 000 people within two steps of you.

As we’ve seen, social networks abound in triangles — sets of three people who mutually

know each other — and in particular, many of your 100 friends will know each other. As a

result, when we think about the nodes you can reach by following edges from your friends,

many of these edges go from one friend to another, not to the rest of world, as illustrated

schematically in Figure 20.1(b). The number 10, 000 came from assuming that each of your

100 friends was linked to 100 new people; and without this, the number of friends you could

reach in two steps could be much smaller.

So the effect of triadic closure in social networks works to limit the number of people

you can reach by following short paths, as shown by the contrast between Figures 20.1(a)

Figure 1.1. Which one looks more realistic?

Two experiments showed that of the ∼ 25% of folders that were actually received by thetarget, their paths had an average length of 6.2 links—hence, “six degrees of separation.”It was noted that these paths were not random, with local links correlated to gender andfriends/family and global links typically related to hometown or work. Furthermore, severalintermediaries received multiple folders, ensuring that they somehow play a central role inthis routing network.

Milgram’s observations have been confirmed in modern settings, most notably in anonline experiment by Dodds and Watts using e-mail forwarding. The small-world propertyis also evident in networks where the social graph is known, like when computing one’s Erdosnumber (distance to Paul Erdos in a co-authorship network) or Bacon number (distance toKevin Bacon in an acting collaboration network). In an analysis of the largest social graphto date, a 2011 study of Facebook’s 721 million users found an average number of 4.74 linksbetween users. These experiments provide good evidence that it is, in fact, “a small world,after all.”

1.2.2 A first guess at a mathematical model

To build a nave model of a social graph, consider the following reasoning. Suppose you have500 direct acquaintances, each of which has 500 direct acquaintances, and so forth. If eachperson’s set of acquaintances is disjoint, then the resulting graph is a tree. The result isthat 500 people (0.00017% of US population) are one link away from you, 250K (0.083% ofUS population) are two links away, and 125 million (42% of US population) are three linksaway. Notice that we have a small-world effect in this model.

More generally, suppose each person has λ acquaintances. The total number of people dlinks away equals n = λd. Then d = lnn/ lnλ, so that the diameter of this simple networkscales logarithmically with the number of people. This is a tell-tale sign of a small-worldeffect which we will observe in a variety of settings throughout the course. Is this modelsufficient to understand a real-world network?

The answer, of course, is no. The unrealistic assumption of disjoint sets of direct contactsrules out the phenomenon of clustering. In any social context, it is reasonable to expect thatthat your friends are likely to be friends with each other. This leads to cross-connections

01-2

Page 3: M375T/M396C: Topics in Complex Networks Spring 2013 ... · M375T/M396C Lecture 01 | 01/15/13 Spring 2013 Networks: Lecture 1 Introduction Visual ExamplesÑ1 Figure:The network structure

M375T/M396C Lecture 01 — 01/15/13 Spring 2013

Networks: Lecture 1 Introduction

Visual Examples—1

Figure: The network structure of political blogs prior to the 2004 U.S. Presidentialelection reveals two natural and well-separated clusters (Adamic and Glance, 2005)

4

Figure 1.2. A typical real-world network. Note the existence of two dense clusters that are well-separated,and that some nodes may have many more connections than others.

across levels of the simple tree model, i.e., triads (see Figure 1.1). A quick calculation demon-strates that if each acquaintance only has a fraction m ∈ (1/λ, 1] of “new” acquaintancesthen the maximum distance from you is d = (lnm+lnn)/(lnm+lnλ) ≈ lnn/(lnm+lnλ) forlarge n. Thus, it is possible to drop the disjointness assumption and still have the diameterstill grow logarithmically, albeit at a faster rate.

Another reason that the simple model is incorrect is that social acquaintances are biased,and often lead to people who are significantly more well-connected than others. The appear-ance of power-law distributions in the number of contacts per person, etc., is something wewill look at in greater detail later.

To summarize, there are several criterion that we would like our models to satisfy inorder to be in accordance with reality:

• Small-world effect : small diameter of social graph

• Clustering : prevalence of triads

• Algorithmic small-world : efficiency of greedy routing

• Power-laws : some people are significantly better-connected than others

1.3 Landau symbols

We will make frequent use of Landau notation (i.e., big-O notation) throughout this course.Landau symbols are used to describe the asymptotic behavior of a function relative to anotherfunction, as we define precisely below.

Let f : N→ R and g : N→ R. We say that

f = O(g) ⇐⇒ lim supn→∞

∣∣∣∣f(n)

g(n)

∣∣∣∣ < +∞.

01-3

Page 4: M375T/M396C: Topics in Complex Networks Spring 2013 ... · M375T/M396C Lecture 01 | 01/15/13 Spring 2013 Networks: Lecture 1 Introduction Visual ExamplesÑ1 Figure:The network structure

M375T/M396C Lecture 01 — 01/15/13 Spring 2013

Equivalently, f = O(g) if and only if there exist constants C and N such that |f(n)| ≤C|g(x)| for all n > N . This implies that f grows no faster (or decays no slower) than g asn→∞.

Next, we say that

f = o(g) ⇐⇒ limn→∞

∣∣∣∣f(n)

g(n)

∣∣∣∣ = 0.

In this case, f grows strictly slower (or decays strictly faster) than g as n→∞.We can use big-O and little-O notation to define all the other Landau symbols. In

particular,

f = Ω(g) ⇐⇒ g = O(f)

f = ω(g) ⇐⇒ g = o(f)

f = Θ(g) ⇐⇒ f = O(g) and g = O(f).

A handy table of analogies to inequalities is given below:

Notation Analogy

f = O(g) f ≤ gf = o(g) f < gf = Ω(g) f ≥ gf = ω(g) f > gf = Θ(g) f = g

References

1. Acemoglu, D., & Ozdaglar A. (2009). 6.207/14.15, Networks, Fall 2009 [Lecture notes].Retrieved from:http://economics.mit.edu/faculty/acemoglu/courses/

2. Backstrom, L., Boldi, P., Rosa, M., Ugander, J., & Vigna, S. (2011). Four degrees ofseparation. arXiv preprint arXiv:1111.4570.

3. Chaintreau, A. (2012). COMS 4995-1, Introduction to Social Networks, Fall 2012[Lecture notes]. Retrieved from:http://www.cs.columbia.edu/~augustin/

4. Dodds, P. S., Muhamad, R., & Watts, D. J. (2003). An experimental study of searchin global social networks. Science, 301(5634), 827-829.

5. Easley, D., & Kleinberg, J. (2010). Networks, crowds, and markets. Cambridge Uni-versity Press.

01-4

Page 5: M375T/M396C: Topics in Complex Networks Spring 2013 ... · M375T/M396C Lecture 01 | 01/15/13 Spring 2013 Networks: Lecture 1 Introduction Visual ExamplesÑ1 Figure:The network structure

M375T/M396C Lecture 01 — 01/15/13 Spring 2013

6. Jackson, M. O. (2010). Social and economic networks. Princeton University Press.

7. Milgram, S. (1967). The small world problem. Psychology today, 2(1), 60-67.

8. Travers, J., & Milgram, S. (1969). An experimental study of the small world problem.Sociometry, 425-443.

9. Ugander, J., Karrer, B., Backstrom, L., & Marlow, C. (2011). The anatomy of theFacebook social graph. arXiv preprint arXiv:1111.4503.

Last edited: March 21, 2013

01-5