7
Research Article A Comparison of Online Social Networks and Real-Life Social Networks: A Study of Sina Microblogging Dayong Zhang and Guang Guo Department of New Media and Arts, Harbin Institute of Technology, Harbin 150001, China Correspondence should be addressed to Dayong Zhang; [email protected] Received 10 November 2013; Revised 17 February 2014; Accepted 21 March 2014; Published 22 April 2014 Academic Editor: Guoqiang Hu Copyright © 2014 D. Zhang and G. Guo. is is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Online social networks appear to enrich our social life, which raises the question whether they remove cognitive constraints on human communication and improve human social capabilities. In this paper, we analyze the users’ following and followed relationships based on the data of Sina Microblogging and reveal several structural properties of Sina Microblogging. Compared with real-life social networks, our results confirm some similar features. However, Sina Microblogging also shows its own specialties, such as hierarchical structure and degree disassortativity, which all mark a deviation from real-life social networks. e low cost of the online network forms a broader perspective, and the one-way link relationships make it easy to spread information, but the online social network does not make too much difference in the creation of strong interpersonal relationships. Finally, we describe the mechanisms for the formation of these characteristics and discuss the implications of these structural properties for the real-life social networks. 1. Introduction In the past decade, the online social network has made new opportunities for communication and has revolutionized many aspects of our lives. As a new kind of online social networks, the Microblogging, such as Twitter and Sina Microblogging, has been developing rapidly in recent years. e online social network provides people with a public network platform, meets needs of computer-mediated com- munications, and rebuilds social connections. e structure and evolution of online social network have attracted the attention of researchers in different disciplines, including sociology, physics, and computer information technology [1, 2]. But large-scale and complex structure of online social network brings many difficulties in complete topolog- ical analysis. However, the emergence of complex network research [35] provides us with an effective method to study the online social network topology and information dissemination characteristics [611]. As for the mapping and expansion of the actual social net- works on the Internet, online social networks have broken the spatiotemporal limitations and decreased communication cost. Will the online social networks change the rules of actual social networks? e following studies reach fundamentally different conclusions: Backstrom and Boldi [12] studied the largest online social network that has ever been created (721 million active Facebook users and their 69 billion friendship links). e analyses of data proved that the average distance of Facebook has shortened from 5.28 in 2008 to 4.74 in 2011. When the search scale was narrowed to a country, it showed that most of the people were in fact only four apart. e result presented the “shrinking diameter” phenomena [13]. It suggested that the online social network made the relationship among people come closer. Six degrees of sep- aration is inapplicable to the online social network. However, the statistics of Facebook’s official website showed that the average number of Facebook active users’ friends was 130 (http://www.facebook.com/press/info.php?statistics), which agrees well with Dunbar’s number. Similarly, Gonc ¸alves et al. [14] analyzed a dataset of Twitter conversations collected across six months involving 1.7 million individuals and found that users kept stable relationships with 100–200 users. e data is in agreement with Dunbar’s result too. us, the Hindawi Publishing Corporation Mathematical Problems in Engineering Volume 2014, Article ID 578713, 6 pages http://dx.doi.org/10.1155/2014/578713

Research Article A Comparison of Online Social Networks ...downloads.hindawi.com/journals/mpe/2014/578713.pdf · the random network (the clustering coe cient of random network is

  • Upload
    others

  • View
    8

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Research Article A Comparison of Online Social Networks ...downloads.hindawi.com/journals/mpe/2014/578713.pdf · the random network (the clustering coe cient of random network is

Research ArticleA Comparison of Online Social Networks and Real-Life SocialNetworks A Study of Sina Microblogging

Dayong Zhang and Guang Guo

Department of New Media and Arts Harbin Institute of Technology Harbin 150001 China

Correspondence should be addressed to Dayong Zhang zhdyhiteducn

Received 10 November 2013 Revised 17 February 2014 Accepted 21 March 2014 Published 22 April 2014

Academic Editor Guoqiang Hu

Copyright copy 2014 D Zhang and G Guo This is an open access article distributed under the Creative Commons AttributionLicense which permits unrestricted use distribution and reproduction in any medium provided the original work is properlycited

Online social networks appear to enrich our social life which raises the question whether they remove cognitive constraintson human communication and improve human social capabilities In this paper we analyze the usersrsquo following and followedrelationships based on the data of Sina Microblogging and reveal several structural properties of Sina Microblogging Comparedwith real-life social networks our results confirm some similar featuresHowever SinaMicroblogging also shows its own specialtiessuch as hierarchical structure and degree disassortativity which all mark a deviation from real-life social networks The low costof the online network forms a broader perspective and the one-way link relationships make it easy to spread information but theonline social network does not make too much difference in the creation of strong interpersonal relationships Finally we describethemechanisms for the formation of these characteristics and discuss the implications of these structural properties for the real-lifesocial networks

1 Introduction

In the past decade the online social network has made newopportunities for communication and has revolutionizedmany aspects of our lives As a new kind of online socialnetworks the Microblogging such as Twitter and SinaMicroblogging has been developing rapidly in recent yearsThe online social network provides people with a publicnetwork platform meets needs of computer-mediated com-munications and rebuilds social connections The structureand evolution of online social network have attracted theattention of researchers in different disciplines includingsociology physics and computer information technology[1 2] But large-scale and complex structure of onlinesocial network brings many difficulties in complete topolog-ical analysis However the emergence of complex networkresearch [3ndash5] provides us with an effective method tostudy the online social network topology and informationdissemination characteristics [6ndash11]

As for themapping and expansion of the actual social net-works on the Internet online social networks have broken thespatiotemporal limitations and decreased communication

costWill the online social networks change the rules of actualsocial networks The following studies reach fundamentallydifferent conclusions Backstrom and Boldi [12] studied thelargest online social network that has ever been created (721million active Facebook users and their 69 billion friendshiplinks) The analyses of data proved that the average distanceof Facebook has shortened from 528 in 2008 to 474 in2011 When the search scale was narrowed to a country itshowed that most of the people were in fact only four apartThe result presented the ldquoshrinking diameterrdquo phenomena[13] It suggested that the online social network made therelationship among people come closer Six degrees of sep-aration is inapplicable to the online social network Howeverthe statistics of Facebookrsquos official website showed that theaverage number of Facebook active usersrsquo friends was 130(httpwwwfacebookcompressinfophpstatistics) whichagrees well with Dunbarrsquos number Similarly Goncalves etal [14] analyzed a dataset of Twitter conversations collectedacross six months involving 17 million individuals and foundthat users kept stable relationships with 100ndash200 users Thedata is in agreement with Dunbarrsquos result too Thus the

Hindawi Publishing CorporationMathematical Problems in EngineeringVolume 2014 Article ID 578713 6 pageshttpdxdoiorg1011552014578713

2 Mathematical Problems in Engineering

Table 1 Summary of degree distribution

119896 Number of nodes Proportion Extremum1 le 119896 lt 10 783 933 119896min = 1

10 le 119896 lt 100 50 600119896 ge 100 6 007 119896max = 327

ldquoeconomy of attentionrdquo is limited in the online world bycognitive and biological constraints as predicted by Dunbarrsquostheory [15]

Has online social network changed our real-life social pat-tern Why do such discrepancies exist between the studiesThe differences are due to the different analytical content andanalysis approachThe low cost of the online network forms abroader perspective and the one-way link relationshipsmakeit easy to spread information but the online social networksdo not make too much difference in the creation of stronginterpersonal relationshipsThese online social relationshipswhich may reflect the real-life social networks create anunprecedented field to understand the characteristics ofhuman networks [16 17]

In this paper we study the topological characteristicsof Sina Microblogging and try to explain its characteristicsformation mechanism by comparing it with the real-lifesocial networks This paper is organized as follows Section 2describes our data collection on Sina Microblogging Thenwe conduct topological analysis of the Sina Microbloggingnetwork and analyze the mechanisms for the formation ofthe structure in Section 3 In Section 4 we study the mixingpatterns on the Sina Microblogging network In Section 5we focus on the analysis of node importance by betweennesscentrality In Section 6 we conclude

2 Sina Microblogging Data Collection

In August 2009 Sina Microblogging began its trial serviceand became the most popular Microblogging service inChina with more than 500 million users as of 2013 Sina blogand Sina news provide good capital bases and natural advan-tages for the success of Sina Microblogging Due to theseadvantages SinaMicroblogging developed the characteristicsof We Media that spread news faster than any other mediaIn order to study the structure of Sina Microblogging wedevelop a spider with Python for Sina Microblogging Thesnowball [18] crawl algorithm has been applied in the spiderWe collected profiles of 3441 users on Sina Microblogginguntil November 11 2011 Isolated points are excluded sothat filter leaves only 839 nodes In order to protect privacywe only keep the usersrsquo ID number in the original datainformation about the users Based on the ldquofollowingrdquo andldquofollowedrdquo we construct an undirected network which has2112 links and analyze its basic characteristics

3 Analysis of Network Structure

We begin our analysis of Sina Microblogging from thefollowing aspects In Section 31 first we calculate the degree

100

10minus1

10minus2

10minus3

103

102

101

100

Rate

Deg

Figure 1 Degree distribution for nodes in Sina Microblogging

distribution Then we evaluate the average path lengthand clustering coefficient and study the reason why SinaMicroblogging possesses obviously small-world property inSection 32

31 Power-Law Node Degrees In Table 1 we summarize thedegree distribution of Sina Microblogging We find that onlya few nodes of the network have a high degree The degreedistribution is very unevenly distributed

We begin the analysis of Sina Microblogging topology bylooking at its degree distributions Networks of a power-lawdegree distribution (119896) prop 119896minus120574 where 119896 is the node degreeand 120574 le 3 attest to the existence of a relatively small numberof nodes with a very large number of links Many socialnetworks satisfy power-law degree distribution [19 20 23]and a few networks obey stretched exponential distribution[24] which is defined as 119901(119896) = 119887119890minus(1198961198960)

119888

The degree distribution of Sina Microblogging is shown

as in Figure 1 The 119910-axis represents the frequency of degreeIt satisfies a power-law distribution with an exponent of 175the goodness of fit is 0712

The networks which meet a power-law distribution havescale-free property and such networks are called scale-free networks Therefore Sina Microblogging is a scale-freenetwork The real-life social networks are usually scale-freenetworks The scale-free network is caused by the growingand preferential mechanisms that the new nodes trend toconnect with hub nodes and this phenomenon is called theMatthew effect

32 The Small-World Property The study of ldquosmall-worldrdquonetworks has become a key to understanding the societalstructure ever since Stanley Milgramrsquos famous ldquosix degreesof separationrdquo experiment In his work he reports thatany two people could be connected on average within sixhops from each other Watts and Strogatz [3] have revealedtwo important characteristics of ldquosmall-worldrdquo networks

Mathematical Problems in Engineering 3

Table 2 Small-World property compared with different socialnetworks

Social network Average pathlength

Clusteringcoefficient

Sina Microblogging 2794 0374Random network 4162 0006Cyworld [19] 32 016Renren [20] 348 02Mixi [21] 553 01215Co-author network [22] 46 0066

(1) the ldquosmall-worldrdquo networks have small characteristic pathlengths like random graphs (2) the ldquosmall-worldrdquo networkscan be highly clustered like regular lattices The networkmeasurement studies have shown that the real networks aremostly small-world especially social networks [13 21]

The concept of average path length for undirected net-works is well known the measures are related by

119871 =1

(12)119873 (119873 + 1)sum

119894ge119895

119889119894119895

(1)

where 119894 119895 are two nodes 119873 is the number of nodes in thenetwork and 119889

119894119895

denote the shortest distance between 119894 and119895

The average path length is closely connected to thecharacteristics of the network such as connectivity reacha-bility and transferring latency A short average path lengthfacilitates the quick transfer of information and reduces costs

Another property of ldquosmall-worldrdquo networks is the clus-tering coefficient It is themeasure of the extent to which onersquosfriends are also friends of each other The local clusteringcoefficient for a node is then given by the proportion oflinks between the nodes within its neighborhood divided bythe number of links that could possibly exist between themThe local clustering coefficient for undirected graphs can bedefined as

119862119894

=2119864119894

119896119894

(119896119894

minus 1) (2)

where 119896119894

is the degree of node 119894 and 119864119894

is the number of edgesbetween neighbors of node 119894

The clustering coefficient for the whole network is givenbyWatts and Strogatz [3] as the average of the local clusteringcoefficients of all the nodes the measures are related by

119862 =1

119873

119873

sum

119894

119862119894

(3)

The networks with the largest possible average clusteringcoefficient are found to have a modular structure

The results concerning average path length and clusteringcoefficient are displayed in Table 2 Compared with thesame scale random network the average path length ofSina Microblogging is shorter than random network (theaverage path length of random network is defined as (119871 prop

ln119873 ln 119896) and the clustering coefficient ismuch greater thanthe random network (the clustering coefficient of randomnetwork is defined as = ⟨119896⟩119873) Thus we confirm that SinaMicroblogging has the small-world property

Renren Cyworld and Mixi are the largest and oldestonline social networking services in China South Koreaand Japan respectively Coauthor network is a kind of real-life social network By contrast the average path lengthof Sina Microblogging has a shorter average path lengthand a greater clustering coefficient It is revealed that SinaMicrobloggingrsquos small-world property is the most obviousThe striking small-world phenomenon indicates that thereare rich local connections in Sina Microblogging networkthe nodes are linked closely and information disseminationis more efficient

We analyze the mechanisms for the formation of small-world property from the following two aspects First is theuserrsquos behavior aspect In 2011 for many users the mainreasons they use social network out there are the latestdevelopments of friends (734) keeping in touch withold friends (731) and documentation of life and feeling(675) and the main purposes of Microblogging users aregetting information (581) following celebrities (576)discussing the hot topics and personal experience (523)according to the data of ldquothe user behavior research of SNSand microblogging in Chinardquo The Microblogging users tendto use the Microblogging to record personal feelings sharethe news and find groups with similar interests and thetraditional SNS users show themselves contact with friendsin real life and expand social circle on SNS Consequently theformer are oriented to information exchange but the latterstress interpersonal communication

Second is the aspect of usersrsquo link mode The maindifference between the Renren Cyworld and Mixi networksand Microblogging is the directed nature of Microbloggingrelationship In Renren Cyworld and Mixi a link repre-sents a mutual agreement of a relationship while on SinaMicroblogging a user is not obligated to reciprocate followersby following them Thus a path from a user to another mayfollowdifferent hops or not exist in the reverse direction Boththe internal links (strong ties) and one-way following links(weak ties) are in Microblogging And the Microbloggingrsquosdistinction betweendifferent types of interactions allows us togetmore information personal interactions aremore likely tooccur on internal links (strong ties) and events transmittingnew information rely more on one-way following links (weakties) [25]

In conclusion Microblogging is easier to form the usercentricWeMedia than traditional SNS therefore it possessessmall-world property that can facilitate the flow of informa-tion

In order to gauge the correlation between clusteringcoefficient and the degree we plot the clustering coefficientof nodes (119910) against the degree of nodes (119909) in Figure 2 andbin the clustering coefficient in log scale If the correlationbetween clustering coefficient and the degree accords with119862(119896) = 119896

minus119886 we can consider that the network is of apparenthierarchical structure We observe that the clustering coeffi-cient varies inversely as the degree It satisfies a power-law

4 Mathematical Problems in Engineering

100

10minus1

10minus2

103

102

101

100

Clus

terin

g co

effici

ent

Deg

Figure 2 Correlations between clustering coefficients and degrees

Deg10000 20000 30000 4000

5000

000

000

10000

15000

20000

25000

Exce

ss av

erag

e deg

ree

Figure 3 Correlations between excess average degrees and degrees

distribution with an exponent of 0733 and the goodnessof fit is 0712 Thus the Sina Microblogging has apparenthierarchical structure where vertices divide into groups thatfurther subdivide into groups of groups and so forth overmultiple scales

The structure is caused by Sina Microbloggingrsquos fansmechanism Basing on the user interest and the celebrityeffect Sina Microblogging builds many kinds of fans groupsCelebrities and professionals in a certain area who have moreresources and influence will get more attention and theseusers only keep internal linkswith peers or slightly lower levelusers and so forth until the hierarchical structure is formed

4 The Mixing Patterns

Mixing patterns refer to systematic tendencies of one typeof nodes in a network to connect to another type Thereare three types of mixing patterns assortative networkdisassortative network and neutral network Similar verticestend to connect to each other in assortativity network andnodes of low degree are more likely to connect with nodesof high degree in disassortative network A lot of empiricalstudies [26 27] had revealed that the real-life social networkstrend to assortativity opposite of the online social networksIn real-life social networks the ordinary people want to getalong with the celebrity while the celebrity tends to make theacquaintance of peers therefore the ordinary people have lessopportunity to integrate into the circles of the celebrity Incontrast the ordinary people can easily get connected withthe celebrity the celebrities are also willing to show theirinfluence by the number of fans in the online social networkThus the online social networks trend to be disassortativenetworks

We cannot understand the mixing pattern intuitively sowe introduce the concept of excess average degreeThe excessdegree is the number of edges leaving the vertex other thanthe one we arrived along This number is one less than thedegrees of the vertices themselves The excess average degreeis defined as

⟨119896119899119899

⟩ (119896) =1

119902119896

119896max

sum

119896=119896min

1198961015840

119890119896119896

(4)

The excess average degree is positive in assortative net-work and negative otherwise Figure 3 plots the curve of theexcess average degree (119910) against the degree (119909) as the redline We see significant negative correlation Hence the SinaMicroblogging is a disassortative network

To better understand the meaning of disassortative net-work we illustrate the network connections of the node ID975 in Figure 4 In connected component different colorsrepresent different communities We find that the node ID975 connects with three hubs of community and showsdisassortativity

5 Analysis of Node Importance

Social networks are discrete systems with a large amountof heterogeneity among nodes Measures of centrality directat a quantification of nodesrsquo importance for structure andfunction The most direct measure of centrality is the degreecentrality that is the node with greater degree is the mostimportant one In addition betweenness centrality is also ameasure of nodersquos centrality in a network and can be usedto measure the influence a node has over the spread ofinformation through the network It is equal to the numberof the shortest paths from all vertices to all others that passthrough that node

In Figure 5 we plot the correlation between node bet-weenness centralities and degrees Then we compute thatthe average of node betweenness centralities is 3362 andthe correlation coefficient is 096 indicating that there are

Mathematical Problems in Engineering 5

Figure 4 The network connections of the node ID 975

Betw

eenn

ess c

entr

ality

100

102

104

106

103

102

101

100

Deg

Figure 5 Correlations between node betweenness centralities anddegrees

strong and positive correlations between node betweennesscentralities and degrees However the special case discussedhere is the one in which the node connecting to severalgroups has high betweenness centrality We rank users bybetweenness centrality and find three nodes both in the top5 list and with a degree less than 10 They connect withall communities in the network and act as bridges betweenthe different communities The result is consistent well withthe structural holes theory that was advanced by sociologistRonald Burt in real-life social network study

6 Conclusions

In this paper we have studied the structural properties ofMicroblogging ever created (839 active Sina Microbloggingusers and their 2112 social relations) from several viewpoints

First of all we have found a power-law distribution ashort average length and a high clustering coefficient in

its topology analysis which are all compatible with knowncharacteristics of other online social networks and real-lifesocial networks In order to illuminate the mechanisms forthe formation of small-world property we have studied thedifference between SinaMicroblogging and traditional socialnetworks from the aspect of usersrsquo behavior and the usersrsquolinkmode and found that SinaMicroblogging can easily formthe user centric We Media Therefore it possesses small-world property that can facilitate the flow of informationThen we calculated the correlation between clustering coef-ficients and degrees and showed that Sina Microblogging hasapparent hierarchical structure and we have found that SinaMicroblogging trends to be disassortative network which allmark a deviation from real-life social networksMoreover weanalyzed the betweenness centralities of intermediary nodesand confirmed that the intermediary nodes can control thespread of information Last but not least our work is only thefirst step towards exploring the difference between the onlineand real-life social networks Much work still remains

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

Acknowledgments

This work was partly supported by the National ScienceFoundation of China under Grant no 70903016 and theNational Science amp Technology Support Program underGrant no 2012BAH81F03

References

[1] R Corten ldquoComposition and structure of a large online socialnetwork in the netherlandsrdquo PLoS ONE vol 7 no 4 Article IDe34760 2012

6 Mathematical Problems in Engineering

[2] R E Wilson and S D Gosling ldquoA review of Facebook researchin the social sciencesrdquo Perspectives on Psychological Science vol7 no 3 pp 203ndash220 2012

[3] D J Watts and S H Strogatz ldquoCollective dynamics of ldquosmall-worldrdquo networksrdquoNature vol 393 no 6684 pp 440ndash442 1998

[4] R Albert H Jeong andA-L Barabasi ldquoDiameter of the world-wide webrdquo Nature vol 401 no 6749 pp 130ndash131 1999

[5] A-L Barabasi and R Albert ldquoEmergence of scaling in randomnetworksrdquo Science vol 286 no 5439 pp 509ndash512 1999

[6] M Barthelemy ldquoSpatial networksrdquo Physics Reports vol 499 no1 pp 1ndash101 2011

[7] D Centola ldquoThe spread of behavior in an online social networkexperimentrdquo Science vol 329 no 5996 pp 1194ndash1197 2010

[8] Y-Y Ahn S Han H Kwak S Moon and H Jeong ldquoAnalysisof topological characteristics of huge online social networkingservicesrdquo in Proceedings of the 16th International World WideWeb Conference (WWW rsquo07) pp 835ndash844 May 2007

[9] H Kwak C Lee H Park and S Moon ldquoWhat is Twitter asocial network or a news mediardquo in Proceedings of the19thInternationalWorldWideWeb Conference (WWW rsquo10) pp 591ndash600 April 2010

[10] Q Yan L R Wu and L Zheng ldquoSocial network basedmicroblog user behavior analysisrdquo Physica A StatisticalMechanics and Its Applications vol 392 no 7 pp 1712ndash17232013

[11] W G Yuan Y Liu J J Cheng and F Xiong ldquoEmpiricalanalysis of microblog centrality and spread influence based onBi-directional connectionrdquo Acta Physica Sinica vol 62 no 3Article ID 038901 2013

[12] L Backstrom and P Boldi ldquoFour degrees of separationrdquo inProceedings of the 3rd Annual ACMWeb Science Conference pp33ndash42 2012

[13] J Leskovec J Kleinberg and C Faloutsos ldquoGraph evolutiondensification and shrinking diametersrdquo ACM Transactions onKnowledge Discovery from Data vol 1 no 1 Article ID 12173012007

[14] B Goncalves N Perra and A Vespignani ldquoModeling usersrsquoactivity on twitter networks validation of Dunbarrsquos numberrdquoPloS ONE vol 6 no 8 2011

[15] R I M Dunbar ldquoSocial cognition on the Internet testingconstraints on social network sizerdquo Philosophical Transactionsof the Royal Society B Biological Sciences vol 367 no 1599 pp2192ndash2201 2012

[16] GMiller ldquoSocial scientists wade into the tweet streamrdquo Sciencevol 333 no 6051 pp 1814ndash1815 2011

[17] R Lex and B Kovacs ldquoA comparison of email networks and off-line social networks a study of a medium-sized bankrdquo SocialNetworks vol 34 no 4 pp 462ndash469 2011

[18] S H Lee P-J Kim and H Jeong ldquoStatistical properties ofsampled networksrdquoPhysical ReviewE Statistical Nonlinear andSoft Matter Physics vol 73 no 1 Article ID 016102 2006

[19] Y-Y Ahn S Han H Kwak S Moon and H Jeong ldquoAnalysisof topological characteristics of huge online social networkingservicesrdquo in Proceedings of the 16th International World WideWeb Conference (WWW rsquo07) pp 835ndash844 May 2007

[20] F Fu X Chen L Liu and L Wang ldquoSocial dilemmas in anonline social network the structure and evolution of coopera-tionrdquo Physics Letters A General Atomic and Solid State Physicsvol 371 no 1-2 pp 58ndash64 2007

[21] K Yuta and N Ono ldquogap in thecommunity-size distributionof a large-scale social networking siterdquo Physics and Societyhttparxivorgabsphysics0701168

[22] M Tomassini and L Luthi ldquoEmpirical analysis of the evolutionof a scientific collaboration networkrdquo Physica A StatisticalMechanics and Its Applications vol 385 no 2 pp 750ndash764 2007

[23] K-I Goh Y-H Eom H Jeong B Kahng and D Kim ldquoStruc-ture and evolution of online social relationships heterogeneityin unrestricted discussionsrdquo Physical Review E Statistical Non-linear and Soft Matter Physics vol 73 no 6 Article ID 0661232006

[24] HHuDHan andXWang ldquoIndividual popularity and activityin online social systemsrdquo Physica A StatisticalMechanics and ItsApplications vol 389 no 5 pp 1065ndash1070 2010

[25] P A Grabowicz J J Ramasco E Moro J M Pujol and VM Eguiluz ldquoSocial features of online networks the strength ofintermediary ties in online social mediardquo PLoS ONE vol 7 no1 Article ID e29358 2012

[26] T A B Snijders G G van de Bunt and C E G SteglichldquoIntroduction to stochastic actor-based models for networkdynamicsrdquo Social Networks vol 32 no 1 pp 44ndash60 2010

[27] J Ugander and B Karrer ldquoThe anatomy of the facebooksocial graphrdquo Social and Information Networks httparxivorgabs11114503

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 2: Research Article A Comparison of Online Social Networks ...downloads.hindawi.com/journals/mpe/2014/578713.pdf · the random network (the clustering coe cient of random network is

2 Mathematical Problems in Engineering

Table 1 Summary of degree distribution

119896 Number of nodes Proportion Extremum1 le 119896 lt 10 783 933 119896min = 1

10 le 119896 lt 100 50 600119896 ge 100 6 007 119896max = 327

ldquoeconomy of attentionrdquo is limited in the online world bycognitive and biological constraints as predicted by Dunbarrsquostheory [15]

Has online social network changed our real-life social pat-tern Why do such discrepancies exist between the studiesThe differences are due to the different analytical content andanalysis approachThe low cost of the online network forms abroader perspective and the one-way link relationshipsmakeit easy to spread information but the online social networksdo not make too much difference in the creation of stronginterpersonal relationshipsThese online social relationshipswhich may reflect the real-life social networks create anunprecedented field to understand the characteristics ofhuman networks [16 17]

In this paper we study the topological characteristicsof Sina Microblogging and try to explain its characteristicsformation mechanism by comparing it with the real-lifesocial networks This paper is organized as follows Section 2describes our data collection on Sina Microblogging Thenwe conduct topological analysis of the Sina Microbloggingnetwork and analyze the mechanisms for the formation ofthe structure in Section 3 In Section 4 we study the mixingpatterns on the Sina Microblogging network In Section 5we focus on the analysis of node importance by betweennesscentrality In Section 6 we conclude

2 Sina Microblogging Data Collection

In August 2009 Sina Microblogging began its trial serviceand became the most popular Microblogging service inChina with more than 500 million users as of 2013 Sina blogand Sina news provide good capital bases and natural advan-tages for the success of Sina Microblogging Due to theseadvantages SinaMicroblogging developed the characteristicsof We Media that spread news faster than any other mediaIn order to study the structure of Sina Microblogging wedevelop a spider with Python for Sina Microblogging Thesnowball [18] crawl algorithm has been applied in the spiderWe collected profiles of 3441 users on Sina Microblogginguntil November 11 2011 Isolated points are excluded sothat filter leaves only 839 nodes In order to protect privacywe only keep the usersrsquo ID number in the original datainformation about the users Based on the ldquofollowingrdquo andldquofollowedrdquo we construct an undirected network which has2112 links and analyze its basic characteristics

3 Analysis of Network Structure

We begin our analysis of Sina Microblogging from thefollowing aspects In Section 31 first we calculate the degree

100

10minus1

10minus2

10minus3

103

102

101

100

Rate

Deg

Figure 1 Degree distribution for nodes in Sina Microblogging

distribution Then we evaluate the average path lengthand clustering coefficient and study the reason why SinaMicroblogging possesses obviously small-world property inSection 32

31 Power-Law Node Degrees In Table 1 we summarize thedegree distribution of Sina Microblogging We find that onlya few nodes of the network have a high degree The degreedistribution is very unevenly distributed

We begin the analysis of Sina Microblogging topology bylooking at its degree distributions Networks of a power-lawdegree distribution (119896) prop 119896minus120574 where 119896 is the node degreeand 120574 le 3 attest to the existence of a relatively small numberof nodes with a very large number of links Many socialnetworks satisfy power-law degree distribution [19 20 23]and a few networks obey stretched exponential distribution[24] which is defined as 119901(119896) = 119887119890minus(1198961198960)

119888

The degree distribution of Sina Microblogging is shown

as in Figure 1 The 119910-axis represents the frequency of degreeIt satisfies a power-law distribution with an exponent of 175the goodness of fit is 0712

The networks which meet a power-law distribution havescale-free property and such networks are called scale-free networks Therefore Sina Microblogging is a scale-freenetwork The real-life social networks are usually scale-freenetworks The scale-free network is caused by the growingand preferential mechanisms that the new nodes trend toconnect with hub nodes and this phenomenon is called theMatthew effect

32 The Small-World Property The study of ldquosmall-worldrdquonetworks has become a key to understanding the societalstructure ever since Stanley Milgramrsquos famous ldquosix degreesof separationrdquo experiment In his work he reports thatany two people could be connected on average within sixhops from each other Watts and Strogatz [3] have revealedtwo important characteristics of ldquosmall-worldrdquo networks

Mathematical Problems in Engineering 3

Table 2 Small-World property compared with different socialnetworks

Social network Average pathlength

Clusteringcoefficient

Sina Microblogging 2794 0374Random network 4162 0006Cyworld [19] 32 016Renren [20] 348 02Mixi [21] 553 01215Co-author network [22] 46 0066

(1) the ldquosmall-worldrdquo networks have small characteristic pathlengths like random graphs (2) the ldquosmall-worldrdquo networkscan be highly clustered like regular lattices The networkmeasurement studies have shown that the real networks aremostly small-world especially social networks [13 21]

The concept of average path length for undirected net-works is well known the measures are related by

119871 =1

(12)119873 (119873 + 1)sum

119894ge119895

119889119894119895

(1)

where 119894 119895 are two nodes 119873 is the number of nodes in thenetwork and 119889

119894119895

denote the shortest distance between 119894 and119895

The average path length is closely connected to thecharacteristics of the network such as connectivity reacha-bility and transferring latency A short average path lengthfacilitates the quick transfer of information and reduces costs

Another property of ldquosmall-worldrdquo networks is the clus-tering coefficient It is themeasure of the extent to which onersquosfriends are also friends of each other The local clusteringcoefficient for a node is then given by the proportion oflinks between the nodes within its neighborhood divided bythe number of links that could possibly exist between themThe local clustering coefficient for undirected graphs can bedefined as

119862119894

=2119864119894

119896119894

(119896119894

minus 1) (2)

where 119896119894

is the degree of node 119894 and 119864119894

is the number of edgesbetween neighbors of node 119894

The clustering coefficient for the whole network is givenbyWatts and Strogatz [3] as the average of the local clusteringcoefficients of all the nodes the measures are related by

119862 =1

119873

119873

sum

119894

119862119894

(3)

The networks with the largest possible average clusteringcoefficient are found to have a modular structure

The results concerning average path length and clusteringcoefficient are displayed in Table 2 Compared with thesame scale random network the average path length ofSina Microblogging is shorter than random network (theaverage path length of random network is defined as (119871 prop

ln119873 ln 119896) and the clustering coefficient ismuch greater thanthe random network (the clustering coefficient of randomnetwork is defined as = ⟨119896⟩119873) Thus we confirm that SinaMicroblogging has the small-world property

Renren Cyworld and Mixi are the largest and oldestonline social networking services in China South Koreaand Japan respectively Coauthor network is a kind of real-life social network By contrast the average path lengthof Sina Microblogging has a shorter average path lengthand a greater clustering coefficient It is revealed that SinaMicrobloggingrsquos small-world property is the most obviousThe striking small-world phenomenon indicates that thereare rich local connections in Sina Microblogging networkthe nodes are linked closely and information disseminationis more efficient

We analyze the mechanisms for the formation of small-world property from the following two aspects First is theuserrsquos behavior aspect In 2011 for many users the mainreasons they use social network out there are the latestdevelopments of friends (734) keeping in touch withold friends (731) and documentation of life and feeling(675) and the main purposes of Microblogging users aregetting information (581) following celebrities (576)discussing the hot topics and personal experience (523)according to the data of ldquothe user behavior research of SNSand microblogging in Chinardquo The Microblogging users tendto use the Microblogging to record personal feelings sharethe news and find groups with similar interests and thetraditional SNS users show themselves contact with friendsin real life and expand social circle on SNS Consequently theformer are oriented to information exchange but the latterstress interpersonal communication

Second is the aspect of usersrsquo link mode The maindifference between the Renren Cyworld and Mixi networksand Microblogging is the directed nature of Microbloggingrelationship In Renren Cyworld and Mixi a link repre-sents a mutual agreement of a relationship while on SinaMicroblogging a user is not obligated to reciprocate followersby following them Thus a path from a user to another mayfollowdifferent hops or not exist in the reverse direction Boththe internal links (strong ties) and one-way following links(weak ties) are in Microblogging And the Microbloggingrsquosdistinction betweendifferent types of interactions allows us togetmore information personal interactions aremore likely tooccur on internal links (strong ties) and events transmittingnew information rely more on one-way following links (weakties) [25]

In conclusion Microblogging is easier to form the usercentricWeMedia than traditional SNS therefore it possessessmall-world property that can facilitate the flow of informa-tion

In order to gauge the correlation between clusteringcoefficient and the degree we plot the clustering coefficientof nodes (119910) against the degree of nodes (119909) in Figure 2 andbin the clustering coefficient in log scale If the correlationbetween clustering coefficient and the degree accords with119862(119896) = 119896

minus119886 we can consider that the network is of apparenthierarchical structure We observe that the clustering coeffi-cient varies inversely as the degree It satisfies a power-law

4 Mathematical Problems in Engineering

100

10minus1

10minus2

103

102

101

100

Clus

terin

g co

effici

ent

Deg

Figure 2 Correlations between clustering coefficients and degrees

Deg10000 20000 30000 4000

5000

000

000

10000

15000

20000

25000

Exce

ss av

erag

e deg

ree

Figure 3 Correlations between excess average degrees and degrees

distribution with an exponent of 0733 and the goodnessof fit is 0712 Thus the Sina Microblogging has apparenthierarchical structure where vertices divide into groups thatfurther subdivide into groups of groups and so forth overmultiple scales

The structure is caused by Sina Microbloggingrsquos fansmechanism Basing on the user interest and the celebrityeffect Sina Microblogging builds many kinds of fans groupsCelebrities and professionals in a certain area who have moreresources and influence will get more attention and theseusers only keep internal linkswith peers or slightly lower levelusers and so forth until the hierarchical structure is formed

4 The Mixing Patterns

Mixing patterns refer to systematic tendencies of one typeof nodes in a network to connect to another type Thereare three types of mixing patterns assortative networkdisassortative network and neutral network Similar verticestend to connect to each other in assortativity network andnodes of low degree are more likely to connect with nodesof high degree in disassortative network A lot of empiricalstudies [26 27] had revealed that the real-life social networkstrend to assortativity opposite of the online social networksIn real-life social networks the ordinary people want to getalong with the celebrity while the celebrity tends to make theacquaintance of peers therefore the ordinary people have lessopportunity to integrate into the circles of the celebrity Incontrast the ordinary people can easily get connected withthe celebrity the celebrities are also willing to show theirinfluence by the number of fans in the online social networkThus the online social networks trend to be disassortativenetworks

We cannot understand the mixing pattern intuitively sowe introduce the concept of excess average degreeThe excessdegree is the number of edges leaving the vertex other thanthe one we arrived along This number is one less than thedegrees of the vertices themselves The excess average degreeis defined as

⟨119896119899119899

⟩ (119896) =1

119902119896

119896max

sum

119896=119896min

1198961015840

119890119896119896

(4)

The excess average degree is positive in assortative net-work and negative otherwise Figure 3 plots the curve of theexcess average degree (119910) against the degree (119909) as the redline We see significant negative correlation Hence the SinaMicroblogging is a disassortative network

To better understand the meaning of disassortative net-work we illustrate the network connections of the node ID975 in Figure 4 In connected component different colorsrepresent different communities We find that the node ID975 connects with three hubs of community and showsdisassortativity

5 Analysis of Node Importance

Social networks are discrete systems with a large amountof heterogeneity among nodes Measures of centrality directat a quantification of nodesrsquo importance for structure andfunction The most direct measure of centrality is the degreecentrality that is the node with greater degree is the mostimportant one In addition betweenness centrality is also ameasure of nodersquos centrality in a network and can be usedto measure the influence a node has over the spread ofinformation through the network It is equal to the numberof the shortest paths from all vertices to all others that passthrough that node

In Figure 5 we plot the correlation between node bet-weenness centralities and degrees Then we compute thatthe average of node betweenness centralities is 3362 andthe correlation coefficient is 096 indicating that there are

Mathematical Problems in Engineering 5

Figure 4 The network connections of the node ID 975

Betw

eenn

ess c

entr

ality

100

102

104

106

103

102

101

100

Deg

Figure 5 Correlations between node betweenness centralities anddegrees

strong and positive correlations between node betweennesscentralities and degrees However the special case discussedhere is the one in which the node connecting to severalgroups has high betweenness centrality We rank users bybetweenness centrality and find three nodes both in the top5 list and with a degree less than 10 They connect withall communities in the network and act as bridges betweenthe different communities The result is consistent well withthe structural holes theory that was advanced by sociologistRonald Burt in real-life social network study

6 Conclusions

In this paper we have studied the structural properties ofMicroblogging ever created (839 active Sina Microbloggingusers and their 2112 social relations) from several viewpoints

First of all we have found a power-law distribution ashort average length and a high clustering coefficient in

its topology analysis which are all compatible with knowncharacteristics of other online social networks and real-lifesocial networks In order to illuminate the mechanisms forthe formation of small-world property we have studied thedifference between SinaMicroblogging and traditional socialnetworks from the aspect of usersrsquo behavior and the usersrsquolinkmode and found that SinaMicroblogging can easily formthe user centric We Media Therefore it possesses small-world property that can facilitate the flow of informationThen we calculated the correlation between clustering coef-ficients and degrees and showed that Sina Microblogging hasapparent hierarchical structure and we have found that SinaMicroblogging trends to be disassortative network which allmark a deviation from real-life social networksMoreover weanalyzed the betweenness centralities of intermediary nodesand confirmed that the intermediary nodes can control thespread of information Last but not least our work is only thefirst step towards exploring the difference between the onlineand real-life social networks Much work still remains

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

Acknowledgments

This work was partly supported by the National ScienceFoundation of China under Grant no 70903016 and theNational Science amp Technology Support Program underGrant no 2012BAH81F03

References

[1] R Corten ldquoComposition and structure of a large online socialnetwork in the netherlandsrdquo PLoS ONE vol 7 no 4 Article IDe34760 2012

6 Mathematical Problems in Engineering

[2] R E Wilson and S D Gosling ldquoA review of Facebook researchin the social sciencesrdquo Perspectives on Psychological Science vol7 no 3 pp 203ndash220 2012

[3] D J Watts and S H Strogatz ldquoCollective dynamics of ldquosmall-worldrdquo networksrdquoNature vol 393 no 6684 pp 440ndash442 1998

[4] R Albert H Jeong andA-L Barabasi ldquoDiameter of the world-wide webrdquo Nature vol 401 no 6749 pp 130ndash131 1999

[5] A-L Barabasi and R Albert ldquoEmergence of scaling in randomnetworksrdquo Science vol 286 no 5439 pp 509ndash512 1999

[6] M Barthelemy ldquoSpatial networksrdquo Physics Reports vol 499 no1 pp 1ndash101 2011

[7] D Centola ldquoThe spread of behavior in an online social networkexperimentrdquo Science vol 329 no 5996 pp 1194ndash1197 2010

[8] Y-Y Ahn S Han H Kwak S Moon and H Jeong ldquoAnalysisof topological characteristics of huge online social networkingservicesrdquo in Proceedings of the 16th International World WideWeb Conference (WWW rsquo07) pp 835ndash844 May 2007

[9] H Kwak C Lee H Park and S Moon ldquoWhat is Twitter asocial network or a news mediardquo in Proceedings of the19thInternationalWorldWideWeb Conference (WWW rsquo10) pp 591ndash600 April 2010

[10] Q Yan L R Wu and L Zheng ldquoSocial network basedmicroblog user behavior analysisrdquo Physica A StatisticalMechanics and Its Applications vol 392 no 7 pp 1712ndash17232013

[11] W G Yuan Y Liu J J Cheng and F Xiong ldquoEmpiricalanalysis of microblog centrality and spread influence based onBi-directional connectionrdquo Acta Physica Sinica vol 62 no 3Article ID 038901 2013

[12] L Backstrom and P Boldi ldquoFour degrees of separationrdquo inProceedings of the 3rd Annual ACMWeb Science Conference pp33ndash42 2012

[13] J Leskovec J Kleinberg and C Faloutsos ldquoGraph evolutiondensification and shrinking diametersrdquo ACM Transactions onKnowledge Discovery from Data vol 1 no 1 Article ID 12173012007

[14] B Goncalves N Perra and A Vespignani ldquoModeling usersrsquoactivity on twitter networks validation of Dunbarrsquos numberrdquoPloS ONE vol 6 no 8 2011

[15] R I M Dunbar ldquoSocial cognition on the Internet testingconstraints on social network sizerdquo Philosophical Transactionsof the Royal Society B Biological Sciences vol 367 no 1599 pp2192ndash2201 2012

[16] GMiller ldquoSocial scientists wade into the tweet streamrdquo Sciencevol 333 no 6051 pp 1814ndash1815 2011

[17] R Lex and B Kovacs ldquoA comparison of email networks and off-line social networks a study of a medium-sized bankrdquo SocialNetworks vol 34 no 4 pp 462ndash469 2011

[18] S H Lee P-J Kim and H Jeong ldquoStatistical properties ofsampled networksrdquoPhysical ReviewE Statistical Nonlinear andSoft Matter Physics vol 73 no 1 Article ID 016102 2006

[19] Y-Y Ahn S Han H Kwak S Moon and H Jeong ldquoAnalysisof topological characteristics of huge online social networkingservicesrdquo in Proceedings of the 16th International World WideWeb Conference (WWW rsquo07) pp 835ndash844 May 2007

[20] F Fu X Chen L Liu and L Wang ldquoSocial dilemmas in anonline social network the structure and evolution of coopera-tionrdquo Physics Letters A General Atomic and Solid State Physicsvol 371 no 1-2 pp 58ndash64 2007

[21] K Yuta and N Ono ldquogap in thecommunity-size distributionof a large-scale social networking siterdquo Physics and Societyhttparxivorgabsphysics0701168

[22] M Tomassini and L Luthi ldquoEmpirical analysis of the evolutionof a scientific collaboration networkrdquo Physica A StatisticalMechanics and Its Applications vol 385 no 2 pp 750ndash764 2007

[23] K-I Goh Y-H Eom H Jeong B Kahng and D Kim ldquoStruc-ture and evolution of online social relationships heterogeneityin unrestricted discussionsrdquo Physical Review E Statistical Non-linear and Soft Matter Physics vol 73 no 6 Article ID 0661232006

[24] HHuDHan andXWang ldquoIndividual popularity and activityin online social systemsrdquo Physica A StatisticalMechanics and ItsApplications vol 389 no 5 pp 1065ndash1070 2010

[25] P A Grabowicz J J Ramasco E Moro J M Pujol and VM Eguiluz ldquoSocial features of online networks the strength ofintermediary ties in online social mediardquo PLoS ONE vol 7 no1 Article ID e29358 2012

[26] T A B Snijders G G van de Bunt and C E G SteglichldquoIntroduction to stochastic actor-based models for networkdynamicsrdquo Social Networks vol 32 no 1 pp 44ndash60 2010

[27] J Ugander and B Karrer ldquoThe anatomy of the facebooksocial graphrdquo Social and Information Networks httparxivorgabs11114503

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 3: Research Article A Comparison of Online Social Networks ...downloads.hindawi.com/journals/mpe/2014/578713.pdf · the random network (the clustering coe cient of random network is

Mathematical Problems in Engineering 3

Table 2 Small-World property compared with different socialnetworks

Social network Average pathlength

Clusteringcoefficient

Sina Microblogging 2794 0374Random network 4162 0006Cyworld [19] 32 016Renren [20] 348 02Mixi [21] 553 01215Co-author network [22] 46 0066

(1) the ldquosmall-worldrdquo networks have small characteristic pathlengths like random graphs (2) the ldquosmall-worldrdquo networkscan be highly clustered like regular lattices The networkmeasurement studies have shown that the real networks aremostly small-world especially social networks [13 21]

The concept of average path length for undirected net-works is well known the measures are related by

119871 =1

(12)119873 (119873 + 1)sum

119894ge119895

119889119894119895

(1)

where 119894 119895 are two nodes 119873 is the number of nodes in thenetwork and 119889

119894119895

denote the shortest distance between 119894 and119895

The average path length is closely connected to thecharacteristics of the network such as connectivity reacha-bility and transferring latency A short average path lengthfacilitates the quick transfer of information and reduces costs

Another property of ldquosmall-worldrdquo networks is the clus-tering coefficient It is themeasure of the extent to which onersquosfriends are also friends of each other The local clusteringcoefficient for a node is then given by the proportion oflinks between the nodes within its neighborhood divided bythe number of links that could possibly exist between themThe local clustering coefficient for undirected graphs can bedefined as

119862119894

=2119864119894

119896119894

(119896119894

minus 1) (2)

where 119896119894

is the degree of node 119894 and 119864119894

is the number of edgesbetween neighbors of node 119894

The clustering coefficient for the whole network is givenbyWatts and Strogatz [3] as the average of the local clusteringcoefficients of all the nodes the measures are related by

119862 =1

119873

119873

sum

119894

119862119894

(3)

The networks with the largest possible average clusteringcoefficient are found to have a modular structure

The results concerning average path length and clusteringcoefficient are displayed in Table 2 Compared with thesame scale random network the average path length ofSina Microblogging is shorter than random network (theaverage path length of random network is defined as (119871 prop

ln119873 ln 119896) and the clustering coefficient ismuch greater thanthe random network (the clustering coefficient of randomnetwork is defined as = ⟨119896⟩119873) Thus we confirm that SinaMicroblogging has the small-world property

Renren Cyworld and Mixi are the largest and oldestonline social networking services in China South Koreaand Japan respectively Coauthor network is a kind of real-life social network By contrast the average path lengthof Sina Microblogging has a shorter average path lengthand a greater clustering coefficient It is revealed that SinaMicrobloggingrsquos small-world property is the most obviousThe striking small-world phenomenon indicates that thereare rich local connections in Sina Microblogging networkthe nodes are linked closely and information disseminationis more efficient

We analyze the mechanisms for the formation of small-world property from the following two aspects First is theuserrsquos behavior aspect In 2011 for many users the mainreasons they use social network out there are the latestdevelopments of friends (734) keeping in touch withold friends (731) and documentation of life and feeling(675) and the main purposes of Microblogging users aregetting information (581) following celebrities (576)discussing the hot topics and personal experience (523)according to the data of ldquothe user behavior research of SNSand microblogging in Chinardquo The Microblogging users tendto use the Microblogging to record personal feelings sharethe news and find groups with similar interests and thetraditional SNS users show themselves contact with friendsin real life and expand social circle on SNS Consequently theformer are oriented to information exchange but the latterstress interpersonal communication

Second is the aspect of usersrsquo link mode The maindifference between the Renren Cyworld and Mixi networksand Microblogging is the directed nature of Microbloggingrelationship In Renren Cyworld and Mixi a link repre-sents a mutual agreement of a relationship while on SinaMicroblogging a user is not obligated to reciprocate followersby following them Thus a path from a user to another mayfollowdifferent hops or not exist in the reverse direction Boththe internal links (strong ties) and one-way following links(weak ties) are in Microblogging And the Microbloggingrsquosdistinction betweendifferent types of interactions allows us togetmore information personal interactions aremore likely tooccur on internal links (strong ties) and events transmittingnew information rely more on one-way following links (weakties) [25]

In conclusion Microblogging is easier to form the usercentricWeMedia than traditional SNS therefore it possessessmall-world property that can facilitate the flow of informa-tion

In order to gauge the correlation between clusteringcoefficient and the degree we plot the clustering coefficientof nodes (119910) against the degree of nodes (119909) in Figure 2 andbin the clustering coefficient in log scale If the correlationbetween clustering coefficient and the degree accords with119862(119896) = 119896

minus119886 we can consider that the network is of apparenthierarchical structure We observe that the clustering coeffi-cient varies inversely as the degree It satisfies a power-law

4 Mathematical Problems in Engineering

100

10minus1

10minus2

103

102

101

100

Clus

terin

g co

effici

ent

Deg

Figure 2 Correlations between clustering coefficients and degrees

Deg10000 20000 30000 4000

5000

000

000

10000

15000

20000

25000

Exce

ss av

erag

e deg

ree

Figure 3 Correlations between excess average degrees and degrees

distribution with an exponent of 0733 and the goodnessof fit is 0712 Thus the Sina Microblogging has apparenthierarchical structure where vertices divide into groups thatfurther subdivide into groups of groups and so forth overmultiple scales

The structure is caused by Sina Microbloggingrsquos fansmechanism Basing on the user interest and the celebrityeffect Sina Microblogging builds many kinds of fans groupsCelebrities and professionals in a certain area who have moreresources and influence will get more attention and theseusers only keep internal linkswith peers or slightly lower levelusers and so forth until the hierarchical structure is formed

4 The Mixing Patterns

Mixing patterns refer to systematic tendencies of one typeof nodes in a network to connect to another type Thereare three types of mixing patterns assortative networkdisassortative network and neutral network Similar verticestend to connect to each other in assortativity network andnodes of low degree are more likely to connect with nodesof high degree in disassortative network A lot of empiricalstudies [26 27] had revealed that the real-life social networkstrend to assortativity opposite of the online social networksIn real-life social networks the ordinary people want to getalong with the celebrity while the celebrity tends to make theacquaintance of peers therefore the ordinary people have lessopportunity to integrate into the circles of the celebrity Incontrast the ordinary people can easily get connected withthe celebrity the celebrities are also willing to show theirinfluence by the number of fans in the online social networkThus the online social networks trend to be disassortativenetworks

We cannot understand the mixing pattern intuitively sowe introduce the concept of excess average degreeThe excessdegree is the number of edges leaving the vertex other thanthe one we arrived along This number is one less than thedegrees of the vertices themselves The excess average degreeis defined as

⟨119896119899119899

⟩ (119896) =1

119902119896

119896max

sum

119896=119896min

1198961015840

119890119896119896

(4)

The excess average degree is positive in assortative net-work and negative otherwise Figure 3 plots the curve of theexcess average degree (119910) against the degree (119909) as the redline We see significant negative correlation Hence the SinaMicroblogging is a disassortative network

To better understand the meaning of disassortative net-work we illustrate the network connections of the node ID975 in Figure 4 In connected component different colorsrepresent different communities We find that the node ID975 connects with three hubs of community and showsdisassortativity

5 Analysis of Node Importance

Social networks are discrete systems with a large amountof heterogeneity among nodes Measures of centrality directat a quantification of nodesrsquo importance for structure andfunction The most direct measure of centrality is the degreecentrality that is the node with greater degree is the mostimportant one In addition betweenness centrality is also ameasure of nodersquos centrality in a network and can be usedto measure the influence a node has over the spread ofinformation through the network It is equal to the numberof the shortest paths from all vertices to all others that passthrough that node

In Figure 5 we plot the correlation between node bet-weenness centralities and degrees Then we compute thatthe average of node betweenness centralities is 3362 andthe correlation coefficient is 096 indicating that there are

Mathematical Problems in Engineering 5

Figure 4 The network connections of the node ID 975

Betw

eenn

ess c

entr

ality

100

102

104

106

103

102

101

100

Deg

Figure 5 Correlations between node betweenness centralities anddegrees

strong and positive correlations between node betweennesscentralities and degrees However the special case discussedhere is the one in which the node connecting to severalgroups has high betweenness centrality We rank users bybetweenness centrality and find three nodes both in the top5 list and with a degree less than 10 They connect withall communities in the network and act as bridges betweenthe different communities The result is consistent well withthe structural holes theory that was advanced by sociologistRonald Burt in real-life social network study

6 Conclusions

In this paper we have studied the structural properties ofMicroblogging ever created (839 active Sina Microbloggingusers and their 2112 social relations) from several viewpoints

First of all we have found a power-law distribution ashort average length and a high clustering coefficient in

its topology analysis which are all compatible with knowncharacteristics of other online social networks and real-lifesocial networks In order to illuminate the mechanisms forthe formation of small-world property we have studied thedifference between SinaMicroblogging and traditional socialnetworks from the aspect of usersrsquo behavior and the usersrsquolinkmode and found that SinaMicroblogging can easily formthe user centric We Media Therefore it possesses small-world property that can facilitate the flow of informationThen we calculated the correlation between clustering coef-ficients and degrees and showed that Sina Microblogging hasapparent hierarchical structure and we have found that SinaMicroblogging trends to be disassortative network which allmark a deviation from real-life social networksMoreover weanalyzed the betweenness centralities of intermediary nodesand confirmed that the intermediary nodes can control thespread of information Last but not least our work is only thefirst step towards exploring the difference between the onlineand real-life social networks Much work still remains

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

Acknowledgments

This work was partly supported by the National ScienceFoundation of China under Grant no 70903016 and theNational Science amp Technology Support Program underGrant no 2012BAH81F03

References

[1] R Corten ldquoComposition and structure of a large online socialnetwork in the netherlandsrdquo PLoS ONE vol 7 no 4 Article IDe34760 2012

6 Mathematical Problems in Engineering

[2] R E Wilson and S D Gosling ldquoA review of Facebook researchin the social sciencesrdquo Perspectives on Psychological Science vol7 no 3 pp 203ndash220 2012

[3] D J Watts and S H Strogatz ldquoCollective dynamics of ldquosmall-worldrdquo networksrdquoNature vol 393 no 6684 pp 440ndash442 1998

[4] R Albert H Jeong andA-L Barabasi ldquoDiameter of the world-wide webrdquo Nature vol 401 no 6749 pp 130ndash131 1999

[5] A-L Barabasi and R Albert ldquoEmergence of scaling in randomnetworksrdquo Science vol 286 no 5439 pp 509ndash512 1999

[6] M Barthelemy ldquoSpatial networksrdquo Physics Reports vol 499 no1 pp 1ndash101 2011

[7] D Centola ldquoThe spread of behavior in an online social networkexperimentrdquo Science vol 329 no 5996 pp 1194ndash1197 2010

[8] Y-Y Ahn S Han H Kwak S Moon and H Jeong ldquoAnalysisof topological characteristics of huge online social networkingservicesrdquo in Proceedings of the 16th International World WideWeb Conference (WWW rsquo07) pp 835ndash844 May 2007

[9] H Kwak C Lee H Park and S Moon ldquoWhat is Twitter asocial network or a news mediardquo in Proceedings of the19thInternationalWorldWideWeb Conference (WWW rsquo10) pp 591ndash600 April 2010

[10] Q Yan L R Wu and L Zheng ldquoSocial network basedmicroblog user behavior analysisrdquo Physica A StatisticalMechanics and Its Applications vol 392 no 7 pp 1712ndash17232013

[11] W G Yuan Y Liu J J Cheng and F Xiong ldquoEmpiricalanalysis of microblog centrality and spread influence based onBi-directional connectionrdquo Acta Physica Sinica vol 62 no 3Article ID 038901 2013

[12] L Backstrom and P Boldi ldquoFour degrees of separationrdquo inProceedings of the 3rd Annual ACMWeb Science Conference pp33ndash42 2012

[13] J Leskovec J Kleinberg and C Faloutsos ldquoGraph evolutiondensification and shrinking diametersrdquo ACM Transactions onKnowledge Discovery from Data vol 1 no 1 Article ID 12173012007

[14] B Goncalves N Perra and A Vespignani ldquoModeling usersrsquoactivity on twitter networks validation of Dunbarrsquos numberrdquoPloS ONE vol 6 no 8 2011

[15] R I M Dunbar ldquoSocial cognition on the Internet testingconstraints on social network sizerdquo Philosophical Transactionsof the Royal Society B Biological Sciences vol 367 no 1599 pp2192ndash2201 2012

[16] GMiller ldquoSocial scientists wade into the tweet streamrdquo Sciencevol 333 no 6051 pp 1814ndash1815 2011

[17] R Lex and B Kovacs ldquoA comparison of email networks and off-line social networks a study of a medium-sized bankrdquo SocialNetworks vol 34 no 4 pp 462ndash469 2011

[18] S H Lee P-J Kim and H Jeong ldquoStatistical properties ofsampled networksrdquoPhysical ReviewE Statistical Nonlinear andSoft Matter Physics vol 73 no 1 Article ID 016102 2006

[19] Y-Y Ahn S Han H Kwak S Moon and H Jeong ldquoAnalysisof topological characteristics of huge online social networkingservicesrdquo in Proceedings of the 16th International World WideWeb Conference (WWW rsquo07) pp 835ndash844 May 2007

[20] F Fu X Chen L Liu and L Wang ldquoSocial dilemmas in anonline social network the structure and evolution of coopera-tionrdquo Physics Letters A General Atomic and Solid State Physicsvol 371 no 1-2 pp 58ndash64 2007

[21] K Yuta and N Ono ldquogap in thecommunity-size distributionof a large-scale social networking siterdquo Physics and Societyhttparxivorgabsphysics0701168

[22] M Tomassini and L Luthi ldquoEmpirical analysis of the evolutionof a scientific collaboration networkrdquo Physica A StatisticalMechanics and Its Applications vol 385 no 2 pp 750ndash764 2007

[23] K-I Goh Y-H Eom H Jeong B Kahng and D Kim ldquoStruc-ture and evolution of online social relationships heterogeneityin unrestricted discussionsrdquo Physical Review E Statistical Non-linear and Soft Matter Physics vol 73 no 6 Article ID 0661232006

[24] HHuDHan andXWang ldquoIndividual popularity and activityin online social systemsrdquo Physica A StatisticalMechanics and ItsApplications vol 389 no 5 pp 1065ndash1070 2010

[25] P A Grabowicz J J Ramasco E Moro J M Pujol and VM Eguiluz ldquoSocial features of online networks the strength ofintermediary ties in online social mediardquo PLoS ONE vol 7 no1 Article ID e29358 2012

[26] T A B Snijders G G van de Bunt and C E G SteglichldquoIntroduction to stochastic actor-based models for networkdynamicsrdquo Social Networks vol 32 no 1 pp 44ndash60 2010

[27] J Ugander and B Karrer ldquoThe anatomy of the facebooksocial graphrdquo Social and Information Networks httparxivorgabs11114503

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 4: Research Article A Comparison of Online Social Networks ...downloads.hindawi.com/journals/mpe/2014/578713.pdf · the random network (the clustering coe cient of random network is

4 Mathematical Problems in Engineering

100

10minus1

10minus2

103

102

101

100

Clus

terin

g co

effici

ent

Deg

Figure 2 Correlations between clustering coefficients and degrees

Deg10000 20000 30000 4000

5000

000

000

10000

15000

20000

25000

Exce

ss av

erag

e deg

ree

Figure 3 Correlations between excess average degrees and degrees

distribution with an exponent of 0733 and the goodnessof fit is 0712 Thus the Sina Microblogging has apparenthierarchical structure where vertices divide into groups thatfurther subdivide into groups of groups and so forth overmultiple scales

The structure is caused by Sina Microbloggingrsquos fansmechanism Basing on the user interest and the celebrityeffect Sina Microblogging builds many kinds of fans groupsCelebrities and professionals in a certain area who have moreresources and influence will get more attention and theseusers only keep internal linkswith peers or slightly lower levelusers and so forth until the hierarchical structure is formed

4 The Mixing Patterns

Mixing patterns refer to systematic tendencies of one typeof nodes in a network to connect to another type Thereare three types of mixing patterns assortative networkdisassortative network and neutral network Similar verticestend to connect to each other in assortativity network andnodes of low degree are more likely to connect with nodesof high degree in disassortative network A lot of empiricalstudies [26 27] had revealed that the real-life social networkstrend to assortativity opposite of the online social networksIn real-life social networks the ordinary people want to getalong with the celebrity while the celebrity tends to make theacquaintance of peers therefore the ordinary people have lessopportunity to integrate into the circles of the celebrity Incontrast the ordinary people can easily get connected withthe celebrity the celebrities are also willing to show theirinfluence by the number of fans in the online social networkThus the online social networks trend to be disassortativenetworks

We cannot understand the mixing pattern intuitively sowe introduce the concept of excess average degreeThe excessdegree is the number of edges leaving the vertex other thanthe one we arrived along This number is one less than thedegrees of the vertices themselves The excess average degreeis defined as

⟨119896119899119899

⟩ (119896) =1

119902119896

119896max

sum

119896=119896min

1198961015840

119890119896119896

(4)

The excess average degree is positive in assortative net-work and negative otherwise Figure 3 plots the curve of theexcess average degree (119910) against the degree (119909) as the redline We see significant negative correlation Hence the SinaMicroblogging is a disassortative network

To better understand the meaning of disassortative net-work we illustrate the network connections of the node ID975 in Figure 4 In connected component different colorsrepresent different communities We find that the node ID975 connects with three hubs of community and showsdisassortativity

5 Analysis of Node Importance

Social networks are discrete systems with a large amountof heterogeneity among nodes Measures of centrality directat a quantification of nodesrsquo importance for structure andfunction The most direct measure of centrality is the degreecentrality that is the node with greater degree is the mostimportant one In addition betweenness centrality is also ameasure of nodersquos centrality in a network and can be usedto measure the influence a node has over the spread ofinformation through the network It is equal to the numberof the shortest paths from all vertices to all others that passthrough that node

In Figure 5 we plot the correlation between node bet-weenness centralities and degrees Then we compute thatthe average of node betweenness centralities is 3362 andthe correlation coefficient is 096 indicating that there are

Mathematical Problems in Engineering 5

Figure 4 The network connections of the node ID 975

Betw

eenn

ess c

entr

ality

100

102

104

106

103

102

101

100

Deg

Figure 5 Correlations between node betweenness centralities anddegrees

strong and positive correlations between node betweennesscentralities and degrees However the special case discussedhere is the one in which the node connecting to severalgroups has high betweenness centrality We rank users bybetweenness centrality and find three nodes both in the top5 list and with a degree less than 10 They connect withall communities in the network and act as bridges betweenthe different communities The result is consistent well withthe structural holes theory that was advanced by sociologistRonald Burt in real-life social network study

6 Conclusions

In this paper we have studied the structural properties ofMicroblogging ever created (839 active Sina Microbloggingusers and their 2112 social relations) from several viewpoints

First of all we have found a power-law distribution ashort average length and a high clustering coefficient in

its topology analysis which are all compatible with knowncharacteristics of other online social networks and real-lifesocial networks In order to illuminate the mechanisms forthe formation of small-world property we have studied thedifference between SinaMicroblogging and traditional socialnetworks from the aspect of usersrsquo behavior and the usersrsquolinkmode and found that SinaMicroblogging can easily formthe user centric We Media Therefore it possesses small-world property that can facilitate the flow of informationThen we calculated the correlation between clustering coef-ficients and degrees and showed that Sina Microblogging hasapparent hierarchical structure and we have found that SinaMicroblogging trends to be disassortative network which allmark a deviation from real-life social networksMoreover weanalyzed the betweenness centralities of intermediary nodesand confirmed that the intermediary nodes can control thespread of information Last but not least our work is only thefirst step towards exploring the difference between the onlineand real-life social networks Much work still remains

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

Acknowledgments

This work was partly supported by the National ScienceFoundation of China under Grant no 70903016 and theNational Science amp Technology Support Program underGrant no 2012BAH81F03

References

[1] R Corten ldquoComposition and structure of a large online socialnetwork in the netherlandsrdquo PLoS ONE vol 7 no 4 Article IDe34760 2012

6 Mathematical Problems in Engineering

[2] R E Wilson and S D Gosling ldquoA review of Facebook researchin the social sciencesrdquo Perspectives on Psychological Science vol7 no 3 pp 203ndash220 2012

[3] D J Watts and S H Strogatz ldquoCollective dynamics of ldquosmall-worldrdquo networksrdquoNature vol 393 no 6684 pp 440ndash442 1998

[4] R Albert H Jeong andA-L Barabasi ldquoDiameter of the world-wide webrdquo Nature vol 401 no 6749 pp 130ndash131 1999

[5] A-L Barabasi and R Albert ldquoEmergence of scaling in randomnetworksrdquo Science vol 286 no 5439 pp 509ndash512 1999

[6] M Barthelemy ldquoSpatial networksrdquo Physics Reports vol 499 no1 pp 1ndash101 2011

[7] D Centola ldquoThe spread of behavior in an online social networkexperimentrdquo Science vol 329 no 5996 pp 1194ndash1197 2010

[8] Y-Y Ahn S Han H Kwak S Moon and H Jeong ldquoAnalysisof topological characteristics of huge online social networkingservicesrdquo in Proceedings of the 16th International World WideWeb Conference (WWW rsquo07) pp 835ndash844 May 2007

[9] H Kwak C Lee H Park and S Moon ldquoWhat is Twitter asocial network or a news mediardquo in Proceedings of the19thInternationalWorldWideWeb Conference (WWW rsquo10) pp 591ndash600 April 2010

[10] Q Yan L R Wu and L Zheng ldquoSocial network basedmicroblog user behavior analysisrdquo Physica A StatisticalMechanics and Its Applications vol 392 no 7 pp 1712ndash17232013

[11] W G Yuan Y Liu J J Cheng and F Xiong ldquoEmpiricalanalysis of microblog centrality and spread influence based onBi-directional connectionrdquo Acta Physica Sinica vol 62 no 3Article ID 038901 2013

[12] L Backstrom and P Boldi ldquoFour degrees of separationrdquo inProceedings of the 3rd Annual ACMWeb Science Conference pp33ndash42 2012

[13] J Leskovec J Kleinberg and C Faloutsos ldquoGraph evolutiondensification and shrinking diametersrdquo ACM Transactions onKnowledge Discovery from Data vol 1 no 1 Article ID 12173012007

[14] B Goncalves N Perra and A Vespignani ldquoModeling usersrsquoactivity on twitter networks validation of Dunbarrsquos numberrdquoPloS ONE vol 6 no 8 2011

[15] R I M Dunbar ldquoSocial cognition on the Internet testingconstraints on social network sizerdquo Philosophical Transactionsof the Royal Society B Biological Sciences vol 367 no 1599 pp2192ndash2201 2012

[16] GMiller ldquoSocial scientists wade into the tweet streamrdquo Sciencevol 333 no 6051 pp 1814ndash1815 2011

[17] R Lex and B Kovacs ldquoA comparison of email networks and off-line social networks a study of a medium-sized bankrdquo SocialNetworks vol 34 no 4 pp 462ndash469 2011

[18] S H Lee P-J Kim and H Jeong ldquoStatistical properties ofsampled networksrdquoPhysical ReviewE Statistical Nonlinear andSoft Matter Physics vol 73 no 1 Article ID 016102 2006

[19] Y-Y Ahn S Han H Kwak S Moon and H Jeong ldquoAnalysisof topological characteristics of huge online social networkingservicesrdquo in Proceedings of the 16th International World WideWeb Conference (WWW rsquo07) pp 835ndash844 May 2007

[20] F Fu X Chen L Liu and L Wang ldquoSocial dilemmas in anonline social network the structure and evolution of coopera-tionrdquo Physics Letters A General Atomic and Solid State Physicsvol 371 no 1-2 pp 58ndash64 2007

[21] K Yuta and N Ono ldquogap in thecommunity-size distributionof a large-scale social networking siterdquo Physics and Societyhttparxivorgabsphysics0701168

[22] M Tomassini and L Luthi ldquoEmpirical analysis of the evolutionof a scientific collaboration networkrdquo Physica A StatisticalMechanics and Its Applications vol 385 no 2 pp 750ndash764 2007

[23] K-I Goh Y-H Eom H Jeong B Kahng and D Kim ldquoStruc-ture and evolution of online social relationships heterogeneityin unrestricted discussionsrdquo Physical Review E Statistical Non-linear and Soft Matter Physics vol 73 no 6 Article ID 0661232006

[24] HHuDHan andXWang ldquoIndividual popularity and activityin online social systemsrdquo Physica A StatisticalMechanics and ItsApplications vol 389 no 5 pp 1065ndash1070 2010

[25] P A Grabowicz J J Ramasco E Moro J M Pujol and VM Eguiluz ldquoSocial features of online networks the strength ofintermediary ties in online social mediardquo PLoS ONE vol 7 no1 Article ID e29358 2012

[26] T A B Snijders G G van de Bunt and C E G SteglichldquoIntroduction to stochastic actor-based models for networkdynamicsrdquo Social Networks vol 32 no 1 pp 44ndash60 2010

[27] J Ugander and B Karrer ldquoThe anatomy of the facebooksocial graphrdquo Social and Information Networks httparxivorgabs11114503

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 5: Research Article A Comparison of Online Social Networks ...downloads.hindawi.com/journals/mpe/2014/578713.pdf · the random network (the clustering coe cient of random network is

Mathematical Problems in Engineering 5

Figure 4 The network connections of the node ID 975

Betw

eenn

ess c

entr

ality

100

102

104

106

103

102

101

100

Deg

Figure 5 Correlations between node betweenness centralities anddegrees

strong and positive correlations between node betweennesscentralities and degrees However the special case discussedhere is the one in which the node connecting to severalgroups has high betweenness centrality We rank users bybetweenness centrality and find three nodes both in the top5 list and with a degree less than 10 They connect withall communities in the network and act as bridges betweenthe different communities The result is consistent well withthe structural holes theory that was advanced by sociologistRonald Burt in real-life social network study

6 Conclusions

In this paper we have studied the structural properties ofMicroblogging ever created (839 active Sina Microbloggingusers and their 2112 social relations) from several viewpoints

First of all we have found a power-law distribution ashort average length and a high clustering coefficient in

its topology analysis which are all compatible with knowncharacteristics of other online social networks and real-lifesocial networks In order to illuminate the mechanisms forthe formation of small-world property we have studied thedifference between SinaMicroblogging and traditional socialnetworks from the aspect of usersrsquo behavior and the usersrsquolinkmode and found that SinaMicroblogging can easily formthe user centric We Media Therefore it possesses small-world property that can facilitate the flow of informationThen we calculated the correlation between clustering coef-ficients and degrees and showed that Sina Microblogging hasapparent hierarchical structure and we have found that SinaMicroblogging trends to be disassortative network which allmark a deviation from real-life social networksMoreover weanalyzed the betweenness centralities of intermediary nodesand confirmed that the intermediary nodes can control thespread of information Last but not least our work is only thefirst step towards exploring the difference between the onlineand real-life social networks Much work still remains

Conflict of Interests

The authors declare that there is no conflict of interestsregarding the publication of this paper

Acknowledgments

This work was partly supported by the National ScienceFoundation of China under Grant no 70903016 and theNational Science amp Technology Support Program underGrant no 2012BAH81F03

References

[1] R Corten ldquoComposition and structure of a large online socialnetwork in the netherlandsrdquo PLoS ONE vol 7 no 4 Article IDe34760 2012

6 Mathematical Problems in Engineering

[2] R E Wilson and S D Gosling ldquoA review of Facebook researchin the social sciencesrdquo Perspectives on Psychological Science vol7 no 3 pp 203ndash220 2012

[3] D J Watts and S H Strogatz ldquoCollective dynamics of ldquosmall-worldrdquo networksrdquoNature vol 393 no 6684 pp 440ndash442 1998

[4] R Albert H Jeong andA-L Barabasi ldquoDiameter of the world-wide webrdquo Nature vol 401 no 6749 pp 130ndash131 1999

[5] A-L Barabasi and R Albert ldquoEmergence of scaling in randomnetworksrdquo Science vol 286 no 5439 pp 509ndash512 1999

[6] M Barthelemy ldquoSpatial networksrdquo Physics Reports vol 499 no1 pp 1ndash101 2011

[7] D Centola ldquoThe spread of behavior in an online social networkexperimentrdquo Science vol 329 no 5996 pp 1194ndash1197 2010

[8] Y-Y Ahn S Han H Kwak S Moon and H Jeong ldquoAnalysisof topological characteristics of huge online social networkingservicesrdquo in Proceedings of the 16th International World WideWeb Conference (WWW rsquo07) pp 835ndash844 May 2007

[9] H Kwak C Lee H Park and S Moon ldquoWhat is Twitter asocial network or a news mediardquo in Proceedings of the19thInternationalWorldWideWeb Conference (WWW rsquo10) pp 591ndash600 April 2010

[10] Q Yan L R Wu and L Zheng ldquoSocial network basedmicroblog user behavior analysisrdquo Physica A StatisticalMechanics and Its Applications vol 392 no 7 pp 1712ndash17232013

[11] W G Yuan Y Liu J J Cheng and F Xiong ldquoEmpiricalanalysis of microblog centrality and spread influence based onBi-directional connectionrdquo Acta Physica Sinica vol 62 no 3Article ID 038901 2013

[12] L Backstrom and P Boldi ldquoFour degrees of separationrdquo inProceedings of the 3rd Annual ACMWeb Science Conference pp33ndash42 2012

[13] J Leskovec J Kleinberg and C Faloutsos ldquoGraph evolutiondensification and shrinking diametersrdquo ACM Transactions onKnowledge Discovery from Data vol 1 no 1 Article ID 12173012007

[14] B Goncalves N Perra and A Vespignani ldquoModeling usersrsquoactivity on twitter networks validation of Dunbarrsquos numberrdquoPloS ONE vol 6 no 8 2011

[15] R I M Dunbar ldquoSocial cognition on the Internet testingconstraints on social network sizerdquo Philosophical Transactionsof the Royal Society B Biological Sciences vol 367 no 1599 pp2192ndash2201 2012

[16] GMiller ldquoSocial scientists wade into the tweet streamrdquo Sciencevol 333 no 6051 pp 1814ndash1815 2011

[17] R Lex and B Kovacs ldquoA comparison of email networks and off-line social networks a study of a medium-sized bankrdquo SocialNetworks vol 34 no 4 pp 462ndash469 2011

[18] S H Lee P-J Kim and H Jeong ldquoStatistical properties ofsampled networksrdquoPhysical ReviewE Statistical Nonlinear andSoft Matter Physics vol 73 no 1 Article ID 016102 2006

[19] Y-Y Ahn S Han H Kwak S Moon and H Jeong ldquoAnalysisof topological characteristics of huge online social networkingservicesrdquo in Proceedings of the 16th International World WideWeb Conference (WWW rsquo07) pp 835ndash844 May 2007

[20] F Fu X Chen L Liu and L Wang ldquoSocial dilemmas in anonline social network the structure and evolution of coopera-tionrdquo Physics Letters A General Atomic and Solid State Physicsvol 371 no 1-2 pp 58ndash64 2007

[21] K Yuta and N Ono ldquogap in thecommunity-size distributionof a large-scale social networking siterdquo Physics and Societyhttparxivorgabsphysics0701168

[22] M Tomassini and L Luthi ldquoEmpirical analysis of the evolutionof a scientific collaboration networkrdquo Physica A StatisticalMechanics and Its Applications vol 385 no 2 pp 750ndash764 2007

[23] K-I Goh Y-H Eom H Jeong B Kahng and D Kim ldquoStruc-ture and evolution of online social relationships heterogeneityin unrestricted discussionsrdquo Physical Review E Statistical Non-linear and Soft Matter Physics vol 73 no 6 Article ID 0661232006

[24] HHuDHan andXWang ldquoIndividual popularity and activityin online social systemsrdquo Physica A StatisticalMechanics and ItsApplications vol 389 no 5 pp 1065ndash1070 2010

[25] P A Grabowicz J J Ramasco E Moro J M Pujol and VM Eguiluz ldquoSocial features of online networks the strength ofintermediary ties in online social mediardquo PLoS ONE vol 7 no1 Article ID e29358 2012

[26] T A B Snijders G G van de Bunt and C E G SteglichldquoIntroduction to stochastic actor-based models for networkdynamicsrdquo Social Networks vol 32 no 1 pp 44ndash60 2010

[27] J Ugander and B Karrer ldquoThe anatomy of the facebooksocial graphrdquo Social and Information Networks httparxivorgabs11114503

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 6: Research Article A Comparison of Online Social Networks ...downloads.hindawi.com/journals/mpe/2014/578713.pdf · the random network (the clustering coe cient of random network is

6 Mathematical Problems in Engineering

[2] R E Wilson and S D Gosling ldquoA review of Facebook researchin the social sciencesrdquo Perspectives on Psychological Science vol7 no 3 pp 203ndash220 2012

[3] D J Watts and S H Strogatz ldquoCollective dynamics of ldquosmall-worldrdquo networksrdquoNature vol 393 no 6684 pp 440ndash442 1998

[4] R Albert H Jeong andA-L Barabasi ldquoDiameter of the world-wide webrdquo Nature vol 401 no 6749 pp 130ndash131 1999

[5] A-L Barabasi and R Albert ldquoEmergence of scaling in randomnetworksrdquo Science vol 286 no 5439 pp 509ndash512 1999

[6] M Barthelemy ldquoSpatial networksrdquo Physics Reports vol 499 no1 pp 1ndash101 2011

[7] D Centola ldquoThe spread of behavior in an online social networkexperimentrdquo Science vol 329 no 5996 pp 1194ndash1197 2010

[8] Y-Y Ahn S Han H Kwak S Moon and H Jeong ldquoAnalysisof topological characteristics of huge online social networkingservicesrdquo in Proceedings of the 16th International World WideWeb Conference (WWW rsquo07) pp 835ndash844 May 2007

[9] H Kwak C Lee H Park and S Moon ldquoWhat is Twitter asocial network or a news mediardquo in Proceedings of the19thInternationalWorldWideWeb Conference (WWW rsquo10) pp 591ndash600 April 2010

[10] Q Yan L R Wu and L Zheng ldquoSocial network basedmicroblog user behavior analysisrdquo Physica A StatisticalMechanics and Its Applications vol 392 no 7 pp 1712ndash17232013

[11] W G Yuan Y Liu J J Cheng and F Xiong ldquoEmpiricalanalysis of microblog centrality and spread influence based onBi-directional connectionrdquo Acta Physica Sinica vol 62 no 3Article ID 038901 2013

[12] L Backstrom and P Boldi ldquoFour degrees of separationrdquo inProceedings of the 3rd Annual ACMWeb Science Conference pp33ndash42 2012

[13] J Leskovec J Kleinberg and C Faloutsos ldquoGraph evolutiondensification and shrinking diametersrdquo ACM Transactions onKnowledge Discovery from Data vol 1 no 1 Article ID 12173012007

[14] B Goncalves N Perra and A Vespignani ldquoModeling usersrsquoactivity on twitter networks validation of Dunbarrsquos numberrdquoPloS ONE vol 6 no 8 2011

[15] R I M Dunbar ldquoSocial cognition on the Internet testingconstraints on social network sizerdquo Philosophical Transactionsof the Royal Society B Biological Sciences vol 367 no 1599 pp2192ndash2201 2012

[16] GMiller ldquoSocial scientists wade into the tweet streamrdquo Sciencevol 333 no 6051 pp 1814ndash1815 2011

[17] R Lex and B Kovacs ldquoA comparison of email networks and off-line social networks a study of a medium-sized bankrdquo SocialNetworks vol 34 no 4 pp 462ndash469 2011

[18] S H Lee P-J Kim and H Jeong ldquoStatistical properties ofsampled networksrdquoPhysical ReviewE Statistical Nonlinear andSoft Matter Physics vol 73 no 1 Article ID 016102 2006

[19] Y-Y Ahn S Han H Kwak S Moon and H Jeong ldquoAnalysisof topological characteristics of huge online social networkingservicesrdquo in Proceedings of the 16th International World WideWeb Conference (WWW rsquo07) pp 835ndash844 May 2007

[20] F Fu X Chen L Liu and L Wang ldquoSocial dilemmas in anonline social network the structure and evolution of coopera-tionrdquo Physics Letters A General Atomic and Solid State Physicsvol 371 no 1-2 pp 58ndash64 2007

[21] K Yuta and N Ono ldquogap in thecommunity-size distributionof a large-scale social networking siterdquo Physics and Societyhttparxivorgabsphysics0701168

[22] M Tomassini and L Luthi ldquoEmpirical analysis of the evolutionof a scientific collaboration networkrdquo Physica A StatisticalMechanics and Its Applications vol 385 no 2 pp 750ndash764 2007

[23] K-I Goh Y-H Eom H Jeong B Kahng and D Kim ldquoStruc-ture and evolution of online social relationships heterogeneityin unrestricted discussionsrdquo Physical Review E Statistical Non-linear and Soft Matter Physics vol 73 no 6 Article ID 0661232006

[24] HHuDHan andXWang ldquoIndividual popularity and activityin online social systemsrdquo Physica A StatisticalMechanics and ItsApplications vol 389 no 5 pp 1065ndash1070 2010

[25] P A Grabowicz J J Ramasco E Moro J M Pujol and VM Eguiluz ldquoSocial features of online networks the strength ofintermediary ties in online social mediardquo PLoS ONE vol 7 no1 Article ID e29358 2012

[26] T A B Snijders G G van de Bunt and C E G SteglichldquoIntroduction to stochastic actor-based models for networkdynamicsrdquo Social Networks vol 32 no 1 pp 44ndash60 2010

[27] J Ugander and B Karrer ldquoThe anatomy of the facebooksocial graphrdquo Social and Information Networks httparxivorgabs11114503

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of

Page 7: Research Article A Comparison of Online Social Networks ...downloads.hindawi.com/journals/mpe/2014/578713.pdf · the random network (the clustering coe cient of random network is

Submit your manuscripts athttpwwwhindawicom

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical Problems in Engineering

Hindawi Publishing Corporationhttpwwwhindawicom

Differential EquationsInternational Journal of

Volume 2014

Applied MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Probability and StatisticsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Mathematical PhysicsAdvances in

Complex AnalysisJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

OptimizationJournal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

CombinatoricsHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Operations ResearchAdvances in

Journal of

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Function Spaces

Abstract and Applied AnalysisHindawi Publishing Corporationhttpwwwhindawicom Volume 2014

International Journal of Mathematics and Mathematical Sciences

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

The Scientific World JournalHindawi Publishing Corporation httpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Algebra

Discrete Dynamics in Nature and Society

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Decision SciencesAdvances in

Discrete MathematicsJournal of

Hindawi Publishing Corporationhttpwwwhindawicom

Volume 2014 Hindawi Publishing Corporationhttpwwwhindawicom Volume 2014

Stochastic AnalysisInternational Journal of