10
A Survey on Social Network Search Ranking

A Survey on Social Network Search Ranking

  • Upload
    aron

  • View
    28

  • Download
    0

Embed Size (px)

DESCRIPTION

A Survey on Social Network Search Ranking. Web vs. Social Networks. Limitations of web (hyperlink-based) search It underestimates recently published content It has a bias in favor of large community (e.g., Michael Jordan, the basketball player or the computer scientist?). - PowerPoint PPT Presentation

Citation preview

Page 1: A Survey on Social Network Search Ranking

A Survey on Social Network Search Ranking

Page 2: A Survey on Social Network Search Ranking

Web vs. Social NetworksWeb Social Network

Publishing •Place documents on server •Post contents on social network sites

Locating •Via search engine •Navigate through the social network•Browse contents recommended by other users

• Limitations of web (hyperlink-based) search– It underestimates recently published content– It has a bias in favor of large community (e.g.,

Michael Jordan, the basketball player or the computer scientist?)

Page 3: A Survey on Social Network Search Ranking

Roadmap for the following

• PeerSpective 1.0 (HotNet ‘06)– Demonstrate why social network search matters

• Network-Aware Searching (VLDB ‘08)– Query + Importance of user (relative to the query

user)

• Efficient Search Ranking in Social Networks (CIKM ’07)– Propose some challenges of social (network-

aware) searching

Page 4: A Survey on Social Network Search Ranking

PeerSpective 1.0 (HotNet ‘06)

• An experiment uses social nets to search the Web

• Idea: users can query their friends’ viewed pages

• Results from friends appear alongside Google results

• Ranking:

usereach for

core]PageRank_SreLucene_Sco)before page thereaduser ([I

Page 5: A Survey on Social Network Search Ranking

PeerSpective Experimental Results

• Run PeerSpective with 10 users for 1 month– 51,410 distinct URLs viewed– 1,730 Google searches

• Google contains only 62.5% URLs• 30.4% of URLs previously viewed by someone in

network• 13.3% of URLs previously viewed but not in Google• 7.7% of (top 10) result clicks are on PeerSpective-

only results

Page 6: A Survey on Social Network Search Ranking

Network-Aware Searching (VLDB ‘08)

• The query content + Importance of users (relative to the query user)– Overlap-based similarity– Indirectly connected users

– Add a uniform background– Social frequency

• tfu(d,t) is typically 0 or 1

|)'(||)(|

|)'(|2),(

utagsetutagset

uutagsetuuO

1

01

...path

),()( max0

k

iii

uuuuu uuOuP

k

1( ) (1 ) ( )

| |u uF u P uU

( , ) ( ) ( , )u u uu U

sf d t F u tf d t

Page 7: A Survey on Social Network Search Ranking

Network-Aware Searching Example

• O(A,A)=1, O(A,B)=2/4, O(B,C)=2/4, O(C,D)=2/5, O(A,E)=2/4, O(E,D)=0/5

• PA(D)=max(1/10,0)=1/10

• FA(D)=0.1*1/5+(1-0.1)*1/10=0.11

• Similarly, FA(A)=0.92, FA(B)=0.47, FA(C)=0.245, FA(E)=0.47

• sfA(z,a)=0.92*1 + 0.47*1 + 0.245*0 + 0.11*1 + 0.47*0 = 1.5

A

E

D

CB

a,b

a,c c,d

a,d,e

b,c

Tags of document z by user A, B, C, D, and E:

Page 8: A Survey on Social Network Search Ranking

Efficient Search Ranking in Social Networks (CIKM ’07)

• Consider usernames as query terms only• Idea: search ranking is based on the path

length• Challenge: large size of SN prevents efficient

computation of shortest path at query time– Orkut: 40 million– Facebook: more than 200 million active users

Page 9: A Survey on Social Network Search Ranking

Efficient Search Ranking in Social Networks: Approaches

• Pre-compute all distance b2n any pair– Trivial– Non-scalable: 40 million users 40,000,0002=16*1014

• On-the-fly ranking– BFS in real-time– Each user has 100 friends, distance 3 1,000,000 users

• Co-friend ranking– Mixture of above two– Store “friends of friends” for each user and search from

the list

Page 10: A Survey on Social Network Search Ranking

Conclusion

• Network aware search is not a big problem

• However, how to search “in real time”?– Search limited number of hops– Approximated shortest path– Pre-compute (partial) data