58
Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

  • View
    222

  • Download
    2

Embed Size (px)

Citation preview

Page 1: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Interactive Visualization of Large Graphs and

NetworksTamara Munzner

Stanford UniversityComputer Science Department

Page 2: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Contributions

• analysis of three software systems– relating intended tasks to spatial

layout, visual encoding choices

• two novel layout/drawing algorithms– scalable– targeted

Page 3: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Three Visualization Systems

generalgraph drawing infovis

domain specific

H3 PM Const

H3• web hyperlinks • quasi-hierarchical

Planet Multicast• MBone tunnels• find poorly placed

Constellation• parsed dictionaries• refine algorithms

Page 4: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Talk Outline• graph drawing, information

visualization background• software systems

– goal– previous work– video– discussion– evaluation

• general discussion• conclusion

Page 5: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Graph Drawing• automatic layout and drawing of node-link

graphs

Hofstadter. Godel, Escher, Bach. Gansner and North. Improved force-directed layouts.

Page 6: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Goal: help humans understand

• aesthetic criteria– minimize crossings– expose structure: hierarchy,

symmetry, circular

Tom Sawyer Software. Hierarchical Toolkit Tom Sawyer Software. Symmetric Toolkit Tom Sawyer Software. Circular Toolkit

Page 7: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

System Scalability, Data Set Size

node count, log scale

1000 10M1K 100M1M100K10K10 1B

Web(pages)

manualdictionary

GEB figure

most GD systems

exceptional GD systems(dot, Gem3D)

H3

Planet Multicast

Constellation

MBone (tunnels)

Net(hosts)

Net(routers)

mid-size web sites

my site

Stanford graphics site

data sets

my systems

previous systems

Page 8: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Fundamental Idea

• extend reach of graph drawing with information visualization approach

• techniques– interactivity– incorporate domain-specific

information

Page 9: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Information Visualization

• external visual representation of data, exploits perceptual system to reduce human cognitive load

• find appropriate visual metaphor for data that is not implicitly spatial

Page 10: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Interactivity

• mimic reality– 2D paper: pan, zoom– 3D object: rotate, translate, scale

• beyond– semantics impossible in real world– distortion, multi-scale

Page 11: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Domain/Task Focus

• user-centered design, ethnography• understand high level goals

– maintain web site

• break down into lower level tasks– minimize user navigation to important

pages– find and fix broken links

• design visual encoding• evaluate effectiveness

Page 12: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Evaluating Visualization Systems

• quantitative algorithmic improvements

• conceptual framework analysis• impact/adoption• user studies• anecdotal evidence

Page 13: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

System 1: H3

• time: 1996-8

• data: web hyperlinks– quasi-hierarchical

graphs: can find reasonable spanning tree using domain-specific information

• goal: scalability

• method: 3D hyperbolic

Page 14: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Background: Hyperbolic Space

• Focus+Context distortion– project from infinite hyperbolic to

finite euclidean– pick best model for useful distortion

conformal:geodesics warped

projective:angles warped

4x4 matrix

Page 15: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Background: Hyperbolic Space

• exponential room in space• exponential number of tree nodes

hyperbolic hemisphere area

exponential:2 sinh r

2

euclidean hemisphere areageometric:

2r2

Thurston and Weeks, The Mathematics of Three Dimensional Manifolds, Scientific American

2D hyperbolic plane

Page 16: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Previous Work: Hierarchies

Cone Trees[Robertson, Mackinlay, Card

91]

Tree Maps[Johnson, Shneiderman

91]

• hierarchies

• distortion: [Furnas, Brown, Carpendale, Keahey]

Robertson, Mackinlay and Card. "Cone Trees: Animated 3D visualizations of hierarchical information. Johnson and Shneiderman. Treemaps: A Space-filling Approach to the Visualization of Hierarchical Information

Page 17: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Previous Work: Distortion & Hierarchy

• 2D Hyperbolic Tree– [Lamping, Rao, Pirolli 94,95]– scalability analysis later

• Fractal [Koike, Yoshihara 93]

• SHriMP [Storey, Muller 95]– don’t scale

• taxonomy [Noik 94]

Lamping, Rao, and Pirolli. A Focus+Content Technique Based on Hyperbolic Geometry for Viewing Large Hierarchies.

Page 18: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Concurrent Work: Nicheworks

• Nicheworks [Wills 97]– layout scales to 1M nodes– linked views– multiple layout approaches

• very different visual metaphor

Wills et al. Nicheworks.

Page 19: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Concurrent Work: Skeletonization

• Skeletonization [Herman 98]– abstractions for tree structure

Herman et al. Skeletonization

Page 20: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department
Page 21: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

H3 Layout• novel layout algorithm detailed in thesis

• hemisphere surface instead of linear circumference

• bottom-up pass: compute hemisphere sizes

• top-down pass: place child on parent surface

Page 22: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Information Density: Scale

center peripheral fringeH3 dozens hundreds thousands

2DPARC

dozens - hundreds

Lamping, Rao, and Pirolli. A Focus+Content Technique Based on Hyperbolic Geometry for Viewing Large Hierarchies.

Page 23: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Information Density: Codimension

dimspace

dim- structure = codim

webviz 3 1circle

2

H3 3 2hemisphere

1

(Carpendale) 3 3cubic grid

0

• want balance between clutter and void• topological approach to describing density• difference between structure and surrounding space

dense

sparse

Carpendale, Cowperthwaite, and Fracchia. Extending Distortion Viewing from 3D to 2D.

Page 24: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Evaluation: Scalability• drawing: constant

– incremental• exception: precision

• layout: linear in |E| – 110,000 edges in 12 seconds given DFS input

• limits:– computational: global layout in main memory– cognitive: disorientation past ~100K nodes

• large neighborhood not global overview• future: landmarks, LOD, abstraction

Page 25: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Evaluation: Impact• product from SGI

– Site Manager aimed at web content creators– bundled starting with Irix 6.3

• research use of library– interface for Skitter Internet tomography

data– analysis of Autonomous System data

• viewer use– 6 researchers converted data to use viewer

• image use– 6 reprint requests

Page 26: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Evaluation: User Study

• [Risden, Czerwinski, Munzner, Cook 00]

• compared 3 browsers for adding content to collection of web pages

snap portal(Yahoo style)

collapsible tree XML3D: H3 + lists

Page 27: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

User Study Results• reliably faster for existing category task• no decline in quality for new category task

differencesstatistically significant

differencesstatistically insignificant

Page 28: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

System 2: Planet Multicast

• time: 1996• joint work:

– Hoffman, Claffy, Fenner

• data: MBone tunnels • task: find badly

placed tunnels• goal: simple baseline• method: 3D

geographic

Page 29: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Previous Work: Geographic Network

• SeeNet3D– [Cox, Eick 95]– arcs on globe layout

• SeeNet– [Becker, Eick, Wilks 95]

• NSFNet– [Cox, Patterson 92]

Becker, Eick, and Wilks. Visualizing Network Data

Cox and Eick. 3D Displays of Network Traffic.

Cox and Patterson. Visualization Study of the NSFNet.

Page 30: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department
Page 31: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Geographic Layout

• distance as stand-in for resource usage– partially correlated

• geographical determination arduous– major scalability problem

• immediate comprehension– evocative, many image reprints

• Wired, National Geographic

– still picture captures much of function

Page 32: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Evaluation: Anecdotal Insights

…> pen-mbone-1.sprintlink.net(204.213.238.11) dc-mbone-1.sprintlink.net(206.229.87.99)

[1/64/tunnel]> elm.can.net(199.246.170.7) dc-mbone-1.sprintlink.net(206.229.87.99) [1/64/tunnel]> boston.terra.net(199.103.128.254) dc-mbone-1.sprintlink.net(206.229.87.99) [1/0/tunnel/querier]> NS.FLSIG.ORG(192.153.117.162) dc-mbone-1.sprintlink.net(206.229.87.99) [1/64/tunnel]> ace.mid.net(198.247.225.251) dc-mbone-1.sprintlink.net(206.229.87.99) [1/64/tunnel]> fw-mbone-1.sprintlink.net(206.61.106.99) dc-mbone-1.sprintlink.net(206.229.87.99)

[1/16/tunnel]> gateway10.crawford.com(198.69.210.2) dc-mbone-1.sprintlink.net(206.229.87.99) [1/32/tunnel]> csce-2--rngm-nb-f-1.net.tamu.edu(128.194.1.11) dc-mbone-1.sprintlink.net(206.229.87.99)

[1/64/tunnel]...

Page 33: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

System 3: Constellation

• time: 1998-9• joint work:

– Guimbretière

• data: MindNet query results

• task: plausibility checking for linguists

• method: 2D custom• goal: targeted

Page 34: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Definition Graph• dictionary entry sentence• nodes: word senses• links: relation types

Page 35: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Semantic Network

• definition graphs as building blocks• unify shared words• large network

– millions of nodes– grammar checking now, translation

future– global structure known: dense

• probes return local info

Page 36: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Path Query• best N paths between two words• words on path itself

• definition graphs used in computation

Page 37: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Task: Plausibility Checking

• paths ordered by computed plausibility

• researcher hand-checks results– high-ranking paths believable?– believable paths high-ranked?– stop words

Page 38: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Top 10 Paths: kangaroo - tail

Page 39: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Goal

• create unified view of relationships between paths and definition graphs– shared words are key– thousands of words (not millions)

• special-purpose algorithm debugging tool – not understand the structure of English

Page 40: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Previous Work: Semantic Networks

• Visual Thesaurus– [Thinkmap applet]– casual browsing,

constant motion– < 20 nodes

• SemNet– [Fairchild, Poltrock,

Furnas 88]– multiple 3D layouts

Fairchild, Poltrock, and Furnas. SemNet: Three-Dimensional Graphic Representations of Large Knowledge Bases.

Thinkmap applet. www.thinkmap.com cited 3/09/00.

Page 41: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department
Page 42: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Traditional Layout

• avoid crossings– reason: avoid false attachments

A

C

B

ambiguity

A B

C D

artifact salience

Page 43: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Information Visualization Approach

• spatial position is strongest perceptual cue– encode domain specific attribute– plausibility gradient

Page 44: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Constellation Semantic Layout

• novel layout algorithm detailed in thesis– paths as backbone, definition graphs attached – curvilinear grid– iterative design for maximum semantics with reasonable information density

• allow crossings for long-distance proxy links

Page 45: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Selective Emphasis• highlight sets of boxes and edges

– interaction– additional perceptual channels

• avoid perception of false attachments

Page 46: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Evaluation: Layout Effectivness

Page 47: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Evaluation: Layout Comparison

dot H3

Page 48: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Talk Outline• graph drawing background• software systems

– goal– previous work– video– discussion– evaluation

• general discussion• conclusion

Page 49: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Visual Salience

• Planet Multicast– long-distance tunnels

• H3– distant points of possible interest

• fringe: aggregate information

• Constellation– selective emphasis– word size tied to importance

Page 50: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Canonical Word Size

Page 51: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Hidden State• Constellation avoids hidden state

– change salience instead of toggle drawing

• closed world assumption– if not visible, doesn’t exist– easy to forget previous actions– false negative conclusions

• H3, PM do have hidden state– non-tree links sometimes drawn – intra-city tunnels never drawn

Page 52: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Graph Functions

• structure discovery– pure spatial layout– implicit in traditional graph drawing

• contextual backdrop• linked view

Page 53: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Graph Functions

• structure discovery• contextual backdrop

– additional visual encoding• color, linewidth, shape, enclosure• combination more than sum of parts

• linked view

Page 54: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Contextual Backdrop

Page 55: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Graph Functions

• structure discovery• contextual backdrop• linked view

– brushing [Becker and Cleveland 88]– invoke other software components

Page 56: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Linked View

Page 57: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department

Contributions• detailed analysis of three software

systems– interactive, range of domain specificity– relate intended tasks to spatial layout,

visual encoding

• two novel layout/drawing algorithms– Constellation

• targeted design

– H3: • scales 100x beyond previous work • product, user study

Page 58: Interactive Visualization of Large Graphs and Networks Tamara Munzner Stanford University Computer Science Department