Northwestern data visualization - why why why

Preview:

Citation preview

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

who is a data scientist?

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

who is a data scientist?

review of literature

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

who is a data scientist?

review of literature

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

review of literature

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

review of literature

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

who is a data scientist?

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

who is a data scientist?“a scientist who can code”

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

who is a data scientist?“a scientist who can code”

• lower barrier to attack new problems

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

who is a data scientist?“a scientist who can code”

• lower barrier to attack new problems • repeatable analysis

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

who is a data scientist?“a scientist who can code”

• lower barrier to attack new problems • repeatable analysis • freedom to think about problems new ways

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

which were difficult to answer before

Northwestern Data Visualization| @gabegaster | 2015 may

which were difficult to answer before

Northwestern Data Visualization| @gabegaster | 2015 may

computing has progressed

which were difficult to answer before

Northwestern Data Visualization| @gabegaster | 2015 may

1950computing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

1950

cost of new analysis

computing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

1950

cost of new analysis years

computing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

1950

cost of new analysis years

todaycomputing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

1950

cost of new analysis years

today

v

computing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

1950

cost of new analysis years

today

hoursv

v

computing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

1950

cost of new analysis years

today

same person thinking about the problem can conduct experiments to answer it

hoursv

v

computing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

computing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

open-source code

computing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

open-source code

standing on shoulders of giants

computing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

open-source code

standing on shoulders of giants

computing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

open-source code

standing on shoulders of giants

computing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

open-source code

standing on shoulders of giants

reinventing the wheel

computing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

open-source code

standing on shoulders of giants

reinventing the wheel

computing has progressed

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

which were difficult to answer before

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

knowing what is possible

which were difficult to answer before

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

which were difficult to answer before

knowing what is possible

doing something useful

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

which were difficult to answer before

knowing what is possible

doing something useful

HOW

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

which were difficult to answer before

knowing what is possible

doing something useful

HOW WHY

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

which were difficult to answer before

knowing what is possible

doing something useful

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

which were difficult to answer before

knowing what is possible

doing something useful

usingnew good

the righttools

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

which were difficult to answer before

knowing what is possible

doing something useful

usingnew good

the rightasking whytools

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

which were difficult to answer before

knowing what is possible

doing something useful

usingnew good

the rightasking whytools

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

which were difficult to answer before

knowing what is possible

doing something useful

usingnew good

the rightasking whytools

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

which were difficult to answer before

knowing what is possible

doing something useful

usingnew good

the rightasking whytools WHY

Northwestern Data Visualization| @gabegaster | 2015 may

what is data science?

using emerging technologies to approach problems scientifically

which were difficult to answer before

knowing what is possible

doing something useful

usingnew good

the rightasking whytools WHY

WHY

Northwestern Data Visualization| @gabegaster | 2015 may

why why whywhat is data science?

Northwestern Data Visualization| @gabegaster | 2015 may

why why whywhat is data science?

science is about asking why

Northwestern Data Visualization| @gabegaster | 2015 may

why why whywhat is data science?

science is about asking whystart there

Northwestern Data Visualization| @gabegaster | 2015 may

an anecdote

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

an examplefrom the real world

Northwestern Data Visualization| @gabegaster | 2015 may

• e

an example

Northwestern Data Visualization| @gabegaster | 2015 may

• what

what data do you have? how can we make a network?

Northwestern Data Visualization| @gabegaster | 2015 may

• what

what data do you have? how can we make a network?

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

goal: save money

Northwestern Data Visualization| @gabegaster | 2015 may

goal: save money

Northwestern Data Visualization| @gabegaster | 2015 may

goal: save money

Northwestern Data Visualization| @gabegaster | 2015 may

goal: save money

Northwestern Data Visualization| @gabegaster | 2015 may

goal: save moneytask: find needle in the haystack (without poking yourself)

Northwestern Data Visualization| @gabegaster | 2015 may

abou

t pat

ent

not

abou

t pat

ent

goal: save moneytask: find needle in the haystack (without poking yourself)

Northwestern Data Visualization| @gabegaster | 2015 may

abou

t pat

ent

not

abou

t pat

ent

turn over to plaintiffdon’t

turn over to plaintiff

adverse inference

goal: save moneytask: find needle in the haystack (without poking yourself)

Northwestern Data Visualization| @gabegaster | 2015 may

abou

t pat

ent

not

abou

t pat

ent

turn over to plaintiffdon’t

turn over to plaintiff

adverse inference

give away trade secrets

goal: save moneytask: find needle in the haystack (without poking yourself)

Northwestern Data Visualization| @gabegaster | 2015 may

abou

t pat

ent

not

abou

t pat

ent

turn over to plaintiffdon’t

turn over to plaintiff

adverse inference

give away trade secrets

goal: save moneytask: find needle in the haystack (without poking yourself)

Northwestern Data Visualization| @gabegaster | 2015 may

turn over to plaintiffdon’t

turn over to plaintiff

goal: save moneytask: find needle in the haystack (without poking yourself)

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

goal: save moneyprototype

Northwestern Data Visualization| @gabegaster | 2015 may

goal: save moneyprototype — design for lawyers

Northwestern Data Visualization| @gabegaster | 2015 may

Sexier. Less nerdy. Tailored. design for transparency

Northwestern Data Visualization| @gabegaster | 2015 may

http://www.daegis.com/judicial-acceptance-of-tar/

Northwestern Data Visualization| @gabegaster | 2015 may

another examplecontests

Northwestern Data Visualization| @gabegaster | 2015 may

another example

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

task:

Northwestern Data Visualization| @gabegaster | 2015 may

classify schizophrenia w MRItask:

Northwestern Data Visualization| @gabegaster | 2015 may

why?classify schizophrenia w MRItask:

Northwestern Data Visualization| @gabegaster | 2015 may

why?classify schizophrenia w MRItask:

improve understanding of disease

Northwestern Data Visualization| @gabegaster | 2015 may

why?classify schizophrenia w MRItask:

improve understanding of diseasehow?

Northwestern Data Visualization| @gabegaster | 2015 may

why?classify schizophrenia w MRItask:

improve understanding of diseasehow? … outside contest purview

Northwestern Data Visualization| @gabegaster | 2015 may

why? outside contest purview

Northwestern Data Visualization| @gabegaster | 2015 may

why? outside contest purview

Northwestern Data Visualization| @gabegaster | 2015 may

why? outside contest purview

kaggle

Northwestern Data Visualization| @gabegaster | 2015 may

why? outside contest purview

kagglegetting data

& making usable

Northwestern Data Visualization| @gabegaster | 2015 may

why? outside contest purview

kagglegetting data

& making usable

WHY

Northwestern Data Visualization| @gabegaster | 2015 may

timeline of contest

Accuracy of Classification

Northwestern Data Visualization| @gabegaster | 2015 may

timeline of contest

AUC

Accuracy of Classification

Northwestern Data Visualization| @gabegaster | 2015 may

what is AUC?AU

C

Northwestern Data Visualization| @gabegaster | 2015 may

AUCwhat is AUC? Area Under Curve

Northwestern Data Visualization| @gabegaster | 2015 may

AUCwhat is AUC? Area Under Curve

what curve?

Northwestern Data Visualization| @gabegaster | 2015 may

AUCwhat is AUC? Area Under Curve

what curve? Receiver Operating Characteristic

Northwestern Data Visualization| @gabegaster | 2015 may

AUCwhat is AUC? Area Under Curve

what curve? Receiver Operating Characteristic

Northwestern Data Visualization| @gabegaster | 2015 may

AUCwhat is AUC? Area Under Curve

what curve? Receiver Operating Characteristic

Northwestern Data Visualization| @gabegaster | 2015 may

balances:

AUCwhat is AUC? Area Under Curve

what curve? Receiver Operating Characteristic

Northwestern Data Visualization| @gabegaster | 2015 may

balances: True Positive RateFalse Positive Rate

AUCwhat is AUC? Area Under Curve

what curve? Receiver Operating Characteristic

Northwestern Data Visualization| @gabegaster | 2015 may

balances: True Positive RateFalse Positive Rate

AUCwhat is AUC? Area Under Curve

what curve? Receiver Operating Characteristic

Northwestern Data Visualization| @gabegaster | 2015 may

AUCwhat is AUC?

balances: True Positive RateFalse Positive Rate

Area Under Curve

what curve? Receiver Operating Characteristic

Northwestern Data Visualization| @gabegaster | 2015 may

why?AUCwhat is AUC?

balances: True Positive RateFalse Positive Rate

Area Under Curve

what curve? Receiver Operating Characteristic

Northwestern Data Visualization| @gabegaster | 2015 may

why?…

AUCwhat is AUC?

balances: True Positive RateFalse Positive Rate

Area Under Curve

what curve? Receiver Operating Characteristic

Northwestern Data Visualization| @gabegaster | 2015 may

why?…

upshot:

AUCwhat is AUC?

balances: True Positive RateFalse Positive Rate

Area Under Curve

what curve? Receiver Operating Characteristic

Northwestern Data Visualization| @gabegaster | 2015 may

why?…

choice of metric matters a LOT

upshot:

in practice

AUCwhat is AUC?

balances: True Positive RateFalse Positive Rate

Area Under Curve

what curve? Receiver Operating Characteristic

Northwestern Data Visualization| @gabegaster | 2015 may

timeline of contest

Accuracy of Classification

AUC

Northwestern Data Visualization| @gabegaster | 2015 may

timeline of contest

Accuracy of Classification

AUC

random guess

Northwestern Data Visualization| @gabegaster | 2015 may

timeline of contest

Accuracy of Classification

AUC

random guess

basic SVM

Northwestern Data Visualization| @gabegaster | 2015 may

timeline of contest

goal?

Accuracy of Classification

AUC

random guess

basic SVM

Northwestern Data Visualization| @gabegaster | 2015 may

timeline of contest

goal: depends on why

Accuracy of Classification

AUC

random guess

basic SVM

Northwestern Data Visualization| @gabegaster | 2015 may

random guess

basic SVM

timeline of contest

Accuracy of Classification

AUC

Northwestern Data Visualization| @gabegaster | 2015 may

me

timeline of contest

Accuracy of Classification

AUC

Northwestern Data Visualization| @gabegaster | 2015 may

me

timeline of contest

Accuracy of Classification

AUC turned out to place 9th — because overfitting

Northwestern Data Visualization| @gabegaster | 2015 may

me

timeline of contest

Accuracy of Classification

AUC turned out to place 9th — because overfitting

very common problem

Northwestern Data Visualization| @gabegaster | 2015 may

timeline of contest

Accuracy of Classification

worth it?

AUC

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

We need to reduce the costs of Service Requests. They are too expensive.

!!!!

Thousands of engineers around the world, 24-7 read through emails and hardware log files to determine

the cause of failure of a server. This is an expensive process. We've tried to automate it. We can now

automatically resolve 7% of new Service Requests. But we want more. That's why we bought a few

super computers with TBs of memory.

client

an example !

from the industrial internet

Northwestern Data Visualization| @gabegaster | 2015 may

Why? Why do you need to set up a hadoop architecture to do clustering? What will this help you achieve? !!!!!How do you handle Service Requests? !

We need to reduce the costs of Service Requests. They are too expensive.

!!!!

Thousands of engineers around the world, 24-7 read through emails and hardware log files to determine

the cause of failure of a server. This is an expensive process. We've tried to automate it. We can now

automatically resolve 7% of new Service Requests. But we want more. That's why we bought a few

super computers with TBs of memory.

client

Northwestern Data Visualization| @gabegaster | 2015 may

Why? Why do you need to set up a hadoop architecture to do clustering? What will this help you achieve? !!!!!!!

We need to reduce the costs of Service Requests. They are too expensive.

!!!!

Thousands of engineers around the world, 24-7 read through emails and hardware log files to determine

the cause of failure of a server. This is an expensive process. We've tried to automate it. We can now

automatically resolve 7% of new Service Requests. But we want more. That's why we bought a few

super computers with TBs of memory.

client

Northwestern Data Visualization| @gabegaster | 2015 may

Why? Why do you need to set up a hadoop architecture to do clustering? What will this help you achieve? !!!!!How do you handle Service Requests? !

We need to reduce the costs of Service Requests. They are too expensive.

!!!!

Thousands of engineers around the world, 24-7 read through emails and hardware log files to determine

the cause of failure of a server. This is an expensive process. We've tried to automate it. We can now

automatically resolve 7% of new Service Requests. But we want more. That's why we bought a few

super computers with TBs of memory.

client

Northwestern Data Visualization| @gabegaster | 2015 may

Why? Why do you need to set up a hadoop architecture to do clustering? What will this help you achieve? !!!!!How do you handle Service Requests? !

We need to reduce the costs of Service Requests. They are too expensive.

!!!!

Thousands of engineers around the world, 24-7 read through emails and hardware log files to determine

the cause of failure of a server. This is an expensive process. We've tried to automate it. We can now

automatically resolve 1% of new Service Requests. But we want more. That's why we bought a few

super computers with TBs of memory.

client

Northwestern Data Visualization| @gabegaster | 2015 may

clienttools are not everything

Northwestern Data Visualization| @gabegaster | 2015 may

clienttools are not everything

but it is important to know the right tool for the job

Northwestern Data Visualization| @gabegaster | 2015 may

clienttools are not everything

but it is important to know the right tool for the job

Northwestern Data Visualization| @gabegaster | 2015 may

clienttools are not everything

but it is important to know the right tool for the job

Northwestern Data Visualization| @gabegaster | 2015 may

clienttools are not everything

but it is important to know the right tool for the job

don’t start w hadoop unless you have to. !

Northwestern Data Visualization| @gabegaster | 2015 may

clienttools are not everything

but it is important to know the right tool for the job

don’t start w hadoop unless you have to. !

probably you don’t have to.

Northwestern Data Visualization| @gabegaster | 2015 may

client

How did you automate resolving Service Requests? !!!!!!!!!!!

Northwestern Data Visualization| @gabegaster | 2015 may

client

How did you automate resolving Service Requests? !!!!!!!!!!!

A group of senior engineers thought about different use cases and came up with a list of conditions that, if any

are met, lead to predetermined solutions.

Northwestern Data Visualization| @gabegaster | 2015 may

client

How did you automate resolving Service Requests? !!!!!!!!!!!

A group of senior engineers thought about different use cases and came up with a list of conditions that, if any

are met, lead to predetermined solutions. !

Took a year to create. !

Northwestern Data Visualization| @gabegaster | 2015 may

client

How did you automate resolving Service Requests? !!!!!!!!!!!

A group of senior engineers thought about different use cases and came up with a list of conditions that, if any

are met, lead to predetermined solutions. !

Took a year to create. !

We’ve been keeping track of every solved request for several years now.

Northwestern Data Visualization| @gabegaster | 2015 may

client

How did you automate resolving Service Requests? !!!!!!!!!!!

A group of senior engineers thought about different use cases and came up with a list of conditions that, if any

are met, lead to predetermined solutions. !

Took a year to create. !

We’ve been keeping track of every solved request for several years now.

from sklearn import naive_bayes as nb!nb.GaussianNB().fit(historical_requests,!! ! ! ! ! ! historical_decisions)

Northwestern Data Visualization| @gabegaster | 2015 may

client

This works really well! But we can’t use it. !!!!!!

Northwestern Data Visualization| @gabegaster | 2015 may

client

This works really well! But we can’t use it. !!!!!!

Oh. Why is that?

Northwestern Data Visualization| @gabegaster | 2015 may

client

This works really well! But we can’t use it. !!!!!!

Engineers don’t trust the predictions.

Oh. Why is that?

Northwestern Data Visualization| @gabegaster | 2015 may

client

This works really well! But we can’t use it. !!!!!!

Engineers don’t trust the predictions.

Oh. Why is that?

Northwestern Data Visualization| @gabegaster | 2015 may

an example

just for fun

Northwestern Data Visualization| @gabegaster | 2015 may

an example

just for fun

(a bit more depth this time)

Northwestern Data Visualization| @gabegaster | 2015 may

a typical trip for me

Northwestern Data Visualization| @gabegaster | 2015 may

Bus transit times = a LIE

Northwestern Data Visualization| @gabegaster | 2015 may

Chicago is a grid city

Northwestern Data Visualization| @gabegaster | 2015 may

Difficult

Public Transit on the grid

=+ Diagonals

Northwestern Data Visualization| @gabegaster | 2015 may

Difficult

Public Transit on the grid

=+ Diagonals

2+ buses = FAIL

Northwestern Data Visualization| @gabegaster | 2015 may

Adding bikes to public transit

= win

Northwestern Data Visualization| @gabegaster | 2015 may

show how has divvy changed where people

can go

viz Goal:

Northwestern Data Visualization| @gabegaster | 2015 may

show how has divvy changed where people

can goshow where people

actually go

viz Goal:

Northwestern Data Visualization| @gabegaster | 2015 may

How to show where people go from here?

Northwestern Data Visualization| @gabegaster | 2015 may

one way is with an arrow

A B

How to show where people go from here?

Northwestern Data Visualization| @gabegaster | 2015 may

good for abstract relationships

one way is with an arrow

Northwestern Data Visualization| @gabegaster | 2015 may

hard to decipher

Northwestern Data Visualization| @gabegaster | 2015 may

lines between pts?

@flowingdata

Northwestern Data Visualization| @gabegaster | 2015 may

emphasizes traffic

@flowingdata

lines between pts?(the lines superimpose)

Northwestern Data Visualization| @gabegaster | 2015 may

emphasizes traffic

@flowingdata

lines between pts?

beautiful map(the lines superimpose)

Northwestern Data Visualization| @gabegaster | 2015 may

emphasizes traffic

@flowingdata

lines between pts?

beautiful map(the lines superimpose)

— but not suited for this goal

Northwestern Data Visualization| @gabegaster | 2015 may

lines between pts?

Northwestern Data Visualization| @gabegaster | 2015 may

lines between pts?

Northwestern Data Visualization| @gabegaster | 2015 may

lines between pts?

how to represent stations?

Northwestern Data Visualization| @gabegaster | 2015 may

lines between pts?

on a maphow to represent stations?

Northwestern Data Visualization| @gabegaster | 2015 may

but how?

lines between pts?

on a maphow to represent stations?

Northwestern Data Visualization| @gabegaster | 2015 may

can use gradient — to show gradual differences

between stationsLondon transit map

@mySociety

Northwestern Data Visualization| @gabegaster | 2015 may

@mbostock

or use natural borders?

London transit map

@mySociety

Northwestern Data Visualization| @gabegaster | 2015 may

what regions?

Northwestern Data Visualization| @gabegaster | 2015 may

each point is related to the closest station

what regions?

Northwestern Data Visualization| @gabegaster | 2015 may

each point is related to the closest station

what regions?

—> Voronoi

Northwestern Data Visualization| @gabegaster | 2015 may

each point is related to the closest station

what regions?

—> Voronoihuh?

Northwestern Data Visualization| @gabegaster | 2015 may

each point is related to the closest station

what regions?

—> Voronoihuh? http://alexbeutel.com/webgl/voronoi.html

Northwestern Data Visualization| @gabegaster | 2015 may

each point is related to the closest station

what regions?

—> Voronoihuh? http://alexbeutel.com/webgl/voronoi.html

Find the closest station — that’s my region!

Northwestern Data Visualization| @gabegaster | 2015 may

Czech beer

Northwestern Data Visualization| @gabegaster | 2015 may

Czech beer

starbucks

@gabegaster | http://bit.ly/1pdP2Tb

recap:

@gabegaster | http://bit.ly/1pdP2Tb

recap: stations are voronoi tiles

@gabegaster | http://bitly.com/bundles/gabegaster/1

recap: stations are voronoi tiles

@gabegaster | http://bitly.com/bundles/gabegaster/1

recap: stations are voronoi tiles

bubbles

@gabegaster | http://bit.ly/1pdP2Tb

recap: stations are voronoi tiles

@gabegaster | http://bit.ly/1pdP2Tb

recap: stations are voronoi tiles

too many edges to show…

@gabegaster | http://bit.ly/1pdP2Tb

recap: stations are voronoi tiles

too many edges to show… !

what about using color?

@gabegaster | http://bit.ly/1pdP2Tb

how touse color?

@gabegaster | http://bit.ly/1pdP2Tb

how touse color?

colors v colors

@gabegaster | http://bit.ly/1pdP2Tb

colors !

v!

colors

@mySociety

@mbostock

@gabegaster | http://bit.ly/1pdP2Tb

how touse color?

• two color scale

colors v colors

@gabegaster | http://bit.ly/1pdP2Tb

how touse color?

binned v gradient

colors v colors

• two color scale

@gabegaster | http://bit.ly/1pdP2Tb

• hard to read • differences subtle

gradient

@gabegaster | http://bit.ly/1pdP2Tb

how touse color?

• two colors not many • binned not gradient

binned v gradient

colors v colors

@gabegaster | http://bit.ly/1pdP2Tb

how touse color?

• two colors not many • binned not gradient

binned v gradient

colors v colors

binned

@gabegaster | http://bit.ly/1pdP2Tb

how touse color?

• two colors not many • binned not gradient • transparent empty bin

binned v gradient

colors v colors

binned

@gabegaster | http://bit.ly/1pdP2Tb

how touse color?

• two colors not many • binned not gradient • transparent empty bin • iterate

binned v gradient

colors v colors

binned

@gabegaster | http://bitly.com/bundles/gabegaster/1

• dispersion (where people can / do go)

with these choices, the viz emphasizes:

• exploration

@gabegaster | http://bitly.com/bundles/gabegaster/1

what can we learn from this?

urban vs lake frontcommute vs party

vs train

there’s me, too.hipster vs yuppie

@gabegaster | http://bit.ly/1pdP2Tb

Great! This looks great! !

@gabegaster | http://bit.ly/1pdP2Tb

Great! This looks great! !

But it’s too much at once.

@gabegaster | http://bit.ly/1pdP2Tb

Great! This looks great! !

But it’s too much at once.

make it interactive

@gabegaster | http://bit.ly/1pdP2Tb

Great! This looks great! !

But it’s too much at once.

make it interactive

divvy.datasco.pe

Northwestern Data Visualization| @gabegaster | 2015 may

new question

Northwestern Data Visualization| @gabegaster | 2015 may

How are stations different?

Northwestern Data Visualization| @gabegaster | 2015 may

How are stations different?

who uses it

Northwestern Data Visualization| @gabegaster | 2015 may

How are stations different?

when is the station usedwho uses it

Northwestern Data Visualization| @gabegaster | 2015 may

How are stations different?

when is the station usedhow it used

who uses it

Northwestern Data Visualization| @gabegaster | 2015 may

How are stations different?

when is the station usedhow it used

who uses it

use the time signature of a station

Northwestern Data Visualization| @gabegaster | 2015 may

Time Signature of a station

Northwestern Data Visualization| @gabegaster | 2015 may

Time Signature of a station

http://divvy.datasco.pe/multiline/

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

questions?

Northwestern Data Visualization| @gabegaster | 2015 may

in conclusion

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

Northwestern Data Visualization| @gabegaster | 2015 may

thanks!@gabegaster

Recommended