2
ThemeDelta: Dynamic Segmentations over Temporal Topic Models Paper By: Samah Gad, Waqas Javed, Sohaib Ghani, Niklas Elmqvist, Tom Ewing, Keith N. Hampton, and Naren Ramakrishnan Published: IEEE Transactions on Visualization and Computer Graphics 21(5) 2015 Presentation By: Yasha Pushak What: Text Dataset What: Text Dataset with Timestamps “Shouldn’t we help our homeless before refugees?” “Canada stands with Paris.” “I can’t believe I have so many racist friends…” “My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.” “Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.” “Canada extends it’s condolences to France.” “These are the faces of the Syrian refugees. Men, women, and children who’s homes were destroyed and were forced to flee.” “Terror in Paris…” “Canada is full. Say No to terrorists.” Time 2 Why: Identify Scatter/Gather Relationships What: Derived Bag of word representation “Shouldn’t we help our homeless before refugees?” “Canada stands with Paris.” “I can’t believe I have so many racist friends…” “My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.” “Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.” “Canada extends it’s condolences to France.” “These are the faces of the Syrian refugees. Men, women, and children who’s homes were destroyed and were forced to flee.” “Terror in Paris…” “Canada is full. Say No to terrorists.” “Shouldn’t we help our homeless before refugees?” “Canada stands with Paris.” “I can’t believe I have so many racist friends…” “My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.” “Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.” “Canada extends it’s condolences to France.” “These are the faces of the Syrian refugees. Men, women, and children who’s homes were destroyed and were forced to flee.” “Terror in Paris…” “Canada is full. Say No to terrorists.” “Shouldn’t we help our homeless before refugees?” “Canada stands with Paris.” “I can’t believe I have so many racist friends…” “My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.” “Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.” “Canada extends it’s condolences to France.” “These are the faces of the Syrian refugees. Men, women, and children who’s homes were destroyed and were forced to flee.” “Terror in Paris…” “Canada is full. Say No to terrorists.” “Shouldn’t we help our homeless before refugees?” “Canada stands with Paris.” “I can’t believe I have so many racist friends…” “My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.” “Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.” “Canada extends it’s condolences to France.” “These are the faces of the Syrian refugees. Men, women, and children who’s homes were destroyed and were forced to flee.” “Terror in Paris…” “Canada is full. Say No to terrorists.” “Canada extends it’s condolences to France.” Canada: Condolences: France: Syria: 1 1 1 0 5 3 But what if we have lots of data? What: Derived Bag of word representation “Shouldn’t we help our homeless before refugees?” “Canada stands with Paris.” “I can’t believe I have so many racist friends…” “My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.” “Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.” “Canada extends it’s condolences to France.” “These are the faces of the Syrian refugees. Men, women, and children who’s homes were destroyed and were forced to flee.” “Terror in Paris…” “Canada is full. Say No to terrorists.” “Shouldn’t we help our homeless before refugees?” “Canada stands with Paris.” “I can’t believe I have so many racist friends…” “My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.” “Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.” “Canada extends it’s condolences to France.” “These are the faces of the Syrian refugees. Men, women, and children who’s homes were destroyed and were forced to flee.” “Terror in Paris…” “Canada is full. Say No to terrorists.” “Shouldn’t we help our homeless before refugees?” “Canada stands with Paris.” “I can’t believe I have so many racist friends…” “My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.” “Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.” “Canada extends it’s condolences to France.” “These are the faces of the Syrian refugees. Men, women, and children who’s homes were destroyed and were forced to flee.” “Terror in Paris…” “Canada is full. Say No to terrorists.” “Shouldn’t we help our homeless before refugees?” “Canada stands with Paris.” “I can’t believe I have so many racist friends…” “My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.” “Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.” “Canada extends it’s condolences to France.” “These are the faces of the Syrian refugees. Men, women, and children who’s homes were destroyed and were forced to flee.” “Terror in Paris…” “Canada is full. Say No to terrorists.” “Canada extends it’s condolences to France.” Canada: Condolences: France: Syria: 1 1 1 0 4 Processing the bags of words Latent Dirichlet Allocation (LDA) Input: Bag of Words over time Output: Topics (Groups of keywords at a specific point in time) Timeline Segmentation Input: Topics Output: Optimal time intervals containing groups of topics 5 What? Why? What: Data Timestamped text dataset What: Derived Bag of Words over time Topics (Groups of keywords at a specific point in time) Time intervals containing groups of topics Why: Tasks Identify changes in topics over time Identify scatter/gather relationships 6 How? What? Why? How? How: Encode Parallel axes for time segments Spatially partition topics along a segment Label keywords within topics Linked keywords across time intervals Segment labels for dates and duration How: Encode (Free Channels) Size of labels for quantitative data Width of links for quantitative data Link colour for categorical or ordered data 7 What? Why? How? How: Encode (Free Channels) Size of labels for quantitative data Width of links for quantitative data Link colour for categorical or ordered data How: Manipulate Navigate: geometric zooming and panning Select: highlight keywords Search: Select keywords by searching How: Reduce Filter: by selected keywords and resort 8 Filtering on “Energy” 9 Example: US Presidential Election 2012 - Mitt Romney 10 Sep 09 – Oct 09 “German”, “mask” – advisories from the first world war to wear a mask Oct 10 – Dec 05 “home”, “family”, “son”, “daughter” – men from the army were allowed to return home Dec 06 – Dec 13 “German” disappears – The war was won on November 11. 11 Spanish Flu in the News Expert User Study on Spanish Flu Data Changed-Focused Questions How did the newspapers describe the spread of influenza? How does the description of the pandemic change over time? Are there different times when the influenza pandemic becomes less important? What are those time periods? Connection-Focused Questions What are the categories that appear to be associated with influenza in different newspapers? Was there a specific feeling that surrounded the influenza reporting in the newspapers? 12 Scalability Limits 13 Thank You References ThemeDelta: Dynamic Segmentations over Temporal Topic Models, by Samah Gad, Waqas Javed, Sohaib Ghani, Niklas Elmqvist, Tom Ewing, Keith N. Hampton, Naren Ramakrishnan, in IEEE Transactions on Visualization and Computer Graphics 21(5) 2015. Visualization Analysis and Design, by Tamara Munzner, A K Peters Visualization Series, CRC Press, 2014. 14 What? Why? How? How: Encode (Free Channels) Size of labels for quantitative data Width of links for quantitative data Link colour for categorical or ordered data How: Manipulate Navigate: geometric zooming and panning Select: highlight keywords Search: Select keywords by searching How: Reduce Filter: by selected keywords and resort What: Data Timestamped text dataset What: Derived Bag of Words over time Topics (Groups of words) Time intervals containing groups of topics Why: Tasks Identify changes in topics over time Identify scatter/gather relationships How: Encode Parallel axes for time segments Spatially partition topics along a segment Label keywords within topics Linked keywords across time intervals Segment labels for dates and duration

ThemeDelta : Dynamic Segmentations over Temporal Topic Modelstmm/courses/547-15/slides/yasha... · 2015. 12. 8. · ThemeDelta : Dynamic Segmentations over Temporal Topic Models Paper

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: ThemeDelta : Dynamic Segmentations over Temporal Topic Modelstmm/courses/547-15/slides/yasha... · 2015. 12. 8. · ThemeDelta : Dynamic Segmentations over Temporal Topic Models Paper

ThemeDelta: Dynamic Segmentations over Temporal

Topic ModelsPaper By: Samah Gad, Waqas Javed, Sohaib Ghani, Niklas Elmqvist, Tom

Ewing, Keith N. Hampton, and Naren Ramakrishnan

Published: IEEE Transactions on Visualization and Computer Graphics 21(5) 2015

Presentation By: Yasha Pushak

What: Text DatasetWhat: Text Dataset with Timestamps

“Shouldn’t we help ourhomeless before refugees?”

“Canada stands with Paris.”

“I can’t believe I have somany racist friends…”

“My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.”

“Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.”

“Canada extends it’s condolences to France.”

“These are the faces of the Syrian refugees.Men, women, and children who’s homes

were destroyed and were forced to flee.”

“Terror in Paris…”

“Canada is full. SayNo to terrorists.”

Time2

Why: Identify Scatter/Gather Relationships

What: DerivedBag of word representation

“Shouldn’t we help ourhomeless before refugees?”

“Canada stands with Paris.”

“I can’t believe I have somany racist friends…”

“My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.”

“Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.”

“Canada extends it’s condolences to France.”

“These are the faces of the Syrian refugees.Men, women, and children who’s homes

were destroyed and were forced to flee.”

“Terror in Paris…”

“Canada is full. SayNo to terrorists.”

“Shouldn’t we help ourhomeless before refugees?”

“Canada stands with Paris.”

“I can’t believe I have somany racist friends…”

“My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.”

“Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.”

“Canada extends it’s condolences to France.”

“These are the faces of the Syrian refugees.Men, women, and children who’s homes

were destroyed and were forced to flee.”

“Terror in Paris…”“Canada is full. SayNo to terrorists.”

“Shouldn’t we help ourhomeless before refugees?”

“Canada stands with Paris.”

“I can’t believe I have somany racist friends…”

“My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.”

“Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.”

“Canada extends it’s condolences to France.”

“These are the faces of the Syrian refugees.Men, women, and children who’s homes

were destroyed and were forced to flee.”

“Terror in Paris…”

“Canada is full. SayNo to terrorists.”

“Shouldn’t we help ourhomeless before refugees?”

“Canada stands with Paris.”

“I can’t believe I have somany racist friends…”

“My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.”

“Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.”

“Canada extends it’s condolences to France.”

“These are the faces of the Syrian refugees.Men, women, and children who’s homes

were destroyed and were forced to flee.”

“Terror in Paris…”“Canada is full. SayNo to terrorists.”

“Canada extends it’s condolences to France.”

Canada: Condolences: France: Syria:

11

1 0

53

But what if we have lots of data?

What: DerivedBag of word representation

“Shouldn’t we help ourhomeless before refugees?”

“Canada stands with Paris.”

“I can’t believe I have somany racist friends…”

“My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.”

“Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.”

“Canada extends it’s condolences to France.”

“These are the faces of the Syrian refugees.Men, women, and children who’s homes

were destroyed and were forced to flee.”

“Terror in Paris…”

“Canada is full. SayNo to terrorists.”

“Shouldn’t we help ourhomeless before refugees?”

“Canada stands with Paris.”

“I can’t believe I have somany racist friends…”

“My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.”

“Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.”

“Canada extends it’s condolences to France.”

“These are the faces of the Syrian refugees.Men, women, and children who’s homes

were destroyed and were forced to flee.”

“Terror in Paris…”“Canada is full. SayNo to terrorists.”

“Shouldn’t we help ourhomeless before refugees?”

“Canada stands with Paris.”

“I can’t believe I have somany racist friends…”

“My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.”

“Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.”

“Canada extends it’s condolences to France.”

“These are the faces of the Syrian refugees.Men, women, and children who’s homes

were destroyed and were forced to flee.”

“Terror in Paris…”

“Canada is full. SayNo to terrorists.”

“Shouldn’t we help ourhomeless before refugees?”

“Canada stands with Paris.”

“I can’t believe I have somany racist friends…”

“My great grand parents immigrated from Russia to escape violence… No one told them ‘We’re full’.”

“Diversity is our strength. We strongly condemn the acts aimed at certain Canadians after the Paris attacks.”

“Canada extends it’s condolences to France.”

“These are the faces of the Syrian refugees.Men, women, and children who’s homes

were destroyed and were forced to flee.”

“Terror in Paris…”“Canada is full. SayNo to terrorists.”

“Canada extends it’s condolences to France.”

Canada: Condolences: France: Syria:

11

1 0

4

Processing the bags of words

Latent Dirichlet Allocation (LDA)Input: Bag of Words over time

Output: Topics (Groups of keywords at a specific point in time)

Timeline SegmentationInput: Topics

Output: Optimal time intervals containing groups of topics

5

What? Why?

What: Data Timestamped text dataset

What: DerivedBag of Words over time

Topics (Groups of keywords at a specific point in time)

Time intervals containing groups of topics

Why: TasksIdentify changes in topics over time

Identify scatter/gather relationships

6

How? What? Why? How?

How: EncodeParallel axes for time segments

Spatially partition topics along a segment

Label keywords within topics

Linked keywords across time intervals

Segment labels for dates and duration

How: Encode (Free Channels)Size of labels for quantitative data

Width of links for quantitative data

Link colour for categorical or ordered data

7

What? Why? How?

How: Encode (Free Channels)Size of labels for quantitative data

Width of links for quantitative data

Link colour for categorical or ordered data

How: ManipulateNavigate: geometric zooming and panning

Select: highlight keywords

Search: Select keywords by searching

How: ReduceFilter: by selected keywords and resort

8

Filtering on “Energy”

9

Example: US Presidential Election 2012 - Mitt Romney

10

Sep 09 – Oct 09“German”, “mask” – advisories from the first world war to wear a mask

Oct 10 – Dec 05“home”, “family”, “son”, “daughter” – men from the army were allowed to return home

Dec 06 – Dec 13 “German” disappears – The war was won on November 11.

11

Spanish Flu in the News Expert User Study on Spanish Flu Data

Changed-Focused QuestionsHow did the newspapers describe the spread of influenza?

How does the description of the pandemic change over time?

Are there different times when the influenza pandemic becomes less important? What are those time periods?

Connection-Focused QuestionsWhat are the categories that appear to be associated with influenza in different newspapers?

Was there a specific feeling that surrounded the influenza reporting in the newspapers?

12

Scalability Limits

13

Thank You

ReferencesThemeDelta: Dynamic Segmentations over Temporal Topic Models, by Samah Gad,

Waqas Javed, Sohaib Ghani, Niklas Elmqvist, Tom Ewing, Keith N. Hampton, NarenRamakrishnan, in IEEE Transactions on Visualization and Computer Graphics 21(5) 2015.

Visualization Analysis and Design, by Tamara Munzner, A K Peters Visualization Series, CRC Press, 2014.

14

What? Why? How?

How: Encode (Free Channels)

Size of labels for quantitative data

Width of links for quantitative data

Link colour for categorical or ordered data

How: Manipulate

Navigate: geometric zooming and panning

Select: highlight keywords

Search: Select keywords by searching

How: Reduce

Filter: by selected keywords and resort

What: Data Timestamped text dataset

What: DerivedBag of Words over timeTopics (Groups of words)Time intervals containing groups of topics

Why: TasksIdentify changes in topics over timeIdentify scatter/gather relationships

How: EncodeParallel axes for time segmentsSpatially partition topics along a segmentLabel keywords within topicsLinked keywords across time intervalsSegment labels for dates and duration

Page 2: ThemeDelta : Dynamic Segmentations over Temporal Topic Modelstmm/courses/547-15/slides/yasha... · 2015. 12. 8. · ThemeDelta : Dynamic Segmentations over Temporal Topic Models Paper

Low vs High Income Neighbourhood DiscussionsLow Income High Income Layout Sorting