Can Social Media tell us something about our lives?

Embed Size (px)

Citation preview

  • 7/28/2019 Can Social Media tell us something about our lives?

    1/43

    Can Social Media tell us something

    about our lives?

    Vasileios Lampos

    Computer Science DepartmentUniversity of Sheffield

    March, 2013

    V. Lampos [email protected] Can Social Media tell us something about our lives? 1/43

    1/43

  • 7/28/2019 Can Social Media tell us something about our lives?

    2/43

    Outline

    Motivation, Aims [Facts, Questions] Data

    Nowcasting Events

    Extracting Mood Patterns

    TrendMiner Extracting Political Opinion

    |= Conclusions

    V. Lampos [email protected] Can Social Media tell us something about our lives? 2/43

    2/43

  • 7/28/2019 Can Social Media tell us something about our lives?

    3/43

    Facts

    We started to work on those ideas back in 2008, when...

    Web contained 1 trillion unique pages (Google)

    Social Networks were rising, e.g. Facebook: 100m (2008) >1 billion active users (October, 2012) Twitter: 6m (2008) 500m active users (July, 2012)

    User behaviour was changing

    Socialising via the Web Giving up privacy (Debatin et al., 2009)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 3/43

    3/43

  • 7/28/2019 Can Social Media tell us something about our lives?

    4/43

    Some general questions

    Does user generated text posted on Social Web platforms include

    useful information?

    How can we extract this useful information...... automatically? Therefore, not we, but a machine.

    Practical / real-life applications?

    Can those large samples of human input assist studies in otherscientific fields?Social Sciences, Psychology, Epidemiology

    V. Lampos [email protected] Can Social Media tell us something about our lives? 4/43

    4/43

  • 7/28/2019 Can Social Media tell us something about our lives?

    5/43

    The Data (1/3)

    Why Twitter?

    Has a lot of content that is publicly accessible Provides a well-documented API for several types of data collection

    Opinions and personal statements on various domains

    Connection with current affairs (usually in real-time) Some content is geo-located

    Option for personalised modelling ... and we got good results from the very first, simple experiment!

    V. Lampos [email protected] Can Social Media tell us something about our lives? 5/43

    5/43

  • 7/28/2019 Can Social Media tell us something about our lives?

    6/43

    The Data (2/3)What does a @tweet look like?

    Figure 1 : Some biased and anonymised examples of tweets (limit of 140characters/tweet, # denotes a topic)

    (a) (user will remain anonymous) (b) they live around us

    (c) citizen journalism (d) flu attitude

    V. Lampos [email protected] Can Social Media tell us something about our lives? 6/43

    6/43

  • 7/28/2019 Can Social Media tell us something about our lives?

    7/43

    The Data (3/3)

    Data Collection & Preprocessing

    The easiest part of the process... not true! Storage space, crawler implementation, parallel dataprocessing, new technologies (e.g., Map-Reduce) (Preotiuc et al., 2012)

    Data collected via Twitters Search API:

    collective sampling tweets geo-located in 54 urban centres in the UK periodical crawling (every 3 or 5 minutes per urban centre)

    Data collected via Twitters REST API:

    user-centric sampling

    preprocessing to approximate users location (city & country) ... or manual user selection from domain experts get their latest tweets (3,000 or more)

    Several forms of ground truth (flu/rainfall rates, polls)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 7/43

    7/43

  • 7/28/2019 Can Social Media tell us something about our lives?

    8/43

    Nowcasting Events from the

    Social Web

    V. Lampos [email protected] Can Social Media tell us something about our lives? 8/43

    8/43

  • 7/28/2019 Can Social Media tell us something about our lives?

    9/43

    Nowcasting?

    We do not predict the future, but infer the present i.e. the very recent past

    Figure 2 : Nowcasting the magnitude of an event () emerging in the real worldfrom Web information

    Our case studies: nowcasting (a) flu rates & (b) rainfall rates (?!)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 9/43

    9/43

  • 7/28/2019 Can Social Media tell us something about our lives?

    10/43

    What do we get in the end?

    This is a regression problem (text regression in NLP)i.e. time interval i we aim to infer yi R using text input xxxi Rn

    0 5 10 15 20 25 300

    2

    4

    6

    8

    10

    12

    14

    16

    Days

    R

    ainfallr

    ate(m

    m)

    Bristol

    Actual

    Inferred

    Figure 3 : Inferred rainfall rates for Bristol, UK (October, 2009)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 10/4310/43

    M h d l (1/5) T i V S

  • 7/28/2019 Can Social Media tell us something about our lives?

    11/43

    Methodology (1/5) Text in Vector Space

    Candidate features (n-grams): C = {ci}Set of Twitter posts for a time interval u:

    P(u) =

    {pj

    }Frequency of ci in pj:

    g(ci, pj) =

    if ci pj,0 otherwise.

    g Boolean, maximum value for is 1

    Score of ci in P(u):

    s

    ci, P(u)

    =

    |P(u)

    |j=1

    g(ci, pj)

    |P(u)|

    V. Lampos [email protected] Can Social Media tell us something about our lives? 11/4311/43

    M h d l (2/5)

  • 7/28/2019 Can Social Media tell us something about our lives?

    12/43

    Methodology (2/5)

    Set of time intervals: U= {uk} 1 hour, 1 day, ...

    Time series of candidate features scores:

    X(U) =

    xxx(u1) ... xxx(u|U|)T

    ,

    wherexxx(ui) =

    s

    c1, P(ui)

    ... s

    c|C|, P(ui)T

    Target variable (event):

    yyy(U) =

    y1 ... y|U|T

    V. Lampos [email protected] Can Social Media tell us something about our lives? 12/4312/43

    M th d l (3/5) F t l ti

  • 7/28/2019 Can Social Media tell us something about our lives?

    13/43

    Methodology (3/5) Feature selection

    Solve the following optimisation problem:

    minw

    X(U)www

    yyy(U)

    22

    s.t. www1 t,t = wwwOLS1 , (0, 1].

    Least Absolute Shrinkage and Selection Operator (LASSO)argmin

    wwwX(U)www yyy(U)22 + www1

    (Tibshirani, 1996)

    Expect a sparse www (feature selection)

    Least Angle Regression (LARS) computes entire regularisationpath (wwws for different values of ) (Efron et al., 2004)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 13/4313/43

    M th d l (4/5)

  • 7/28/2019 Can Social Media tell us something about our lives?

    14/43

    Methodology (4/5)

    LASSO is model-inconsistent:

    inferred sparsity pattern may deviate from the true model, e.g.,when predictors are highly correlated (Zhao and Yu, 2006)

    bootstrap [?] LASSO (Bolasso) performs a more robust featureselection (Bach, 2008)?:

    in each bootstrap, input space is sampled with replacement apply LASSO (LARS) to select features select features with nonzero weights in all bootstraps

    better alternative soft-Bolasso: a less strict feature selection select features with nonzero weights in p% of bootstraps (learn p using a separate validation set)

    weights of selected features determined via OLS regression

    V. Lampos [email protected] Can Social Media tell us something about our lives? 14/4314/43

    Methodolog (5/5) Sim lified s mma

  • 7/28/2019 Can Social Media tell us something about our lives?

    15/43

    Methodology (5/5) Simplified summary

    Observations: X

    R

    mn (m time intervals, n features)

    Response variable: yyy Rm

    For i = 1 to number of bootstrapsForm Xi

    X by sampling X with replacement

    Solve LASSO for Xi and yyy, i.e. learn wwwi RnGet the k n features with nonzero weights

    End_For

    Select the v n features with nonzero weight in p% of the bootstrapsLearn their weights with OLS regression on X(v) Rmv and yyy

    V. Lampos [email protected] Can Social Media tell us something about our lives? 15/4315/43

    How do we form candidate features?

  • 7/28/2019 Can Social Media tell us something about our lives?

    16/43

    How do we form candidate features?

    Commonly formed by indexing the entire corpus(Manning, Raghavan and Schtze, 2008)

    We extract them from Wikipedia, Google Search results, PublicAuthority websites (e.g., NHS)

    Why? reduce dimensionality to bound the error of LASSO

    L(www) L(www) + Q, with Q min

    W21N

    +p

    N,

    W21N

    +W1

    N

    p candidate features, N samples, empirical loss

    L(www) and

    www1 W1 (Bartlett, Mendelson and Neeman, 2011)

    Harry Potter Effect!

    V. Lampos [email protected] Can Social Media tell us something about our lives? 16/4316/43

    The Harry Potter effect (1/2)

  • 7/28/2019 Can Social Media tell us something about our lives?

    17/43

    The Harry Potter effect (1/2)

    Figure 4 : Events co-occurring (correlated) with the inference target may affectfeature selection, especially when the sample size is small.

    180 200 220 240 260 280 300 320 3400

    50

    100

    150

    200

    250

    300

    Day Number (2009)

    Event

    Score

    Flu (England & Wales)

    Hypothetical Event I

    Hypothetical Event II

    (Lampos, 2012a)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 17/4317/43

    The Harry Potter effect (2/2)

  • 7/28/2019 Can Social Media tell us something about our lives?

    18/43

    The Harry Potter effect (2/2)Table 1 : Top 1-grams correlated with flu rates in England/Wales (0612/2009)

    1-gram Event Corr. Coef.latitud Latitude Festival 0.9367

    flu Flu epidemic 0.9344swine 0.9212harri Harry Potter Movie 0.9112

    slytherin 0.9094potter 0.8972

    benicassim Benicssim Festival 0.8966graduat Graduation (?) 0.8965

    dumbledor Harry Potter Movie 0.8870hogwart 0.8852quarantin Flu epidemic 0.8822gryffindor Harry Potter Movie 0.8813ravenclaw 0.8738

    princ 0.8635swineflu Flu epidemic 0.8633

    ginni Harry Potter Movie 0.8620weaslei 0.8581

    hermion 0.8540draco 0.8533

    Solution: ground truth with some degree of variability

    (Lampos, 2012a)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 18/4318/43

    About n grams

  • 7/28/2019 Can Social Media tell us something about our lives?

    19/43

    About n-grams

    1-grams

    decent (dense) representation in the Twitter corpus unclear semantic interpretation

    Example: I am not sick. But I dont feel great either!

    2-grams

    very sparse representation in tweets

    sometimes clearer semantic interpretation

    Experimental process indicated that...

    a hybrid combination of 1-grams and 2-grams

    delivers the best inference performance

    refer to (Lampos, 2012a)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 19/4319/43

    Flu rates Example of selected features

  • 7/28/2019 Can Social Media tell us something about our lives?

    20/43

    Flu rates Example of selected features

    Figure 5 : Font size is proportional to the weight of each feature; flipped n-gramsare negatively weighted. All words are stemmed (Porter, 1980).

    (Lampos and Cristianini, 2012)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 20/4320/43

    Rainfall rates Example of selected features

  • 7/28/2019 Can Social Media tell us something about our lives?

    21/43

    Rainfall rates Example of selected features

    Figure 6 : Font size is proportional to the weight of each feature; flipped n-gramsare negatively weighted. All words are stemmed (Porter, 1980).

    (Lampos and Cristianini, 2012)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 21/4321/43

    Examples of inferences

  • 7/28/2019 Can Social Media tell us something about our lives?

    22/43

    Examples of inferences

    0 5 10 15 20 25 300

    20

    40

    60

    80

    100

    120

    Days

    FluRate

    C.En

    gland&Wales

    Actual

    Inferred

    (a) Central England/Wales (flu)

    0 5 10 15 20 25 300

    20

    40

    60

    80

    100

    120

    Days

    FluRate

    S.England

    Actual

    Inferred

    (b) South England (flu)

    0 5 10 15 20 25 300

    2

    4

    6

    8

    10

    12

    14

    16

    Days

    R

    ainfallrate(mm)

    Bristol

    Actual

    Inferred

    (c) Bristol (rain)

    Figure 7 : Examples of flu and rainfall rates inferences from Twitter content

    (Lampos and Cristianini, 2012)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 22/4322/43

    Performance figures

  • 7/28/2019 Can Social Media tell us something about our lives?

    23/43

    Performance figures

    Table 2 : RMSE for flu rates inference (5-fold cross validation), 50m tweets,21/06/200919/04/2010

    Method 1-grams 2-grams Hybrid

    Baseline 12.442.37 13.813.29 11.621.58

    Bolasso 11.142.35 12.642.57 10.572.2

    CART ensemble 9.635.21 13.134.72 9.44.21

    Table 3 : RMSE (in mm) for rainfall rates inference (6-fold cross validation),8.5m tweets, 01/07/200930/06/2010

    Method 1-grams 2-grams Hybrid

    Baseline 2.910.6 3.10.57 4.392.99

    Bolasso 2.730.65 2.950.55 2.600.68CART ensemble 2.710.69 2.720.72 2.640.63

    As implemented in (Ginsberg et al., 2009) Classification and Regression Tree (Breiman et al., 1984) & (Sutton, 2005)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 23/4323/43

    Flu Detector

  • 7/28/2019 Can Social Media tell us something about our lives?

    24/43

    Flu Detector

    URL: http://geopatterns.enm.bris.ac.uk/epidemics

    Figure 8 : Flu Detector uses the content of Twitter to nowcast flu rates inseveral UK regions

    (Lampos, De Bie and Cristianini, 2010)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 24/4324/43

    http://geopatterns.enm.bris.ac.uk/epidemicshttp://geopatterns.enm.bris.ac.uk/epidemics
  • 7/28/2019 Can Social Media tell us something about our lives?

    25/43

    Computing a mood score

  • 7/28/2019 Can Social Media tell us something about our lives?

    26/43

    p gTable 4 : Mood terms from WordNet Affect

    Fear Sadness Joy Anger

    afraid depressed admire angryfearful discouraged cheerful despise

    frighten disheartened enjoy enviouslyhorrible dysphoria enthousiastic harassed

    panic gloomy exciting irritate... ... ... ...

    (92 terms) (115 terms) (224 terms) (146 terms)

    Mood score computation for a time interval d using n mood terms

    msd =1

    n

    ni=1

    c(td)iN(td)

    c(td)i : count of term i in the Twitter corpus of day d

    N(td): number of tweets for day dUsing the sample of d days, compute a standardised mood score:

    msstdd =msd ms

    ms

    V. Lampos [email protected] Can Social Media tell us something about our lives? 26/4326/43

    The mood of the nation (1/5)

  • 7/28/2019 Can Social Media tell us something about our lives?

    27/43

    ( / )

    Figure 9 : Daily time series (actual & their 14-point moving average) for themood of Joy based on Twitter content geo-located in the UK

    Jul 09 Jan 10 Jul 10 Jan 11 Jul 11 Jan 122

    0

    2

    4

    6

    8

    10

    933 Day Time Series for Joy in Twitter Content

    Date

    NormalisedEmotionalValence

    * RIOTS

    * CUTS

    * XMAS* XMAS

    * XMAS

    * roy.wed.

    * halloween

    * halloween

    * halloween

    * valentine* valentine

    * easter

    * easter

    raw joy signal

    14day smoothed joy

    (Lansdall, Lampos and Cristianini, 2012a&b)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 27/4327/43

    The mood of the nation (2/5)

  • 7/28/2019 Can Social Media tell us something about our lives?

    28/43

    ( / )

    Figure 10 : Daily time series (actual & their 14-point moving average) for themood of Anger based on Twitter content geo-located in the UK

    Jul 09 Jan 10 Jul 10 Jan 11 Jul 11 Jan 124

    3

    2

    1

    0

    1

    2

    3

    4

    5933 Day Time Series for Anger in Twitter Content

    Date

    NormalisedEmotionalValence

    * RIOTS

    * CUTS

    * XMAS

    * XMAS

    * XMAS

    * roy.wed.

    * halloween

    * halloween

    * halloween

    * valentine

    * valentine* easter

    * easter

    raw anger signal

    14day smoothed anger

    (Lansdall, Lampos and Cristianini, 2012a&b)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 28/4328/43

  • 7/28/2019 Can Social Media tell us something about our lives?

    29/43

    The mood of the nation (4/5)

  • 7/28/2019 Can Social Media tell us something about our lives?

    30/43

    ( / )

    Figure 12 : Projections of 4-dimensional mood score signals (joy, sadness, anger andfear) on their top-2 principal components (PCA) Twitter content from 2011

    1.5 1 0.5 0 0.5 10.5

    0.4

    0.3

    0.2

    0.1

    0

    0.1

    0.2

    0.3

    0.4

    SaturdaySunday

    MondayTuesday

    Wednesday

    Thursday

    Friday

    1st Principal Component

    2ndPrincipalComponent

    Days of the Week

    (a) Days of the week (2011)

    8 6 4 2 0 2 4 6 82

    0

    2

    4

    6

    8

    10

    1

    2345

    6789

    10

    1112

    1314

    151617

    18192021

    22

    23

    24252627

    28

    2930

    3132

    333435 363738

    3940

    4142

    4344

    45

    46474849

    5051

    5253545556 57585960 6162

    636465

    66

    676869

    707172

    737475

    76

    7778

    7980818283

    84 858687888990

    91

    92

    93

    949596979899100

    101102103104105

    106107

    108109110111112 113

    114

    115

    116117118

    119

    120121

    122

    123124

    125126

    127128

    129

    130131132

    133 134135

    136137138139

    140141

    142143144

    145146147

    148149150151152153154

    155156157158159160161

    162163164165166167168

    169

    170

    171172173

    174175176177178179180

    181

    182183184185

    186187188189

    190191

    192193194195

    196 197

    198199

    200201202203

    204205

    206207

    208209210

    211212213214215216217

    218219

    220221222

    223224

    225226227228229230

    231

    232233234235236237238239240241

    242243244245

    246247248249250251252

    253 254

    255256257

    258259

    260261

    262263264

    265

    266267

    268269

    270271272

    273

    274275276277278279

    280281

    282

    283284285286

    287

    288289

    290291292293294

    295 296297

    298299300301

    302303304

    305306307308

    309310

    311312313314

    315

    316317

    318319320321322

    323324325326327

    328

    329

    330331

    332333

    334

    335336

    337338339340341342

    343

    344345346347348349

    350351352

    353354

    355356

    357

    358

    359

    360

    361362363

    364

    365

    1st Principal Component

    2ndPrincipalComponent

    Days in 2011

    (b) Days of the year (2011)

    Cluster INew Year (1), Valentines (45), Christmas Eve (358), New Years Eve (365)

    Cluster IIO.B. Ladens death (122), Winehouses death + Breivik (204), UK riots (221)

    (Lampos, 2012a)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 30/4330/43

    The mood of the nation (5/5)

  • 7/28/2019 Can Social Media tell us something about our lives?

    31/43

    URL: http://geopatterns.enm.bris.ac.uk/mood

    Figure 13 : Mood of the Nation uses the content of Twitter to nowcast moodrates in several UK regions

    (Lampos, 2012a)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 31/4331/43

    Circadian mood patterns (1/3)

    http://geopatterns.enm.bris.ac.uk/moodhttp://geopatterns.enm.bris.ac.uk/mood
  • 7/28/2019 Can Social Media tell us something about our lives?

    32/43

    Compute 24-h mood score patterns

    Mood score computation for a time interval u = 24hours using nmood terms (WordNet) and a sample of D days:

    Ms(u) = 1|D||D|

    j=1

    1n

    ni=1

    sf(tj,

    u)i

    sf(td,u)

    i =f(td,u)

    i fif

    i

    , i {1,..., n}.

    f(td,u)

    i : normalised frequency of a mood term i during time interval u in day dD

    V. Lampos [email protected] Can Social Media tell us something about our lives? 32/4332/43

    Circadian mood patterns (2/3)

  • 7/28/2019 Can Social Media tell us something about our lives?

    33/43

    Figure 14 : Circadian (24-hour) mood patterns based on UK Twitter content

    V. Lampos [email protected] Can Social Media tell us something about our lives? 33/4333/43

    Circadian mood patterns (3/3)

  • 7/28/2019 Can Social Media tell us something about our lives?

    34/43

    Figure 15 : Autocorrelation of circadian mood patterns based on hourly lagsrevealing daily and weekly periodicities

    1 12 24 36 48 60 72 84 96 108 120 132 144 156 168

    0

    0.2

    0.4

    Autocorr. Lags (Hours)

    Autocorr.

    (Fea

    r)

    Autocorr.

    Conf. Bound

    (a) Fear

    1 12 24 36 48 60 72 84 96 108 120 132 144 156 168

    0

    0.1

    0.2

    0.3

    0.4

    Autocorr. Lags (Hours)

    Autocorr.

    (Sadness)

    Autocorr.

    Conf. Bound

    (b) Sadness

    1 12 24 36 48 60 72 84 96 108 120 132 144 156 168

    0.2

    0

    0.2

    0.4

    Autocorr. Lags (Hours)

    Autocorr.

    (Joy)

    Autocorr.

    Conf. Bound

    (c) Joy

    1 12 24 36 48 60 72 84 96 108 120 132 144 156 168

    0

    0.1

    0.2

    0.3

    Autocorr. Lags (Hours)

    Autocorr.

    (Anger)

    Autocorr.

    Conf. Bound

    (d) Anger

    ... further analysis on those patterns (in collab. with domain experts) under submission

    V. Lampos [email protected] Can Social Media tell us something about our lives? 34/4334/43

  • 7/28/2019 Can Social Media tell us something about our lives?

    35/43

    TrendMiner Project

    Extracting political opinion fromSocial Media

    V. Lampos [email protected] Can Social Media tell us something about our lives? 35/4335/43

    A few words about the project...

  • 7/28/2019 Can Social Media tell us something about our lives?

    36/43

    TrendMiner is an EU-FP7 project

    Several participants incl. the Univ. of Sheffield & Southampton(UK) and DFKI (Germany)

    Aims to form methods for interpreting the vast stream of onlineinformation

    Our focus on analysis of Twitter content

    political opinion,

    financial indicators

    Work in progress and under submission process cannot go intomuch detail!

    V. Lampos [email protected] Can Social Media tell us something about our lives? 36/4336/43

    Some new challenges

  • 7/28/2019 Can Social Media tell us something about our lives?

    37/43

    Aim: model voting intention regression task multiple outputs

    Overcome limitations of previous methods use of sentiment analysis taxonomies language specific, restrictive combined modelling of word frequencies and the domain of users? multi-task learning exploit correlations in the feature space multi-task & multi-domain learning

    model political opinion + financial indicators jointly

    Proper evaluation k-fold cross-validation may sometimes be misleading can we actually predict future values?

    V. Lampos [email protected] Can Social Media tell us something about our lives? 37/4337/43

    A snapshot of the results

  • 7/28/2019 Can Social Media tell us something about our lives?

    38/43

    vi = uuuTXwww +

    (plus multi-task learning)

    Figure 16 : 50 voting intention polls (YouGov) and their respective inferred values forthe Conservative (RMSE: 1.781.781.78%), Labour (1.591.591.59%) and Liberal Democrat (1.051.051.05%) parties(Nov. 2011 to Feb. 2012)

    5 10 15 20 25 30 35 40 45

    0

    5

    10

    15

    20

    25

    30

    35

    40

    VotingIntention%

    Time

    CON

    LAB

    LIB

    (a) Voting intention polls

    5 10 15 20 25 30 35 40 45

    0

    5

    10

    15

    20

    25

    30

    35

    40

    VotingIntention%

    Time

    CON

    LAB

    LIB

    (b) Voting intention inferences

    V. Lampos [email protected] Can Social Media tell us something about our lives? 38/4338/43

    Qualitative evaluation is also essential...

  • 7/28/2019 Can Social Media tell us something about our lives?

    39/43

    Some domains may be represented by smooth trends (e.g.,political domain)

    Predictions could be easy in that context how do we know we are not overfitting?

    Perform qualitative analysis using the selected features (words,users and tweets) Do the selected words and users make some sense? Does their combination make sense? score single tweets

    Possibly better models when increasing the statistical evidence(multi-task learning)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 39/4339/43

    Conclusions Did they tell us anything?

  • 7/28/2019 Can Social Media tell us something about our lives?

    40/43

    Social Media hold valuable information

    We can develop methods to extract portions of this informationautomatically detect, quantify, nowcast events

    extract collective mood patterns

    model other domains (such as politics)

    User generated input + other features tell/reveal something about the users & their context

    Side effect: what about our privacy? ...

    V. Lampos [email protected] Can Social Media tell us something about our lives? 40/4340/43

    In collaboration with...

  • 7/28/2019 Can Social Media tell us something about our lives?

    41/43

    Prof. Nello Cristianini, University of Bristol (Ph.D. Advisor)

    Prof. Ricardo Araya, University of Bristol (Psychiatry)

    Dr. Tijl De Bie, University of Bristol

    Thomas Lansdall-Welfare, University of Bristol

    Dr. Trevor Cohn, University of Sheffield (TrendMiner)

    Daniel Preotiuc-Pietro, University of Sheffield (TrendMiner)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 41/4341/43

    Last Slide!

  • 7/28/2019 Can Social Media tell us something about our lives?

    42/43

    The end.Any questions?

    Download the slides fromhttp://www.lampos.net/research/presentations-and-posters

    V. Lampos [email protected] Can Social Media tell us something about our lives? 42/4342/43

    References1 B Debatin J P Lovejoy A M A Horn and B N Hughes Facebook and Online Privacy: Attitudes Behaviors

    http://www.lampos.net/research/presentations-and-postershttp://www.lampos.net/research/presentations-and-posters
  • 7/28/2019 Can Social Media tell us something about our lives?

    43/43

    1. B. Debatin, J.P. Lovejoy, A.M.A. Horn and B.N. Hughes. Facebook and Online Privacy: Attitudes, Behaviors,and Unintended Consequences. Journal of Computer-Mediated Communication 15, pp. 83108, 2009.

    2. D. Preotiuc-Pietro, S. Samangooei, T. Cohn, N. Gibbins and M. Niranjan. TrendMiner: An Architecture for RealTime Analysis of Social Media Text. Proceedings of ICWSM 12, pp. 3842, 2012.

    3. V. Lampos and N. Cristianini. Nowcasting Events from the Social Web with Statistical Learning . ACM TIST

    3(4), n. 72, 2012.4. R. Tibshirani. Regression Shrinkage and Selection via the LASSO. Journal of the Royal Statistical Society, series

    B, 58(1), pp. 267288, 1996.

    5. B. Efron, T. Hastie, I. Johnstone and R. Tibshirani. Least Angle Regression. The Annals of Statistics 32(2), pp.407499, 2004.

    6. C.D. Manning, P. Raghavan and H. Schtze. Introduction to Information Retrieval. Cambridge University Press,p. 544, 2008.

    7. P.L. Bartlett, S. Mendelson and J. Neeman. L1-regularized linear regression: persistence and oracle inequalities.Probability Theory and Related Fields, pp. 132, 2011.

    8. M.F. Porter. An algorithm for suffix stripping. Program 14(3), pp. 130137, 1980.

    9. V. Lampos and N. Cristianini. Tracking the flu pandemic by monitoring the Social Web. Proceedings of CIP 10,pp. 411416, 2010.

    10. V. Lampos, T. De Bie and N. Cristianini. Flu Detector Tracking Epidemics on Twitter. Proceedings of ECMLPKDD 10, pp. 599602, 2010.

    11. T. Lansdall-Welfare, V. Lampos and N. Cristianini. Effects of the Recession on Public Mood in the UK.

    Proceedings of WWW 12, pp. 12211226, 2012.(a)12. T. Lansdall-Welfare, V. Lampos and N. Cristianini. Nowcasting the mood of the nation. Significance 9(4), pp.

    2628, 2012.(b)

    13. V. Lampos. Detecting Events and Patterns in Large-Scale User Generated Textual Streams with StatisticalLearning Methods. PhD Thesis, University of Bristol, p. 243, 2012.(a)

    14. V. Lampos. On voting intentions inference from Twitter content: a case study on UK 2010 General Election .CoRR, 2012.(b)

    V. Lampos [email protected] Can Social Media tell us something about our lives? 43/4343/43