19
1 Paper to the Annual Conference of the UK Political Studies Association Cardiff Wales 25-27 March, 2013 Understanding the Dynamics of Internet-based Collective Action using Big Data: Analysing the Growth Rates of Internet-based Petitions Scott Hale, Helen Margetts, Taha Yasseri Oxford Internet Institute, University of Oxford {scott.hale,helen.margetts, taha.yasseri}@oii.ox.ac.uk Abstract Now that so much of collective action takes place on Internet-based platforms, it leaves a digital imprint which may be harvested to further understanding of the dynamics of mobilization. This 'big data' offers social science researchers the potential for new forms of analysis, using real-time transactional data based on entire populations, rather than sample-based surveys of what people think they did or might do. This paper uses a big data approach to track the growth of over 30,000 petitions to the UK Government on two platforms over two years, analysing the rate of growth and testing the hypothesis that the distribution of daily change will be leptokurtic (rather than normal) as previous research on agenda setting would suggest. This hypothesis is confirmed, suggesting that Internet-based mobilization is characterized by tipping points (or punctuations) and explaining some of the volatility in online collective action. We find that the vast majority of petitions fail to attain even minimal levels of success. Where there is success, the punctuations occur very early in the life of a petition. We propose a model of ‘collective attention’ to characterise this rapid growth and early saturation. These findings have implications for the strategies of those initiating petitions and the design of web sites with the aim of maximising citizen engagement with policy issues. Keywords Collective Action, Big Data, Petitions, Attention

Paper to the Annual Conference of the UK Political Studies ... · Paper to the Annual Conference of the UK Political Studies Association Cardiff Wales ... rather than sample-based

  • Upload
    vandiep

  • View
    214

  • Download
    0

Embed Size (px)

Citation preview

1

Paper to the Annual Conference of the UK Political Studies Association

Cardiff Wales

25-27 March, 2013

Understanding the Dynamics of Internet-based Collective Action using Big Data:

Analysing the Growth Rates of Internet-based Petitions

Scott Hale, Helen Margetts, Taha Yasseri

Oxford Internet Institute, University of Oxford

{scott.hale,helen.margetts, taha.yasseri}@oii.ox.ac.uk

Abstract

Now that so much of collective action takes place on Internet-based platforms, it leaves a digital

imprint which may be harvested to further understanding of the dynamics of mobilization. This 'big

data' offers social science researchers the potential for new forms of analysis, using real-time

transactional data based on entire populations, rather than sample-based surveys of what people

think they did or might do. This paper uses a big data approach to track the growth of over 30,000

petitions to the UK Government on two platforms over two years, analysing the rate of growth and

testing the hypothesis that the distribution of daily change will be leptokurtic (rather than normal)

as previous research on agenda setting would suggest. This hypothesis is confirmed, suggesting that

Internet-based mobilization is characterized by tipping points (or punctuations) and explaining some

of the volatility in online collective action. We find that the vast majority of petitions fail to attain

even minimal levels of success. Where there is success, the punctuations occur very early in the life of

a petition. We propose a model of ‘collective attention’ to characterise this rapid growth and early

saturation. These findings have implications for the strategies of those initiating petitions and the

design of web sites with the aim of maximising citizen engagement with policy issues.

Keywords Collective Action, Big Data, Petitions, Attention

2

Introduction

Now that so much of collective action takes place on Internet-based platforms, most

mobilizations include a digital element and some take place almost wholly online. All

this Internet-based activity leaves a digital footprint, which allows researchers to generate

‘big data’, real-time transactional data of political behaviour. This kind of data presents

new opportunities for social science research, allowing a move away from sample-based

survey data about what people think they did or will do, to transactional data about what

people actually did based on whole populations. Big data is receiving massive attention

and interest across the corporate world and scientific research communities (Mayer

Schoenberger and Cukier, 2013). Yet the potential of this kind of data for understanding

political behaviour remains under-explored. We are still at the start of exploiting the great

potential of ‘big data’ to explore the dynamics of Internet-based collective action and how

it may challenge existing theories and models of collective behaviour.

This paper analyses two ‘big data’ sets of Internet-based mobilizations, generated

from two electronic petition platforms; first, the e-petition part of the No.10 Downing

Street web site (from February 2009 to the point at which it closed in March 2011) and

second, the new e-petition site developed by the UK Cabinet Office for the incoming

Coalition Government in 2010 and launched in August 2011, which replaced it. E-

petitions were first launched in the UK in November 2006, and over the course of its

lifetime the No. 10 site received more than 8 million signatures from over 5 million

unique email addresses.1 Both sites have allowed anyone to view petitions, and any user

with a valid email address could create a new petition or sign an existing petition. There

are important differences between the sites however; for example, whereas the first site

showed the names of the 500 most recent petitioners, the new site shows only the name

of the petition creator. The sites have also provided alternative measures of ‘success’ of a

petition; for example, for the earlier site, the government promised an official response to

all petitions receiving at least 500 signatures, while the incoming Coalition government in

2010 promised that any petition on the new site attracting more than 100,000 signatures

would qualify for a parliamentary debate on the issue raised.

This paper analyses the growth of petitions and the distinctive characteristics of the

mobilization curves of successful and unsuccessful petitions on both platforms. First, we

provide some background on online collective action in general and e-petitions in

1 http://www.mysociety.org/projects/no10-petitions-website/

3

particular. We use previous work on political attention to develop a hypothesis regarding

the development of online mobilizations: that they will be characterised by long periods

of stasis and short periods of rapid change, leading to a leptokurtic distribution of daily

change. Second, we outline the methods used to test this hypothesis and third, provide

the results for two big data sets, harvested from two e-petition platforms provided by the

UK government, followed by a discussion of the policy and design implications of the

findings.

Background

The 21st century has seen a prominent role for the Internet in mobilization, from the

dramatic events in authoritarian states of the Arab Spring, to a series of protests,

demonstrations and social backlash against austerity driven cutbacks and state

retrenchment in liberal democracies facing the consequences of the financial crash of

2008. Researchers have turned their attention to the theoretical and conceptual

implications of online collective action (e.g. Bennett and Segerberg, 2012; Bimber, 2003;

Lupia and Sin, 2005) and some are using innovative methods, including experiments and

data-mining, to explore the viral spread of mobilizations across online social networks

(see, for example, Ackland, 2007; Etling et al, 2009; Hindman, 2008; Gonzalez Bailon et

al, 2011; Segerman and Bennett, 2011; Aral and Walker, 2010).

While online activity may be a minor element of some mobilizations, other

mobilizations occur almost entirely online. The trend for ‘e-petitioning’ represents one

such activity, where online petitions are created, disseminated, circulated, and presented

online, and although policy-makers may discuss responses in offline contexts, such

responses are generated and sent online. The UK government’s e-petition site was

created by the social enterprise MySociety on the No. 10 Downing Street website in

November 2006 and ran until March 2011, when it was closed by the incoming Coalition

government. Some of these petitions had high policy impact, notably one against the

Labour administration’s proposed road pricing policy, which policy-makers admitted off

the record played a role in getting the policy scrapped. A new site was launched in

August 2011 by the Cabinet Office, initially on the direct.gov portal, which transmuted

into the new portal www.gov.uk in the autumn of 2012, with a different format. Signing

petitions has long been among the more popular political activities, leading the field for

participatory acts outside voting and with other social benefits ascribed to it as well as

4

having the potential to bring about policy change; e-petitioning reinforces ‘civic

mindedness’ (Whyte et al. 2005) and is one of a growing portfolio of Internet-based

democratic innovations (Smith, 2009). The widespread use of e-petitions by both

governments and NGOs such as Avaaz and 38 Degrees has received accolades for their

democratic contribution (Escher, 2011; Chadwick, 2012) and the German e-petition

platforms have been analysed extensively (see Lindner and Riehm, 2011; Jungherr and

Jurgens, 2010), but the UK petition platforms have received rather less attention in recent

political science research, with the exception of Wright (2012).

E-petitions are interesting examples of mobilizations with a strong online imprint,

which will include the entire transaction history for both successful and unsuccessful

mobilizations. The data that can be harvested from the signing of electronic petitions is

an example of what is now commonly known as ‘big data’, representing a transactional

audit trail of what people actually did (as opposed to what people think they did) and an

entire population (without the need to take a representative sample). Data like this

represents a big shift for social science research into political behaviour, which has

traditionally rested on survey data, or, for elections, voting data. Big data also presents

challenges to social science research—it doesn’t come with handy demographics attached

and we do not know where people came from to any one interaction, nor where they are

going, so it is often difficult to match up online activities across different platforms, or to

identify the underlying factors influencing behaviour, such as age, income or gender.

This data however, makes it possible to look at the different patterns of growth in the

30,000 mobilization curves that we have and identify the distinctive characteristic of

those mobilizations that succeed and those that fail with our digital hindsight. Such an

analysis, using data that has rarely been available to political science researchers before

the current decade, may tell us something about the nature of collective action itself in a

digital world. Of the research noted above, Jungherr and Jurgens used a smaller dataset

to illustrate the viability of a big data (or computational social science) approach, but

other studies used surveys (Lindner and Riehm, 2011) or more qualitative approaches

(Wright, 2012).

So what would we hypothesise about these mobilizations? A possible hypothesis may

be derived from previous research on agenda setting in political systems. The most well

known model of how policy attention proceeds in a liberal democracy is that of

‘punctuated equilibrium’, developed by the US authors Baumgartner and Jones and their

‘Policy Agendas’ programme of research (see www.policyagendas.org). The theory

5

argues that policy attention to any issue will remain in long periods of stasis where little

change occurs. Where issues do hit the policy agenda, it will be because some event has

‘punctuated’ the equilibria, all eyes (including the media, public opinion, interest groups

and politicians concerned) turn to the issue, money is spent, institutions are created and

policy change occurs (John and Margetts, 2003; Baumgartner and Jones, 1993; Jones and

Baumgartner, 2005). The theory of punctuated equilibria is multi-faceted and has been

illustrated by a range of empirical data across policy areas and within different

dimensions of attention, such as public opinion, budgetary change and congressional

attention (Baumgartner and Jones, 2005) and in various countries, including the UK

(John and Margetts, 2003). Baumgartner and Jones do not discuss Internet-related

activity to any great degree; however, we might hypothesise that the pattern of

mobilizations around a petition would proceed in a similar way, thereby contributing to

the same sort of issue attention cycle that has been observed many times over in agenda

setting research. Such a model would predict that the distribution of daily changes in

attention would be ‘leptokurtic’, with a small number of large changes and a longer tail of

much smaller ones.

Such a finding could not show any causal effect, as only the activity of petitioners is

being analysed here (and tipping points could suggest a media effect, although it is

extremely unlikely that this could take place before substantive amounts of signatures

had amassed). It could, however, point towards a role for online mobilization in policy

change analogous to that of the media in the agenda setting analysis, which is ascribed a

lurching effect, due to the capacity of the media to parallel process only a small number

of issues; at the point at which a punctuation occurs, media attention will ‘tip over’ from

specialist outlets into the mainstream media. In addition, in lower level mobilizations

where the media is not paying attention, such a finding could suggest a role in policy

issues below the media radar.

Methods

The UK Government's petition website (petitions.number10.gov.uk) was accessed daily

from 2 February 2009 until March 2011, when the site closed, with an automated script.

Each day, the number of overall signatures to date on each active petition was recorded.

In addition, the name of the petition, the text of the petition, the launch date of the

petition, and the category of the petition were recorded. Overall, 8,326 unique petitions

6

were tracked from the earlier site, representing all publically available petitions active at

any point during the study. A first pass at this data after the site closed revealed the

importance of the first day in the future of a petition (see below and Hale and Margetts,

2012) and suggested that more frequent scraping of the data could deliver a more fine-

grained analysis. For this reason, when the new site was launched in August 2011, we set

the automatic script to scrape it every hour, recording the same details as for the previous

site. Our second dataset currently contains hourly data points for all the petitions

(19,789) submitted to the new site from 5th August 2011 to 22nd February 2013.

The two sites differed under different policies for how the government would

respond to petitions. For the No 10 Downing Street site, prospective petitioners were told

that if their petition achieved 500 signatures, they would receive an official response.

There were no other official measures of success, although one petition did succeed in

raising over one million signatures, which previous research has identified as a possible

‘tipping point’ for mobilizations; if potential participants know that more than one

million have participated, they are more likely to participate themselves (Margetts et al,

2011); the petition was widely regarded as influential in getting the policy reversed. For

the Cabinet Office site, the bar for an official response is unclear from the site, although

the majority of petitions that have over 10,000 signatures do receive a response with the

prefix ‘As this e-petition has received more than 10,000 signatures, the relevant

Government department have provided the following response’. More importantly, in the

early days of the Coalition Administration, David Cameron promised that signatures

obtaining more than 100,000 signatures would generate a parliamentary debate on the

issue raised by the petition. All these information cues will have acted as possible drivers

on individuals considering whether to sign a petition.

Petitions on the Number 10 website closed, by default, 12 months after they first

launched. To identify patterns in how petitions grow, the percentage change in new

signatures was calculated each day, for the earlier site. Most petitions had a long period

of inactivity prior to their deadline date. To consider just how petitions grow, data was

truncated after the last signature on a petition, removing any final period of zero

signature-per-day growth prior to the petition's deadline.

Leptokurtic distributions have a more acute peak close to the mean and larger tails.

There is no statistical test to specifically classify a distribution as leptokurtic. However,

several tests in combination help demonstrate a distribution is leptokurtic (see John and

Margetts, 2003). The most rigorous test is the Shapiro-Wilk test (1965), which checks

7

whether the points could possibly be drawn randomly from a normal distribution.

Leptokurtic distributions should reject the Shapiro-Wilk null hypothesis of normality.

The Kolmogorov-Smirnov test (Chakravarti et al. 1967, pp. 392-4) tests that a set of

frequencies is normal distributed by focusing on the skewedness and kurtosis of a

distribution, and this null hypothesis should be rejected if a distribution is leptokurtic and

hence non-normal. Visualizing the histogram and plotting a log-lot plot, which should be

nearly a straight line if changes are leptokurtic, give further evidence of a leptokurtic

distribution.

Results

First, we explored the data harvested from the first e-petitions site on the No. 10

Downing Street web site, which produced a set of 8,326 unique petitions, shown in

Figure 1. The most immediate finding of interest was that 94 per cent failed to obtain

even the modest 500 signatures required to elicit an official response, the only measurable

‘success’ indicator for the earlier site. Nearly all petitions that succeeded in obtaining 500

signatures did so quickly. Successful petitions took a mean time of 8.4 days to reach 500

signatures, but a median time of only two days. In fact, 230 of the 533 successful

petitions succeeded in obtaining 500 signatures on the day they were launched (day 1).

Only a few petitions take a much longer time to reach the 500 signature mark: 31

petitions (6 per cent) succeed after taking more than 30 days, and only five petitions in

our dataset reached the 500 signature mark after being active more than four months.

Next, we tested our hypothesis that the distribution of daily change in signatures

would be leptokurtic. Figure 2 shows the percentage change in new signatures adjusted

so that the mean growth of each petition lies at zero. While most daily change is small,

petitions’ growth is punctuated by a few large changes. The distribution of growth is

leptokurtic and strongly rejects the Shapiro-Wilk null hypothesis of normality with a w

statistic of 0.17 translating to a p-value less than 0.000001. The distribution has a kurtosis

score of 1,445 and a skewedness of 30.53, and rejects the Kolmogorov-Smirnov test for a

normal distribution (p<0.0001). When we applied the same tests to the population of

petitions that were successful in achieving 500 signatures (that is, excluding the

unsuccessful ones), we found a similar leptokurtic distribution (Shapiro-Wilk w statistic

of 0.10, p<0. 000001).

So having identified punctuations, what can we say about where they are? The

8

largest daily changes happened at the start in the life cycle of the petition. Looking at the

distribution of the day on which the punctuation occurred, we see that (for example) all

daily changes of more than 80 per cent occurred in the first five days, and greater than 40

per cent in the first 8 days 0 and even for all changes over 40 per cent, the median day is

1, and the mean is 2.2 and the third quartile is one.

So it seems that the early days of a petition are crucial, in particular the first day.

Running a logit regression revealed that the number of signatures a petition received on

its first day is the most important factor in explaining the petition’s success, and a linear

regression (shown in Table 1) showed that it was also the most important factor in

explaining the total number of signatures the petition receives during its lifetime. All

other factors tested—the topic category, the start day of the week, weekend vs weekday

launch—did not have a significant effect on the growth of a petition once controlling for

the number of signatures the petition received on its first day. Petitions tended to grow

shortly after launch and then stop growing. This active period of growth for petitions has

a mean length of 57 days and a median length of 27 days.

Figure 1. Petition Growth on No. 10 Downing Street e-petitions site, 2009 – 2010

Note: N = 8,326 petitions, all petitions created between September 2009 and May 2010

9

Table 1: Factors affecting growth. OLS predicting the total number of signatures

Estimate Std. Error t value Pr(>|t|)

(Intercept) 218.41 78.55 2.78 0.00544 **

Category -6.37 4.97 -1.28 0.19976

Day of week

petition started

-12.23 11.79 -1.04 0.29974

Signatures

collected on first

day

1.95 0.02 96.29 <0.00001

***

Number of

petitions started

on the same day

-1.55 2.96 -0.53 0.59960

Note: Adjusted R-squared: 0.5268

Figure 2: Log of daily percentage change in number of signatories (centred around

each petition's mean).

10

Note: Any final period in which petitions that gain no more signatures after a certain point before

closing date has been removed from the daily per cent change data (so the tallest bars do not include

these ‘zero change’ days.

The identification of the first day as a key ‘tipping point’ for successful petitions caused

us to re-examine our data collection techniques as we took the data harvesting and

analysis forward with the new site that opened in the summer of 2011. That is, the

mobilization on this first day was so rapid that daily scraping of the site could not

provide a fine-grained analysis of this or other potential tipping points. Therefore, as

noted above, when we collected data from the new site we did so every hour, providing

us with 19,789 petitions up to February 2013 (we continue to collect the data). The

analysis of this dataset in this draft paper, however, uses a randomly selected subset of

this new data. For this subset, Figure 3 shows (in different colours) those petitions that

attained the first level of ‘success’ (that is, the 10,000 signatures required for an official

response); and those that attained the second level, the 100,000 signatures required to

generate a parliamentary debate.

Figure 3: Petition growth on Cabinet Office e-petitions site, 2011 - 2013

Note: Graph shows a fraction of N=3813 petitions, all created between 5th August 2011 and 22nd

February 2013. Note also that y-axis uses a logarithmic scale.

Once again, it is immediately clear that the vast majority of petitions did not achieve any

11

measure of success. Only 5 per cent of petitions obtained the 500 signatures, which we

calculated to compare with the previous dataset, and only 4 per cent received 1,000.

Only 0.7 per cent attained the 10,000 signatures which seems to be the bar for receiving

some sort of official response, and only 0.1 per cent attained the 100,000 required for a

parliamentary debate. Again, the first day was crucial to achieving any kind of success.

Any petition receiving 100,000 signatures after three months, needed to have obtained

3,000 within the first 10 hours on average.

Discussion

The results from both data sets show just how few petitions actually attain success by any

measure. For the earlier data from the No.10 Downing Street site, the 500 signature mark

seems at first consideration a very low threshold that should easily be passed. However,

by far the majority of petitions (94 per cent in this time period) fail to attain even this

modest number of signatures, illustrating the point that in online environments, the low

costs of initiating a collective action mean that there are likely to be large numbers of

unsuccessful mobilizations. Petitions are most active when they are first launched and

most petitions (presumably in the lack of outside stimulus) become digital dust after a

couple of months despite typical deadlines of one year on the site. Our second data set

tells the same story, suggesting that this finding may be generalized to other

mobilizations rather than representing some characteristics of the No. 10 Downing Street

platform. The finding that most mobilizations of this kind fail to take off in any sense

chimes well with recent research into the spread or diffusion of initiatives across online

networks. Goel et al (2012), for example analyse the diffusion patterns arising from

online domains, ranging form networked games to microblogging services and find that

in all their seven cases, the vast majority of cascades are small, and are described by a

handful of simple tree structures that terminate within one degree of an initial adopting

‘seed’. Even for the few large cascades that they observed, the bulk of adoptions often

take place within one degree of a few dominant individuals. Although we have not yet

made an attempt to model the network activity behind the petitions studied here, it seems

likely that to do so would reveal a similar pattern.

We attempt to capture the characteristic of early rapid growth and decay that the data

reveals, with a model of ‘collective attention’ decay, drawing on Wu and Huberman

(2007). In their model, they calculate a ‘novelty’ parameter relating to the novelty of

news items on a news sharing platform, but in a more general framework, the decay in

12

attention could have other reasons, for example reaching the system size limits, or lack of

viral spread. In the model, N agents at the time t, bring Nµ new agents in the next step in

average, µ being a multiplication factor; in our case, this would mean that every signature

brings µ new signatures in the next hour, leading to an exponential growth of rate µ in

the number of signatures. This model works quite well at the beginning, but very soon

the spread rate decays and new signatures come at a much lower rate. Therefore we let

the multiplication factor decay by introducing a second factor r(t), which decays in a way

that is an intrinsic of the medium; each signature at time t, on average brings µr(t) new

signatures in the next hour. To correct for the early saturation observed in the empirical

results, we enter an ‘outreach’ parameter which can change over time and damp the fast

initial growth. To calculate that empirically, we average over the logarithm of the

number of signatures in hourly bins, starting from the launching time and then calculate

the hourly increment at time t and normalize it by the logarithm of the number of

signatures up to time t as follows:

In other words, the outreach factor measures the relative growth of the logarithm of

number of signatures within an hour, averaged over the whole sample. This parameter is

shown as a function of adjusted time in Figure 4. This shows that collective attention

decays very fast indeed, and that after 24 hours, a petition’s fate is virtually set.

13

Figure 4. Rate of Change of ‘Collective Attention’ paid to Petitions created on the

Cabinet Office site calculated based on Eq. 1.

Confirmation of our hypothesis regarding the leptokurtic distribution of changes to

the support for a petition suggests that in online environments, collective action could

play a role in a punctuated equilibrium model of policy change. That is, the general

pattern for policy attention is for issues to remain dormant or in stasis, with a generally

low level of attention. Some issues (by far the minority) that attract attention quickly gain

a ‘critical mass’ of activists and start to vie for policy attention, joining the range of other

institutional influences in helping to ‘punctuate’ the equilibrium. Such an argument

would not include the claim that the mechanism by which collective action acts to bring

about instability would be the same as the role played by the media, which plays a

distinctive ‘lurching’ role in Jones and Baumgartner’s analysis, based in part on the

tendency of the media to process a small number of issues in parallel. In the context

studied here, the mechanism would depend more on the ways in which a mobilization is

disseminated via online social networks, something that previous research mentioned in

the introduction has begun to investigate. If such activity tends to take place on the day

14

the petition is initiated, then these findings could indicate the importance of achieving

some kind of viral spread across the petitioner’s closest contacts right at the start, because

initial rapid growth will have a greater effects on subsequent participation decisions by

‘weaker tie’ contacts than a gradual growth over a prolonged period.

In other empirical studies using experimental methodologies, we have started to

uncover the mechanism behind such punctuations. That is, in addition to the close

contacts informed about the petition directly by the petitioner, other individuals

(including people unknown to the petitioner or more distant contacts) deciding whether

to participate will be influenced by the information that other people have already

participated. This influence will depend on the number of other participants (Margetts et

al, 2011), the personality of the individual deciding whether to participate (Margetts et al,

2013), and the closeness of the individual to the petitioner. The occurrence of a

punctuation will depend on the existence of ‘starters’ whose thresholds for participation

are low or whose closeness to the petitioner has in this instance reduced their threshold

for participation. These starters will act as a signal for people with higher thresholds and

weaker ties to the petitioner to ‘follow’ in signing the petition, thereby acting as a further

signal for people with even higher thresholds to join. At some point, if the petition is

successful, then the number of followers will reach ‘critical mass’ and attention to the

mobilization will become widespread, breaking out of the petitioner’s social network and

gain more general social media exposure.

So, there are two possible explanations for the importance of the first day in

achieving the critical mass: first, the way in which the petition is disseminated via online

networks and second, the dynamic of starters and followers and social information about

other participants in the burst of activity. If it is that visitors to the petition are heavily

influenced by the numbers of other participants that they observe, then the finding could

join those of other work in economics illustrating the importance of ‘first donations’ in

charitable giving (Bog et al, 2006), showing that early donors set the precedent for later

donors, or the wider literature on conditional co-operation, showing that social

information about the contribution of others influences an individual participant’s

decision to contribute (Croson and Shang, 2009; Frey and Meier, 2004). This study

complements this previous work by focusing on numbers of participants, rather than

contribution amount (as everyone’s contribution, at least that we are able to measure, is

the same) and a more explicitly political context, rather than that of charitable giving.

The strong effect of petitions tending to succeed quickly or not at all will be

15

influenced by the design of the petition website during the period of study; the ‘outreach’

factor will vary across platforms. For users starting at the homepage of the earlier No.10

Downing St site, it was possible to view petitions overall or within a specific category and

to sort petitions by the number of signatures or the date added. It was therefore easiest to

look at petitions with the largest or smallest number of signatures and the oldest or

newest petitions. On the newer Cabinet Office site, petitions can be sorted by signature or

closing date, or viewed by government department, but not by topic. In addition, we can

expect different behaviour from users of either site who arrive at the homepage (who may

respond to these information cues by looking only at the newest petitions or the petitions

with the most signatures contributing to the effects observed), or users following links

shared via email and social media, which would point to a specific petition that the

contact was supporting (who will avoid these information cues). These alternative effects

might be tested using an experimental approach in future research. In March 2012, the

Cabinet Office introduced a change to the e-petitions site which also shaped the

information environment of prospective petitioners, by introducing a ‘Trending e-

petitions’ with the highest signatures on the front page. The fact that our data was

captured both before and after this change provides us with a ‘natural experiment’

whereby we can test the effect of this change, which will be the next step in this

programme of research.

Conclusions

We have found that in online mobilizations, growth tends not to occur, meaning that

most mobilizations that are initiated, fail. But where it does, it proceeds in rapid bursts

followed by periods of stasis. Such a finding suggests that online mobilizations of the

kind covered here could play a role in the more general process of punctuated equilibria

in policy-making. For example Jones and Baumgartner (2005) found a high correlation

between public concern on an issue and Congressional attention. Our findings here could

be even more interesting. In the theory of punctuated equilibrium the media plays a key

role in terms of ‘lurching’ from one issue to another and having a complex feedback

relationship with public opinion. But the sort of mobilizations we are looking at here are

bubbling up relatively independently of the media, gaining media attention only when

they obtain significantly high levels of support—the petition on road-pricing that

successfully played a role in obtaining policy change, for example, received a great deal

16

of media attention once that it reached one million signatories. Research that develops

our understanding of the mechanics of this turbulence will be important for scholars and

policy-makers alike as collective action continues to move into online settings. If online

collective action is characterised by punctuations, then it looks as if such activity could

inject a further dose of instability into political systems. If mobilizations follow a pattern

of very low levels of attention punctuated by occasional ‘spurts’ which grow rapidly into

full scale mobilizations that merge with other elements of the political system to push

policy change on to the agenda and the institutional landscape, then we can expect to see

increasing turbulence in contemporary politics, adding to the ‘instability’ that

Baumgartner and Jones (1993; 2005) and their co-investigators have modelled so

extensively in previous research.

This paper has, we hope, demonstrated the potential for ‘big data’ approaches in

political science research. The data we report here was automatically and non-obtrusively

generated to provide a dataset of real-time transactional data of a kind that has rarely

been available to political science researchers before. One of the aims of the research

programme of which this analysis forms a part is to develop the methods we have used to

both harvest and analyse the data, which require skills and expertise and conceptual

approaches that span academic disciplines; of the authors of this paper, one is a physicist,

one a computer scientist and one a political scientist. As big data is used more extensively

in this kind of research, the ability to work across disciplines in this way will become

increasingly important.

Our future empirical work will also explore co-ordination with media coverage and

mentions of the petitions on social networking sites (such as Facebook, YouTube,

Google Search and Twitter); we are now systematically gathering data on any mention of

the petitions for which we have captured data across all these platforms. Much work on

online collective action is based on research into a single platform, whereas any online

activity tends to involve several. By looking carefully at the timing with which an issue

gains attention in different parts of the political system, including the activist activities

investigated here, we might get closer to establishing some sort of sequencing of

attention. In addition, one way of getting around the lack of causal inference in research

of this kind is to carry out experiments, as in Margetts et al (2011, 2012, 2013). Future

work will also use an experimental approach to analyse the effect of different information

environments surrounding petition websites on petition growth. Designers of web sites

that involve civic engagement, such as e-petition sites, must decide what social

17

information about existing levels of participation to include, for example the numbers of

people who have already signed and the timings of when they did so. Other design

decisions include whether participants are anonymous (as on the Cabinet Office site) or

whether their names are made visible (as on the earlier No. 10 Downing St site and the

German e-petitions platform), and whether input from other social media platforms is

incorporated into the petition site. Research of this kind, using both ‘big data’ and

experimental approaches, can inform such design decisions in ways that maximise

citizens’ input to policy debates.

References

Ackland, R. And Gibson, R. (2006) Hyperlinks and horizontal political communication on the

WWW: The untold story of parties online. The Australian National University.

Aral, S. and Walker, D. (2011) ‘Creating Social Contagion Through Viral Product

Design: A Randomized Trial of Peer Influence in Networks, Management Science.

Baumgartner, F. and Jones, B. (1993) Agendas and Instability in American Politics.

University of Chicago Press pp 1-298.

Bennett, L. and Segerberg, A. (2011) ‘Digital Media and the Personalization of

Collective Action: Social Technology and the Organization of Protests against the

Global Economic Crisis.’ Information, Communication & Society (2011) 14: 770-799.

Bennett, L., and Segerberg, A. (2012) ‘The Logic of Connective Action’, Information,

Communication & Society, 15 (5): 739-768.

Bimber, B. (2003) Information and American democracy: Technology in the evolution of political

power. New York: Cambridge University Press.

Bimber, B, Flanagin, A, Stohl, C (2006) ‘Reconceptualizing Collective Action in the

Contemporary Media Environment’, Communication Theory, Vol. 15 Iss. 4: 365-388.

Bøg, M., Harmgart, H., Huck, S. and Jeffers, A. M. (2012), Fundraising on the Internet.

Kyklos, 65: 18–30. doi: 10.1111/j.1467-6435.2011.00525.x

Borge, R. and Cardenal, A. (2011) ‘Surfing the Net: A pathway to participation for the

politically uninterested?’, Policy and Internet, vol 3, issue 1.

Centola, D. (2010) ‘The Spread of Behaviour in an Online Social Network Experiment’

Science 329, 1194.

Croson, Rachel and Jennifer. Shang (2009). ‘Field Experiments in Charitable

18

Contribution: The Impact of Social Influence on the Voluntary Provision of Public

Goods’, Economic Journal. 119 (540): 1422–1439.

Escher, T. 2011. Writetothem.com: Analysis of users and usage for UK Citizens Online

Democracy, London: My Society.

Etling, B. Kelly, J. Faris, R. and Palfrey, J. (2009) Mapping the Arabic Blogosphere: Politics,

Culture and Dissent, Berkman Center Research Publication NO. 2009-06.

Frey, Bruno S. and Stephan, Meier (2004). Social Comparisons and Pro-social Behavior:

Testing Conditional Cooperation in a Field Experiment, American Economic Review.

94: 1717–1722.

Goel, S. Watts, D. and Goldstein, D. (2012) ‘The Structure of Online Diffusion

Networks’, Proceedings of the 13th ACM Conference on Electronic Commerce, pp. 623-638.

Gonzalez Bailon, S. Borge-Holthoefer, J. Rivero, A. and Moreno, Y. (2011) ‘The

Dynamics of Protest Recruitment through an Online Network’, Scientific Reports,

December.

John, P., & Margetts, H. (2003). Policy punctuations in the UK: Fluctuations and

equilibria in central government expenditure since 1951. Public Administration, 81(3),

411-432. Blackwell Publishing Ltd. doi:10.1111/1467-9299.00354

Jones, B. D., & Baumgartner, F. R. (2004). Representation and Agenda Setting. Policy

Studies Journal, 32(1), 1-24. Blackwell Publishing Ltd. doi:10.1111/j.0190-

292X.2004.00050.x

Jones, B. and Baumgartner, F. (2005) The Politics of Attention: How Government Prioritizes

Problems, Chicago: University of Chicago Press.

Jungherr, A. Jurgens, A. (2012) ‘The Political Click: Political Participation through E-

Petitions in Germany’, Policy and Internet: 2 (4). Pp. 131-165.

Liben-Nowell, D., & Kleinberg, J. (2008). Tracing information flow on a global scale

using Internet chain-letter data. Proceedings of the National Academy of Sciences, 105(12),

4633. National Acad Sciences.

Lindner, R. and Riehm, U. (2011) ‘Broadening Participation Through E-Petitions? An

Empirical Study of Petitions to the Germany Parliament’, Policy and Internet: 3(1), pp.

63-85.

Lupia, A., & Sin, G. (2003). Which Public Goods are Endangered?: How Evolving

Communication Technologies Affect The Logic of Collective Action. Public Choice,

117(3), 315-331. Springer Netherlands. Retrieved from

19

http://dx.doi.org/10.1023/B:PUCH.0000003735.07840.c7

Margetts, H., John, P., Reissfelder, S., and Hale, S. 'Applying Social Influence to

Collective Action: Heterogeneous Personality Effects' (July 19, 2011,

http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1892805)

Margetts, H. John, P., Escher, T. and Reissfelder, S. (2011) 'Social Information and

Political Participation on the Internet: an Experiment', European Political Science

Review.

Margetts, H., John, P., Escher, T., & Reissfelder, S. (2009) Experiments for Web

Science: Examining the Effect of the Internet on Collective Action, Proceedings of

WebSci2009 Conference, Athens, 18-20 March.

Mayer-Schoenberger, V. and Cukier, K. (2013) Big Data: A Revolution That Will Transform

How We Live, Work and Think. John Murray.

Mosca, L., & Santucci, D. (2009). Petitioning Online. The Role of E-Petitions in Web

Campaigning. Political Campaigning on the Web, 121. transcript Verlag.

Schelling, T. C. (2006). Micromotives and macrobehavior. WW Norton & Company.

Smith, G. (2009). Democratic innovations: Designing institutions for citizen participation.

Cambridge, England: Cambridge University Press

Whyte, A., Renton, A., & Macintosh, A. (2005). e-Petitioning in Kingston and Bristol

Evaluation of e-Petitioning in the Local e-Democracy National Project.

Wright, S. (2012) Assessing (e-) Democratic Innovations: “Democratic Goods” and

Downing Street E-Petitions”, Journal of Information Technology and Politics, 9: 453-470.

Wu, F. and Huberman, B. (2007) ‘Novelty and collective attention’, Proceedings of the

National Academy of Sciences of the USA, 104 (45) 17599-17601.