20
An Empirical Study on End-users Productivity Using Model-based Spreadsheets Laura Beckwith acome Cunha Jo˜ ao P. Fernandes Jo˜ ao Saraiva HCIResearcher, Denmark, [email protected] Universidade do Minho, Portugal, {jacome,jpaulo,jas}@di.uminho.pt Universidade do Porto, Portugal, [email protected] EuSpRIG 2011 July 14 – 15 acome Cunha (UMinho) EuSpRIG 2011 1 / 19

Talk at EUSPRIG '11

Embed Size (px)

DESCRIPTION

 

Citation preview

Page 1: Talk at EUSPRIG '11

An Empirical Study on End-users Productivity UsingModel-based Spreadsheets

Laura Beckwith Jacome Cunha Joao P. Fernandes Joao Saraiva

HCIResearcher, Denmark, [email protected] do Minho, Portugal, {jacome,jpaulo,jas}@di.uminho.pt

Universidade do Porto, Portugal, [email protected]

EuSpRIG 2011

July 14 – 15

Jacome Cunha (UMinho) EuSpRIG 2011 1 / 19

Page 2: Talk at EUSPRIG '11

Agenda

1 Introduction

2 Model-based SpreadsheetsInferring ModelsRefactored SpreadsheetVisual Spreadsheet

3 Study Design

4 ResultsEffectivenessEfficiency

5 Conclusions

Jacome Cunha (UMinho) EuSpRIG 2011 2 / 19

Page 3: Talk at EUSPRIG '11

Introduction

Much research has been done to prevent spreadsheet end-user errors

Model-Driven Engineering (MDE) is a promising approach

We have created a set of different model-based spreadsheets

ww�Need to evaluate their impact

Jacome Cunha (UMinho) EuSpRIG 2011 3 / 19

Page 4: Talk at EUSPRIG '11

Introduction

Much research has been done to prevent spreadsheet end-user errors

Model-Driven Engineering (MDE) is a promising approach

We have created a set of different model-based spreadsheets

ww�Need to evaluate their impact

Jacome Cunha (UMinho) EuSpRIG 2011 3 / 19

Page 5: Talk at EUSPRIG '11

Original Spreadsheet - original

A project management system

Tabular spreadsheet (database like spreadsheet)

Jacome Cunha (UMinho) EuSpRIG 2011 4 / 19

Page 6: Talk at EUSPRIG '11

Functional Dependencies

A ⇀ B means that the values in column A uniquely determine valuesin column B

If two values two rows in column A are equal, then the correspondingvalues in B are also equal

For example, suppose A = {A1,A2} and B = {B1,B2}:

A1 A2 B1 B2

a1 a2 b1 b2

a1 a3 b1 b1

a1 a2 b1 b2

(a) Table inducing A ⇀ B.

A1 A2 B1 B2

a1 a2 b1 b2

a1 a3 b1 b1

a1 a2 b2 b2

(b) Table not inducing A ⇀ B.

Jacome Cunha (UMinho) EuSpRIG 2011 5 / 19

Page 7: Talk at EUSPRIG '11

Relational Model

From the spreadsheet data (also from formulas) we can inferfunctional dependencies

From these dependencies, we can compute a relational databaseschema

A ⇀ BC D ⇀ E...

Infer FDs

Original SS

RDB Schema

Infer RDB Schema

Reasoning about FDs

Jacome Cunha (UMinho) EuSpRIG 2011 6 / 19

Page 8: Talk at EUSPRIG '11

Refactored Spreadsheet - refactored

We use the relational model to generate a new refactored spreadsheet(respecting the relational model)Each entity separated in a different tableCombo boxes used to list allowed values (from other tables)

Jacome Cunha (UMinho) EuSpRIG 2011 7 / 19

Page 9: Talk at EUSPRIG '11

Visual Spreadsheet - visual

From functional dependencies we can generate edit assistanceA ⇀ B: A columns become green combo boxes and B redSimilar to originalCombo boxes help user introducing data

Jacome Cunha (UMinho) EuSpRIG 2011 8 / 19

Page 10: Talk at EUSPRIG '11

The Entire Picture

A ⇀ BC D ⇀ E...

Infer FDs

Original SS

RDB Schema

Infer RDB Schema

Reasoning about FDs

Generate Refactored SS

Refactored SS

Generate Visual SS

Visual SS

Goal: compare efficiency and effectiveness of original, refactored and visual

Jacome Cunha (UMinho) EuSpRIG 2011 9 / 19

Page 11: Talk at EUSPRIG '11

Research Questions

RQ1 Do end users introduce fewer errors when they use one of themodel-based spreadsheets versus the original unmodified one?

RQ2 Are end users more efficient using the model-based spreadsheets?

RQ3 Do particular models lead to fewer errors in particular tasks?

Jacome Cunha (UMinho) EuSpRIG 2011 10 / 19

Page 12: Talk at EUSPRIG '11

Study Design

Background questionnaire, followed by a list of tasks and a postquestionnaire

No initial tutorial was given to the participants

38 participants: 25 females and 11 males, all students

Tasks for inserting and editing data and computing statistics

3 different spreadsheet applications (projects, dishes andproperties)

3 different models (original, refactored, visual)

Each participant received 3 spreadsheets, each of a differentapplication/model

Jacome Cunha (UMinho) EuSpRIG 2011 11 / 19

Page 13: Talk at EUSPRIG '11

Global Effectiveness

original refactored visual

dishes 86% 76% 78%projects 73% 68% 78%properties 75% 64% 62%

Globally, models do not seem very competitive

Further analyses are necessary

Jacome Cunha (UMinho) EuSpRIG 2011 12 / 19

Page 14: Talk at EUSPRIG '11

Effectiveness: Inserting Data

original refactored visual

dishes 91% 90% 81%projects 76% 60% 75%properties 86% 67% 68%

Models always lose

But very similar to original

Jacome Cunha (UMinho) EuSpRIG 2011 13 / 19

Page 15: Talk at EUSPRIG '11

Effectiveness: Editing Data

original refactored visual

dishes 91% 82% 82%projects 54% 62% 50%properties 65% 98% 48%

refactored in general was better

Because there is less data to edit

Jacome Cunha (UMinho) EuSpRIG 2011 14 / 19

Page 16: Talk at EUSPRIG '11

Effectiveness: Computing Statistics

original refactored visual

dishes 52% 37% 57%projects 19% 76% 13%properties 44% 57% 51%

Again, refactored was better

And again, less data do analyze

Jacome Cunha (UMinho) EuSpRIG 2011 15 / 19

Page 17: Talk at EUSPRIG '11

Efficiency

original refactored visual

dishes 35′ 32′ 28′

projects 39′ 40′ 41′

properties 37′ 36′ 40′

Models are competitive

In further analyzes we discovered that users took more time understanding model,specially refactored

Jacome Cunha (UMinho) EuSpRIG 2011 16 / 19

Page 18: Talk at EUSPRIG '11

Answers to Research Questions

RQ1 Do end users introduce fewer errors when they use one of themodel-based spreadsheets versus the original unmodified one?

We believe there is potential for model-based spreadsheets

But a deep knowledge seems to be necessary to be effective

RQ2 Are end users more efficient using model-based spreadsheets?

Model-based spreadsheet can help in some cases

We believe better knowledge about models could improve this result

RQ3 Do particular models lead to fewer errors in particular tasks?

For inserting data, this was not the case

The models helped editing data and in computing statistics

Jacome Cunha (UMinho) EuSpRIG 2011 17 / 19

Page 19: Talk at EUSPRIG '11

Conclusions

We did not give a tutorial to participants

We believe better results could be achieve with initial training

Nevertheless, our model-based spreadsheet are competitive

Better for editing and statistics

We plan to run a new study similar to this, but with an initial tutorial

Jacome Cunha (UMinho) EuSpRIG 2011 18 / 19

Page 20: Talk at EUSPRIG '11

Thank you for your attention!

Jacome Cunha (UMinho) EuSpRIG 2011 19 / 19