The Development and Evaluation of a Model of Time-of ... · scientific discussions about time of arrival uncertainty, experimental methods, and data analyses; facilitating the development

THE DEVELOPMENT AND EVALUATION OF A MODEL OF

TIME-OF-ARRIVAL UNCERTAINTY

by

BECKY L. HOOEY

A thesis submitted in conformity with the requirements

for the degree of Doctor of Philosophy

Graduate Department of Mechanical and Industrial Engineering

University of Toronto

© Copyright by Becky Hooey 2009

ii

The Development and Evaluation of a Model of Time-of-Arrival Uncertainty

Becky L. Hooey

Doctor of Philosophy, 2009 Department of Mechanical and Industrial Engineering, University of Toronto

Abstract

Uncertainty is inherent in complex socio-technical systems such as in aviation, military, and surface

transportation domains. An improved understanding of how operators comprehend this uncertainty is

critical to the development of operations and technology. Towards the development of a model of time of

arrival (TOA) uncertainty, Experiment 1 was conducted to determine how air traffic controllers estimate

TOA uncertainty and to identify sources of TOA uncertainty. The resulting model proposed that operators

first develop a library of speed and TOA profiles through experience. As they encounter subsequent

aircraft, they compare each vehicle!s speed profile to their personal library and apply the associated

estimate of TOA uncertainty.

To test this model, a normative model was adopted to compare inferences made by human observers to

the corresponding inferences that would be made by an optimal observer who had knowledge of the

underlying distribution. An experimental platform was developed and implemented in which subjects

observed vehicles with variable speeds and then estimated the mean and interval that captured 95% of

the speeds and TOAs.

Experiments 2 and 3 were then conducted and revealed that subjects overestimated TOA intervals for

fast stimuli and underestimated TOA intervals for slow stimuli, particularly when speed variability was

high. Subjects underestimated the amount of positive skew of the TOA distribution, particularly in

slow/high variability conditions. Experiment 3 also demonstrated that subjects overestimated TOA

uncertainty for short distances and underestimated TOA uncertainty for long distances. It was shown that

subjects applied a representative heuristic by selecting the trained speed profile that was most similar to

the observed vehicle!s profile, and applying the TOA uncertainty estimate of that trained profile.

iii

Multiple regression analyses revealed that the task of TOA uncertainty estimation contributed the most to

TOA uncertainty estimation error as compared to the tasks of building accurate speed models and

identifying the appropriate speed model to apply to a stimulus. Two systematic biases that account for

the observed TOA uncertainty estimation errors were revealed: Assumption of symmetry and aversion to

extremes. Operational implications in terms of safety and efficiency for the aviation domain are

discussed.

iv

Acknowledgements

I would like to express my gratitude to the members of my thesis committee (Professors Paul

Milgram, John Senders, and Greg Jamieson) for their guidance and insightful comments throughout

the entire PhD process. I am particularly grateful to Paul who was willing to support a long-distance

mentoring relationship as we attempted to close the chasm between engineering and psychology,

and between theoretical and applied research. I am honoured to have had the opportunity to share

many discussions with John Senders who brought interesting insights to the TOA uncertainty

problem. I am grateful to Greg for inspiring my PhD studies, as it was because of Greg!s seminar at

NASA Ames Research Center in 2003 that I pursued graduate studies at the University of Toronto. I

also extend my appreciation to my external committee members, Professors Ian Spence and Esa

Rantanen, for their time and effort.

I have been very fortunate to have the support of my colleague and friend, Dr. David Foyle of NASA

Ames Research Center, who supported this endeavour in every way possible including: endless

scientific discussions about time of arrival uncertainty, experimental methods, and data analyses;

facilitating the development and testing of the experiments; and his continual moral support and

encouragement. I would also like to thank Ron Miller for his programming expertise and support in

developing the experimental platform.

I am very grateful for the financial support that I received from the following sources: NSERC (Canada

Graduate Scholarship), Queen!s University (Marty Memorial Scholarship), University of Toronto

(Dissertation Completion Grant), and NASA Ames Research Center (support for the human-in-the-

loop experiments).

Finally, I would like to thank Brian Gore for sharing in all of the trials and tribulations along the way.

This could not have been completed without Brian!s support and encouragement.

v

The Development and Evaluation of a Model of Time-of-Arrival Uncertainty

Table of Contents

!"#$%&'$((((((((((((((((((((((((((((((((((((((((((((((((((((((( ((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((())!

!'*+,-./01/2/+$# (((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((()3!

45!6789:;<::7=>8?@A?!99=B!C:D7@!E:FG4897!=G7H:=G:4@>6C8I::J@4=@?7845G=4!C:JHJ78>J(((((((((((((((;!

;(;:=G79@KF47=@G((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((;!

;(;(;:7)2/:,L:!%%)3&.:F+'/%$&)+$M:)+:$N/:!)%:7%&LL)':>&+&1/2/+$:JM#$/2 (((((((((((((((((((((((((((((((((((((((((((((; !

;(;(O::7)2/:,L:!%%)3&.:K/')#),+:JPQQ,%$:JM#$/2#((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((R!

;(;(R::K/')#),+:>&*)+1:&+0:J)$P&$),+:!-&%/+/## (((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((S!

;(;(S::K/L)+)+1:F+'/%$&)+$M(((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((T!

;(O::98J8!945:@UV847=B8 ((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((W!

45!6789:O<::8I689=>8G7:;:X:!G:=GB8J7=Y!7=@G:!GK:698C=>=G!9H:>@K8C:@A:7@!:

FG4897!=G7H:=G:!B=!7=@G:JF9A!48:D7!I=E:@689!7=@GJ ((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((Z !

O(;::!B=!7=@G:JF9A!48:D7!I=E:@689!7=@GJ((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((Z!

O(O:8I689=>8G7:[=75:!=9:79!AA=4:4@G79@C:D!74E:JFUV847:>!7789:8I6897J((((((((((((((((((((((((((( ;\!

O(O(;::>/$N,0(((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( ;;!

"#"#$#$!%&'()*)+&,(-################################################################################################################################################################## $$!

"#"#$#"!!.++&'&(/- #################################################################################################################################################################### $$!

O(O(O::6%,'/0P%/ (((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( ;;!

"#"#"#$!!0,('12/*()1,################################################################################################################################################################ $$!

"#"#"#"!!3)-(&,*456+442!7&8)9'&()1,################################################################################################################################# $"!

"#"#"#:!!;<+4')=4,(&8!>')&8-################################################################################################################################################# $:!

"#"#"#?!!%1-(@-(/2A!349')4B ################################################################################################################################################### $?!

O(O(R::9/#P.$# (((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( ;S!

"#"#:#$!!>C.!74'(&),(A!D&(),E-############################################################################################################################################ $F!

"#"#:#"!!0,(4'G)4H-#################################################################################################################################################################### $I!

O(R:!:>@K8C:@A:7@!:FG4897!=G7H:8J7=>!7=@G(((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( ;Z!

O(S::JF>>!9H((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( O;!

45!6789:R<::>875@KJ:!GK:>879=4J:7@:8B!CF!78::7@!:FG4897!=G7H:8J7=>!7=@G(((((((((((((((((((((((((((((((OO!

R(;::!UJ79!47=GY:K=J79=UF7=@G!C:69@6897=8J (((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( OO!

R(O::8I689=>8G7!C:>875@KJ:!GK:>879=4J(((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( O]!

R(O(;::8^Q/%)2/+$&.:7&#*(((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( OZ!

R(O(O::>/$%)'#:$,:83&.P&$/:F+'/%$&)+$M:4&.)"%&$),+ ((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( R\!

45!6789:S<::8I689=>8G7:O:X:758:8AA847:@A:J688K:!GK:B!9=!U=C=7H:@G::7@!:FG4897!=G7H:

8J7=>!7=@G((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((R_!

S(;::J684=A=4:98J8!945:`F8J7=@GJ:!GK:5H6@758J8J (((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( R_!

S(O::8I689=>8G7!C:K8J=YG(((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( RT!

S(R::>875@K ((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( RW!

S(R(;::JP"a/'$#(((((((((((((((((((((((((((((((((((((((((((((((( ((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( RW!

S(R(O::>&$/%)&.# (((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((((( R]!

?#:#"#$!!.++&'&(/- #################################################################################################################################################################### :J!

?#:#"#"!!6()=/8) ########################################################################################################################################################################### :J!

vi

!"#"#$$%&'()*+&) """""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" #,!

"#$#$#%!!&'()*+,-(.*'!/'+!0)/.'.'1#################################################################################################################################### $2!

"#$#$#$!!345).46!/'+!7*88*9:;<########################################################################################################################################### "%!

!"!$$-./012/ """"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" !3!

!"!"4$$.567896)*$/:))* """""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" !#!

!"!"3$$.567896)*$/:))*$;7<*'=5""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" !>!

!"!"#$$.567896)*$2?@"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" !A!

!"!"!$$.567896)*$2?@$;7<*'=5 """""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" BC!

!"!"B$$.567896)*$2?@$;7<*'=$/D88)6&D""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" B3!

!"B$$EF/G0//F?H """""""""""""""""""""""""""""""""""""""""""""""" """"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" BB!

!"B"4$$/+889&D$'I$-)5+J65"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" BB!

!"B"3$$17876967'<5$'I$6K)$G+&&)<6$.L:)&78)<6$9<*$/+MM)567'<5$I'&$N+6+&)$/6+*7)5"""""""""""""""""""""""""" B>!

!"B"#$$F8:J7(967'<5$I'&$6K)$H)L6$O)<)&967'<$@P7967'<$/D56)8"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" BA!

GQ@%2.-$BR$$.S%.-FT.H2$#$U$2?@$0HG.-2@FH2V$./2FT@2F?H$@HE$.S2-@%?1@2F?H"""""""""""""""""""""""""""""""B,!

B"4$$FH2-?E0G2F?H"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" B,!

B"4"4$$@**&)557<M$17876967'<5$'I$.L:)&78)<6$3 """""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" B,!

B"3$$-./.@-GQ$W0./2F?H/ """""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" >C!

B"#$$T.2Q?E """"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" >4!

B"#"4$$/+XY)(65"""""""""""""""""""""""""""""""""""""""""""""""" """"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" >4!

B"#"3$$.L:)&78)<69J$E)57M<"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" >4!

=#$#>#%!?@/A4!$/B!!34C48*<.'1!/!D.5)/)E!*6!F<44+!?)*6.84A###################################################################################### G%!

=#$#>#>#!!?@/A4!$5B!!HA(.I/(.'1!0JK!;'-4)(/.'(E!6*)!0)/.'4+!?)*6.84A############################################################### G"!

=#$#>#$#!!?@/A4!$-B!!HA(.I/(.'1!0JK!;'-4)(/.'(E!6*)!L49M!;'()/.'4+!?)*6.84A ############################################### GN!

B"!$$-./012/ """"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" >,!

B"!"4$$%K95)$#9R$E)P)J':7<M$9$17X&9&D$'I$/:))*$%&'I7J)5"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" >,!

=#"#%#%!!HA(.I/(4+!O4/'!F<44+########################################################################################################################################### G2!

=#"#%#>!HA(.I/(4+!F<44+!P.'+*9A#################################################################################################################################### NQ!

B"!"3$$%K95)$#XR$.5678967<M$2?@$+<()&697<6D$I'&$6&97<)*$:&'I7J)5 """"""""""""""""""""""""""""""""""""""""""""""""""""""""" A3 !

=#"#>#%#!!HA(.I/(4+!0JK!;'-4)(/.'(E ################################################################################################################################ N>!

=#"#>#>#!!0JK!;'-4)(/.'(E!HA(.I/(.*'!H))*) ################################################################################################################## NG!

=#"#>#$!!HA(.I/(4+!0JK!&'(4)C/8!FEII4()E################################################################################################################### NN!

B"!"#$$%K95)$#(R$.5678967<M$2?@$0<()&697<6D$I'&$0<6&97<)*$%&'I7J)5"""""""""""""""""""""""""""""""""""""""""""""""""""" A, !

=#"#$#%!!&+4'(.6.-/(.*'!*6!0)/.'4+!/'+!;'()/.'4+!?)*6.84A ####################################################################################### RQ!

=#"#$#>!!S*))4-(!&+4'(.6.-/(.*'!*6!F<44+!?)*6.84############################################################################################################ RQ!

=#"#$#$!!F.I.8/).(E!T/(.'1A ##################################################################################################################################################### R%!

=#"#$#"!!HA(.I/(4+!0JK!P.'+*9######################################################################################################################################## R>!

B"B$$G?HG10/F?H$@HE$?%.-@2F?H@1$FT%1FG@2F?H/ """"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" ZA!

GQ@%2.-$>R$$T?E.1$-.NFH.T.H2 """"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""Z,!

>"4$2FT.$?N$@--F[@1$0HG.-2@FH2V$T?E.1$-./[email protected]$@HE$-.NFH.E """""""""""""""""""""""""""""""""""""""""""""""" Z,!

>"3$$/?0-G./$?N$2FT.$?N$@--F[@1$0HG.-2@FH2V$./2FT@2F?H$.--?-""""""""""""""""""""""""""""""""""""""""""""""" ,C!

>"#$$G@0/./$?N$2FT.$?N$@--F[@1$0HG.-2@FH2V$./2FT@2F?H$.--?-"""""""""""""""""""""""""""""""""""""""""""""""""" ,4!

>"#"4$$@55+8:67'<$'I$/D88)6&D """"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" ,4!

>"#"3$$@P)&57'<$6'$.L6&)8)5""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" ,!!

GQ@%2.-$AR$$/0TT@-V\$1FTF2@2F?H/$@HE$FT%1FG@2F?H/""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""",>!

A"4$$T?E.1$@//0T%2F?H/$@HE$1FTF2@2F?H/ """"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" ,A!

A"4"4$$/:))*$75$H'&89JJD$E756&7X+6)*"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" ,A!

A"4"3$$@P97J9X7J76D$'I$@((+&96)$/:))*$-)9*'+65 """""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" ,Z!

vii

!"#"$%&'()%*+%,--'./0%123)-4/'245%674'(/4'*2%,89)-)7%4*%49)%:-*;*7)8%&<*=74);%:-*3)77%>*8)0

""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""" ??!

!"@%%A:6B,&CAD,E%C>:ECF,&CADG%HAB%,IC,&CAD""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""#JJ!

!"@"#"%%C(;0'3/4'*27%+*-%KL(/2%6--*- """""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""#JJ!

!"@"@%%C(;0'3/4'*27%+*-%KL(/2=F)24-)8%,L4*(/4'*2%4*%GL;;*-4%,'-%&-/++'3%F*24-*0 """""""""""""""""""""#J@!

!"$%%H1&1B6%MCB6F&CADG """""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""#JN!

FK,:&6B%OP%%B6H6B6DF6G """""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""#J!!

,::6DMCF6G""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""##$!

,::6DMCQ%,#P%%6Q:6BC>6D&%#%:BA&AFAE """"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""##$!

,::6DMCQ%R@P%%6Q:6BC>6D&%@%M6>ASB,:KCF%T16G&CADD,CB6""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""##O!

,::6DMCQ%R$P%%6Q:6BC>6D&%@%M6RBC6H%T16G&CADG"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""##?!

,::6DMCQ%RUP%>6&KAM%HAB%M6&6B>CDCDS%,F&1,E%G:66M%,DM%&A,%GFAB6G"""""""""""""""""""""""""""""""""#@J!

,::6DMCQ%F#P%%6Q:6BC>6D&%$%:BA&AFAE """"""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""#@#!

,::6DMCQ%F@P%%6Q:6BC>6D&%$%M6>ASB,:KCF%T16G&CADD,CB6""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""#@O!

,::6DMCQ%F$P%%6Q:6BC>6D&%$%M6RBC6H%T16G&CADG """""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""""#@?!

1

CHAPTER 1: TIME-OF-ARRIVAL (TOA) UNCERTAINTY IN COMPLEX

SOCIO-TECHNICAL SYSTEMS

1.1 INTRODUCTION

The need to predict time of arrival (TOA) is becoming increasingly prevalent in complex

socio-technical systems. For example, pilots and air traffic controllers predict TOA in order

to identify and prevent potential collisions, taxi dispatchers predict driver arrival times, and

weather forecasters use complex meteorological and oceanographic models to predict the

time at which a hurricane is expected to make landfall. These diverse domains all share the

common problem of TOA prediction uncertainty. In these cases, even when the data used

to make TOA predictions are essentially accurate at the specific point in time that they are

collected, there is some degree of uncertainty inherent in the prediction, since the world is

dynamic and continues to change. That is, even if the current location, course, and speed

are known, an object or event can change course and speed in the future, and this

introduces uncertainty into the predictions.

1.1.1 Time of Arrival Uncertainty in the Air Traffic Management System

The aviation domain has been selected for investigation in the current research in order to

ground it in a real-world problem. In the Air Traffic Management (ATM) system it is

necessary for air traffic controllers (ATC) to have an accurate view of possible future events

to prevent dangerous or undesirable situations and to optimize traffic flow. ATM is greatly

vulnerable to uncertainty, and controllers and pilots are frequently required to make

important decisions based on uncertain or incomplete information, especially when they

involve predictions of future system states. The sources of TOA uncertainty include factors

such as wind shifts, weather, route complexity, turbulence, pilots! intentions, pilot variability,

and prediction time span. Erroneous interpretation of this uncertain information carries with

it a potential for serious consequences.

Many attempts have been made to minimize uncertainty in the aviation system, such as

reliance on procedures and standardized routes intended to minimize variability and

increase predictability of aircraft movements. Also, great efforts have been taken to improve

the reliability of automation algorithms to minimize uncertainty (Kuchar, 2001; van Doorn,

2

Bakker & Meckiff, 2001). However, it is not possible to eliminate uncertainty from human-

automation systems altogether, particularly because of the latent variability in human and

environmental factors. Furthermore, this approach may not be sustainable, as new ATM

concepts that require greater precision and greater accuracy, such as reduced separation

minima and coordinated runway crossings, are introduced. These concepts require pilots

and controllers to anticipate trajectories of traffic to facilitate detection and avoidance of

potential conflicts and to make more efficient use of airspace and runways. However, all

these tasks inherently involve uncertainty, as future states within such complex system

cannot be known in advance.

Much research has been conducted to explore planning tasks that require anticipation and

prediction in uncertain, dynamic environments. Boudes and Cellier (2000) studied human

anticipation when controlling dynamic environments (such as ATC or power plants) in which

the operator does not have complete control over the evolution of the system. As per their

definition, !anticipating" involves evaluating the future state of a process and deciding,

according to the subsequent representations developed, which action is to be undertaken

and when. It is, of course, necessary to have an accurate view of possible future events to

prevent dangerous or undesirable situations and to better prepare for favourable conditions

for traffic evolution and optimization.

However, prediction and anticipation are not functions that people perform with high

precision, given known perceptual and cognitive limitations. For example, human operators

are limited in their ability to integrate information over time. Such tasks normally require

mental computation to extrapolate the future from the present and past, and may require that

future estimates be stored in working memory (Wickens & Hollands, 2000). Due to limits on

memory, people tend to give undue weight to early cues (primacy), those that support the

initial hypothesis (anchoring), and very recent cues (recency) (Kahneman, Slovic & Tversky,

1982; Wickens & Hollands, 2000). Furthermore, humans are not good estimators of

variability and are limited in their ability to perceive trends emerging in data that require

extrapolation of exponential or accelerating growth trends.

Non-specialized subjects (Schiff & Oldak, 1990) and trained experts, such as air traffic

controllers (Boudes & Cellier, 2000), also tend to exhibit a conservative bias in that they

make decisions that are risk averse. For example, the conservative bias is demonstrated by

3

an air traffic controller who is unaware of the exact location or speed of an aircraft and

leaves very large !space envelopes" around each aircraft. In essence, this decision bias is a

way of preparing optimal solutions to absorb unforeseen events, because it allows for a time

margin for solving problems. With large space envelopes, the controller has more time to

issue commands to the aircraft to resolve and avoid potential conflicts. However, it is

important to note that, although functional, the bias may also produce temporal errors and

introduce risk into the ATM system. For example, the temporal errors caused by the

conservative bias may lead controllers to ignore potential conflicts because he/she may

have a false sense that the aircraft are safely separated despite his/her lack of actual

knowledge of the aircraft"s precise location or speeds. The controller"s !safer" space

envelopes may also increase system risk in some cases, for example when the pilot must

carry out a go-around manoeuvre rather than land the aircraft to maintain the controller"s

conservative space envelope. Furthermore, the conservative decision bias may also limit

system efficiency and throughput by creating the need for holding patterns and departure

queues, both of which may be particularly undesirable as we face increasingly congested

airspace and runways.

1.1.2 Time of Arrival Decision Support Systems

To support TOA prediction in dynamic, uncertain environments, Decision Support Systems

(DSSs) are being developed and introduced into many complex domains, in an attempt to

reduce operator workload, improve decision-making, and reduce errors. For example,

automated conflict detection systems have been developed using predictor algorithms and

dynamic models to propagate aircraft state forward in time and predict conflicts over time

and space (Kuchar, 2001; Wickens & Morphew, 1997; Johnson, Battiste & Holland, 1999).

This requires some assumptions regarding the future intentions of each aircraft, such as

assuming that each aircraft will continue to fly at its current heading, speed, and altitude.

When such assumptions are violated, however, the inaccuracies may result in unpredicted

conflicts or unnecessary manoeuvres. In reality, in other words, such predictive conflict

detection systems are rarely 100% reliable. A recent estimate suggested that 85% reliability

is typical of a conflict detection system that predicts conflicts with long look-ahead times (i.e.,

20 minutes) in airspace subject to uncertainties arising from turbulence and future pilot

control actions between the time of alert and the predicted conflict (Xu, Wickens &

Rantanen, 2004).

4

Much of the research on automation reliability (see Wickens, 2000) has assumed a cueing

system that notifies the operator of impending events. In these systems, automation is often

considered to be either correct or incorrect – that is, the automation correctly cues the

human operator to an object that exists in the world, or does not. However, a DSS that

provides TOA predictions is more likely to have varying degrees of error, rather than error

versus no error. Yet, despite the known lack of perfect reliability and the inherent uncertainty

due to factors that make perfect prediction impossible (i.e., wind shifts, turbulence, pilot

variability, prediction time span, etc.), information about uncertainty associated with

prediction errors is not normally presented to the human decision maker (For one exception,

however, see Lee & Milgram, 2008).

1.1.3 Decision Making and Situation Awareness

In ill-structured, uncertain, and dynamic environments, automation has been less-than-

successful, and has even created serious problems, as the automation becomes "brittle"

and cannot adapt to unknown or unforeseen situations (Cohen, 1993). It fact, the very

features of real-world environments, such as their ill-structuredness, uncertainty, shifting

goals, dynamic evolution, time stress, multiple players, etc., typically defeat the static,

bounded models typically used by decision support systems (Cohen, 1993). Cohen

suggests that each decision involves a unique and complex combination of factors, which

seldom fits easily into a standard decision analytic template, a previously collected body of

expert knowledge, or a predefined set of linear constraints, and that users are sometimes

better than the aids in their ability to recognize and adapt to such complex patterns.

Hansman and Davison (2000) suggest that when decisions must be made based on

incomplete data [or uncertain information], humans are better at decision making than is

automation that follows rule-based strategies, because human operators can formulate a

more comprehensive picture and detect patterns indicating that a system is straying outside

normal limits. It is proposed that, even though humans are better at managing their tasks in

ill-structured, uncertain, dynamic environments, their performance in these environments

may be improved if provided with visualization tools that enable them to forecast and predict

future spatial-temporal states and their associated uncertainty.

5

Another rationale for placing the human in the role of estimating TOA uncertainty associated

with a TOA prediction is evident from Wickens! (2000) research on automation reliability. It

has been shown that, when the operator is unaware that an automated system is less-than-

perfectly reliable, performance decrements, such as slowed failure detection time and lower

situational awareness (SA), are observed. This occurs because human operators tend to

become complacent and overly trusting of the automated systems when they should be

monitoring operations, considering alternative hypotheses, or preparing for alternative

courses of action. When users are aware of automation unreliability, however, they tend to

calibrate their cognitive strategies with the actual level of imperfection (Wickens, 2000) and

exhibit improved allocation of visual attention to raw data, as manifested by their visual scan

patterns, ultimately yielding improved performance. Paradoxically, because an operator

may attend more to the raw data underlying an automated cue, or implement a wider scan

pattern when they are uncertain of the outcome, they may actually perform better than when

they are certain that the automation is perfectly reliable.

Endsley, Bolte & Jones (2003) proposed that asking operators to manage TOA uncertainty

explicitly may help operators maintain operator attention and situation awareness, and

actually alleviate some of the negative consequences frequently attributed to increasing

levels of automation. In essence, it may not be desirable to totally eliminate uncertainty from

the environment; rather, it is important for operators to maintain an accurate awareness of

the state of the world in which they are operating, and that includes an awareness of the

level of certainty of the world or system. Endsley et al. proposed that the relationship

between confidence and situation awareness (SA) plays a significant role in the quality of

decisions. If confidence is high, and SA is high, an operator is more likely to achieve a good

outcome. However, given the same level of SA, but lower levels of confidence, the operator

will be less likely to act, choosing to gather more information, and thus be ineffective. Lack

of confidence therefore can contribute to indecisiveness and delays and the operator in this

case may be less likely to revise a plan when appropriate. On the other hand, if an operator

with poor SA recognizes that it is poor (i.e., has low confidence) he will correctly choose not

to act or will continue to gather more information to improve SA, and avert a potentially bad

outcome. The worst situation is the operator who has poor SA, but high confidence in his

erroneous picture of the world. This operator will likely act boldly but inappropriately. A

critical issue, according to Endsley et al., is ensuring not only that operators have a good

6

picture of the situation, but also that they are able to maintain an appropriate amount of

confidence with respect to that picture. As such, it is very important that operators

understand the degree and nature of uncertainties in the system.

Endsley et al. discussed the relationship between situation awareness (SA) and decision

making (Endsley, Bolte & Jones, 2003), together with the task/system factors and individual

factors that influence both of them. They also showed that system capability, interface

design, operator stress, workload, task complexity, and degree of automation can influence

an operator!s SA, decisions, and actions. According to Endsley et al., SA comprises three

levels. Level 1 refers to perception of the elements of the current situation. In an aviation-

related TOA scenario, an operator would be considered as having good level 1 SA if he/she

knows the location of each aircraft, its speed, and distance to go. Level 2 refers to

comprehension of the current situation, or the ability of the operator to determine whether

the aircraft are ahead, behind, or on schedule. Level 3 refers to projection of future states,

or the ability for the operator to survey the airport surface and predict whether all of the

aircraft under his control will arrive at the final destination on time, early, or late, based on

potential traffic and/or route complexities.

1.1.4 Defining Uncertainty

Before embarking on research to determine how operators comprehend TOA uncertainty, it

is imperative to first define TOA uncertainty. There is no clear consensus in the literature as

to how to define or quantify uncertainty associated with a TOA prediction, or what factors

should be included in an uncertainty metric. As a starting point, Schunn, Kirschenbaum and

Trafton (2005) have produced a potentially useful taxonomy of informational uncertainty that

is arguably the most comprehensive to date. Informational uncertainty is defined as

objective uncertainty in the information a person uses to make decisions. Drawing on

expertise in three domains – weather forecasting, submarine / sonar operations, and

medical imaging – they defined four classes of informational uncertainty that are prevalent

and generalize across many complex domains and labelled them as Physics,

Computational, Visualization, and Cognitive uncertainty. The focus of this research is

cognitive uncertainty; however at the same time it is acknowledged that this interacts with

several other forms of uncertainty, such as future prediction uncertainty, and physical

7

uncertainty associated with aircraft dynamics and environmental factors.

Physics Uncertainty refers to uncertainty in the raw, measured data. This is further

subdivided into three sub-types: Absence of signal being measured, noise or bias in the

signal, and noise or bias introduced as the signal is transduced (or converted into data).

Computational Uncertainty refers to the computational procedures that are applied to data

before they are presented to the human operator. There are three ways that computational

procedures can add new sources of uncertainty. The first (and the focus of the present

research) is future prediction uncertainty. Data are collected and may be accurate at a

specific point in time; however, the world is dynamic and continues to change.

Computational procedures either make no corrections for these changes or make only

partially accurate corrections, and this introduces a potentially large source of uncertainty. A

second form of computational uncertainty is statistical artefact uncertainty, in which

statistical algorithms such as data smoothing introduce artefacts in the displayed data and

distort reality. A third type of computational uncertainty is that related to the use of fast and

cheap algorithms that present the current best guess so that the problem solver can make

quick (though approximate) judgements if necessary. The third category of uncertainty

information within the taxonomy is Visualization Uncertainty. Here, the presentation of data

in the form of a map, table, or graph creates uncertainty, either by failing to represent

relevant information or by presenting it in a misleading way. The final category is Cognitive

Uncertainty. Inclusion of this category in the taxonomy acknowledges that humans, in acting

as an encoding device, information storage/retrieval device, and procedure enactors, are a

key, but error-prone, part of the information system. As possible errors are introduced into

these tasks, sources of information uncertainty are introduced. Cognitive uncertainty can be

created because of perceptual errors, memory encoding errors, information overload,

retrieval errors, background knowledge errors, and skill errors1.

1.2 RESEARCH OBJECTIVE

Given that uncertainty is inherent in systems, it is important to consider how operators in

general comprehend such uncertainty, and to identify conditions under which an operator

1 Because skilled behaviour is conventionally modelled as being automatic, "skill errors" should arguably not be

included as a type of cognitive error.

8

believes there to be more or less uncertainty than actually exists in the world. With regards

to TOA uncertainty in particular, an improved understanding of how operators comprehend

this uncertainty and the biases inherent in the process is critical to the development of

operations and technology in complex, uncertain, dynamic environments that involve

estimating time of arrival. A model of TOA uncertainty estimation is a critical first step

toward the development of user-centred DSSs. It is expected that the findings of this

research will extend to operators who make decisions in a variety of complex, dynamic, and

uncertain environments – i.e., not only aviation, but also process control, surface

transportation, and military command and control domains.

9

CHAPTER 2: EXPERIMENT 1 – AN INVESTIGATION AND PRELIMINARY MODEL OF

TOA UNCERTAINTY IN AVIATION SURFACE (TAXI) OPERATIONS

2.1 AVIATION SURFACE (TAXI) OPERATIONS

The concept of TOA uncertainty is applicable to many domains; however, it is helpful to work

in a single problem domain in order to ground the problem in reality. The chosen domain is

that of airport surface (taxi) operations. In current day airport surface operations, a

controller is responsible for both issuing taxi clearances and monitoring aircraft for

conformance to the issued taxi route. Controllers monitor for potential conflicts and act in a

tactical / reactive manner by issuing a hold command to aircraft when needed to prevent

conflicts. The ability of a controller to predict the time of arrival of an aircraft is limited, mostly

because the controller lacks the tools and information needed to make such predictions.

This leads to ultra-conservative, risk averse behaviour, as exhibited by the tendency for

controllers to require larger than necessary separations between aircraft. For example, when

clearing taxiing aircraft to cross an active runway, the controllers tend to build a queue of

several aircraft to cross the runway all at once during a single large break in arriving traffic,

rather than attempting to sequence the taxiing aircraft in a coordinated fashion between

arriving aircraft. This ultimately reduces airport capacity and increases the delay

experienced by passengers on the airport surface, with the first aircraft in the queue often

waiting 20 minutes or more. Furthermore, an aircraft that is stopped requires twice as long to

!spool up" and cross a taxiway/runway than an aircraft that arrives !just in time" without

stopping. This effectively doubles runway occupancy time and reduces arrival and

departure throughput. Upon arrival at its destination (departure runway or gate), an aircraft

will often have to stop and wait (either for take-off or for the gate to become available). This

creates inefficiencies, as a stopped aircraft may block a gateway, a taxiway, or runway exit

and restrict movements.

The future aviation system is moving to 4-D operations2, which will involve the issuance of a

time- or speed-based clearance by the controller, with the additional requirement to monitor

2 The term "4-D operations" is currently used in the aviation industry in reference to the time/speed

element; however, in airport surface (taxi) operations, the third dimension, altitude, is of course constant.

10

for TOA conformance. It is expected that pilots will be required to arrive at the runway (for

crossing or take-off) within a specified TOA window. The acceptable duration of this

window, and therefore the precision requirements, have not yet been determined.

Currently, automated systems are being developed to assist the controller in issuing

commanded TOA, and also to predict aircraft TOA, but these predictions will be inherently

uncertain, given the dynamic nature of the environment. Pilots! taxiing ability, weather,

traffic, and route complexity are among the factors that may cause an aircraft to alter its

speed and thus may influence its probability of arriving on time. The degree to which

controllers can effectively estimate TOA uncertainty will impact their situational assessment

and decision-making abilities, yielding safer, more efficient operations.

As a first step in creating a model of how ground controllers actually deal with TOA

uncertainty in their environment, experiment 1 was carried out with expert ground

controllers. The goal of this experiment was to understand factors that are relevant to

estimating TOA uncertainty in real-world settings and which should be considered for further

evaluation in future studies. A second goal was to gain an understanding of the task of TOA

uncertainty estimation, as is characterized in a preliminary process model of TOA

uncertainty estimation later in this chapter, and evaluated in more rigorous experimental

studies presented in Chapters 4 and 5.

2.2 EXPERIMENT WITH AIR TRAFFIC CONTROL (ATC) SUBJECT MATTER EXPERTS

This experiment explored factors that contribute to air traffic controllers! certainty that a time-

based taxi plan issued to pilots would be carried out as scheduled. The experiment was

conducted in two parts: First a desktop simulation experiment was conducted to collect

subjective ratings, given a number of factors believed to influence TOA uncertainty. Second,

the same controllers participated in a debrief interview to further understand the factors that

contribute to TOA uncertainty in actual current-day and potential future time-based taxi

operations.

11

2.2.1 Method

2.2.1.1 Participants

One current and three recently retired air traffic controllers with extensive major airport tower

experience were recruited. Their mean age was 56.75 years and their mean years of

experience at major international and domestic airports was 23.5. All participants had

worked in local controller and ground controller positions, while two of the participants had

additional experience in TRACON (tower control). They were all familiar with tower

operations and procedures and common tower radar and computer systems.

2.2.1.2 Apparatus

Static pictures of typical taxi routes (as shown in Figure 2-1) at a complex airport, Dallas

Forth Worth (DFW), were presented using SuperLab software on a desktop Macintosh G3

computer with a 23” Apple cinema display. Each picture depicted a taxi route (highlighted in

magenta) and an aircraft icon (yellow triangle) that showed the aircraft!s position along the

route. The time that had elapsed since the aircraft started the route was shown in a white

box beside the aircraft icon (shown as 2 minutes in Figure 2-1). Each taxi route ended at a

runway that was highlighted in yellow. An assigned runway crossing time window (a window

of time within which the aircraft had to reach the runway threshold) was presented in a

yellow box beside the runway crossing point (shown as 5:41 to 6:11 in Figure 2-1).

2.2.2 Procedure

2.2.2.1 Introduction

The instructional protocol for this experiment is presented in Appendix A1. The concepts of

coordinated runway crossings and time-based taxi clearances were explained to each

participant, as well as the rationale in terms of both system-wide safety and efficiency

benefits. Participants were told that, if an aircraft arrives early at the runway-to-be-crossed,

the aircraft will have to stop and wait for the assigned crossing time, which doubles the time

required for an aircraft to cross a runway due to engine spool-up times. Furthermore,

arriving late will mean that the aircraft has missed the crossing window and this will

inevitably cause a delay.

12

Figure 2-1. Experiment 1 stimulus: Dallas Fort Worth (DFW) Airport layout, shown with an aircraft (yellow triangle) taxiing along a taxi route (magenta path) en route to a to-be-crossed runway (yellow highlight). Elapsed time is shown in the white box and assigned runway-crossing window is presented in the yellow box.

2.2.2.2 Distance/Speed Calibration

In order to calibrate the air traffic controllers to the approximate times required to taxi along

different routes at DFW, participants were shown four different taxi routes overlaid on the

DFW airport (as in Figure 2-1 above). Two routes (6000! and 12,000!) running east/west and

two running north/south across the airport were displayed. The time to complete each route

at three different speeds (16 kts, 20 kts, and 24 kts) were also provided, assuming constant

speed, as shown in Table 2-1, along with the graphical route depictions. The controllers

were instructed that typical taxi speeds ranged from 16 to 24 kts, with an average speed of

20 kts.

13

Table 2-1. Distance and TOA for each route distance and speed combination.

Time to complete route per route distance (in Minutes:Seconds)

Taxi Speed

6,000! 12,000!

16 kts 3:42 7:24

20 kts 2:58 5:56

24 kts 2:28 4:56

2.2.2.3 Experimental Trials

Subjects were shown 162 static pictures of taxi plans in progress as in Figure 2-1. Each

picture showed the taxi plan and an aircraft!s progress in carrying out the plan – both its

current location along the route and the elapsed time since the aircraft started taxiing. Each

picture was a combination of the five independent variables presented in Table 2-2: route

distance, route complexity, runway crossing window size, position along the route, and

speed. Route Distance was expected to influence TOA uncertainty estimations, with longer

routes producing lower TOA certainty than shorter routes. Route complexity was

represented as the number of turns required to complete the route. Turns add variability to

the taxi speed, as pilots must reduce speed to safely manoeuvre the turn and accelerate

after the turn to resume taxi speed. It was expected that the increased speed variability

would increase TOA uncertainty. The duration of the runway-crossing window was a

manipulation of the degree of precision required by the taxiing pilot. It was expected that

controllers! TOA uncertainty would be higher when the aircraft was required to make a very

small crossing window than for a larger crossing window. The aircraft!s position along the

route was also manipulated. It was expected that TOA uncertainty would be larger for

aircraft at the start of their route than for those that had progressed appropriately along their

route towards their destination. The final independent variable was the nominal taxi speed

of the aircraft.

The subjects! task was to assess the information available in the depicted taxi plan and

provide a rating of how certain he/she was that the aircraft would arrive at its destination

within the allotted runway crossing window – neither too early, nor too late. Following the

sizeable literature (e.g., see Soll & Klayman, 2004; Block & Harper, 1991) that use subjects!

14

ratings of certainty as a proxy for estimating probability, subjects in this experiment provided

ratings on a seven-point scale from 1 (Very Low Certainty) to 7 (Very High Certainty) by

entering a number on a standard keyboard, and were encouraged to use the entire range of

the seven-point scale. Subjects were told to assume that it was a clear day at DFW airport

and that no traffic conflicts were anticipated. Also, they were told that the aircraft represent

a Boeing 757 that taxis at 20 kts on average but that any number of factors could influence

their ability to maintain a constant 20 kts.

Table 2-2. Independent Variables

Independent Variable Levels

Route Distance 6000!, 12000!

Route Complexity 1, 2, 3 turns per 6000!

Runway Crossing Window Size 15, 30, 45 seconds

Position along route Start, 1/3rd, 2/3rd

Taxi Speed 16, 20, 24 kts

Subjects first completed 18 practice trials that exposed them to each level of the five factors.

Then, after a short break, subjects completed 162 experimental trials comprising each

combination of the five factors. The order of the 162 trials was randomized, and they were

presented in 4 blocks, each separated by a short break. The task was self-paced.

2.2.2.4 Post-study Debrief

Upon completion of the experimental trials, a semi-structured interview was conducted with

each of the ground controllers to better understand their current tasks, how these might

change with the introduction of 4-D clearances, and their perceptions of TOA uncertainty on

the airport surface. Specifically, the purpose of the interviews was to explore factors that

contributed to their perceptions of certainty that a time-based taxi plan once issued to a pilot

would be carried out as scheduled. The interview questions are presented in Appendix A2.

2.2.3 Results

Prior to presenting the results of this experiment, is it noteworthy to recall that the main goal

of this experiment was to determine which factors are relevant to the task of estimating TOA

15

uncertainty, for the purposes of developing a model of the TOA uncertainty estimation task

for future empirical evaluation.

2.2.3.1 TOA Certainty Ratings

Recall that the dependent measure was a subjective rating of TOA certainty, with lower

numbers representing low certainty, and higher numbers representing higher certainty. The

mean and standard error of the ratings for each level of the five independent variables are

presented in Table 2-3. A Distance (3) x Complexity (3) x Runway window crossing size (3)

x Position (3) x Speed (3) repeated measures analysis of variance (ANOVA) was conducted.

Given the exploratory nature of this experiment, an alpha level of 0.1 was used in

interpreting these results.

There was some evidence that, as route complexity (number of turns) increased, the

subjects! TOA certainty ratings decreased, F(2,4)=5.00, p=.08. In particular, the high

complexity condition (3 turns) yielded lower certainty scores than both low (1 turn) and

medium complexity (2 turns; p<.05). Recall that route complexity represents speed

variability, thus this suggests that factors that contribute to speed variability on the airport

surface may reduce TOA certainty. (These factors were explored further in the debrief

interviews, as discussed in section 2.2.2).

Table 2-3. Certainty Ratings Mean and Standard Error

Independent Variable Condition Mean Rating

Standard Error

Route Distance 6000!

12000! 4.9 5.2

.627

.415

Route Complexity 1 turn 2 turn 3 turn

5.3 5.2 4.7

.367

.602

.566

Runway Crossing Window 15 sec 30 sec 45 sec

4.9 5.1 5.1

.496

.534

.502

Aircraft Position Start 1/3rd 2/3rd

5.3 4.6 5.3

.617

.406

.555

Speed 16 20 24

5.5 5.1 4.5

.388

.424

.730

16

There was a small, but significant, increase in subjectively rated TOA certainty as a function

of runway crossing window size, F(2,4) = 15.45, p=.013, with lower certainty ratings for the

15 sec window than both the 30 and 45 second window. This finding suggests a possible

threshold level associated with the required TOA precision at which the subjects! certainty

associated with an aircrafts! TOA may drop.

Speed yielded interesting results, with subjects! certainty scores decreasing as speed

increases, F(2,4)=5.829, p=.065. Comments made by the subjects during the debrief

interview suggested that certainty scores were lower under high-speed conditions because

they were unsure if the aircraft could maintain the high speed throughout the entire route,

especially given the required turns in the route.

Although there was no significant effect of route distance, there was a significant effect of

the aircraft position along the route on the subjects! certainty ratings, F (2,4)=4.84, p=.086.

As can be seen in Table 2-3, when the aircraft was 1/3rd of the way along the route, certainty

ratings were lower than when the aircraft was either at the start or 2/3rd along the route.

This suggests that the most uncertainty was perceived when the aircraft had not travelled

very far (thus the ability to assess mean speed was limited) and when there was a longer

distance remaining (thus the potential for increased variability in the speed on the route

remaining).

In summary, turn complexity, or factors expected to increase speed variability reduced the

subjects! certainty ratings, as did aircraft speed, short runway window crossing sizes (or high

TOA precision requirements), and unknowns regarding aircrafts! speed variability when only

a short distance has been travelled and larger distance remained. These results indicate

that future empirical studies should manipulate speed variability and distance remaining.

Related, feedback from the SMEs interviewed after this experiment revealed the need for

dynamic simulations so that subjects could better understand the amount of speed variability

for a given aircraft. Showing the aircraft position along the route with elapsed time (see

Figure 2-1) provided no indication as to whether the aircraft!s speed was constant or

variable.

17

Also, while the precision required for an aircraft to arrive at its destination (runway crossing

window) affected the subjects! ratings of certainty, results from the debrief interviews

suggested that this was a coarse level of granularity. Future studies should consider using

TOA estimation as a dependent variable rather than an independent variable – that is, rather

than asking subjects to rate their certainty associated with the aircraft arriving within a

certain time window, future studies should ask subjects to estimate the TOA window in

which they are certain the aircraft will arrive.

2.2.3.2 Interviews

The post-study interviews with the air traffic controllers were useful to provide insights into

the ground controller!s task. First, it is clear that, at any given time, a controller is

responsible for monitoring several aircraft simultaneously, often more than 20 aircraft, each

with its own starting point, destination, and route. Second, it was noted that, in time-metered

situations, it is generally beneficial for a controller to detect an aircraft that will arrive earlier

or later than the commanded time, so that the controller can issue a new speed command,

reroute the aircraft, or reroute another aircraft. Third, the uncertainty associated with on-

time arrival is due primarily to the variability of speed of the aircraft, which is determined by

the pilot in control of the aircraft. An important assumption for future operations is that each

pilot will be made aware of the optimal speed to travel and uncertainty with regard to the

TOA will be based on the pilot!s ability to maintain that speed.

The debrief interviews from the four air traffic controllers were synthesized into three

relevant topic areas, as discussed next.

1. How controllers determine TOA and TOA uncertainty

At the outset of this experiment, it was assumed that air traffic controllers calculate time of

arrival, albeit with rough approximations, by considering approximate route distances and

aircraft speeds. However, results of the debrief clearly suggest otherwise. Instead, it was

revealed that controllers develop heuristics to determine how long a taxiing aircraft should

take to complete a given route. These heuristics are built over years of on-the-job

experience. For example, the controllers stated:

18

• “I never thought about how long a 12000! route at SFO takes. I really don!t

know. I just know by experience how long it takes to get from one side of the

airport to the other”

• “Today, I found myself using math calculations [to do the experimental task]. I

would never do this in the real world.”

• “With experience at an airport, you just know how long it takes to travel a taxi

route”

These TOA heuristics developed by the controllers take a number of factors into

consideration, including the airport layout, aircraft type, airline culture, and traffic flow. For

example, controllers had developed expectations that pilots of a certain airline could

consistently be relied on to make an expedited runway crossing, whereas for others they

would take a different, more conservative approach, because the controller was uncertain

whether those pilots would comply. They cited issues such as corporate culture, experience

at the airport, and aircraft type as factors in their decision. The following are direct quotes

from the air traffic controllers selected to shed light on this process:

• “Pilots flying out of their own base [airport] are very familiar with airport

operations. They know the system and I can be more certain of their TOA.

They are easier than the "once a month! pilots, who tend to be more cautious

and apprehensive”.

• “We [ATC] have running jokes about airlines and which ones will comply to

expedited clearances. Some we know and others … maybe they will, maybe

they won!t”

• “I never told [Airline X] to do anything [with a time element to it] but [Airline Y]

would always be able to do it”

2. Sources of TOA uncertainty

The experiment reported above considered four factors that were expected to contribute to a

controller!s uncertainty of an aircraft!s TOA: route distance, route complexity, runway-

crossing window size, and position along the route. One of the findings of that experiment

was that speed variability, as manipulated by route complexity, was a large contributor to

TOA uncertainty. This finding was echoed by the controllers in the debrief interviews. One

19

pilot stated “It is the human element that is the big problem. The pilot has to be comfortable

with the visibility, the equipment, and the taxi route/ airport layout. The pilot has the final

say”. Other factors, in addition to those tested as independent variables in experiment 1,

that contribute to speed variability, and that controllers consider in estimating TOA

uncertainty include:

• Traffic patterns/ volume

• Aircraft type

• Weather

• Visibility

• Airline (familiarity with airport, past experience)

• Arrival traffic, separation on final

• Departure queue

• Flow control

• Aircraft equipage (with speed/time displays)

3. Effect of TOA uncertainty on ATC tasks

As aviation operations move from current-day operations to future time-based operations, it

is only reasonable to expect a change to ATC tasks and the way controllers operate. When

asked during the debrief interviews, each controller expressed concern about a need for

increased monitoring. For example, one controller stated that “[in cases of high TOA

uncertainty] controllers would have to monitor the aircraft more closely, and more frequently.

It may require re-routing to ensure conflict-free routing”. Similarly, another controller stated

that “it will change the way controllers monitor”, and also added that this change in

operations would mandate new displays and technologies to support the task. “We would

need more information, such as expected TOA and speed readouts, to know who!s not

going to make their commanded arrival time. A speed readout would be important, so that

we can see if the speed is reasonable and how much it fluctuates”

2.3 A MODEL OF TOA UNCERTAINTY ESTIMATION

Based on the results of this first experiment and its debrief interviews, a preliminary model of

the TOA uncertainty estimation task is proposed. This model will be used to guide

subsequent experiments. It is proposed that the process of estimating TOA uncertainty is a

20

two-step process, as depicted in Figure 2-2 and described next. Controllers, through years

of experience, develop a library of speed profiles that describe vehicles or situations that

they encounter, based on particular airlines, traffic patterns, or weather conditions. Each

speed profile consists of both an average speed and amount of speed variability (for

example, “this aircraft tends to be fast, but very variable – or consistently slow”). This speed

profile may be generated based on previous experience and expectations developed over

time and exposure, real-time observation, or training. At one extreme, if the variability is

zero, the aircraft will travel at a constant speed, there is no uncertainty in TOA, and the

operator should be able to predict TOA perfectly. At the other extreme, if the aircraft speed

is very variable, the uncertainty will be high, and the controller cannot easily predict TOA.

Figure 2-2. A model of the TOA uncertainty estimation process

During operations, controllers determine if each vehicle matches a known speed profile in

their personal library. For example, debrief comments suggested that controllers considered

all aircraft from a particular airline as sharing the same profile. Generally, airlines that flew

into an airport very infrequently were considered to have a speed profile that was slow and

highly variable, whereas those airlines that were based out of that airport were faster and

more consistent.

21

Operators then use knowledge of the speed profile (e.g., fast and variable) combined with

an estimate of the distance to travel, and an associated estimate of TOA and TOA

uncertainty to make decisions and act on those decisions, depending on factors such as

personal risk tolerances and consequences of error. These decision-making processes,

however, are beyond the scope of this research effort.

The model must also anticipate the situation in which the operator is faced with a new

vehicle profile with which he/she is not familiar. In this case, it is expected that the controller

would apply a heuristic to estimate TOA uncertainty for unknown profiles, based on their

similarity to known profiles. This was suggested by the controllers! comments, that

suggested a tendency to lump all aircraft from a single airline together – even though each is

piloted by a different pilot, whom they may or may not have encountered before.

2.4 SUMMARY

Experiment 1 resulted in a simple model of TOA uncertainty based on insights gathered

from skilled professionals who engage in TOA uncertainty tasks. The next step in the

process was to evaluate the model using empirical human-in-the-loop experiments. To

accomplish this, first an appropriate experimental method and platform was developed, as is

discussed next in Chapter 3. Two additional experiments, presented in Chapters 4 and 5,

were conducted to evaluate the above TOA uncertainty model, and a refined model is

presented and discussed in Chapter 6.

22

CHAPTER 3: METHODS AND METRICS TO EVALUATE

TOA UNCERTAINTY ESTIMATION

No literature was identified to date that has attempted to develop experimental methods and

metrics to assess an operator!s comprehension of TOA uncertainty. Indeed, there is a

small body of human factors literature that has investigated uncertainty displays. However,

much of that work was conducted to evaluate the effectiveness of a specific uncertainty

display for a domain-specific need (e.g. Andre & Cutler, 1998; Banbury et al., 1998) and

thus the generalizability of those findings, and methods, is limited. Other researchers, such

as Schaefer, Gizdavu & Nicholls (2004) have evaluated the usability of uncertainty displays

and assessed the impact of the uncertainty display characteristics on operator workload and

situation awareness. Just as with the subjective certainty ratings used in Experiment 1 of

the current research effort (reported in Chapter 2), those studies provided valuable insights,

but they are not sufficient to further our knowledge of the understanding and management of

uncertainty.

To objectively evaluate TOA uncertainty, methods and metrics that compare objective

uncertainty to subjective uncertainty are necessary. Objective uncertainty is a property of

the system under study, in this case derived from the dynamics of the aircraft and other

environmental factors. As such, objective uncertainty can be objectively computed, given

knowledge of the variability in the speeds of the aircraft. On the other hand, subjective

uncertainty refers to estimates by an observer of the system given his/her state of

information at the time. When perfectly calibrated, subjective uncertainty equals objective

uncertainty. On the other hand, if the observer is not well calibrated, he/she will either

overestimate or underestimate TOA uncertainty. Thus, the first challenge of the present

research is to develop an experimental task and metric that will enable evaluation of

uncertainty calibration.

3.1 ABSTRACTING DISTRIBUTIONAL PROPERTIES

The ability to learn the distributional properties of probabilistic information is critical to the

task of estimating TOA uncertainty. In daily experience with probabilistic processes, it is

very rare that we are told explicitly what an underlying distribution is – i.e., whether it is

normal or skewed, or what are the mean and standard deviation that characterize the

23

distribution (Pitz, 1980). In these cases, it is necessary for an observer to infer properties of

the population from information that is contained in the samples of objects that they observe.

Such tasks require a person to engage in !abstraction," or the process of incorporating into

the representational structure information that has not been presented directly (Pitz, 1980).

Pitz used the term !prototype" to refer to the structure that describes the representation of

information about a population that is inferred from sample information. A prototype is a

theoretical concept that has been employed to describe the end product of abstraction.

Although it is still largely an open question what properties of the population might be

abstracted from a sample, it is known that the process of inferring an average value for

uncertain quantities is a very general process, and one that humans tend to do quite well

(Pitz, 1980; Peterson and Beach, 1967). On the other hand, humans have more difficulty

estimating statistical variance, while skewness and bimodality of a distribution were

recognized and used by the subjects in making their predictions. Pitz concluded that

information about central tendency is abstracted as a prototype; however it appeared that

certain critical features of the sample information are stored directly – such as the extreme

values, the smallest and the largest that have been observed.

Peterson and Beach (1967) found that estimates of variance tend to be influenced by the

mean value of the stimuli. They provide the analogy of comparing the smoothness of the

surface of a forest to that of a desktop. The treetops seem to form a fairly smooth surface,

however, the items on the surface of a desktop would seem very bumpy and variable. This

forest top is far more variable than the surface of the desk, but not relative to the sizes of the

objects being considered. They concluded that, instead of estimating variance, subjects tend

to estimate the coefficient of the variation (the standard deviation divided by the mean).

The issue of how a person represents knowledge about variability and skewness is less

clear. Pitz (1980) proposed that it is possible that measures of variability are inferred and

stored as part of the prototype, along with information about the central tendency. Estimates

of variability can be generated by finding the (absolute) difference between each value and

the currently estimated average value, and taking the average of these differences.

However, such a procedure would place a fairly heavy load on short-term memory and is

unlikely to occur as a spontaneous strategy. Alternatively, it is possible that people use

24

information about the central tendency together with their knowledge of the extreme values.

The difference between the two extremes provides information about variability – while the

location of the central tendency relative to the extremes provides information about

skewness.

Related to this is the issue of how human operators factor in extreme values and outliers.

Alpert and Raiffa (1982) found that subjects were unwilling to weight large deviations

heavily. They gave five groups of subjects almanac questions. All subjects were asked to

report 25th, 50th, and 75th percentiles. In addition, Group 1 was asked to report 1st and 99th

percentiles; Group 2 was asked to report 0.1th and 99.9th percentiles; Group 3 was asked to

report “minimum" and “maximum" values; and Group 4 was asked to report “astonishingly

low" and “astonishingly high" values. In every case, the estimated spread of the tails of the

distributions was too small, regardless of the definition of the extremes and, although

feedback did improve the spread, it did not completely eliminate the overconfidence bias.3

O!Connor and Lawrence (1989) utilized a task that involved time series predictions and they

found that calibration of subjects! estimated confidence intervals was influenced by the

degree of forecasting difficulty. For simple series, subjects were underconfident (that is, the

subjects! estimated intervals were too large), but for medium to high difficulty series, the

subjects were overconfident (the estimated intervals were too small).

The research discussed above evaluated operators! ability to abstract mean and variance

information of a population from samples of numbers. However, the ability to abstract the

mean and variance of a series of times of arrival (TOAs) has evidently not been considered

in the literature, in spite of the fact that understanding human capability to abstract TOA

information from observing a sample of multiple objects with variable speed is particularly

important to understanding how human operators comprehend TOA uncertainty. This

becomes a particularly interesting problem when one considers that TOA is a non-intuitive

metric, due to the fact that, because TOA varies inversely with speed for a given distance

3 In statistics, a confidence interval is an interval estimate of a particular population parameter. For example, a

95% confidence interval means that it is estimated that 95% of samples from the population will lie within this

interval. If a person’s subjective 95% confidence interval were to be narrower than the actual interval, that is

they underestimated the confidence interval, then it can be said that the subject is overconfident with respect to

the actual corresponding interval.

25

(T=D/V), TOA, by definition, is not a linear function of speed. Rather it exhibits a function as

depicted in Figure 3-1, which plots TOA as a function of speed for variables that are along

the order of those encountered in the ground controller!s world.

As can be seen in Figure 3-1, because TOA is the reciprocal of speed, slower speeds have

a bigger effect on TOA. That is, a difference of 1 mph at slow speeds (for example, an

increase from 1 to 2 mph) has a large impact on TOA (in this example, assuming a distance

of 1 mile, TOA at 1 mph = 1 hour, while TOA at 2 mph = .5 hour.) However a difference of 1

mph at a higher speed (for example, from 60 to 61 mph) changes the TOA by only .003

hours (TOA at 60 mph = .0167, while TOA at 61 mph = .0164 for a distance of 1 mile).

Related to this, if one considers a range of speeds that spans 5 to 15 mph, one can see in

Figure 3-1 that the resultant range of TOAs is much larger than if the same 10 mph range

were centred around a speed of 85 mph. It follows then that speeds selected from a normal

distribution of slower speeds would yield a more positively skewed4 distribution when

converted to TOAs than would speeds selected from a normal distribution of faster speeds.

Figure 3-1. The relationship between speed (mph) and TOA (hours) assuming distance = 1 mile

4 A positively skewed distribution is one whose tail is elongated to the right; that is, more data are encountered in

the right tail than would be expected in a symmetrical distribution.

!"

!#$"

!#%"

!#&"

!#'"

!#("

!#)"

!#*"

!#+"

!#,"

$"

!" $!" %!" &!" '!" (!" )!" *!" +!" ,!" $!!"

Tim

e o

f A

rriv

al (h

ours

)"

!"#$%&'()*+,-.)Speed (mph)

26

Furthermore, as the standard deviation (SD) associated with the speed distribution

increases, the positive skew of the resulting TOA distribution would be exaggerated. This

becomes clear if one considers that numbers drawn randomly from a distribution with a

larger SD will consist of more extreme speeds than if drawn from a distribution with a

smaller SD. This is demonstrated in Figure 3-2 that shows in the left panel, histograms

derived from three normal distributions of speed with the same mean (20 mph) and different

standard deviations (2, 4, and 8). The right panel shows histograms of the corresponding

times of arrival (hours) assuming a distance of 1 mile. As can be seen, the positive skew of

the TOA distribution increases considerably as the standard deviation increases from 2 to 8.

Also, consider Table 3-1, which shows the Mean, SD and skewness for distributions of TOA

that are defined by various combinations of means and standard deviations of speed.

Clearly the distribution is much more skewed (skew = 20.1) at lower speeds (M = 20) with a

high standard deviation (SD = 8), than for higher speeds (M = 60) with the same standard

deviation, in which case the skew was 1.2.

Table 3-1. Effect of mean and standard deviation of speed on TOA

Speed Distributions

TOA M 20 SD 2

M 20 SD 4

M 20 SD 8

M 30 SD 2

M 30 SD 4

M 30 SD 8

M 60 SD 2

M 60 SD 4

M 60 SD 8

M .05 .05 .07 .03 .03 .04 .02 .02 .02

SD .005 .01 .23 .002 .005 .013 .0006 .0011 .0025

SKEW .9 2.5 20.1 .5 1.2 3.9 .4 .6 1.2

Notes: For each speed distribution (defined by a Mean and SD of Speed), 1000 samples were drawn using the Regress+ software package, which possesses a normal distribution random number generator. Each speed was converted to a TOA assuming a distance of 1 (i.e., TOA = 1 / speed) yielding 1000 TOAs for each combination of Mean and SD of speed. Descriptive statistics (mean, SD, and skew) were calculated for the 1000 TOAs of each speed distribution condition.

27

Figure 3-2. Left panel shows three histograms of speeds drawn from a normal distribution with mean speed 20 and (top to bottom) SD=2, 4, or 8. Right panel shows corresponding TOA distributions with increasingly positive skew as standard deviation increases.

!"#$%&%'(%

)*%&%'%

)+,-%&%(%

!"#$%&%.(/0%

)*%&%.((/%

)+,-%&%.1%

!"#$%&%'(%

)*%&%2%

)+,-%&%(%

!"#$%&%.(/'%

)*%&%.(0'%

)+,-%&%'./%

!"#$%&%'(%

)*%&%3%

)+,-%&%(%

!"#$%&%.(40%

)*%&%.(23%

)+,-%&%'(.0%

Speed Time of Arrival

%%%%%.('%%%%%%%%%%%%%%%%%%%.(2%%%%%%%%%%%%%%%%%%.(4%%%%%%%%%%%%%%%%%%.(3%%%%%%%%%%%%%%%%%%.0%%%%%%%%%%%%%%%%%%%.0'%

.(2%%%%%%%%%%%%%%%%%%%%%%%.(/%%%%%%%%%%%%%%%%%%%%%%%%.(4%%%%%%%%%%%%%%%%%%%%%%%%.(5%

%%%%%.((% .'%%%%%%%%%%%%%%%%%%%%%%%%%%%.2%%%%%%%%%%%%%%%%%%%%%%%%%%.4% %%%%%%%%%%%%%%%%%%%%%%%.3%%%%%%%%%%%%%%%

%%%%%%%%Velocity (mph) Time of Arrival (hours) %

Speed Time of Arrival

Speed (mph) Time of Arrival (hours)

Fre

qu

en

cy

Fre

qu

en

cy

Fre

qu

en

cy

Fre

qu

en

cy

Fre

qu

en

cy

Fre

qu

en

cy

28

3.2 EXPERIMENTAL METHODS AND METRICS

To evaluate the model of TOA uncertainty, experimental task and metrics are required that

capture the essence of the real-world task faced by air traffic controllers in time-based

environments. Based on the findings of the experiment reported in Chapter 2, the task

should possess the following characteristics.

1) Provide an opportunity for subjects to build speed profile models based on

experience. The air traffic controller interviews clearly revealed that, rather than

real-time calculations, controllers estimate TOA and TOA uncertainty based on

expectations, gained through on-the-job experience. To replicate this in a controlled

experiment requires dynamic simulations (as opposed to the static route plans used

in experiment 1 in Chapter 2) that allow subjects to build the experience needed to

generate estimation heuristics. Providing speed read-outs will also facilitate the

subjects! understanding of the vehicle!s speed profile and remove issues

associated with speed perception.5

2) Avoid confounds associated with subjects! previous experiences. Since each

controller has already built expectations based on their unique individual

experiences, using actual controllers and real-world ATC tasks will likely introduce

confounds with previous experience that cannot be experimentally controlled.

Thus, there is a need for experimental controls that dictate the subjects! experience,

rather than relying on the individual experiences that each subject may bring to the

experiment from their own real-world experience, as these are widely variable

across individuals. This will require that speed variability be modelled in a generic

way not tied to expectations of airport operations, airlines, traffic etc.

3) Consider the interaction among speed, distance, and time. To estimate TOA

and TOA uncertainty, air traffic controllers consider both speed and distance.

5 The implications of this premise are discussed in Chapter 7. While issues related to velocity perception are an

interesting aspect of the TOA estimation problem, relying on velocity perception rather than velocity read-outs

would have the potential of confounding the main experimental findings related to how subjects estimate TOA

uncertainty. Furthermore, it can be safely assumed that accurate ground velocity readouts will be available to

both pilots and ATC in future aviation operations.

29

Furthermore, there are two factors associated with vehicle speed that must be

considered: average speed and speed variability. Inappropriate estimates of TOA

or TOA uncertainty can result from misestimating either speed or distance. This

must be addressed, either by controlling (holding constant) one or more factors or

experimentally manipulating each factor.

4) Approximate ATC workload. While it is not believed to be necessary to replicate

the entire ATC experience by including all task elements in high fidelity, it was

determined that approximating the visual monitoring workload was important. The

interviews with the ATC subjects suggested that ground controllers at major airports

are regularly required to monitor approximately 20 aircraft at a time.

5) Include known and unknown profiles. Controllers must work with both pilots and

airlines that fly into their airports regularly as their base operations and those that

fly in very infrequently. This means that matching the observed speed profile to

those in their library of profiles will be more difficult for those vehicles with

infrequent exposure. The experimental method should address both cases.

3.2.1 Experimental Task

An experiment was designed to simulate the critical elements of the task of an air traffic

ground controller (or other operator), who is responsible for monitoring and predicting the

ability of an aircraft (or object) to arrive at a defined location at a specific time, or within an

allowable time window around a specific time. The task involved several objects, each

travelling along a separate path (with different lengths). All objects started at the same time

and shared a common commanded TOA. Each was assigned a mean speed (which was

computed based on the speed required for on-time arrival) and a standard deviation. Each

object changed speed at pre-determined intervals, with the new speed selected from a

normal distribution defined by the assigned mean and standard deviation. Based on this

distribution, the TOA uncertainty could be computed objectively. Given a symmetrical

distribution of speeds about the mean speed, the expected TOA of each aircraft will be

correct and will not change (that is, if each is given a mean speed at the outset, based on a

proper estimated TOA). However, the probability of arriving within the permitted time

window will change, depending on the variability of the speed.

30

3.2.2 Metrics to Evaluate Uncertainty Calibration

A metric is required that can be used to determine how well human operators can estimate

measures of central tendency and uncertainty for both speed and TOA of aircraft that move

with variable speed. The approach adopted within the current research was to use a

normative model, in order to identify factors relevant to the inference process for speed and

TOA. In particular, the approach was to examine the relationship between inferences made

by human observers and corresponding inferences that would be made by an optimal

observer who had knowledge of the underlying distribution. This approach is analogous to

the use of the ideal observer used in signal detection theory and !economic man" in

economics (Peterson and Beach, 1967). In this sense, Peterson and Beach argue,

probability theory and statistics fulfil a role similar to that of optics and acoustics in the study

of vision and hearing. Statistics can be used as a theory of the uncertain environment and

provide a basis for a descriptive theory of imperfect human inference (Peterson and Beach,

1967).

Peterson and Beach refer to two measures: accuracy and optimality. Accuracy is assessed

when subjects" estimates are compared to descriptive statistics of an observed sample. For

example, if subjects observe a sample of 20 aircraft, each with a speed drawn from a

population with a known distribution, accuracy would be assessed by comparing the

subjects" estimate of the mean speed of the 20 aircraft to the objective mean of the 20

aircraft speed samples. Optimality, on the other hand, is assessed when subjects"

estimates are compared to parameters associated with the underlying population from which

the sample is drawn (rather than with the computed sample estimates). That is, in

assessments of optimality, the subjects" estimates are compared to the actual mean of the

population distribution. The current experiments set out to assess the optimality of subjects"

estimates. Recall that the simple TOA model presented in chapter 2, based on input from

ATC subjects, posits that operators build experience over many exposures and develop a

library of TOA uncertainty expectations that they then apply, rather than assessing the

variability of each object individually. Thus, it is argued that assessing optimality is the more

appropriate variable to evaluate for the current research effort.

31

The optimality of subjects! estimates can be explored using any of a number of population

parameters, including measures of central tendency, variance, skewness, and confidence

intervals. Given the fairly consistent finding from existing literature that subjects perform

quite well at the task of inferring means, this was chosen as one measure of optimality for

this experiment, in order to determine if these past findings extend to dynamic stimuli and for

the variables of speed and TOA. Subjects! estimates of speed and expected TOA were

compared to the mean speed and TOA of the population distribution.

More relevant to understanding TOA uncertainty, however, is to assess the optimality of

subjects! estimates of TOA variability. No literature was identified that has attempted to

develop experimental methods or metrics to assess an operator!s understanding or

inference of TOA uncertainty. Nevertheless, we can draw upon the literature on interval

estimation for insights into this problem.

The fractile method (Block and Harper, 1991) is a widely used method (see for example:

Juslin, Wennerholm, Olsson, 1999; Soll & Klayman, 2004; McKenzie, Liersch, & Yaniv,

2008; Teigen, Halberg, & Fostervold, 2007) that requires participants to provide the smallest

intervals that include an existing true value with a pre-stated probability. The participant is

considered calibrated if the proportion of values that fall within the interval equals the given

probability. An example of a question that would be provided to a subject using the fractile

method is: “Give the boundaries of the interval for which you are 90% confident that it

encloses the number of chickens consumed by American households daily” (Juslin,

Winman, & Olsson, 2003). Although there are exceptions, the literature suggests that

subjects tend to be over-confident in their ratings, in that the proportion of true values

included by the subjects! estimated intervals is typically lower than the pre-stated probability

– that is, the intervals tend to be too small, such that less than the pre-stated probability of

the true values falls within the subjects! intervals.

However, subsequent studies have shown two factors that reduce this overconfidence: 1)

the format of the question and 2) the type of stimuli. Each is discussed next.

32

Format of Question.

Soll and Klayman (2004) conducted a series of three studies with the goal of determining

how to best elicit numerical estimates of intervals from subjects. They compared subjects!

estimates across three methods of interval elicitation:

1) Range interval; (e.g. “I am 80% sure Charles Dickens was born between 1750 and

1860”),

2) Two-point method, involving lower and upper boundary; (e.g. “I am 90% sure that

Dickens was born after 1750” and “I am 90% sure that Charles Dickens was born

before 1860”), and

3) Three-point method, involving a measure of central tendency (mean or median),

lower and upper boundary; (e.g. “I think that it is equally likely that Charles Dickens

was born before or after the year 1805” and “I am 90% sure that Dickens was born

after 1750” and “I am 90% sure that Charles Dickens was born before 1860”). 6

They found that the three-point method – asking for an estimate of central tendency in

addition to both a lower and upper boundary – yielded the lowest overconfidence and the

highest degree of calibration. Similarly, Block and Harper (1991) found that, when

participants were required to produce an explicit point-estimate first, before the lower and

upper boundary, overconfidence decreased. This was also the case when an external point-

estimate was provided by the experimenter. Paese and Sniezek (1991) concurred with this

finding by suggesting that, when the point-estimate is required first, the subject is required to

spend more time processing the task and the stimuli before assessing the probability

interval, and thus produces better-calibrated intervals.

Type of Stimuli.

The fractile method, when used with general knowledge content questions such as the

above example of estimating the interval that contains the daily consumption of chicken with

90% certainty, has been shown to generally produce over-confident ratings (Lichtenstein,

Fischhoff, & Phillips, 1982; Juslin, Wennerholm, & Olsson, 1999). Typically in those studies,

questions are asked of undergraduate students and are selected to be values that the

students are unlikely to know much about (McKenzie, Liersch, & Yaniv, 2008). Thus over-

6 Charles Dickens was born on Feb. 7, 1812.

33

confidence has been attributed to the use of such !general knowledge" questions for which

the subjects" judgment likely involves elaboration of general knowledge through inference

(Juslin, Winman, & Olsson, 2003). In contrast, tasks that require an assessment of sensory

input (e.g., which of two vertical lines is longer) do not show the same over-confidence bias

(Juslin, Winman, & Olsson, 2003). Erev, Wallsten, and Budescu (1994) also report that the

typical over-confidence bias is present when the uncertainty is internal to the subject (i.e.,

related to their knowledge), but not when the uncertainty is external (i.e., a stochastic

property of the environment). Furthermore, the overconfidence bias has been shown to

decrease or disappear when the objects of judgement are randomly sampled from a natural

environment (Gigerenzer, Hoffrage, & Kleinbolting, 1991; Juslin, 1994). Thus, it is believed

that this method is an appropriate method for use when subjects are asked to create

estimates based on an assessment of sensory input.

In the studies that follow, the three-point fractile method was used to elicit estimates of the

TOA interval. Subjects were asked to first provide an estimate of the TOA, and then provide

both a lower boundary and an upper boundary that would produce the smallest window that

they were 95% sure would contain the actual TOA. This measure was chosen because it

reflects both the standard deviation of the distribution and the skewness of the distribution.

The size of the subjects" TOA window was then compared to the objectively computed

intervals to assess the degree to which the human operators were calibrated to the objective

level of TOA uncertainty in the environment. Objective uncertainty was operationalized in the

studies that follow as the 95% confidence intervals around the mean speed or TOA. These

were compared to the subjective TOA window estimates to determine the degree of

calibration. As the variability (standard deviation) of the stimuli to be observed was

increased, so too should the uncertainty associated with it, as reflected by an expected

increase in the size of the estimated window.

The metric chosen to assess TOA uncertainty required that the subject estimate the lower

bound and the upper bound of the smallest time window within which they are 95%

confident the aircraft will arrive at its destination with a specified probability – for example, “I

am 95% confident that the aircraft will arrive between 1:55 and 2:05”. The goal was to

assign the smallest time window possible, while ensuring that the aircraft does not arrive

34

outside the window. An operator who is “optimally calibrated” will estimate time windows of

size equal to the objectively computed TOA windows, computed as the 95% CI of the normal

distribution. If the subject is not perfectly calibrated, he/she will either overestimate or

underestimate the duration of the subject!s TOA window. If the time window is overestimated

(that is, larger than would ordinarily be needed to be 95% certain), that suggests that the

operator is assuming a variance in TOA arrival times that is larger than the actual objective

variance, which therefore suggests that he/she is in fact underestimating the probability that

the aircraft will arrive within the objectively correct (95%) time window.

35

CHAPTER 4: EXPERIMENT 2 – THE EFFECT OF SPEED AND VARIABILITY ON

TOA UNCERTAINTY ESTIMATION

The goal of the present experiment was to determine how well human operators estimate

means and uncertainty for both speed and TOA of vehicles that move with variable speed.

Specifically, the goal was to determine how well subjects estimate the mean and 95%

Confidence Intervals of a population distribution based on observing a sample of 20 vehicles

drawn randomly from the population distribution. This research makes the assumption that

speeds are normally distributed.

4.1 SPECIFIC RESEARCH QUESTIONS AND HYPOTHESES

The specific research questions of interest, and their associated hypotheses, are:

1) How well do subjects estimate means of both speed and TOA (computed as the

difference between the objectively computed mean of the population distributions

and the subjects! estimate of the mean speed and TOA)?

Hypothesis 1: It was expected, based on the findings of Pitz (1980), that

subjects will estimate means with little estimation error; that is, no systematic

biases are expected.

2) How well do subjects estimate uncertainty associated with speed and TOA –

computed as the difference between the objectively computed 95% Confidence

Interval (CI) of the population mean and the subjects! estimate of the smallest

window that they are 95% sure will contain the average speed or actual TOA?

Hypothesis 2: Across all conditions, subjects! estimates of both speed and

TOA intervals will be smaller than objective 95% CIs. This is based on the

previously discussed research (e.g., Alpert & Raiffa, 1982; Juslin,

Wennerholm, & Olsson, 1999; Soll & Klayman, 2004; McKenzie, Liersch, &

Yaniv, 2008; Teigen, Halberg, & Fostervold, 2007) that suggests that human

decision-makers tend to be over-confident, and thus estimate less uncertainty

than actually exists.

3) Do the subjects! estimates differ as a function of the mean speed of the observed

stimuli?

36

Hypothesis 3: It is expected (based on Peterson and Beach, 1967) that the

difference between actual 95% CIs and subjects! estimated speed and TOA

windows will be larger for objects with slower speeds than for those with

faster speeds, due to the non-linear relationship between speed and time-of-

arrival (as shown in figure 3-1).

4) Do the subjects! estimates differ as a function of the standard deviation of the speed

of the observed stimuli?

Hypothesis 4: The difference between actual 95% CIs and subjects!

estimated speed and TOA windows will be larger for speeds with high

variability than for those with low variability. This is based on Alpert and

Raiffa!s (1982) findings that subjects inadequately account for "extreme!

values, of which there will be more in higher variability conditions.

5) Are subjects able to infer skewness in the TOA distribution and apply this

appropriately in their estimation of the TOA intervals?

Hypothesis 5: It was expected, based on Pitz (1980), that subjects will

assume a linear transformation from speed to TOA and therefore will

estimate TOA intervals that are symmetrical as opposed to the objectively

generated positively skewed CIs.

4.2 EXPERIMENTAL DESIGN

This experiment employed a fully-within-subjects experimental design with three

independent variables, summarised in Table 4-1: speed, variability, and repetition. The three

levels of speed7 (slow, moderate, and fast) map to 35 units/time, 45 units/time, and 75

units/time respectively. These speed levels were chosen based on three constraints:

1) Given the relationship between speed and TOA shown in Figure 3-1, an attempt was

made to span a wide range of speeds, in order to capture subjects! ability to estimate

TOA and TOA uncertainty at different points along the speed/TOA curve;

2) The speeds were chosen to yield three expected TOAs that were approximately

equally spaced;

7 While it may be more correct to refer to low and high speeds rather than slow and fast speeds, the latter term

were nevertheless used in the remainder of the thesis to differentiate the levels of speed (slow, moderate, and

fast) from the levels of variability (low, medium, and high).

37

3) Given concerns over the salience of extreme numbers, an attempt was made to

choose !like numbers"; specifically a decision was made to choose three speeds that

ended in !5". It was believed that the psychological difference between (for example) 35

and 37 is less than between 39 and 41.

The three levels of variability (low, medium, and high) map onto standard deviations of the

speed distributions. Specifically, the standard deviations chosen were 3, 6, and 9

respectively. These were chosen to ensure a wide range of speeds without producing

negative values, given the mean speed values chosen.

Each subject completed six repetitions of each speed x variability combination. This was

deemed a sufficient amount based on pilot testing, which revealed very little difference

among estimations across repetitions. Subjects completed six blocks of trials. Each block

consisted of one trial, representing each of the nine unique experimental conditions –

comprising each factorial combination of three levels of speed and three levels of variability.

The presentation order of the nine conditions within each block was randomized.

Table 4-1. Independent variables

Independent Variables Levels

Speed Slow, Moderate, Fast

Variability Low, Medium, High

Repetition 1, 2, 3, 4, 5, 6

4.3 METHOD

4.3.1 Subjects

Fifteen subjects were recruited from colleges and the community local to NASA Ames

Research Center, where the experiment was conducted. Subjects" ages ranged from 18 to

53, with the mean age being 35. The range of education levels possessed by the

participants included high school (3 participants), two-year college diploma (3 participants),

four-year university (7 participants), and graduate school (2 participants). Eight of the

subjects had taken at least one course in statistics. Prior to participation, all subjects

successfully completed a vision test to ensure that they could read text in the same font

style, size, and colour used for the experimental stimuli under lighting conditions that were

38

identical to those of the actual experimental trials. The experiment lasted approximately 2.5

hours and subjects were paid $10 an hour for their participation.

4.3.2 Materials

4.3.2.1 Apparatus

The experiment was conducted in a noise-proof, human-subject testing booth. Subjects

viewed a large screen monitor (32 in. x 18 in.) that subtended a visual angle of 60º from the

average viewing distance of 32 in. Subjects input their responses using a standard

keyboard and mouse. They were also provided a pen and paper and were told they could

use them to take notes during the experiment, if they wished. The monitor brightness and

resolution settings were held constant across all subjects.

4.3.2.2 Stimuli

Experimental stimuli, shown in Figure 4-1, consisted of 20 vehicles (shown as round circles

with a diameter of .5 in) travelling simultaneously from left to right along identical routes, with

one vehicle on each route. All vehicles started at the same time, at trial start. Each route

was 18 in. long and was divided into 10 equal length segments (but the segments were not

marked or visible to the participant). Each vehicle!s speed was constant during each

segment but changed from segment to segment according to pre-determined numbers that

were drawn from a normal distribution with a designated mean and standard deviation

according to the experimental condition. Data tags (1.5 in. x .25 in. in size) displayed the

current ground speed (GS) on a digital readout that updated at the beginning of every

segment. A digital clock, centred vertically, immediately to the right of the 10th vehicle,

displayed elapsed time, with a precision of one decimal point.

A numerical simulation was first conducted to create a population or parent distribution,

which was randomly sampled to select the speeds for any given vehicle. This allowed for

later determination of the 95% intervals, by selecting the 2.5 percentile and 97.5 percentile

from the parent distribution. The stimuli were created following the steps outlined below:

1. Using a random number generator (Regress+ software package) 250,000 speeds

were drawn randomly (with replacement) from a normal distribution with a given

mean and standard deviation.

39

2. The 250,000 speeds were then randomly assigned to 25,000 vehicles providing 10

speeds - one for each segment the vehicle would travel.

3. Assuming a segment distance of .05 miles, the elapsed time for each segment was

computed.

4. The 10 segment elapsed times were summed to provide a total route TOA for each

of the 25,000 vehicles. The 25,000 TOAs were sorted in ascending order, and then

binned into 20 equal groups each representing 5% of the TOAs.

5. For each sample of 20 vehicles, one TOA was randomly selected from each bin.

This ensured that the resulting sample of 20 vehicles was representative of the

distribution. This was necessary because of the relatively small sample size of 20

vehicles.

4.3.3 Procedure

4.3.3.1 Introduction and Training

Participants were welcomed to the test facility at NASA Ames Research Center. First, they

were asked to read the Instruction Protocol (Appendix B1) and then read and sign the

informed consent form. After completing a short demographic questionnaire (Appendix B2),

each participant completed a brief vision test using a standard Snellen eye chart (read from

a distance of 10 feet), and a computer-based vision test which presented ten three-digit

numbers on the experimental apparatus with the same font style, size, and colour used in

the experiment. Participants were asked to read the ten numbers out loud to verify that they

could read them without difficulty.

Next participants were shown the experimental apparatus, as described above, and the

experimenter verbally explained the purpose of the experiment, the experimental stimuli, and

specific details of the task to be performed, including how to input responses using the

keyboard and mouse, as described in the written instructional protocol. The participants

completed three practice trials while the experimenter watched. The three training trials

comprised the following Speed/Variability combinations:

Practice Trial 1: Slow/Low

Practice Trial 2: Moderate/Medium

Practice Trial 3: Fast/High

Participants were permitted to ask questions throughout the three practice trials.

40

Figure 4-1. Experimental stimuli shows 20 vehicle icons with current ground speed data tags. Actual route length was 18 in. Data tags showing ground speed (GS) were 1.5 in. X .25 in. Elapsed time was shown in a box on the right (shown as 21.1 above).

41

4.3.3.2 Experimental Trials Each trial consisted of twenty vehicles traversing routes of the same length. All twenty

vehicles shared the same speed profile – that is, their speeds were drawn from the same

population distribution. All vehicles started at the same time at trial start. After the last of the

20 vehicles completed the route, a question screen appeared, with the following question:

A vehicle!s speed profile is similar in nature to the vehicle you just observed and it is

about to traverse the same length route.

Please estimate this vehicle!s average (mean) speed.

Please provide the smallest window that you are 95% sure will contain the

vehicle!s actual average speed

Participants used a standard keypad to enter the average speed and then used the mouse

to manipulate window sliders to set the size of the window by setting the lower and upper

bounds. Once satisfied with these answers, the subject pressed !continue" and the second

query screen appeared with the following question:

A vehicle!s speed profile is similar in nature to the vehicle you just observed and it is

about to traverse the same length route.

Please estimate this vehicle!s time of arrival (TOA).

Please provide the smallest TOA window that you are 95% sure will contain

the vehicle!s actual TOA

Each participant responded in the same manner as above, and once completed, the

participant pressed !continue" to begin the next trial.

Participants completed six blocks of experimental trials, each consisting of 9 trials – one for

each combination of the three mean speeds and three speed standard deviations.

Participants took a short break after the second and the fourth block. The time to complete

each trial varied from 1 minute (in the shortest condition) to 5 minutes (in the longest

condition). The total duration of the experiment, including training, ranged from 2.5 hours to

3 hours.

4.3.3.3 Debrief and Follow-Up

Upon completion of the last trial, a verbal debrief session was conducted (see Appendix

B3). Participants were asked questions to ascertain the strategies that they developed,

42

whether they used memory aids, and whether they found one task (i.e. estimating speed or

time) easier than the other. The general purpose and objective of the experiment was

explained, the experimenter answered any questions, and the participant was thanked for

his/her time.

4.4 RESULTS

For each trial, subjects estimated the average speed, speed windows, TOA, and TOA

windows of a new !target" vehicle that shared the speed profile of the 20 vehicles that they

had just viewed. The subjects" estimate of the target vehicle"s speed and TOA was

compared to the objectively computed mean speed and TOA of the population distribution

from which the samples of 20 vehicles were drawn. The subjects" estimates of the smallest

window that they were 95% sure contained the vehicles" average speed (estimated speed

window) and TOA (estimated TOA window) were compared to the objectively computed

95% CIs (referred to as speed intervals and TOA intervals) of the population distribution

from which the samples of the 20 vehicle were drawn. The !actual" mean, intervals, and

skew for both speed and TOA are presented below in Table 4-2, and the method by which

the actual scores were calculated is presented in Appendix B4.

The following analyses were conducted:

1) Four separate analyses (3 x 3 x 6 Repeated Measures Analysis of Variance, ANOVA)

were conducted to determine the degree to which the subjects" estimates of speed,

speed windows, TOA, and TOA windows, differed as a function of the independent

variables: speed (3), variability (3), and repetition (6). These analyses were conducted

to assess whether subjects could differentiate among the chosen levels of speed and

variability. The inclusion of repetition in these analyses allowed for the determination of

the effect of learning, fatigue, or scenario differences across trials. Repetition was not

statistically significant in any analysis, suggesting that these effects had minimal

impact, if any, on the data and therefore will not be discussed further.

2) To determine if the subjects" estimates differed from the actual values, the 95% CI

around the mean of the subjects" estimates – of speed, speed windows, TOA, and TOA

windows – were analyzed to determine if the actual score was contained within the CI.

43

If the actual score was not contained within the CI, the subjects! estimates were

deemed to be significantly different than the actual score.

3) To determine if subjects were more likely to produce sub-optimal estimates as a

function of the independent variables, Estimation Error scores were calculated, by

subtracting the subjects! estimated value from the objectively computed, or actual,

value. A separate 3 x 3 x 6 repeated measures ANOVA was conducted on the absolute

values of the Estimation Error scores, to determine if subjects tend to make larger

estimation errors as a function of the independent variables speed, variability, and

repetition. Absolute values were used as these allow an assessment of estimation

errors regardless of direction – i.e. too fast and too slow, or too early and too late – in

light of the fact that, in the ATC operational setting under consideration, errors of both

kinds are detrimental to system performance. Again, there were no significant effects of

repetition and these results will not be discussed further.

4) To determine the degree to which subjects produce symmetrical or skewed estimated

TOA windows, a symmetry score was developed (defined below) and applied to both

the actual and estimated TOA intervals. The 95% CIs around the subjects! TOA

symmetry score was analyzed to determine if the actual symmetry score was contained

within the CI. Instances in which the actual symmetry score was not contained within

the 95% CI suggested that the subjects! estimated intervals were different in symmetry

than the actual intervals.

4.4.1 Estimated Speed

Table 4-3 presents the objectively computed (actual) mean speed and the size of the 95%

confidence intervals for each combination of speed and variability, the subjects! mean

estimated values for speed, and the mean size of their estimated speed windows. It also

includes the mean estimation error scores, calculated as the mean of the differences

between the actual and estimated values, for both mean and window size.

44

Table 4-2 Actual mean values, interval sizes, and skew for speed and TOA

Speed TOA Independent

Variables

Speed/Variability Mean

Lower Bound

Upper Bound

Interval Size

Skew Mean Lower Bound

Upper Bound

Interval Size

Skew

Slow/Low 35 33.2 36.9 3.7 .01 51.8 49.1 54.8 5.7 .14

Slow/Medium 35 31.3 38.7 7.4 .00 53.1 47.4 60.1 12.7 .42

Slow/High 35 29.4 40.6 11.2 .00 56.2 46.3 71.5 25.2 34.32

Moderate/Low 45 43.1 46.9 3.7 .00 40.2 38.6 41.9 3.4 .12

Moderate/Medium 45 41.3 48.7 7.4 .01 40.8 37.5 44.6 7.1 .28

Moderate/High 45 39.4 50.5 11.1 .00 41.9 36.6 48.8 12.3 .73

Fast /Low 75 73.1 76.9 3.8 .01 24.1 23.5 24.7 1.2 .08

Fast /Medium 75 71.3 78.7 7.4 .00 24.2 23 25.4 2.4 .17

Fast /High 75 69.4 80.5 11.1 .00 24.4 22.6 26.4 3.8 .47

First, an analysis was conducted to determine if the subjects! estimated mean speed differed

as a function of the experimental conditions. This was to verify that the task conditions

(speed and variability) had been perceived as being significantly different. Subjects!

estimated mean speed, shown as a dashed line in Figure 4-2, did indeed differ as a function

of speed (F (2,28) = 7306.75, p < .001). Post-hoc paired t-tests revealed significant

differences among all three levels of Speed (p < .001 for all pairwise tests), providing

evidence that subjects were able to differentiate among the three levels of speed. There

were no differences in the subjects! estimates of mean speed as a function of variability or

repetition.

Next, the subjects! estimated mean speed was compared to the actual mean speed by

examining the 95% CI around subjects! estimated mean speed to determine if the 95% CI

contained the actual mean speed. Figure 4-2 presents both the actual mean speed and the

45

subjects! estimated mean speed, the latter plotted with error bars which reflect the 95% CI

around the mean. In conditions in which the 95% CI did not contain the actual speed, it was

concluded that the subjects! estimates were significantly different than the actual speed.

This was the case for three conditions: moderate speed/medium variability, high

speed/medium variability, and high speed/high variability. In each of these three cases, the

actual speed was greater than the estimated values, revealing that subjects underestimated

speed in these three conditions.

Table 4-3. Actual and estimated speed means, intervals, and estimation errors

Actual Speed Subjects! Estimated

Speed

Estimation Error (Actual –

Estimated) Independent Variables (Speed/Variability)

Mean 95% CI Mean Window

Size Mean

Window Size

Slow/Low 35 3.7 34.7 6.0 .3 -2.3

Slow/Medium 35 7.4 35.1 9.0 -.1 -1.6

Slow/High 35 11.2 34.2 12.6 .8 -1.4

Moderate/Low 45 3.7 44.4 6.0 .6 -2.3

Moderate/Medium 45 7.4 43.4 8.8 1.6 -1.4

Moderate/High 45 11.1 44.4 11.9 .6 -.8

Fast /Low 75 3.8 74.7 5.1 .3 -1.4

Fast /Medium 75 7.4 73.9 8.4 1.1 -1.0

Fast /High 75 11.1 73.7 11.4 1.3 -.3

Speed estimation error scores were computed by subtracting the subjects! estimate of the

average speed from the objectively calculated average speed of the original distribution

(presented in Table 4-2). An ANOVA was conducted on the absolute values of the speed

estimation error scores in order to determine if the size of error (regardless of direction)

differed as a function of the experimental variables of interest. The amount of speed

estimation error increased as variability increased (F(2,28) = 119.889, p < .001). All pair-

wise comparisons were significant (p < .001), suggesting that subjects made larger errors in

46

estimating speed in the high variability condition (M=3.5) than the medium variability

condition (M=2.4), and both the high and medium conditions were larger than the low

variability condition (M=1.2).

Figure 4-2. Actual and estimated mean speed as a function of speed and variability.

4.4.2 Estimated Speed Windows

An analysis was conducted to determine if the size of the subjects! estimated speed

windows differed as a function of the experimental conditions. Recall that subjects were

asked to provide the smallest window that they were 95% sure would contain the actual

average speed of a target vehicle that shared a speed profile with the 20 vehicles just

observed. This measure was intended to assess how well subjects were able to

differentiate among the three levels of variability and provide speed windows that reflected

this variability. The width of the subjects! estimated speed window differed as a function of

variability (F(2,28)=11.12, p < .001), shown by the dashed lines in Figure 4-3. The size of

the estimated speed window was smallest when variability was low (M=5.7), followed by

medium variability (M=8.7), and then by the high variability condition (M=12.0). Paired t-

47

tests revealed significant differences among all three levels of variability (p < .001). This

provides evidence that subjects! estimated speed windows were sensitive to the variability in

the stimuli and supports the use of 95% CI as a measure of uncertainty. In other words, this

finding is evidence that subjects can indeed create intervals that reflect the amount of

variability inherent in the stimuli. No other main effects or interactions were observed.

To determine if the subjects! estimated speed window size differed significantly from the

actual speed interval, the 95% CIs around the mean subjects! estimated speed window

widths were examined. As can be seen in Figure 4-3, there were only two conditions for

which the 95% CI did not contain the actual speed interval width – the slow speed/low

variability and moderate speed/low variability conditions. In both of these conditions, the

subjects! estimated speed windows were wider than the actual speed intervals. This shows

that, for the most part, the subjects were capable of developing calibrated windows for the

speed estimation task.

Figure 4-3. Estimated speed windows relative to actual speed interval.

4.4.3 Estimated TOA

Table 4-4 presents the actual mean TOA and TOA interval size, the subjects! estimated

mean TOA and TOA window size, and the difference between the actual and estimated

scores. A 3 x 3 x 6 repeated measures ANOVA was conducted to determine if the subjects!

estimated TOA differed significantly as a function of the experimental conditions. Again, this

was conducted to determine whether the participants could distinguish among the

48

experimental conditions (speed and variability), and to evaluate whether performance

differed across the six repetitions due to learning, fatigue, or scenario effects.

A significant speed x variability interaction (F(4,56) = 96.66, p < .001) provided evidence that

subjects! estimated TOAs differed as a function of the experimental independent variables.

As can be seen by the dashed lines in Figure 4-4, in the slow speed condition subjects

estimated longer TOAs in the high variability condition than for the medium condition (p <

.001) and both were higher than the low variability condition (p < .001 for both). In the

moderate speed condition, estimated TOAs in both the high and medium variability condition

were significantly higher than the low variability condition (p < .05), but there was no

difference between the medium and high variability conditions (p < .05). The same pattern

was observed for the fast speed condition – estimated TOAs in the high and medium

variability conditions were significantly higher than in the low variability condition (p < .05 for

both), but the high and medium variability conditions were not significantly different. These

results demonstrate that the subjects! TOA estimates were sensitive to changes in the

speed and variability of the stimuli.

Table 4-4. Actual TOA scores, estimated TOA scores & TOA estimation error scores

Actual TOAs Subjects! Estimated

TOAs

TOA Estimation Error (Actual –

Estimated)

Independent Variables (Speed / Variability) Mean

Interval Size

Mean Window

Size Mean

Window Size

Slow/Low 51.8 3.7 55.1 4.9 -3.3 -1.2

Slow/Medium 53.1 12.7 56.3 7.2 -3.2 5.5

Slow/High 56.2 25.2 59.5 11.2 -3.3 14.0

Moderate/Low 40.2 3.4 42.7 3.7 -2.5 -0.4

Moderate/Medium 40.8 7.1 43.7 5.2 -2.9 1.9

Moderate/High 41.9 12.3 44.4 6.9 -2.5 5.4

Fast /Low 24.1 1.2 24.6 2.7 -0.5 -1.5

Fast /Medium 24.2 2.4 25.1 3.5 -0.9 -1.1

Fast /High 24.4 3.8 25.3 4.1 -0.9 -0.4

49

Figure 4-4. Actual and Estimated TOA as a Function of Speed and Variability Graph shows 95% Confidence Intervals around the Estimated TOA.

Next, the subjects! estimated TOAs were compared to the actual mean TOAs by examining

the 95% CI around the mean of the subjects! estimated TOA to determine if the CI contained

the actual mean TOA (as shown in Figure 4-4). In conditions in which the CI does not

contain the actual TOA, it was concluded that the subjects! estimate was significantly

different than the actual TOA. In stark contrast to speed, the actual TOA was not contained

within any of the 95% CIs. This indicates that subjects! TOA estimates were significantly

different than the actual TOA for every condition. This suggests that subjects may perform

less well at the task of estimating TOA than they do at estimating speed. Subjects

overestimated TOA in all conditions.

TOA estimation error scores, given in Table 4-3, were computed by subtracting the subjects!

estimated TOA from the actual TOA of the population distribution. A repeated measures

ANOVA was conducted on the absolute values of the TOA estimation error scores to

50

determine if the amount of estimation error (regardless of direction) differed as a function of

the independent variables. The ANOVA yielded a speed x variability interaction (F(4, 56) =

3.941, p < .01). As seen in Figure 4-5, at slow speeds, absolute TOA estimation error was

greater in the high variability condition than the medium variability condition (p < .01) and

low variability condition (p < .05), but there was no difference between the low and medium

variability conditions (p > .05). At moderate speed, the same pattern was found – that is,

TOA estimation error was greater when variability was high than medium (p < .01) and low

(p < .05), but there was no difference between the low and medium variability conditions (p >

.05). (However, this last finding approached significance (p = .053).) At fast speeds, there

were no significant differences in the amount of TOA estimation error as a function of

variability.

Figure 4-5. Absolute TOA estimation error as a function of speed and variability.

(Absolute values are plotted, with +/- 1 SE).

4.4.4 Estimated TOA Windows

As shown in Figure 4-6, subjects! estimated TOA windows differed as a function of the

interaction between speed and variability (F(4,56) = 7.241, p < .001). Subjects! estimated

TOA windows were sensitive to the independent variables tested and reflected the relative

amount of variability (uncertainty) in the TOA of the stimuli. Of most interest was whether

51

subjects! TOA windows increased as a function of variability at each level of speed. Simple

effect tests revealed that this indeed was the case. Subjects! estimated TOA windows were

larger for the high variability condition than the medium variability condition, which in turn

were larger than the low variability condition (p < .05), for each level of speed.

Figure 4-6. Estimated TOA windows as a function of speed and variability. Graph shows 95% CIs around the estimated TOA window.

In order to determine whether the widths of the subjects! estimated TOA windows were

significantly different than actual TOA intervals, the 95% CIs for the three levels of variability

in each level of speed, shown in Figure 4-6, were examined to determine if they contained

the actual TOA interval width. In the slow speed condition, the width of the subjects!

estimated TOA window was significantly smaller than the actual TOA interval width in both

the medium and high variability conditions, but not the low variability conditions. Similarly, in

the moderate speed condition, the width of the subjects! estimated TOA window was again

significantly smaller than the actual TOA interval width in both the medium and high

variability conditions, but not the low variability conditions. In the fast speed condition,

however, the width of the subjects! estimated TOA window was significantly larger than the

actual TOA interval, in the low variability condition only.

A TOA window estimation error score (presented in Table 4-3) was calculated by subtracting

the width of the subjects! TOA window from the width of the actual TOA interval – in other

words, such that positive values represent overestimates of TOA window width. A 3 x 3 x 6

repeated measures ANOVA conducted on the absolute value of the TOA window estimation

Slow Moderate Fast

TO

A I

nte

rva

l W

idth

Low Medium High Low Medium High Low Medium High Variability Variability Variability

TO

A I

nte

rva

l W

idth

TO

A I

nte

rva

l W

idth

52

error score revealed a speed x variability interaction (F(4,56) = 98.503, p < .001), as seen in

Figure 4-7. Follow-up paired t-tests revealed that, at slow speeds, TOA window estimation

error was significantly higher when variability was high than medium (p < .001) or low (p <

.001) and error for the medium variability condition was significantly higher than low

variability condition, p < .001. At moderate speeds, the same pattern was observed, with

TOA window estimation error being significantly higher when variability was high than

medium (p < .001) or low (p < .001) and medium variability being significantly higher than

low variability (p < .001). However, for fast speeds, there was no significant difference in

TOA window estimation error as a function of variability.

Figure 4-7. TOA interval estimation error as a function of speed and variability (Absolute values, plotted with +/- 1 SE)

4.4.5 Estimated TOA Window Symmetry

In addition to the size of the subjects! estimated TOA window, the symmetry of the estimated

TOA window was also of interest. As was discussed previously, given a normal distribution

of speeds, the resulting TOA distribution is necessarily positively skewed – and furthermore

the amount of positive skew changes as a function of speed and variability – with greater

53

positive skew for stimuli with low speeds and high variability. To assess the degree to which

the subjects! estimated TOA window appropriately reflected this positive skew, a TOA

symmetry score was calculated for the actual distribution and for each subject!s TOA

window estimate, using the following equation:

TOA Symmetry Score = |UB –TOA| - |LB-TOA|;

where TOA = subjects! estimates of TOA,

LB = TOA Interval Lower Bound,

UB = TOA Interval Upper Bound (UB)

If the TOA symmetry score is equal to zero, this means that the window created by the

subject was perfectly symmetrical; if it is positive, then the window was positively skewed,

with the upper portion larger than the lower portion; and if negative, then the window was

negatively skewed with the lower portion larger than the upper portion.

Symmetry scores were first computed analytically from the underlying distribution for each

combination of speed and variability, as shown by the solid lines in Figure 4-8. The mean

TOA value was used as the measure of central tendency of TOA. As can be seen, the

actual TOA intervals were either symmetrical or positively skewed. The skew was highest in

the slow speed/high variability condition. This is consistent with the inverse relationship

between speed and TOA, as described previously.

Next, TOA symmetry scores were calculated for each subject!s estimated windows around

the subject!s own estimate of TOA. As can be seen by the dashed lines in Figure 4-8, the

subjects! symmetry scores also tended to be positive (overall mean = 0.2), therefore

suggesting that TOA windows provided by subjects were slightly positively skewed. Note

that for fast speeds the amount of skew was low and quite comparable to the actual

symmetry score; however, for slow speeds with high variability, symmetry scores were

positively skewed, but substantially less skewed than the actual symmetry scores.

54

Figure 4-8. Actual and estimated TOA symmetry scores. Estimated scores are plotted with 95% CIs.

The 95% CIs around the subjects! symmetry scores were examined to determine if they

contained the actual symmetry score. If the 95% CI did not contain the actual symmetry

score, the subjects! symmetry score was considered significantly different than the actual

symmetry score. This was the case for four speed/variability combinations: slow/medium,

slow/high, moderate/medium, and moderate/high. As can be seen in Figure 4-8, in these

four conditions for which the 95% CI did not contain the actual symmetry score, the actual

symmetry score was higher than the upper-bound of the subjects! 95% CI, reflecting that

subjects underestimated the amount of positive skew of the TOA interval. The graph shows

that subjects! estimated windows increased in positive skew as variability increased,

particularly in the slow condition, just as for the actual distributions; however, they

underestimated the magnitude of the skew.

Based on subjective comments made during the debrief session, most subjects detected

that the distribution was skewed. For example, many referred to "laggers! or "stragglers! –

the one or two or three vehicles that seemed substantially later than the rest of the sample.

However, most subjects suggested that they did not know how to factor these extreme

values into their interval estimations. Two strategies were prevalent. Some subjects took the

strategy of excluding the outliers altogether and considered them statistical anomalies.

Others chose to factor the outliers in by adjusting the window boundary "a little bit!, but again

expressed uncertainty as to how much to adjust the window. A hybrid strategy was also

55

implemented by other subjects, in that they would disregard a single outlier, but adjust the

interval boundary if there were 2 or 3 extremes.

4.5 DISCUSSION

4.5.1 Summary of Results

One objective of this research was to determine how well subjects estimated speed and

TOA. As expected, subjects! estimates of both speed and TOA differed among the levels of

speed, suggesting that the mean values chosen for the three distributions tested in this

experiment were perceived by the subjects as coming from different distributions, even for

the highly variable conditions. As expected, based on research of estimates of central

tendency, as summarized by Pitz (1980), subjects performed quite well at the task of

estimating mean speed, with only two conditions in which subjects significantly

underestimated speed as compared to the actual, or objectively computed, speed. In

estimating TOA, on the other hand, subjects over-estimated TOA compared to the actual

TOA for all conditions. This is a marked deviation from Pitz!s research, which would have

predicted that, when asked to estimate central tendency for other tasks, estimation error is

minimal.

It could be that TOA is inherently different from the types of stimuli used in other studies.

For example, it is likely that the positively skewed nature of the TOA distribution could have

increased TOA estimation error, as will be described later. Alternatively, it may be the case

that subjects weren!t attempting to estimate the mean of the distribution, but instead were

estimating some other measure of central tendency, such as the mode or median. During

the debrief, some subjects stated that they selected the time when the "middle group! of

vehicles finished the route – which supports the possibility that subjects were estimating the

median.

A second, and more important, goal was to determine how well subjects estimated the

amount of uncertainty associated with both speed and TOA. Estimated window size was

used as a measure of the subjects! estimate of uncertainty, in that larger windows reflected

larger estimates of uncertainty. For both speed and TOA, subjects! estimated window sizes

increased as a function of stimulus variability, providing support for the use of 95% intervals

56

as a measure of uncertainty. It was hypothesized that the size of the subjects! estimated

windows would be smaller than the actual intervals, since other subjects in the literature

tended to be overconfident in their estimates of uncertainty and not factor in extreme values

appropriately. This was not the case for speed windows. In fact, subjects produced well-

calibrated speed intervals and in only two conditions was there an indication of error in the

size of the estimated speed window – and in both cases the subjects overestimated, rather

than underestimated, the size of the window. TOA window estimation error, however,

depended on the speed of the stimuli. In general, for fast stimuli, subjects slightly

overestimated window size; however, for moderate and low speeds, subjects tended to

underestimate window size, as hypothesized. The amount of this underestimation was

greater for higher variability conditions than medium and low variability. The fact that

subjects did produce well-calibrated speed intervals also provides support for the

experimental technique. Any mis-calibration of TOA intervals can thus more likely be

attributed to real bias in TOA uncertainty estimation, as opposed to a general over- or under-

confidence bias in interval estimation due to the experimental technique.

Finally, it was hypothesized that subjects would create TOA windows that were symmetrical,

neglecting to account for the positive skew of the distribution. Although subjects! windows

were positively skewed, they still underestimated the amount of positive skew, particularly in

the slow/high variability condition.

4.5.2 Limitations of the Current Experiment and Suggestions for Future Studies

Being the first in a series of studies to explore how subjects estimate TOA and TOA

uncertainty, the stimuli and the task were quite simple. For example, the routes were

straight lines of equal length and all vehicles started at the same time. This simplicity may

have produced experimental artefacts that the subjects used in their estimates. During the

debrief, some subjects talked about "watching time! or using "visual patterns! of the vehicles

as they finished their routes. This allowed them to "see! the distribution of TOAs as it

unfolded. In essence then, subjects may have been simply describing the visible profile of

TOAs of the 20 vehicles, rather than applying their knowledge of speed variability to

estimate TOA. Future studies with added stimulus complexity, such as staggering the start

times or positions of the vehicles, should add to the robustness of these findings. In addition,

57

a different experimental paradigm in which subjects undergo a substantial training period to

learn to characterize speed profiles, which are later !applied" in experimental trials, may also

be more informative. This experimental paradigm will be used in the following experiment.

One particularly interesting finding in this experiment was how subjects factored in !extreme"

vehicles – that is those that are much faster or much slower than the others. Both the

objective data and the subjective comments made during the debrief session revealed that

subjects were uncertain how to treat extreme values and how to determine their effect on

TOA or TOA uncertainty. Indeed, this is likely the most difficult task associated with

estimating TOA and TOA uncertainty. With samples of 20 vehicles, there was usually only

one extreme vehicle, and many subjects (but not all) noted that they tended to ignore this

vehicle as a statistical anomaly. The strategy of dealing with extremes, however, was less

clear when there were two or three vehicles that were much slower or faster than the others.

Here, subjects expressed some uncertainty as to how to account for these vehicles.

Therefore, subsequent studies that show larger samples of vehicles to the subjects (i.e., 100

or more) may provide a more realistic opportunity to learn how subjects factor small

numbers of extremes into their estimation process. Having said this, however, it is believed

that 20 vehicles may be at or above the limit of the number of vehicles that a human subject

can process at one time. As such, the follow-on experiment will display a set of 100 vehicles

across five screens, rather than asking subjects to process 100 vehicles at the same time.

4.5.3 Implications for the Next Generation Aviation System

The airspace system is advancing to a !4-D" navigation environment, in which aircraft are

required to not only follow the conventional spatial routes, but also conform to a commanded

arrival time. Of course, for overall system efficiency, the notion of 4-D control has to be

applied to airport surface (taxi) operations as well, to ensure that aircraft are able to depart

on time and meet their next 4-D waypoint. (Note that current convention in the aviation

community utilizes the term “4-D navigation” also on the surface, even though altitude is

fixed.) On the surface, the goal is to minimize wait time by sequencing aircraft so they arrive

at each active intersection !just-in-time" to be able to cross it without waiting for other aircraft

to pass. In this environment, arriving early is as detrimental to system traffic flow as is

arriving late. For airport surface operations, this move to 4-D navigation will mean that pilots

will not only have to navigate the taxiways in order to get from gate to runway (and vice

58

versa), but also manipulate the aircraft speed (without the advantage of auto-pilot) so as not

to arrive too early or too late. Also, controllers must be able to estimate TOA and assign an

appropriate degree of uncertainty to the TOA estimate, based on the speed profile of the

aircraft and adjust traffic sequencing, or re-route traffic accordingly.

The results of this experiment, if they hold true after further, more robust, investigations,

suggest that, if an air traffic controller knows that a particular aircraft adheres to a certain

speed profile, such as slow and variable, or fast and consistent, they will tend to

overestimate the aircraft!s TOA – that is, a controller will estimate that the aircraft will arrive

later than it actually will – and more so for slower aircraft than faster aircraft. Furthermore,

for slow and moderate speed conditions, controllers will have a tendency to underestimate

uncertainty associated with the estimated TOA, which could result in sequencing aircraft too

close together, or not leaving a wide enough time window (or space envelope) around each

aircraft. Finally, the results of the symmetry analysis suggest that, particularly at slow and

moderate speeds with high variability, controllers will underestimate the likelihood that an

aircraft will arrive late to its destination.

59

CHAPTER 5: EXPERIMENT 3 – TOA UNCERTAINTY ESTIMATION AND

EXTRAPOLATION

5.1 INTRODUCTION

This experiment explored participants! ability to develop speed profile expectations by

learning a speed profile through training with feedback, and then to apply this speed profile

expectation to estimate TOA uncertainty. In addition, the ability to identify and apply the

appropriate speed profile based on observation of an individual vehicle was examined.

Thirdly, the ability to extrapolate TOA uncertainty estimates from known to unknown speed

profiles was examined.

5.1.1 Addressing Limitations of Experiment 2

This experiment addressed two limitations that were identified in experiment 2 (Chapter 4):

• Subjects appeared to describe the characteristics of TOA data displayed for each of

the 20 vehicles just observed, rather than apply their global knowledge of each

speed profile to estimate TOA.

• With only 20 vehicles per sample in experiment 2, subjects may not have had

enough data to properly understand and characterize the TOA distribution.

Separating the process of learning speed profiles in the training session from applying them

to estimate TOA uncertainty in the experimental trials was expected to minimize the

limitations of experiment 2, in which subjects were simply describing the TOA distribution of

20 vehicles at a time. In the present experiment, subjects were exposed to a training

program in which they learned speed profiles by exposure to a larger number of vehicles

(100 vehicles per speed profile) than in experiment 2. After the training session, subjects

were asked to apply their knowledge of the speed profiles by estimating TOA uncertainty for

subsequent vehicles – which either followed a trained speed profile or a new speed profile.

A third change in the experimental protocol was made in an attempt to improve the subjects!

understanding of the task by clarifying the concept of the intervals. In the previous

experiment, subjects were asked to provide “the smallest TOA window that they were 95%

sure would contain the vehicle!s actual TOA.” In this follow-on experiment, these

instructions were augmented to better define “95% sure”. Specifically, subjects were

instructed to select a window that would capture 95 of 100 vehicles. Because subjects saw

60

100 vehicles of each speed profile it was expected that this would make the task more

concrete and meaningful.

5.2 RESEARCH QUESTIONS

This experiment set out to address 5 research questions as listed below, and shown

schematically in Figure 5-1, in relation to the model of TOA Uncertainty developed in

Chapter 2. These research questions were addressed across three phases of a single

within-subjects experiment. The experiment phase is listed in parentheses after each

question.

1. Can subjects develop an internal model of a speed profile based on the average speed and speed variability? (Phase 3a)

2. Can subjects apply their internal model of a speed profile to estimate TOA uncertainty – and does this differ as a function of average speed or speed variability? (Phase 3b)

3. Can subjects extrapolate from a trained distance to estimate TOA uncertainty for a new, untrained distance? (Phase 3b)

4. Based on a period of observation of a single vehicle, can subjects identify the vehicle!s speed profile from their repertoire of known speed profiles and apply knowledge of this profile to accurately estimate TOA uncertainty? (Phase 3c)

5. Can subjects extrapolate from trained speed profiles to accurately estimate TOA uncertainty for route distances and speed profiles on which they were not trained. (Phase 3c)

Figure 5-1. Research questions mapped to the preliminary model of TOA uncertainty (Numbers in parentheses refer to phases of the experiment)

61

5.3 METHOD

5.3.1 Subjects

Sixteen subjects were recruited from colleges and the community local to NASA Ames

Research Center, where the experiment was conducted. Subjects were pre-screened for

colour blindness and instructed to wear glasses/contact lenses normally used for computer

work. Subjects! ages ranged from 21 to 54, with a mean age of 35. The range of education

levels possessed by the participants included high school (3 participants), two-year college

diploma (2 participants), four-year university (7 participants), and graduate school (4

participants). Seven of the subjects had taken at least one university-level course in

statistics. Prior to participation, all subjects successfully completed a vision test to ensure

that they could read text in the same font style, size, and colour used for the experimental

stimuli under lighting conditions that were identical to that of the actual experimental trials.

The experiment lasted approximately 3.5 hours and subjects were paid $10 per hour for

their participation.

5.3.2 Experimental Design

This experiment consisted of three phases, as described below. All subjects completed

each of the three phases in a single session, and in the same order. Prior to embarking on

the first phase, subjects first read a set of instructions (see Appendix C1) that outlined the

purpose and tasks required for each phase, signed an informed consent form, and

completed a short demographic questionnaire (see Appendix C2). Upon completion of the

third phase, subjects participated in a post-study debriefing session consisting of a semi-

structured interview (see Appendix C3).

5.3.2.1 Phase 3a: Developing a Library of Speed Profiles

The goal of this first phase of the experiment was to train subjects to accurately characterize

four different speed profiles through exposure to a large number of vehicles and by providing

feedback.

Task Description. As shown in Figure 5-2, subjects observed on a single screen 20 vehicle

icons that shared a common speed profile (i.e., all speeds were randomly drawn from the

same normal distribution), travelling simultaneously along the same length route (medium

62

length: 500 pixels). Throughout this phase, all vehicles that shared the same speed profile

also shared the same colour-coded icon. Each route was divided into 10 segments with

visible vertical hatch marks, and the speed changed for each segment. This was similar to

the stimuli used in experiment 2, except the segment hatch marks were not shown in

experiment 2 and the elapsed time readout shown in experiment 2 was not shown in this

current experiment.

Figure 5-2. Phase 3a Stimuli. Each red dot represents a vehicle. Each three-digit label (e.g., 36.4) represents current ground speed. The twenty routes were identified sequentially, with 1 at the top and 20 at the bottom.

63

After the last vehicle had completed the route, subjects were asked to report the number of

the vehicle that arrived first and last. (This task was aimed at ensuring that subjects actually

attended to the distribution of arrival times.) Next, subjects were asked to characterize the

speed profile by estimating the average speed of the profile, and providing a lower bound

and upper bound for the smallest range of speeds that would capture 95% of the individual

average speeds, in the same manner as experiment 2. Subjects were given feedback after

every trial that indicated whether their answers were “Correct”, “Too High” or “Too Low”

relative to the mean speed and 95% CIs of the population distribution. The feedback was

provided on the subject!s estimate of average speed as well as both the lower bound and

the upper bound individually. The feedback algorithm had a buffer of 1 unit, such that, if a

subject!s answer was within plus or minus 1 of the actual answer, the algorithm reported

“Correct”. In total, subjects experienced five consecutive trials of each colour-coded speed

profile – for a total of 100 vehicles per speed profile. Upon completion of the fifth trial for

each speed profile, subjects also received the actual mean, lower bound and upper bound,

as computed objectively from the speed profile distributions. Subjects then answered the

three questions below pertaining to the speed profile, using their full knowledge of all of the

vehicles that shared that profile – that is, all 100 vehicles observed.

Q1. Based on their average speed, are the vehicles within this profile more likely travelling on a six-lane freeway (i.e., fast) or a two-lane rural road (i.e., slow)? Please circle one:

1 2 3 4 5 6 7 Definitely Definitely two-lane rural six-lane freeway

Q2. Based on their speed fluctuations, are the vehicles within this profile more likely

travelling with cruise control ON or OFF? Please circle one: 1 2 3 4 5 6 7 Definitely Definitely cruise control ON cruise control OFF

Q3. Please characterize the time of arrival of vehicles that share this speed profile.

(use words, symbols, numbers, or pictures).

The purpose of the first two questions was to assess how subjects categorized each profile

on the two dimensions: speed (low or high) and variability (low or high) and to determine if

64

their categorization matched the stimuli. With regards to Question 3, subjects were

encouraged to make notes to characterize the TOA distribution using words, symbols,

numbers, or pictures. For example, subjects could note that the TOAs were !spread out" or

!clumped together", estimate a number of seconds from the first to the last vehicle, or use

symbols or pictures to depict the spread graphically. They were instructed that these notes

would be important for completing Phase 2 and 3 of the experiment, in which they would be

required to estimate TOA, and that these were the only notes that they could carry through

to the subsequent experimental phases. These notes also served to emphasize the

importance of utilizing their knowledge of the speed profile of the full set of 100 vehicles, not

just of a single vehicle.

Subjects completed this task for four different speed profiles made up of speed/variability

pairings adopted from Experiment 2: slow/low, slow/high, fast/low, fast/high, summarised in

Table 5-1. Each speed profile was labelled by colour-coded icons, allowing subjects to

differentiate among the four speed profiles – slow/low = red; slow/high = yellow; fast/low =

green; fast/high = blue. The five trials for each speed profile were blocked, and the order in

which the speed profile blocks was presented to each subject was determined using a Latin

Square design.

Table 5-1. Phase 3a Experimental Design

Speed Profile

(Speed/Variability)

Profile

Labelled

Number

of Trials

Number of

Vehicles per

Trial

Route

Distances

(in pixels)

Slow/Low Yes 5 20 Medium (500)

Slow/High Yes 5 20 Medium (500)

Fast/Low Yes 5 20 Medium (500)

Fast/High Yes 5 20 Medium (500)

5.3.2.2. Phase 3b: Estimating TOA Uncertainty for Trained Profiles

The goals of this phase were to:

1) Assess whether subjects could apply their knowledge of the four learned speed

profiles to accurately estimate TOA uncertainty; and

65

2) Assess subjects! ability to extrapolate to distances other than that distance (medium)

for which they were trained.

Task Description. In this block of trials, subjects were shown a dynamic simulation of a

single vehicle with a digital speed readout as it traversed a medium length route (500 pixels,

as used in Phase 3a; see Figure 5-3 top). All vehicles were colour coded to match one of

the four speed profiles learned in Phase 3a. Subjects were instructed that this was a

prototypical vehicle that was representative of all of the vehicles that shared that profile, and

when answering the questions they should consider all vehicles that shared that same

colour-coded speed profile. The stimuli in this phase, shown in Figure 5-3, were the same

as in Phase 3a, except that only one vehicle was shown at a time. The speed readout was

displayed, but the elapsed time was not displayed. The speed profile always matched one of

the four trained profiles, and was identified using the same colour-coding scheme used in

Phase 3a. The vehicle!s speed changed with each segment of the route, and speeds were

drawn from the normal distribution that characterized the speed profile.

Figure 5-3. Phase 3b experimental stimuli. The top panel shows the dynamic simulation of one vehicle traversing a medium-length route adhering to the "red! speed profile (Slow/Low Variability). The icon changes speed at each segment (vertical hatch mark). The bottom panel shows the static probe screen presented after the dynamic simulation (top) was completed, with the same speed profile (red colour-code) and a visual representation of the route!s distance remaining (in this case a short, 250 pixel route). The top dynamic simulation was shown first, followed by the bottom static screen.

Static Probe Screen

Dynamic Simulation

66

After the vehicle had completed 10 speed change segments and reached the end of the

route, the vehicle looped to the beginning of the route (see Figure 5-3, bottom) and its

movement was paused with a new distance remaining that was either short (250 pixels),

medium (500 pixels), or long (1000 pixels). An !expected TOA" was presented to the

subjects that reflected the time (in seconds) that the vehicle was expected to complete the

route. (The time shown was the mean of the TOA distribution for the distance remaining.)

Subjects were asked to produce TOA intervals that reflected the smallest window that they

were 95% sure would contain the actual TOA, in the same manner as experiment 2. It was

emphasized in the written and on-screen instructions that the subjects should aim to

produce intervals that capture 95 of 100 vehicles in the profile.

As shown in Table 5-2, stimuli consisted of four speed profile conditions at each of three

distances (250, 500, and 1000 pixels). Recall that routes in the training block were 500

pixels long – thereby requiring subjects to apply their knowledge of speed to estimate TOA

windows for the same distance (500 pixels) and also to extrapolate to a shorter route (250

pixels) or a longer route (1000 pixels).

Subjects completed two familiarization trials with the experimenter present: the slow/high-

short route and the fast/low–long route. Then subjects completed one block of 12 training

trials (4 speed profiles x 3 distances), followed by four blocks of 12 experimental trials. The

presentation order of the twelve trials within each block was determined randomly.

Table 5-2. Phase 3b experimental design

Speed Profile (Speed/Variability)

Profile Trained

Profile Labelled

Number of Trials Route Distance

(pixels)

Slow/Low Yes Yes 1 training +

4 experimental 250, 500, 1000

Slow/High Yes Yes 1 training +


Fast/Low Yes Yes 1 training +


Fast/High Yes Yes 1 training +


67

Hypotheses. If the findings from Experiment 2 hold, it was expected that subjects would

overestimate TOA uncertainty in the fast conditions, particularly the low variability condition

(that is, they were expected to produce TOA windows that were larger than the actual TOA

intervals). For slow speeds, it was expected that estimated TOA windows would be smaller

than actual TOA intervals, and more so for the high variability condition than low variability

condition. The magnitude of TOA estimation error was expected to be smaller than for

experiment 2 because the increased training protocol should yield improved estimates.

Further, if subjects were able to extrapolate from the trained route distance (500 pixels) to

new, untrained distances, a main effect of route distance would be expected, with smaller

estimated TOA windows for shorter distances, and longer TOA windows for longer

distances. Absence of a significant main effect, assuming sufficient power, would suggest

that subjects could not extrapolate from trained distances to untrained distances.

5.3.2.3. Phase 3c: Estimating TOA Uncertainty for New, Untrained Profiles

The goal of this phase was to determine if subjects could identify a known speed profile

based on a short period of observation and apply knowledge of this profile to estimate TOA

uncertainty. A second goal was to assess how well subjects could extrapolate from the

trained speed profiles to assess TOA uncertainty for speed profiles on which they were not

trained.

Task Description. These experimental trials were identical to the trials in Phase 3b, with

three exceptions:

1) The speed profile was not labelled (through colour coding) in any trials;

2) Some trials matched the four trained speed profiles, but some did not, as shown in Table

5-3;

3) All route distances were medium length (500 pixels, the same as the trained route

distance).

This enabled an evaluation of the subjects! ability to extrapolate from trained profiles to new

untrained profiles, presented in Table 5-3 as shaded rows. Subjects completed three trials

for each speed profile. The nine speed profiles were presented in random order in three

blocks. Subjects were informed that not all vehicles shared one of the four trained speed

profiles.

68

Prior to estimating the TOA windows, subjects were asked to 1) determine if the speed

profile was one of the trained profiles; 2) identify the speed profile (by colour) that it most

closely resembled (or state the colour of the trained profile they believed it matched), and 3)

rate the similarity of the vehicle!s profile to the selected speed profile on a scale from 1 (very

low) to 5 (very high).

Table 5-3. Phase 3c experimental design. (Shaded rows are new, untrained, speed profiles)

Hypotheses. For trained profiles, it was expected that subjects would overestimate TOA

uncertainty in the fast conditions (that is, estimate TOA windows that are larger than the

actual TOA intervals) and underestimate TOA uncertainty (estimate TOA intervals that are

smaller than actual TOA intervals) in the slow conditions, and more so for the high variability

condition than low variability condition. TOA estimation errors that are larger for Phase 3c

trials than 3b trials would indicate that subjects were not able to identify the speed profiles

solely from the observed stimuli.

Speed Profile

(Speed/Variability)

Profile

Trained

Profile

Labelled

Number of

Trials

Route

Distance

(in pixels)

Slow/Low Yes No 3 500

Slow/Medium No No 3 500

Slow/High Yes No 3 500

Moderate/Low No No 3 500

Moderate/Medium No No 3 500

Moderate/High No No 3 500

Fast/Low No No 3 500

Fast/Medium Yes No 3 500

Fast/High Yes No 3 500

69

5.4 RESULTS

5.4.1 Phase 3a: Developing a Library of Speed Profiles

Recall that the goal of phase 3a was to expose subjects to 100 vehicles travelling with a set

of shared speed profiles to enable them to develop a mental model of the profile in terms of

average speed and speed variability. Analyses were conducted to evaluate how well

subjects were able to estimate the mean speed and speed windows for each speed profile.

Table 5-4 shows actual and estimated mean speed and speed windows for the fifth (final)

trial of each speed profile, and a difference score representing actual minus estimated

values.

Table 5-4. Actual speed scores, estimated speed scores, and speed estimation error scores

Actual Subjects! Estimates Estimation error

(Actual – Estimate) Speed

Profile Mean Interval Mean Window Mean Window

Slow/Low 35 3.75 34.7 4.13 0.3 -0.38

Slow/High 35 11.15 34.7 9.76 0.3 1.39

Fast/Low 75 3.75 75.5 4.26 -0.5 -0.51

Fast/High 75 11.15 73.8 9.38 1.2 1.77

*Data shown in table are for last trial of each speed profile

5.4.1.1 Estimated Mean Speed

The first element required to develop a model of the speed profile was to estimate mean

speed. Figure 5-4 shows the subjects! estimated mean speed (dashed lines) on the fifth

(final) trial for each speed profile, as compared to the actual mean speed (solid lines). A 2 x

2 x 5 ANOVA revealed no significant three-way interaction. But, subjects were sensitive to

the difference in speed, estimating the mean speed for fast stimuli (M = 74.7) as faster than

for slow stimuli (M=34.7) (F(1,15)=5630.17, p<.001). The difference between subjects!

estimated mean speed and the actual mean speed was negligible (p>.05) in all conditions.

The subjects! estimates of mean speed did not differ as a function of variability

(F(1,15)=1.124, p>.05). Although not shown in Figure 5-4, mean speed estimates did differ

70

as a function of trial (F(4,60)=3.212, p<.05), with subjects underestimating mean speed on

the first trial, but quickly calibrating toward the actual mean for the remaining trials.

Figure 5-4. Estimated mean speed on 5th and final trial (shown with 95% CI)

5.4.1.2 Estimated Speed Windows

The second element required to develop an internal model of a speed profile was the

subjects! ability to estimate the speed windows (or the window that captures 95% of the

mean speeds). A 2 x 2 x 5 ANOVA was conducted. Neither a three-way interaction nor a

main effect of average speed on speed window estimation was found (p>.05). This

supports the premise that the subjects understood that actual speed windows do not differ

as a function of speed, but only as a function of variability, as can be seen in Table 5-4. A

main effect of variability (F(1,15) = 47.63, p<.001), on the other hand, revealed that subjects

did estimate larger speed windows for the high variability condition (M = 8.85) than for the

low variability condition (M = 4.82), reflecting an understanding of the relationship between

interval size and variability. A significant trial x variability interaction (F (4,60)=12.61,

p<.001) revealed that estimated window sizes increased as a function of trial repetition in the

high variability condition, but decreased in the low variability condition. Similarly, a Trial x

Speed interaction (F (4,60)=2.87, p<.05) revealed that subjects! window size estimates

increased as trials progressed for fast speeds and decreased for slow speeds. These

71

interactions reflect an improvement in performance due to feedback, as will be shown in the

following section.

Figure 5-5 presents the speed window difference scores (actual minus estimated speed

window size) as a function of speed, and trial, where a perfectly calibrated speed window

estimate yields a difference score of zero. A 2 x 2 x 5 ANOVA revealed a significant speed x

trial interaction (F(4,60)=2.874, p<.05). (No significant three way interaction was found.) As

can be seen in Figure 5-5, initially subjects tended to overestimate the size of the speed

windows for the slow speed trials and underestimate the speed windows for the fast speed

trials – however, with feedback after each trial, both converged towards the actual speed

interval size, with a low difference score by the fifth trial. Analyses of just the fifth trial

revealed no significant difference in speed window difference scores as a function of speed

(p>.05).

Figure 5-5. Speed window difference scores as a function of speed (slow, fast) and trial number (shown plotted with ±1 Std Error)

A variability x trial interaction was also observed (F(4,60)=12.607, p<.001), as seen below in

Figure 5-6. Subjects overestimated speed windows for low variability stimuli and

underestimated speed windows for high variability stimuli. A subsequent analysis of the final

trial showed that the speed window difference score was significantly higher for low

variability conditions than high variability conditions (F(1,15)=9.218, p<.05).

72

Figure 5-6. Speed window population difference scores as a function of variability (low, high) and trial (shown plotted with ± 1 std error)

In summary, in a small number of trials with performance feedback, subjects were able to

effectively develop an internal model of each speed profile and characterize both mean and

speed intervals with little error.

5.4.2 Phase 3b: Estimating TOA uncertainty for trained profiles

Recall that the goal of this second phase of the experiment was to determine if subjects

could apply their internal model of the four speed profiles to accurately estimate TOA

uncertainty for trained distances. A second goal was to determine if subjects could use their

knowledge of trained speed profiles to estimate new (untrained) distances. Each subject

completed five trials for each speed profile x distance combination, in a random order. The

first trial of each condition was considered practice and the analyses that follow were

conducted on the remaining four trials.

5.4.2.1. Estimated TOA Uncertainty

Table 5-5 presents the actual TOA interval size, the subjects! estimated TOA window size,

and the difference between the actual and estimated window size. Recall that the expected

TOA was provided to the subjects to eliminate potential confounds associated with

misestimating the TOA itself.

Trial

73

Table 5-5. Actual TOA scores, estimated TOA scores, and TOA estimation error scores

Independent Variables

Speed Variability Distance

Actual TOA Interval

Estimated TOA Window

TOA Estimation Error (Actual –

Estimated)

Slow Low Short 2.8 4.6 -1.8

Slow Low Medium 5.6 5.5 0.1

Slow Low Long 11.2 6.6 4.6

Slow High Short 12.5 9.3 3.2

Slow High Medium 25.0 10.3 14.7

Slow High Long 50.0 12.4 37.6

Fast Low Short 0.6 2.7 -2.1

Fast Low Medium 1.2 3.4 -2.2

Fast Low Long 2.4 3.8 -1.4

Fast High Short 1.9 4.6 -2.7

Fast High Medium 3.8 5.4 -1.6

Fast High Long 7.6 6.6 -1.0

A 2 x 2 x 3 x 4 repeated measures ANOVA was conducted to determine if the subjects!

estimated TOA window differed as a function of the experimental conditions (speed,

variability, distance, repetition). Again, this was conducted to investigate whether the

subjects could distinguish among the experimental conditions, and to evaluate whether

performance differed across the four repetitions due to learning, fatigue, or scenario

differences. The analysis revealed a significant main effect of distance (F(2,30)=13.979,

p<.001), and post-hoc comparisons revealed that subjects! estimated TOA window size

increased as a function of distance. TOA windows were smallest for short routes (M=5.29),

followed by medium routes (M=6.15), and long routes (M=7.34). All pairwise comparisons

were significantly different (p<.01). This suggests that subjects did factor the distance of the

route into their TOA uncertainty estimation.

A significant speed x variability interaction (F(1,15)=8.95, p=.009) provided evidence that

subjects! estimated TOA uncertainty differed as a function of the two main experimental

independent variables. As can be seen by the dashed lines in Figure 5-7, at slow speeds

74

the difference in estimated window size between low (M=5.55) and high (M=10.66)

variability was greater than the difference between low (M=3.3) and high (M=5.5) variability

at fast speeds.

Figure 5-7. Estimated TOA window size as a function of speed (slow, fast) and

variability (low, high). Error bars represent ±1 std error

Next, the subjects! estimated TOA window was compared to the actual TOA interval by

examining the 95% CI around the subjects! estimated TOA window size, to determine if the

actual TOA window size was contained within the 95% CI. Figure 5-8 shows the subjects!

estimated TOA window size (dashed lines) and the actual TOA interval size (solid lines) as a

function of distance for each combination of speed and variability. As indicated by the circles

in the figure, in all but two conditions (slow/low variability/medium distance and fast/high

variability/long distance) the 95% CI around the subjects! estimated window size did not

contain the actual TOA interval size, indicating that the subjects! estimate was significantly

different than the actual interval size. As can be seen in the upper left graph in Figure 5-8,

for the slow/low variability condition, the subjects overestimated TOA window size for short

distances and underestimated TOA window size for long distances. For the slow/high

variability condition (upper right), the subjects underestimated TOA window size for all

distances. The opposite was true for the fast/low variability condition (lower left) and

75

fast/high variability conditions, in which the subjects overestimated TOA window size for all

distances (with the exception of fast/high variability/long distance).

Figure 5-8. Actual (solid lines) and estimated (dashed lines) TOA windows as a function of speed (slow, fast), variability (low, high), and distance (short, medium, long). Graph shows 95% CIs around the estimated TOA window. Circles show data where actual interval widths fall outside of the 95% CI of the subjects! estimates.

The pattern of overestimated TOA window size for short distances and underestimated TOA

window size for long distances can be seen more readily in Figure 5-9, which adjusts the

subjects! estimates to correct for the initial estimation error seen in the trained (medium)

distance. That is, when the subjects! TOA uncertainty estimate for the medium distance is

shifted to match the actual TOA uncertainty, one can clearly see that the subjects

systematically overestimate TOA uncertainty for short distances, and underestimate TOA

uncertainty for long distances.

Related to this finding, in the debrief session, all subjects were shown three scenarios of a

vehicle with the same speed profile travelling routes of different lengths (see Appendix C3),

to assess their understanding of the effect of distance on TOA uncertainty. In the first

scenario, the route distance was 4 miles, and the TOA window around an expected TOA of

320 seconds was 80 seconds. In the second scenario, subjects were asked to estimate the

TOA window for a vehicle with the same profile travelling a route that was half as long – 2

miles. The third scenario asked subjects to estimate the TOA window for an 8 mile route.

For the second (2 mile route) scenario, all but two subjects correctly estimated a TOA

76

window of 40 seconds (which was half the TOA window provided in scenario 1). One

subject estimated a shorter window (20 seconds) and one estimated a longer window (60

seconds). For scenario 3 (8 mile route, which was twice the distance of scenario 1), all but

four subjects correctly estimated the TOA window to be 160 seconds (or twice as long as

provided in Scenario 1). One subject estimated a shorter TOA window (90 seconds), while

three estimated longer TOA windows (180, 200, 320 seconds). While there were some

exceptions, the majority of the subjects demonstrated an accurate understanding of the

relationship between TOA uncertainty and route distance. Yet in the experiment, the

subjects systematically misestimated TOA uncertainty as a function of distance.

Figure 5-9. Actual (solid lines) and estimated (dashed lines) TOA windows as a function of speed (slow, fast), variability (low, high), and distance (short, medium, long), corrected for

initial estimation error at the trained (medium) distance.

5.4.2.2. TOA Uncertainty Estimation Error

TOA estimation error scores, shown in Table 5-5, were computed by subtracting the

subjects! estimated TOA window size from the actual TOA interval size generated from the

population distribution. A TOA window estimation error score of zero reflects a perfectly

calibrated TOA window estimate. A repeated measures ANOVA was conducted on the

absolute values of the TOA window estimation error scores to determine if the amount of

estimation error (regardless of direction) differed as a function of the independent variables.

As evident in Figure 5-10, the ANOVA yielded a speed x variability x distance interaction

(F(2,30) = 202.37, p<.01). Subsequent simple effect tests revealed a significant variability x

77

distance interaction for the slow speed condition (F(2,30)=258.364, p<.001). In the slow/low

variability condition, TOA window estimation error was not significantly different for short

(M=2.02) and medium (M=1.68) distances, p>.05), but TOA window estimation error was

larger for long distances (M=5.18) than both medium and short distances, (p<.001). In the

slow/high variability condition, TOA window estimation error was highest for long distances

(M=37.99), followed by medium distances (M=14.87) and short distances (M=5.27). All

pairwise comparisons were significantly different (p<.001). For fast speeds, no interaction

was found for variability x distance (p>.05). That TOA uncertainty estimation error was

greatest in the slow/high profile for long distances suggests that subjects treated this

condition as a symmetrical distribution, and failed to recognize the substantial increase in

TOA interval size associated with this condition.

Figure 5-10. TOA window estimation error (absolute values), as a function of speed (slow,

fast) and variability (low, high). (Plotted with ±1 SE).

5.4.2.3 Estimated TOA Interval Symmetry

In addition to the size of the subjects! estimated TOA window, the symmetry of the estimated

TOA window was also of interest. As was discussed previously, given a normal distribution

of speeds, the resulting objectively determined distribution of TOAs is necessarily positively

skewed – and furthermore the amount of positive skew changes as a function of speed and

variability – with the greater amount of positive skew for stimuli with slow speeds and high

78

variability. To assess the degree to which the subjects! estimated TOA windows

appropriately reflected this positive skew, a TOA symmetry score was calculated using the

following equation (and also used in experiment 2):

TOA Symmetry Score = |UB –TOA| - |LB-TOA|;

where TOA = subjects! estimates of TOA,

LB = TOA Interval Lower Bound,

UB = TOA Interval Upper Bound (UB)

If the TOA symmetry score is equal to zero, the interval created by the subject was perfectly

symmetrical; if positive, then the window was positively skewed with the upper portion larger

than the lower portion; and if negative, then the window was negatively skewed with the

lower portion larger than the upper portion.

Symmetry scores were first computed on the population data for each combination of speed

and variability as shown in Figure 5-11. As can be seen from the solid lines, the intervals

were either symmetrical or positively skewed. The skew was highest in the slow speed/high

variability condition. This is consistent with the inverse relationship between speed and

TOA, as discussed previously. Next, TOA symmetry scores were calculated for each

subject!s estimated TOA windows. As can be seen from the dashed lines in Figure 5-11, the

subjects! symmetry scores also tended to be neutral or slightly positive (Overall Mean = .05),

therefore suggesting that subjects provided TOA windows that were slightly positively

skewed. Note that for fast speeds the amount of skew was low and quite comparable to the

actual symmetry score; however, for slow speeds with high variability, symmetry scores

were substantially less skewed than the actual symmetry scores.

The 95% CIs around the subjects! symmetry scores were examined to determine if the CI

contained the actual symmetry score. If the 95% CI did not contain the actual symmetry

score, the subjects symmetry score was considered significantly different than the actual

symmetry score. This was the case for all combinations of variability and distance for the

slow condition, in which the actual TOA windows were more positively skewed than the

subjects! estimates, with the exception of slow/low variability/short distance, which was not

different than the actual symmetry score. In contrast, subjects! estimates of TOA symmetry

for the fast stimuli were not significantly different than the actual symmetry score for any

variability or distance combination.

79

Figure 5-11. Actual and estimated TOA symmetry scores. Estimated scores are plotted with 95% CIs. Top graph: slow speed; bottom graph: fast speed.

5.4.3 Phase 3c: Estimating TOA Uncertainty for Untrained Profiles

The goal of this third phase was to determine if, after observing an unlabelled vehicle

travelling a medium length route with 10 speed changes, subjects could determine whether it

was a speed profile for which they had been trained – and apply TOA windows accordingly.

A second goal was to determine whether subjects could extrapolate from the trained speed

profiles to assess TOA uncertainty for speed profiles on which they were not trained. In

contrast to Phase 3b, only one distance (medium route, 500 pixels) was used for all trials in

this phase. Recall that after viewing the vehicles, subjects were asked to first determine

80

whether the vehicle!s speed profile was new or old (i.e., one of the four they had learned in

training). Subjects were then asked to select which of the four trained profiles it was most

similar to – or state the profile they believed it matched – and provide a similarity rating from

1 (very low) to 5 (very high). Then, subjects estimated the TOA windows and entered them

in the same manner as Phase 3b.

5.4.3.1 Identification of Trained and Untrained Profiles

Subjects correctly identified 63% of the profiles as trained or untrained, with a mean of 17

correct identifications out of 27 trials (with a range of 12 – 21). As shown in Figure 5-12, the

correct identification rate for trained profiles (76%) was higher than for new, untrained

profiles (52%) (X2 = 26.27, p<.05).

Figure 5-12. Identification of "trained! and "untrained! profiles

5.4.3.2 Correct Identification of Speed Profile

Next, subjects were asked to identify which of the original four speed profiles the vehicle

was most similar to – or, if the subject stated that it was a trained profile, they were asked to

identify which one of the four profiles it was. Figure 5-13 shows the percent of trials that

were identified correctly for each of the four trained profiles. (Note that this analysis uses

only the four trained profiles, since there was no "correct! closest profile for the untrained

profiles). Identification rates were highest for the slow/low (88%), followed by slow/high

(73%), fast/high (67%) and fast/low (65%) profile. Chi square residual analyses revealed

Trained Untrained Speed Profile

Pe

rce

nt

Co

rre

ct

81

that the identification rate for the slow/low condition was above the mean percent correct

value and the fast/low and fast/high rates were below the mean percent correct value (X2=

7.8, p<.05). The higher identification rate for the slow conditions is likely attributed to the

skewed distribution yielding a range of TOAs that was much larger, and thus arguably more

salient, than for all other conditions.

Figure 5-13. Correct Identification of Speed Profile

5.4.3.3 Similarity Ratings

As summarised in Table 5-6, when asked to rate the similarity of the observed profile to the

trained profile that the subjects identified as being the closest, subjects rated the trained

profiles higher (more similar) than the untrained profiles. That is, when subjects reported that

it was one of the four trained profiles and it actually was one of the four trained profiles,

subjects rated the similarity rating higher (M = 4.2) than if subjects reported that it was a new

profile that they had not seen before and it was indeed a new profile (M = 2.6)

Table 5-6. Mean similarity ratings

Actual profile was: Subject Response

Trained Untrained

Trained 4.2 3.8

Untrained 3.0 2.6

Pe

rce

nt

Co

rre

ct

Slow/Low Slow/High Fast/Low Fast/High Speed Profile

82

5.4.3.4 Estimated TOA Window

Next, subjects were asked to estimate the TOA uncertainty window in the same manner as

in Phase 3b. Figure 5-14 shows the subjects! estimated TOA windows (dashed lines)

plotted with 95% CIs, and the actual TOA intervals (solid lines). A 3 x 3 x 3 ANOVA was

conducted using all trials (trained and untrained speed profiles) to determine if the subjects!

estimated TOA window size differed as a function of speed, variability, and trial. (Recall that

for this phase, a new speed (moderate/45) and a new level of variability (medium/6) were

introduced, creating three levels of each variable. Distance was held constant at medium, as

used for Phase 3a.)

Figure 5-14. Estimated and Actual TOA windows for both trained and untrained profiles

Results revealed a significant main effect of speed (F(2,30) = 17.24, p<.05). (No three-way

or two-way interactions were found.) Follow-up post-hoc tests did not reveal any significant

difference between the subjects! estimated TOA window for the slow speed condition

(M=6.7) and the estimated TOA window for the newly added moderate speed condition

(M=7.0), but it did reveal that both were significantly larger than the TOA window for high

speeds (M=4.3) (p<.001). This result suggests that subjects may have simply applied the

TOA window for slow speeds to the new moderate speed condition, rather than interpolating

between the slow and fast conditions.

83

Subjects! estimated TOA windows differed as a function of variability (F(2,30)=10.84,

p<.001). TOA window estimates for the low variability condition (M=4.8) were lower than for

the new medium variability condition (M=6.353) (p<.05), and for the high variability condition

(M=6.9) (p<.05). Subjects' TOA window estimates for medium and high variability were not

significantly different (p>.05). This result suggests that subjects may have applied the TOA

window for high variability to the new medium variability condition.

There was a significant effect of repetition (F(2,30)=4.33, p<.05). Results revealed that

estimated TOA windows were shorter on Trial 1 (M=5.7) than Trial 2 (M=6.4) (p<.05), but

Trial 3 (M=6.0) was not significantly different than either Trial 1 or 2 (p >.05).

Next, subjects! estimated TOA windows (dashed lines in Figure 5-14) were compared to the

actual TOA intervals (solid lines in Figure 5-14). In the slow speed condition, subjects

underestimated the TOA window size for medium and high variability conditions. In the

moderate speed condition, subjects overestimated the TOA window for low variability and

underestimated the TOA window for the high variability condition. In the fast speed

condition, subjects overestimated the TOA window for both low and medium variability.

An analysis was conducted to determine if the subjects! estimated TOA window differed as a

function of whether the subject thought the speed profile was new or old. For each profile,

the mean TOA window was calculated for those that answered “old/trained” and for those

that answered “new/untrained”. Due to unequal numbers of samples across conditions,

statistical analyses were not conducted, but mean TOA windows are presented in Figure 5-

15 below. Averaged across all speed profiles, the mean estimated TOA window for trials in

which the subject thought the speed profile was "new! was 6.9 (170 trials, SD = 3.9), as

compared to an average of 5.44 (261 trials, SD = 3.7) for trials in which the subject thought

that the speed profile was old. This finding reflects presumably greater uncertainty

associated with an unknown profile. As can be seen in Figure 5-15, the TOA window

estimates for each profile were very similar for those that the subjects thought had a new

speed profile and those which they thought were old, with the exception of two new speed

profile conditions, 45/6 and 45/9. In both cases when subjects correctly identified these as

new profiles, the estimated TOA uncertainty was much larger than when they incorrectly

identified them as old profiles.

84

Figure 5-15. TOA window estimates for each speed profile plotted by subjects! identification of old/trained or new/untrained profile (shown with +/- 1 Std. Error). Parentheses at the bottom indicate whether the speed profile was actually trained or untrained.

Lastly, subjects! estimated TOA windows from Phase 3c were compared to those from

Phase 3b. The only difference between the two conditions was that the speed profiles were

labelled (by colour code) in 3b, but was not labelled in 3c. Figure 5-16 compares the TOA

window estimation error scores for Phase 3b and 3c, for each of the four trained speed

profiles, revealing a phase x profile interaction (F(3,45)=4.92, p=.005). The difference in

TOA window estimation error scores between Phase 3b (labelled) and Phase 3c (not

labelled) was significant only in the slow/high profile condition (F(1,15)=7.24, p=.017), with

estimation error higher when the profile was not labelled (Phase 3c). This result suggests

that there may be an operational benefit to providing display technology or otherwise

identifying the speed profile to operators, since TOA uncertainty estimation error was larger

in the absence of the profile identification. There were only small differences (p<.05)

between the two studies for the other three profiles, suggesting that, for these profiles,

subjects! TOA window estimates were not altered if the speed profile was labelled or

unlabelled.

85

Figure 5-16. TOA window error scores when the profile was colour-coded (phase 3b) and not colour-coded (phase 3c) (shown plotted with +/- 1 Std. Error)

Understanding the process of extrapolating to untrained speed profiles. Further analyses

were conducted to explore how subjects extrapolated from trained profiles to assess TOA

uncertainty for a new speed profile. In particular, given that subjects were trained on low

and high variability conditions, this analysis set out to determine how subjects estimated

TOA windows for the new, untrained, medium variability conditions. Nisbett, Krantz, Jepson

& Fong!s (1982) research on inductive reasoning suggests two mechanisms that could

explain how subjects went about estimating TOA uncertainty for untrained conditions,

yielding two competing hypotheses for consideration. First, subjects could have estimated

the TOA window for low and high variability based on their model of uncertainty developed

in Phase 3a, and averaged the two numbers to determine the TOA window size for the

medium variability condition. This is the simplest form of a mathematical model, or statistical

heuristic (Nisbett, Krantz, Jepson & Fong, 1982), that could reasonably be expected to be

employed by the subjects during the experiment. Alternatively, subjects could have selected

the trained condition that they felt was closest to the new condition, and simply assume the

same TOA window. This is commonly referred to in the literature on judgment under

uncertainty as the representativeness heuristic (Tversky & Kahneman, 1982).

Speed Profile

TO

A W

ind

ow

Err

or

Sco

re

86

To test these two hypotheses, a multiple regression analysis was conducted to determine

which hypothesis better predicted the outcome measure of estimated TOA window for the

new, untrained profiles. The first hypothesis was modelled as a predictor variable that was

computed as the average of each subject!s estimated TOA window for the low and high

variability conditions. The second hypothesis was modelled as a predictor variable that

represented the subject!s estimated TOA window for the speed profile that he/she stated

most closely matched the stimuli. With both predictor variables in the equation, the model

accounted for 80% of the variance in estimated TOA window times.

Referring to Table 5-7, examination of each predictor variable revealed that only the

predictor variable that represented the subject!s estimate for the closest profile was

significant, with a standardized beta score of .878. This suggests that subjects

overwhelmingly employed the representativeness heuristic – that is, they applied the TOA

window of the trained profile that was most similar rather than the average of the low and

high variability. However, the reader must be aware of one potential limitation of this finding.

Because subjects were first asked to identify the closest trained profile and then determine

the TOA window, it is possible that this experimental method cued the subjects to use the

representativeness heuristic.

Table 5-7. Multiple regression analysis to determine which model best explains how subjects estimated TOA uncertainty for untrained profiles.

Model B Beta t sig

Closest Profile .833 .878 11.012 .000

Average of Low and High 0.29 .22 .278 .782

R2 = .804

A subsequent multiple regression analysis was conducted that included the subjects!

similarity rating scores, in addition to the two heuristics as predictor variables. This was

conducted to determine if the degree of rated similarity to a trained profile accounted for any

more variance in estimated TOA window size. Results revealed that it did not – producing a

R2 of .805, as compared to the previously obtained .804. The similarity score predictor

variable was not significant (t=.681, p>.05) with a standardized beta weight of -0.032.

87

5.5 CONCLUSION AND OPERATIONAL IMPLICATIONS

This experiment addressed two methodological limitations identified in experiment 2. The

inclusion of Phase 3a, an extensive training session that exposed subjects to over 100

vehicles for each of four profiles, served to increase the sample size from which subjects

could understand and characterize the TOA distribution, and also changed the task from one

of (potentially) simply observing and describing a single sample of displayed speed

readouts, to one of developing an internal model for application in subsequent phases of the

experiment. Additionally, changes were made to the experimental protocol to improve

subjects! ability to understand the task.

Not only did this experiment replicate the results of experiment 2 (reported in Chapter 4), it

also revealed further important insights into the TOA uncertainty estimation problem. The

results of Phase 3a revealed that using a training-with-feedback paradigm, subjects can

develop accurate models of speed profiles based on relatively short exposure to only 100

vehicles. Phase 3b then asked subjects to apply their models of each speed profile to

estimate TOA uncertainty, for both the trained distance and two new, untrained distances

(one shorter and one longer). For the trained distance, subjects estimated the TOA

Uncertainty for the slow/low profile very accurately, but underestimated TOA Uncertainty for

slow/high while overestimating for the fast/low and fast/high profiles. The untrained distance

conditions revealed systematic biases – overestimating TOA uncertainty for short distances

and underestimating TOA uncertainty for long distances. The TOA uncertainty estimation

error was greatest for the slow/high profile, and this was attributed to subjects! assumption

that the TOA distribution was symmetrical when in fact it is highly positively skewed.

Phase 3c set out to determine how well subjects could identify an unlabeled speed profile by

observing speed fluctuations and then apply the appropriate TOA uncertainty estimate.

Results revealed that subjects could indeed identify the correct profile about 73% of the

time, and consistently applied TOA uncertainty estimates that were similar to estimates of

the labelled vehicles in Phase 3b; however TOA estimation error was higher when the

vehicle was unlabelled in the slow/high profile. Further, Phase 3c examined how subjects

88

estimated TOA uncertainty for new, untrained speed profiles. A series of analyses

suggested that subjects applied a representative heuristic by selecting the trained speed

profile that was most similar to the observed vehicle!s profile, and applying the TOA

uncertainty estimate of that trained profile.

Implications of these results for the proposed model of TOA uncertainty are discussed next

in Chapter 6 and their implications for the operational environment of future 4-D air traffic

control operations are then discussed in Chapter 7.

89

CHAPTER 6: MODEL REFINEMENT

6.1 TIME OF ARRIVAL UNCERTAINTY MODEL RESTATED AND REFINED

The preliminary model of TOA uncertainty, first presented in chapter 2, and reproduced

below in Figure 6-1, was generated based on input from subject matter experts (ATC), after

the first experiment.

Figure 6-1. Original Model of TOA Uncertainty

After exploring the model systematically in two tightly controlled human-in-the-loop

experiments, a proposed modification to the model is presented in Figure 6-2. Results of

experiment 3c revealed that subjects did not change the way they estimated TOA

uncertainty as a function of whether they had seen the profile before or not, as was

previously hypothesised. Rather in all cases, after observing the vehicles, subjects appeared

to determine which of the profiles in their personal library that it most closely matched, and

applied that estimate of TOA uncertainty directly.

Figure 6-2. Refined Model of TOA Uncertainty

90

6.2 SOURCES OF TIME OF ARRIVAL UNCERTAINTY ESTIMATION ERROR

Based on the results of experiments 2 and 3, it is now possible to explain this model further,

by characterizing the sources of TOA uncertainty error and determining how each of the

three processes represented in the refined model of Figure 6-2 (i.e., Build library of speed

profiles; Identify closest speed profile, and Apply estimate of TOA uncertainty) contribute to

TOA uncertainty estimation error observed in study 3c. A multiple regression analysis was

conducted to determine the relative contribution of each of the model processes in

explaining the observed TOA estimation error. In the multiple regression analysis, the

subjects! ability to build a library of speed profiles was modelled using two variables from

study 3a: 1) speed mean estimation error and 2) speed window estimation error. Subjects!

ability to identify the closest speed profile was modelled using two variables from study 3c:

1) error associated with identifying whether the profile was a trained one, or a new one, and

2) error associated with identifying the speed profile for trained profiles only. Finally,

subjects! ability to estimate TOA uncertainty was modelled as TOA uncertainty estimation

errors from study 3b.

The multiple regression revealed that, collectively, the five predictor variables accounted for

75% of the variance. However, as can be seen in Table 6-1, the variable that contributed

most significantly to this was the subjects! TOA uncertainty estimation error score (t=4.65,

p=.001). This suggests that, based on the experimental studies, TOA uncertainty estimation

error can be attributed primarily to the actual TOA estimation process itself, and not to the

process associated with developing the library of speed profiles, or comparing observed

vehicles to the library and identifying the closest profile.

It is prudent to raise one caveat associated with this conclusion. That is, given the

experimental method employed, it was necessary to provide feedback to subjects in study

3a, during which they built their library of speed profiles. While it is believed that this

feedback would be obtained implicitly over time by operators in the actual environment, it is

unknown how long it would take to achieve the same level of performance without the

explicit feedback. Actual operations may yield more error in the first two processes (Building

a library of speed profiles and Identifying the closest profile from the library) than was

observed in these studies.

91

Table 6-1. Multiple Regression to determine source of TOA Uncertainty Error

Variable B Beta t Sig

Speed Mean Estimation Error 0.082 0.032 0.187 0.855

Speed Window Estimation Error 0.203 0.192 1.127 0.286

Old/New Identification 0.034 0.037 0.218 0.832

Speed Profile Identification -0.192 -0.163 -0.914 0.38

TOA Uncertainty Estimation Error for labelled profiles

0.845 0.793 4.65 0.001

R2 = .75

6.3 CAUSES OF TIME OF ARRIVAL UNCERTAINTY ESTIMATION ERROR

Having identified the process of TOA uncertainty estimation as the potentially largest

contributor to error, the next step is to understand the causes of these errors. Further

examination of the results from experiments 2, 3b, and 3c reveal two systematic biases that

contribute to TOA uncertainty estimation errors.

6.3.1 Assumption of Symmetry

First, data in both experiments 2 and 3 revealed that subjects typically failed to recognize

the asymmetrical shape of the TOA distribution and thus underestimated the amount of

actual TOA uncertainty. (Recall, that approximately half of the subjects had completed at

least one university-level course in statistics). As discussed previously, if speed is normally

distributed, then by definition TOA is positively skewed, since TOA is the reciprocal of

speed. Furthermore, this amount of positive skew is more pronounced for vehicles travelling

at slower speeds and with higher variability. However, subjects appeared to treat all

distributions as symmetrical, or in any case to underestimate the amount of positive skew.

This can be seen clearly in the symmetry metric calculated in both experiment 2 (shown

previously in chapter 4) and experiment 3b, shown previously in chapter 5 and reproduced

here in Figure 6-3.

A second piece of evidence that points to the assumption of normality explanation as a

cause of TOA uncertainty estimation error is the speed x variability interaction observed in

92

experiment 3b, shown previously in chapter 5 and reproduced here in Figure 6-4. As can be

seen, subjects estimated the TOA window to be larger for slower vehicles than faster

vehicles, but failed to recognize the sharp increase in TOA window size associated with the

slow/high variability condition due to the positively skewed TOA distribution.

Figure 6-3. TOA window symmetry scores from experiment 3b show that subjects

underestimated the magnitude of positive skew for slow speeds.

93

Figure 6-4. Estimated and actual TOA window size from experiment 3b shows that subjects underestimated the window size for the slow/high condition, which is highly skewed.

(Error bars represent ±95% CI)

Lastly, subjective reports elicited during the debrief interview also support the assumption of

normality hypothesis. Most subjects expressed a strong bias towards symmetrical TOA

windows, as evidenced by the following comments:

• “My TOA windows were always symmetrical, because speed was always

symmetrical.”

• “TOA windows were symmetrical. There is no point in making asymmetrical

windows. Asymmetrical windows would have been strange.”

• “TOA windows were symmetrical, I have an obsession with symmetry.”

• “I thought all TOA windows should be symmetrical.”

• “I made nice even windows around the TOA.”

• “Symmetrical windows just seemed natural.”

In some cases, subjects expressed confusion or uncertainty as to whether the windows

should be symmetrical, positively skewed, or negatively skewed. The following quotes

demonstrated that these subjects did not clearly understand that TOA distributions are

positively skewed:

94

• “As variability increased, the TOA windows tended to be more asymmetrical, but not

always in the same direction. The red profile [slow/low] had more early vehicles, and

the blue profile [fast/high] had more later vehicles.”

• “My TOA windows tended to be asymmetrical – sometimes bigger on the left [more

earlier than later] and sometimes bigger on the right [more later than earlier]. I don!t

know why.”

However two subjects did recognize that TOA windows were asymmetrical, as evidenced by

the following quotes:

• “ The TOA windows were symmetrical, except for the yellow profile [slow / high

variability] because they lagged a bit.”

• “TOA windows were asymmetrical. In actuality, there is no way that arrivals would

be symmetrical – there is more of a chance of being slow and taking longer.”

6.3.2 Aversion to Extremes

A second potential source of TOA Uncertainty estimation error was an apparent aversion to

extremes. In both experiment 2 and 3, subjects overestimated conditions of small

uncertainty and underestimated conditions of large uncertainty. This systematic bias can be

seen in the speed x variability interaction observed in both experiment 2 and 3b shown

above in Figure 6-4. Subjects overestimated TOA uncertainty for the fast speed profiles that

possessed low actual TOA uncertainty (fast / low variability and fast / high variability) and

underestimated TOA uncertainty for the slow profiles which possessed higher actual TOA

uncertainty. Finally this can also be seen in the adjusted distance extrapolation results of

experiment 3b, reproduced here in Figure 6-5. Subjects systematically overestimated the

low actual TOA uncertainty of short distances and underestimated the higher actual TOA

uncertainty of longer distances. This was seen for each of the four speed profiles. This

systematic aversion to extremes is compatible with previous research on the estimation of

probabilistic events conducted by Tversky and Simonson (2000), which suggests that most

people tend to avoid extremes and choose an intermediate option if available.

95

Figure 6-5. TOA window estimates as a function of speed profiles for trained and untrained distances, showing subjects! systematic bias for overestimating low uncertainty and underestimating high uncertainty.

96

CHAPTER 7: SUMMARY, LIMITATIONS AND IMPLICATIONS

The ability to estimate TOA uncertainty is a critical cognitive process in supervisory control

systems such as air traffic management. It is expected to become a critical issue for the next

generation aviation system, which will rely on strict conformance and monitoring of 4-D

trajectories both in the air and on the ground. Chapter 1 discussed the introduction of new

decision support system technologies to support 4-D operations. There the need for

controllers to develop effective estimates of TOA uncertainty rather than relying on

automation for this purpose was identified.

In chapter 2, experiment 1 solicited input from ATC subject matter experts to shed light on

how TOA is estimated in the real world, together with the uncertainty associated with those

estimates. Several factors believed to impact TOA uncertainty estimates were identified.

This experiment led to the development of a preliminary model of TOA uncertainty

estimation.

In chapter 3, literature from domains such as decision making under uncertainty and

probabilistic reasoning was applied to develop a generic experimental method and metrics

for evaluating TOA uncertainty, extracted from relevant aspects of the ATC TOA estimation

task. This experimental method was then applied in experiment 2 in chapter 4, which

identified that subjects tended to over-estimate TOA uncertainty for fast speeds, and

underestimate it for slow speeds. The latter was attributed to underestimating the amount of

positive skew associated with the TOA distribution.

In chapter 5, the results of experiment 3 addressed two limitations of the experimental

paradigm and replicated the results of experiment 2. Experiment 3 also extended the

findings to better understand how subjects estimate TOA uncertainty for untrained distances

and profiles. From experiment 3, it was concluded that subjects appeared to apply the

representativeness heuristic when estimating TOA uncertainty for new speed profiles that

they have not previously encountered.

97

7.1 MODEL ASSUMPTIONS AND LIMITATIONS

The refined model of TOA uncertainty estimation presented in chapter 6, along with the

sources and causes of estimation error, are based on certain assumptions as follows.

7.1.1 Speed is Normally Distributed

First, the model and results are predicated on the assumption that speeds are normally

distributed, which was the case in the two experimental studies. Although this is a

simplification of the operational environment, data from a recent airport human-in-the-loop

taxiing simulation (Williams, Hooey & Foyle, 2006) nevertheless support that this is the case

for nominal taxi conditions. Figure 7-1 presents a histogram of a short sample of taxi speed

data taken from that study, in which the pilot was taxiing in a medium-fidelity aircraft

simulator while trying to maintain a commanded average taxi speed, unencumbered by

traffic delays. As can be seen, those taxi speeds do approximate a normal distribution in this

simple taxi condition. Of course, this would not necessarily hold in more complex scenarios

with traffic and adverse weather. In addition, it is unknown whether the subjects understood,

or assumed, that the distribution of speeds was normally distributed. It is indeed possible,

for example, that subjects envisioned a uniform distribution in which case they would not

necessarily expect the resulting TOA distribution to be positively skewed. This could have

accounted for the subjects! underestimates of positive skew.

Figure 7-1 Taxi speed data from a medium fidelity pilot-in-the-loop study, showing that

nominal taxi speed maintenance data are approximately normally distributed.

98

7.1.2 Availability of Accurate Speed Readouts

Second, the model assumes that operators have access to speed data, such as in the form

of a digital readout, and are not required to perceive changes in speed visually. In many

ways this simplifies the task and the model, but in a manner that is consistent with future

aviation operations. It is a safe assumption that both pilots and controllers will be equipped

with accurate speed-readouts to assist them in advanced 4-D trajectory environments.

However, there are potential implications for environments where this assumption may not

hold true. Estimates of TOA uncertainty may be systematically influenced by subjects!

misperception of vehicle speed that may occur as a function of the stimulus field, object size,

and the background texture.

As summarized by Ryan and Zanker (2001) and Goldstein (1989), perception of speed is

affected by the size of both the moving object being observed and the framework through

which it moves. The perceived speed of regularly spaced objects moving within apertures

decreases with larger aperture size (Brown, 1931). In particular, Brown asked observers

sitting in a dimly lit room to adjust the speed of a large dot moving across a large rectangle

to make it appear equal to the speed of a small dot moving across a small rectangle. Brown

found that, if the large rectangle was ten times larger than the small one, the large dot had to

move seven times faster than the small one for them to appear to move at the same speed.

This effect, known as speed transposition, shows that two images moving across the visual

field at different speeds can be perceived to be moving at the same speed. This could be

due to a size constancy effect by which, in the absence of veridical depth information, a

smaller stimulus is interpreted as being further away from the observer. This could have

important implications in some domains, particularly if the user is permitted to resize

windows on their display, thus making objects appear to be moving faster or slower. This

effect would be particularly problematic given the results found here, which show that

subjects tend to overestimate TOA uncertainty for faster objects and underestimate TOA

uncertainty for slower objects.

Another factor that may affect the perception of speed is the background on which the object

is displayed. This may range from objects in the real world, to icons on a display with

backgrounds that consist of simple, solid colour backgrounds, to complex grid backgrounds.

99

The threshold for perceiving movement in a homogeneous field is a speed of about 1/6 to

1/3 of a degree of visual angle per second (Goldstein, 1989). However, if vertical lines are

present in the background, the object!s movement can be perceived even at speeds as low

as one-sixtieth of a degree of visual angle per second. Thus, perception of motion is

determined by the background, which affects not only the threshold for movement but also

the perception of speed. Indeed, the grid marks placed on the vehicle!s route in experiment

3 (see chapter 5) could have caused subjects to perceive the speed to be greater than it

really was. Related to this, it is also unknown if the perception of velocity was uniform

across the display (as opposed to the possibility that higher speeds might have been

perceived at the edges of the display), and precisely how this may have affected the results.

However, it is important to note that the factors outlined above are unlikely to have

confounded the results in the current experiments, for two reasons: 1) Subjects in these

experiments relied primarily on the digital speed readouts to estimate speed and thus did so

with negligible perceptual error; and 2) Speed estimation performance in experiment 3 was

very similar to that in experiment 2, in which there were no vertical hatch marks along the

route. Nevertheless, these factors could substantially influence speed perception in

operational settings in which operators are provided displays that show vehicles moving

against a background that changes from visually simple to complex as a function of the

environment such as a road network or airport layout.

7.1.3 Time of Arrival Uncertainty Estimation Adheres to the Proposed Two-step

Process Model

The model and experiments were also predicated on the assumption that subjects first learn

speed profiles and for each speed profile they estimate an expected TOA uncertainty

window. Indeed, this assumption was generated based on results of the first study in which

air traffic controllers described how they estimated TOA and TOA uncertainty. Controllers

reported that they had mentally assigned certain airlines with a speed profile (e.g.,

“Southwest Airlines taxied fast and consistently”) and estimated TOA based on that profile.

It is for this reason that in experiment 3 subjects were trained (with feedback) on speed and

then tested on their ability to estimate TOA. If, instead, subjects had been trained to

estimate TOA by means of a training protocol that provided feedback on TOA instead of

speed, the results most likely would have yielded improved TOA uncertainty estimates.

100

Nonetheless, it is believed that operators, such as air traffic controllers, do indeed develop

models of speed profiles from which they estimate TOA, rather than developing models of

TOA directly.

Nonetheless, this is an important assumption to consider and raises implications for real-

world operational training and display needs. In the operational setting, there are four

important differences from the studies presented in this research: 1) controllers don!t always

get immediate feedback in terms of their speed and TOA estimates; 2) the on-the-job

training would not provide feedback that is as precise and unambiguous as in experiment 3;

3) controllers won!t likely experience 100 aircraft in a row of the same profile, and thus may

experience degradations of their estimates due to interruptions and integration of multiple

profiles; and 4) controllers would need to generate many more speed profiles to capture the

range of operational conditions and the differences among these profiles may be more

subtle than those tested in experiment 3. Exactly how these differences help or hinder

controllers! ability to build a library of speed profiles and apply them to estimate TOA

uncertainty is a topic for further research using operational settings and experienced

controllers.

7.2 OPERATIONAL IMPLICATIONS FOR AVIATION

A number of important implications for the operational environment have been revealed.

These implications are reported below, with the caveat that, without further operational or

field research, the degree to which these laboratory-based studies generalize to expert air

traffic controllers in the operational environment is unknown.

7.2.1. Implications for Human Error

First, it was shown that subjects underestimate TOA uncertainty in conditions where TOA

objective uncertainty is high, such as aircraft travelling long distances and aircraft that travel

at slow average speeds that are highly variable. This may have implications for actual

aviation operations, in that aircraft in these operating conditions may be sequenced too

closely because controllers will think the aircraft need less separation than they actually do.

In the example of runway sequencing, it is likely that controllers will produce smaller-than-

101

necessary runway-crossing windows (a gap between two arriving aircraft in which a third

aircraft can cross the active runway). That is, if a controller creates a runway-crossing

window that is based on an underestimated TOA uncertainty window, the resulting runway-

crossing window will be too small. It is likely that the second arriving aircraft will be required

to conduct a go-around manoeuvre because the crossing aircraft will not have cleared the

runway in time. This could be a safety hazard, leading to potential loss-of-separation, or

create system inefficiencies if aircraft need to be stopped and/or re-routed.

Second, the subjects underestimated the amount of positive skew of TOA distributions for

slow and highly variable aircraft, thus a higher number of aircraft arrived later than expected.

This implies that if actual controllers also exhibit this tendency, they will underestimate the

likelihood that an aircraft will arrive late at a destination (e.g., waypoint, runway, or gate).

This can have both safety and efficiency implications in the case of paired departure

operations (Lunsford, 2009), in which two aircraft must depart parallel runways within 30

seconds of each other to ensure the following aircraft is not subject to excessive wake

turbulence from the lead aircraft. If controllers underestimate the likelihood that the following

aircraft may arrive late at the departure runway, the following aircraft may either experience

a wake vortex incident or experience substantial delays, as it would be required to stop and

wait for the next available departure slot, resulting in system inefficiencies.

Third, subjects overestimated TOA uncertainty when the distance was short, and when

actual TOA uncertainty was low, as with aircraft that travel at a consistently fast speed. If

actual controllers were also to exhibit this tendency, the result would be larger-than-

necessary runway crossing windows. That is, for example, a controller might leave a gap of

two minutes between arriving aircraft to ensure that a third aircraft can cross the active

runway, when in fact a gap of only one minute is required. With accurate knowledge of TOA

uncertainty, the controller could have doubled runway-crossing throughput by crossing two

aircraft in the gap, instead of one. In these cases, overestimates of TOA uncertainty reduce

airport efficiency. Also, in this case, a controller may unnecessarily require an arriving

aircraft to conduct a go-around manoeuvre if he/she believes the runway will not be clear,

which creates additional risks for the arriving aircraft.

102

Finally, subjects tended to generalize unknown profiles to the closest known profile. If

controllers do this in the operational environment, and there is evidence from experiment 1

that they do, this could lead to over-generalizations, resulting in TOA windows that are either

too small or too large as described above, again resulting in either unsafe or inefficient

operations.

7.2.2 Implications for Human-Centred Automation to Support Air Traffic Control

Given the prevalence of the TOA uncertainty estimation errors noted above, it is next

reasonable to consider how the performance of air traffic controllers, particularly in next-

generation time-based environments, could be supported by automation. The results of this

research are interpreted using Parasuraman, Sheridan, and Wickens! (2000) "stages and

levels! of automation framework. The framework proposes that automation can support four

broad classes of functions: 1) Information Acquisition; 2) Information Analysis; 3) Decision

and Action Selection; and 4) Action Implementation. Within each of these functions,

automation can be applied across a continuum of levels from low to high (i.e., from fully

manual to fully automatic). Suggestions for how automation can be used to support air

traffic controllers in each stage are discussed next.

Information Acquisition refers to the process by which raw data are selectively attended or

filtered (Wickens, 2000). In the context of TOA uncertainty estimation, this includes the

controllers! ability to accurately perceive the aircraft!s current state (location, speed, distance

to travel), identify the factors that contribute to a particular aircraft!s TOA uncertainty, and to

accurately characterize the aircraft!s speed profile. The present research demonstrated

human limitations associated with the task of accurately characterizing an aircraft!s speed

profile. Specifically, it was noted that subjects were able to accurately identify a speed

profile after a short period of observation only 65 to 88% of the time for trained profiles.

Automation to support this information acquisition task may take quite different forms

ranging from simple displays of raw data (particularly speed; see section 7.1.2 regarding the

assumption of this minimum level of automation) to target cueing (Yeh, Wickens, & Seagull,

1999), intelligent information management (Hammer, 1999), and highlighting (Wickens,

Kroft, & Yeh, 2000), each of which explicitly directs the controller!s attention to certain

103

information sources at the expense of others. The automation may guide attention to

sources of environmental information that the controller may use to determine the aircraft!s

speed profile (such as information about traffic, weather, aircraft type, and pilot familiarity

with the airport) or may explicitly provide an indication of an aircraft!s speed profile, possibly

through color-coding on the ATC display.

In the second stage (Information Acquisition), data must be integrated and interpreted so as

to draw some intelligent inference regarding the state of the world (Wickens, 2000). In ATC

operations, this refers to the controllers! estimation of TOA uncertainty based on their library

of speed and TOA profiles. As was seen in the results of experiment 2 and 3, this was the

task that was the most prone to bias, and thus presumably may benefit the most from

automation. To support this task, automation could compute and display TOA uncertainty

and prioritize or rank order the aircraft in terms of TOA uncertainty. A controller could use

the display to choose the aircraft that is most likely to arrive at the departure runway on time,

or to identify aircraft that are unlikely to make their takeoff time.

The third phase (Decision and Action Selection), in the context of ATC, refers to the

controllers! task of deciding whether or not the estimated TOA uncertainty is acceptable, or if

action is required to reroute aircraft. Although not addressed in the current research, there

is evidence that controllers may typically apply a conservative bias in this decision-making

phase (see Boudes & Cellier, 2000) and thus may be more likely to determine that the TOA

uncertainty is unacceptable and reroute aircraft. Although this conservative bias is actually

quite adaptive in today!s ATC environment (as described previously in section 1.1.1), it may

be detrimental in NextGen 4-D environments, which will require aircraft to arrive "just-in-time!

(that is, neither early nor late). Automation in the form of visual or auditory alerts that

indicate when objective TOA uncertainty exceeds a pre-determined threshold could be

applied to support this stage.

Finally, the Action Execution stage in this ATC context refers to the task of issuing a revised

clearance to one or more aircraft as a result of excessively high TOA uncertainty. A lower

level of automation to support this stage could take the form of automation that generates

time-based clearances that the controller would approve or modify and then communicate to

104

the aircraft. This approach would take advantage of the automation!s ability to quickly

compute accurate TOA!s and distributions, but at the same time take advantage of the

human!s ability to improvise and use flexible procedures (see Fitts, 1951). A higher level of

automation could operate completely autonomously and issue time-based clearances

directly to the aircraft and modify them when pilots fail to meet their time requirements. The

controller could over-ride the automation to provide certain aircraft with more time and a

larger uncertainty buffer where mandated. However, it must be acknowledged that a

weakness of this approach is the operator "out-of-the-loop! problem (Endsley & Kris, 1995).

As discussed previously in Section 1.1.3, when placed in a monitoring role, it is often difficult

for the operator to maintain sufficient situation awareness to make the necessary

interventions.

While automation to support the Action-Execution stage is currently being researched (for

example, Balakrishnan & Jung, 2007; Rathinam, Montoya & Jug, 2008), many of these

prototype systems typically address TOA uncertainty in an overly simplistic manner by either

applying a constant uncertainty value for each aircraft or assuming perfect on-time

performance from each aircraft. The current research showed that the TOA uncertainty

values incorporated into the automation should consider (at least) the mean speed,

variability, and distance to travel. Experiment 1 also uncovered several other factors that

human controllers consider in estimating TOA uncertainty, such as airline, weather

conditions, traffic levels, and even whether or not the pilot spoke with an accent (suggesting

to the controllers that the pilot might not be familiar with the airport surface). While some of

these could presumably be incorporated into the automation algorithms (with varying

degrees of success) others, such as the pilot!s accent, are not easily considered by

automation. Failure to account for these important variables may result in inaccurate

uncertainty buffers around each aircraft. Uncertainty buffers that are too large will limit

system efficiency and uncertainty buffers that are too small may yield excessively high error

rates.

In summary, automation that supports controllers in the Information Acquisition task may be

the most fruitful for several reasons. First, as noted above, subjects exhibited systematic

biases in their abilities to estimate TOA uncertainty. Second, applying automation at the

earlier stages, rather than at the decision and action stages, may indeed be the best option

105

if the airport surface environment proves to be too dynamic, thus rendering the automation

"brittle" (Cohen, 1993; as discussed previously in section 1.13). This approach would

minimize the operator out-of-the-loop concern because the controller would be actively

controlling the airport traffic, albeit in a more informed manner. Third, a further potential

advantage of automation at the earlier stages that were shown to exhibit systematic bias, is

that it is likely that controllers! TOA uncertainty estimation errors would gradually correct

over time with use of the automation and their estimates would become more accurate.

Indeed, such an improvement in estimation error was observed in Experiment 3 for the task

of velocity and velocity window estimation errors. It is plausible that displaying the TOA

uncertainty information to the controller would serve as adequate feedback to support this

learning process in the actual operational environment. If this is the case, it is possible that

the controllers would rely less on the automation over time, and use it as an aid only when

presented with an aircraft with an unfamiliar profile.

The results of the current research suggest that automation, particularly when applied to

Stages 1 and 2, could help compensate for the systematic biases observed in the tasks

associated with identifying an aircraft!s speed profile and applying the TOA estimate. It is

still unclear, however, exactly how a controller would, or should, use such TOA uncertainty

information, and this remains a topic for future research.

7.3 FUTURE DIRECTIONS

This research culminated in the development of a model that explains how subjects

estimated TOA uncertainty, and the sources and causes of errors in the TOA uncertainty

estimation process. Armed with this model of the systematic biases inherent in human

operators! TOA uncertainty estimation, three future research directions have been identified.

First, the results of the current research were based on a simplification of a real-world task.

Although it is believed that the important elements of the task were represented in the

experimental tasks reported in chapters 4 and 5, future research is required to increase the

realism and operational robustness of the experimental task, to ensure that the findings

generalize to real world ATC operations. This could include modifications to the current

experimental platform, such as including variable start times and route lengths,

106

representations of traffic sequencing tasks and other environmental cues that affect

variability, and implementing variable profiles. Future analyses of actual operational data in

an ATC setting would be of value, as would modelling profiles with realistic, potentially non-

normal distributions.

Second, various forms of ATC automation were presented and discussed in section 7.2.2.

Each of these approaches requires extensive research including human-in-the-loop

simulations to fully define and evaluate. Only with relatively mature automation solutions (or

at least emulations) can the relative merits of each automation level for a particular stage be

evaluated.

Third, given that TOA uncertainty exists in many complex environments and humans

systematically misestimate this uncertainty, there is a need for further research and

development efforts geared toward the design and development of DSSs and other

advanced visual displays to support users in the TOA uncertainty estimation task (such as

Lee & Milgram, 2008). It is expected that elements of the experimental method and metrics

developed within the current research effort will be useful for this purpose.

Fourth, the current research has examined how subjects comprehend TOA uncertainty

relative to actual TOA uncertainty; however, there is still much to be learned regarding what

users will do with this information. Future research is required to better understand how this

comprehension of TOA uncertainty will affect operator situation awareness and the actual

decisions that operators make.

107

CHAPTER 8: REFERENCES

Alpert, M. & Raiffa, H. (1982). A progress report on the training of probability assessors. In

D. Kahneman, P. Slovic and A. Tversky (Eds.). Judgment Under Uncertainty:

Heuristics and Biases. Cambridge: Cambridge University Press.

Andre, A. D. & Cutler, H. A. (1998). Displaying uncertainty in advanced navigation systems.

Proceedings of the Human Factors and Ergonomics Society 42nd Annual Meeting. (pp.

31-35). Santa Monica, CA: HFES.

Balakrishnan, H., & Jung, Y. (2007). A framework for coordinated surface operations

planning at Dallas-Fort Worth International Airport (Paper # AIAA 2007-6553).

Proceedings of the AIAA Guidance, Navigation, and Control Conference and Exhibit.

Alexandria, VA: AIAA

Banbury, S., Selcon, S., Endsley, M. Gorton, T. & Tatlock, K. (1998). Being certain about

Banbury, S., Selcon, S., Endsley, M. Gorton, T. & Tatlock, K. (1998). Being certain

about uncertainty: How the representation of system reliability affects pilot decision

making. Proceedings of the Human Factors and Ergonomics Society 42nd Annual

Meeting. (pp. 36-39). Santa Monica, CA: HFES

Block, R. A. & Harper, D. R. (1991). Overconfidence in estimation: Testing the anchoring

and adjustment hypothesis. Organizational Behavior and Human Decision Processes,

49, 188-207.

Boudes, N. & Cellier, J-M. (2000). Accuracy estimations made by air traffic controllers. The

International Journal of Aviation Psychology, 10(2), 207-225.

Brown, J. F. (1931). The visual perception of velocity. Psychologische Forschung, 14, 199-

232.

Cohen, M. S. (1993). The bottom line: Naturalistic decision aiding. In G. A. Klein, J.

Orasanu, R. Calderwood, & C. E. Zsambok (Eds.). Decision Making in Action: Models

and Methods. Norwood, NJ: Ablex Publishing Corporation.

108

Endsley, M. R., Bolte, B. & Jones, D. G. (2003). Designing for Situational Awareness: An

approach to user-centered design. New York: Taylor & Francis.

Endsley, M. R., & Kris, E. O. (1995). The out-of-the-loop performance problem and level of

control. Human Factors, 37 (2), 381-394.

Erev, I., Wallsten, T.S., & Budescu, D. V. (1994). Simultaneous over- and underconfidence:

The role of error in judgment processes. Psychological Review, 101, 519-527

Fitts, P. M. (1951). Human Engineering for an Effective Air Navigation and Traffic Control

System. Washington DC: National Research Council.

Gigerenzer, G., Hoffrage, U., & Kleinbolting, H. (1991). Probabilistic mental models: A

Brunswikian theory of confidence. Psychological Review, 98, 506-528.

Goldstein, E. B. (1989). Sensation and Perception. Belmont, California: Wadsworth

Publishing Company.

Hammer, J. M. (1999). Human factors of functionality and intelligent avionics. In D. J.

Garland, J. A. Wise, & V. D. Hopkin (Eds.), Handbook of Human Factors in Aviation. (pp.

549-565). Mahwah, NJ: Lawrence Erlbaum.

Hansman, R. J. & Davison, H. J. (2000). The effect of shared information on pilot / controller

and controller / controller interactions. 3rd USA/Europe ATM R&D Seminar, Napoli, Italy.

Johnson, W.W., Battiste, V. & Holland, S. (1999) A cockpit display designed to enable

limited flight deck separation responsibility. Proceedings of the 1999 World Aviation

Conference, Anaheim, CA.

Juslin, P. (1994). The overconfidence phenomenon as a consequence of informal

experimenter-guided selection of times. Organizational Behavior and Human Decision

Processes, 57, 226-246.

Juslin, P., Wennerholm, P., & Olsson, H. (1999). Format dependence in subjective

probability calibration. Journal of Experimental Psychology: Learning Memory and

109

Cognition, 25(4), 1038-1052.

Juslin, P., Winman, A., & Olsson, H. (2003). Calibration, additivity, and source

independence of probability judgments in general knowledge and sensory discrimination

tasks. Organizational Behavior and Human Decision Processes. 92, 34-51.

Kahneman, D., Slovic, P. & Tversky, A. (1982). Judgment under Uncertainty: Heuristics and

Biases. NY: Cambridge University Press.

Kuchar, J. K. (2001). Managing uncertainty in decision-aiding and alerting systems.

Proceedings of the 6th CNS/ATM Conference. Taipei, Taiwan.

Lee, H. & Milgram. P (2008). An Integrated Air Traffic Control Display Concept for Conveying

Temporal and Probabilistic Conflict Information. Proceedings of the Human Factors and

Ergonomics Society (pp. 84-84). Santa Monica, HFES.

Lichtenstein, S., Fischhoff, B., & Phillips, L. D. (1982). Calibration of subjective probabilities:

The state of the art up to 1980. In D. Kahneman, P. Slovic, & A. Tversky (Eds.),

Judgment under Uncertainty: Heuristics and Biases (pp. 306-334). New York:

Cambridge University Press

Lunsford. C. (2009). Feasibility and availability of pairing departures from closely spaced

parallel runways for wake avoidance. Proceedings of the 2009 IEEE/AIAA Integrated

Communications, Navigation and Surveillance Conference (ICNS) conference.

Arlington, VA, May 2009.

McKenzie, C. R. M., Liersch, M. J., & Yaniv, I. (2008). Overconfidence in interval estimates:

What does expertise buy you? Organizational Behavior and Human Decision

Processes, 107, 179-191.

Nisbett, R. E., Krantz, D. H., Jepson, C. & Fong, G. T. (1982). Improving inductive

inference. In D. Kahneman, P. Slovic, & A. Tversky (Eds.). Judgment under

uncertainty: Heuristics and biases. Cambridge: Cambridge University Press

O!Connor, M., & Lawrence, M. (1989). An examination of the accuracy of judgemental

confidence intervals in time series forecasting. Journal of Forecasting, 8(2), 141-155.

110

Paese, P. W., & Sniezek, J. A. (1991). Influences on the appropriateness of confidence in

judgment: Practice, effort, information, and decision making. Organizational Behavior

and Decision Making, 48, 100-130.

Peterson, C. R. & Beach, L. R. (1967). Man as an intuitive statistician. Psychological

Bulletin, 68(1), 29-4.

Pitz, G. F. (1980). The very guide of life: The use of probabilistic information for making

decisions. In T. S. Wallsten (Ed.) Cognitive Processes in Choice and Decision.

Hillsdale, N.J: Erlbaum and Associates.

Rathinam, S., Montoya, J., & Jung, Y. (2008). An optimization model for reducing aircraft

taxi times at the Dallas Fort Worth International Airport. Proceedings of the 26th

International Congress of the Aeronautical Sciences (ICAS). Sept 14-19, 2008,

Anchorage, Alaska.

Ryan, J. & Zanker, J. M. (2001). What determines the perceived speed of dots moving

within apertures? Exp Brain Res. 141, 79-87.

Schaefer, D., Gizdavu, A., & Nicholls, D. (2004). The display of uncertainty information on

the controller working position. Proceedings of the Digital Avionics Systems Conference.

St. Louis, Mo: IEEE.

Schiff, W. & Oldak, R. (1990). Accuracy of judging time to arrival: effects of modality,

trajectory, and gender. J Exp Psychol Hum Percept Perform. 16(2). 303-16.

Schunn, C. D., Kirscehnbaum, S. S., and Trafton, J. G. (2005). The ecology of uncertainty:

Sources, indicators, and strategies for informational uncertainty. Technical Report

(Available at www.au.af.mil/au/awc/awcgate/navy/nrl_uncertainty_taxonomy.pdf)

Sheridan, T. B. (2002). Humans and Automation: System Design and Research Issues.

Santa Monica, CA: HFES / John Wiley& Sons.

Soll, J. B. & Klayman, J. (2004). Overconfidence in Interval Estimates. Journal of

Experimental Psychology: Learning, Memory, and Cognition. 30(2), 29-314.

111

Teigan, K. H., & Halberg, A., & Fostervold, K. I. (2007). More than, less than, or minimum,

maximum: How upper and lower bounds determine subjective interval estimates.

Journal of Behavioral Decision Making, 20, 179-207

Tversky, A. & Kahneman, D. (1982). Judgment under uncertainty: Heuristics and biases.

In D. Kahneman, P. Slovic, & A. Tversky (Eds.). Judgment under uncertainty:

Heuristics and biases. Cambridge: Cambridge University Press

Tversky, A. & Simonson, I. (2000). Context-dependent preferences. In D. Kahneman and

A. Tversky (Eds). Choices, Values, and Frames. Cambridge: Cambridge University

Press

van Doorn B., Bakker, B., & Meckiff, C. (2001). Evaluation of advanced conflict modelling in

the highly interactive problem solver. Proceedings of the 3rd USA/ Europe ATM R&D

Seminar, Napoli.

Xu, A., Wickens, C. D., & Rantan, E. (2004). Imperfect conflict alerting systems for the

cockpit display of traffic information. Technical Report AHFD-04-8/NASA-03-2. Savoy,

Illinois: University of Illinois at Urbana-Champaign, Aviation Human Factors Department.

Wickens, C. D. (2000). Imperfect and unreliable automation and its implications for attention

allocations, information access, and situation awareness. Technical Report ARL-00-

10/NASA-00-02. Savoy, Illinois: ARL, University of Illinois at Urbana-Champaign.

Wickens, C. D., & Hollands, J. G. (2000). Engineering Psychology and Human

Performance. London: Prentice Hall.

Wickens, C. D., Kroft, P., & Yeh, M. (2000). Database overlay in electronic map design:

Testing a computational model. Proceedings of the IEA 2000/HFES 2000 Congress.

Santa Monica, CA: HFES.

Wickens, C. D., & Morphew, E. (1997). Predictive features of a cockpit traffic display: A

workload assessment (Technical Report ARL-97-6/NASA-97-3). Savoy, IL: University

of Illinois, Institute of Aviation, Aviation Research Lab.

112

Williams, J. L., Hooey, B. L., & Foyle, D. C. (2006). 4-D taxi clearances: Pilots' usage of

time- and speed-based formats. Proceedings of the AIAA Modelling and Simulation

Technologies Conference. Paper AIAA-2006-6611.

Yeh, M., Wickens, C. D., & Seagull, F. J. (1999). Target cueing in visual search: The effects

of conformality and display location on the allocation of visual attention). Human Factors,

41, 524-542.

113

APPENDICES

APPENDIX A1: EXPERIMENT 1 PROTOCOL

As we look ahead to the future and the potential implementation of time-based taxi clearances, we acknowledge that the current ground control system is very dynamic and possesses many sources of uncertainty. The purpose of this study is to explore factors that contribute to your perceptions of certainty (or confidence) that a time-based taxi plan that you issue to pilots will be carried out as scheduled. In particular, I am interested in how a number of factors influence your perception of certainty. 1) Distance / Speed Calibration

Before we begin, I will start by providing you with some information to calibrate you to the time required to taxi different routes at two different speeds. You can safely assume that typical taxi speeds range from 16 to 24 knots, with an average speed being 20 knots. This shows two routes running east/west across the airport. One is 6,000! and the other is 12,000!. These are intended to give you a rough idea of how long each taxi route would take to complete at average speeds of 16 kts, 20 kts, and 24 kts. You can see that the distance from one inside runway to the other is about 6,000! and a 12,000! route stretches across most of the airport. Here are the same length routes, this time running north/south on the airport. 2) Scenario Descriptions

In this study, you will see several static pictures of taxi plans in progress. Each picture will show you the taxi plan and an aircraft!s progress in carrying out the plan. Your task will be to assess the information and provide a rating of how certain you are that the plan will succeed (that is, that the aircraft will arrive at the runway threshold within the provided window of time – neither too early, nor too late). Each scenario will contain the following features:

Magenta taxi route: In all trials, you see a map of DFW airport with one taxi route shown in magenta.

Aircraft Icon: The aircraft icon indicates the position of the taxiing aircraft along the route of the aircraft. The aircraft will either be positioned at the beginning of the route or part way through. If it is at the beginning, you can assume that the aircraft has spooled up to a nominal taxi speed.

Data tag: Associated with each aircraft is a data tag with a TIME. This refers to the elapsed time since the pilot started taxiing.

Runway Arrival Time Window: All taxi routes will end at a runway. A runway arrival time window will be provided in the yellow box beside the runway. This time window indicates the acceptable window of time in which the aircraft may arrive at the runway. It DOES NOT include time required to cross the runway.

114

Block 1

You have seen the full range of the factors for this experiment. When you are ready,

we!ll continue with more trials.

Remember to consider all of the factors together and to use the entire scale from 1 (if you have LOW certainty that the plan will succeed) to 7 (if you have HIGH certainty that the plan will succeed). For the remaining trials, use the numbers 1 to 7 on the keyboard to enter your answers. Do you have any questions before we begin? Things to Remember…

1) Please rate how certain you are that the aircraft will arrive at the runway threshold within the cleared window of time using the following scale.

Very Low Certainty Very High Certainty 1 2 3 4 5 6 7

2) Try to use the entire range of the seven-point scale. 3) Runway time windows refer to the arrival time at the runway, NOT the time to cross the runway. 4) The aircraft should arrive at the runway within the crossing window and neither late, nor

early. - Arriving early will require the aircraft to stop – this doubles the time required for an aircraft to cross a runway as compared. - Arriving late will mean the aircraft has missed the crossing window – this will inevitably cause a delay.

5) Assume that it is a clear day at Dallas Forth Worth and there are no traffic conflicts. 6) Assume that each aircraft has navigation and time-management technology that will enable them to arrive at the runway threshold within a 30 second time window about 50% of the time. 7) Assume that an average taxi speed is 20 knots and that the aircraft is a B-757. 8) Remember that every pilot and aircraft is different and any number of factors may influence their ability to comply with the precise time window. For example:

- A pilot may be slow to start taxi or spool up. - A pilot may slow down more for turns than another. - A pilot may be unfamiliar with the airport. - A pilot may have to stop because a passenger is standing up. - There may be equipment problems that need investigating during taxi.

115

APPENDIX A2: EXPERIMENT DEBRIEF INTERVIEW QUESTIONS

1. Please rank the importance of the following six factors in determining your level of certainty (or confidence) that a time-based taxi clearance will be carried out successfully. _____ Route Length _____ Route Complexity _____ Aircraft Position Along Route _____ Duration of Runway Arrival Window 2. What other factors contribute to your certainty that a pilot will arrive at the cleared runway within a specified time window? 3. Please rank these factors relative to the factors in question 1. 4) What strategy did you use for assigning certainty ratings? - At what rating would you cancel a clearance or re-route due to lack of confidence? 5) How did you use the runway threshold window.

- i.e., consider if they would make the middle of the window, the beginning, or the end? 6). What percentage, if any, of your clearances today have some sort of time element in them (i.e. expedite etc.). 7) How do you manage uncertainty in today!s surface environment 8. If you are uncertain that an aircraft will comply with its clearance, how does this impact your task of controlling multiple aircraft at the same time. 9. If you have two aircraft, one that you are certain will comply with its clearance, and one that you are less certain, do you treat monitor them differently? If so how? 10. How will time-based clearances change the way you monitor airport traffic? 11. Do you have any suggestions for managing uncertainty in future time-based environments? - changes to communication, information available, technology?

116

APPENDIX B1: EXPERIMENT 2 PROTOCOL

Welcome and thank you for agreeing to participate in this study. The purpose of this study, which will also form a portion of a PhD research project at the University of Toronto, is to understand how people estimate Time-of-Arrival Uncertainty. Please take a few minutes and read this and the attached informed consent form. It describes the study you are about to participate in, and explains your rights as a subject. If you have any questions at any time, please ask the experimenter. Overview

In this study, I am interested in how you estimate time-of-arrival uncertainty. To start with, I would like you to consider the following scenarios: Scenario A: You travel 10 miles on the freeway to get to work every day for 100 days. Since you work the night shift you don!t have much traffic to worry about, and you always try to stick to the speed limit. Sometimes you go slightly slower or faster when you encounter other traffic along the way, but usually only by 1 or 2 miles per hour. On average, across the 100 days, it takes you 10 minutes to get to work, but of course it is not always exactly 10 minutes because your speed varies slightly, depending on traffic. Scenario B: You travel 10 miles on city streets to get to work every day for 100 days. Since you are always running late, you like to travel as fast as you can, but of course always stopping for red lights. On average, across the 100 days, it takes you 20 minutes to get to work, but sometimes you can make it much faster if you get green lights all the way, and sometimes you take much longer if you get a lot of red lights. Now, imagine your boss is trying to create a very tightly coordinated schedule, and he asks you to predict what time you would be at work. For which scenario would it be easier to estimate a time-of-arrival for your boss? Scenario A might be pretty easy because your time of arrival would be close to the same time every day. Similarly, Scenario B might be more difficult, because your time of arrival will depend on how many red lights you encounter. Instead then, your boss asks you to provide a window of time within which you are very sure (i.e. 95% sure) that you can make it. Of course you!d want to make it as small a window as possible so as to be useful for your boss – but large enough so that you are never considered late. Consider the two scenarios. Would you assign a larger window for Scenario A or Scenario B? Probably Scenario B, because you!ll never know exactly how many traffic lights you might encounter along the way. The Experimental Task

Throughout the study, I am going to show you a screen with 20 "aircraft!, which all move with the same average velocity, and their velocities fluctuate about the same amount across the route. While the aircraft are traversing their routes, you will be able to see a digital read-out of the velocity of each aircraft. As the trial progresses, you will note that, even though the aircraft are all travelling at about the same speed, they each arrive at different times. Sometimes the arrival times will be close together, sometimes they will be more spread out. An elapsed time counter is available at the bottom of the screen.

117

Once all 20 aircraft have completed their routes, a query screen will appear. It will show another aircraft that is about to begin traversing a route. The screen will present the following question: This aircraft!s speed profile is similar in nature to the aircraft you just observed and it

is about to traverse the same length route.

a) Please estimate this aircraft!s time of arrival.

b) Please provide the smallest TOA window that you are 95% sure will contain the

aircraft!s actual Time of Arrival

When you answer these questions, please consider the aircraft that you just observed and remember that you can assume that this aircraft is representative of the aircraft you observed. Also, you should know that there are no 'right' or 'wrong' answers – we are merely interested in your estimate. The first question “Please estimate this aircraft!s time of arrival” is asking you to make your best guess as to when the aircraft will arrive at its destination. For example, if you were planning to meet the aircraft at the destination point, what time would you expect it to arrive? Please provide your answers in seconds, with accuracy to one decimal point. For example type a number like 60.5 in the box on the screen. Press Enter when you are satisfied with your answer. You will not be able to change it once you have pressed enter. For the second question “Please provide the smallest TOA window that you are 95%

certain will contain the aircraft!s actual TOA,” recall that, even though all aircraft were trying to arrive at the destination at the same time, there was some variability around the arrival time. The window you select should represent the smallest possible window of time that you are 95% sure will contain this aircraft!s actual time of arrival. Use the mouse to move the left slider to represent the lower bound of the time window, and the right slider to represent the upper bound of the time window. Again, please provide answers, in seconds, with precision to one decimal point. The actual numbers are shown above the slider scale. Today!s agenda: We will repeat the sequence described above several times throughout the experiment today. The experiment is divided into four blocks, and you can take a short rest break between each block. If you need to stop for any reason before the break, please let me know, but I will not be able to answer any questions once the experiment has started. Whom to contact for more information:

This study is being conducted jointly by NASA Ames Research Center and the University of Toronto. You may seek further information from the following points of contact:

• Becky Hooey, Principal Investigator (650-604-2399) NASA Ames Research Center, Human System Integration Division University of Toronto, Dept. of Mechanical and Industrial Engineering

• David C. Foyle, Co-Investigator (650-604-3053) NASA Ames Research Center, Human System Integration Division,

• Paul Milgram, Co-Investigator (416-978-3662) University of Toronto, Dept. of Mechanical and Industrial Engineering,

If you have read and understood the above information, please sign the attached informed consent form.

118

APPENDIX B2: EXPERIMENT 2 DEMOGRAPHIC QUESTIONNAIRE

1. Name: ________________________________________________________

2. Age: _________________________

3. Gender: Male _________ Female ______________

4. Do you normally wear glasses or contact lenses for any of the following activities? (please

check all that apply)

5. Reading _____ Watching TV _____ Working on computer _____ Driving _____

Other tasks, please specify _____________________________________________

6. Are you wearing either glasses or contact lenses today? No _____ Yes _____

7. Are you currently employed? No _____ Yes _____

If yes, are you employed part-time _____, full-time _____, contract/casual ______

8. What is your job title: __________________________________________________

9. Are you currently a student? No _____ Yes _____

If yes, please state program, institution, and your START date (i.e. A.A / De Anza /

2005)

10. What is the highest level of education that you have completed?

_____ High School / GED

_____Community College (2 year degree) Please specify degree: ___AA or ___AS

_____University (4 year degree) Please specify Degree / University (i.e. B.A. Psych / SJSU)

_____ Graduate School. Please specify Degree / University (i.e., Ph.D. Chemistry /

Stanford)

______Other Education / Diploma / Certificate program (please specify)

11. Do you have (or have you ever had) a Pilot!s license? No _____ Yes _____

If yes, Please specify type (e.g., private / commercial; fixed-wing, rotorcraft)

12. Have you ever taken a course in statistics? No _____ Yes _____

If yes, how many statistics courses have you taken? ____________________

13. Please list the Department / Institution (i.e. Psychology / SJSU) for each course.

119

APPENDIX B3: EXPERIMENT 2 DEBRIEF QUESTIONS

1) Did you use the scratch paper. If so – what did you write down?

2) What strategy did you use to assess the mean velocity?

3) What strategy did you use to assess the velocity window?

4) Were your velocity windows symmetrical or asymmetrical? How did you decide?

5) What strategy did you use to assess TOA

6) What strategy did you use to assess TOA window?

7) Were your TOA windows symmetrical or asymmetrical? How did you decide?

8) Did your strategy change over the course of the experiment?

9) Do you think your windows got smaller with time?

10) Did you find estimating velocity or TOA easier?

120

APPENDIX B4: METHOD FOR DETERMINING ACTUAL SPEED AND TOA SCORES

Actual Speed Scores

To determine the Actual Mean Speed and the Actual Speed Interval, the following process was used to develop a population distribution comprised of 25,000 aircraft, and calculate the mean and 95% CI. The following steps were conducted 9 times, to create actual values for each of 9 combinations of Speed and Variability.

• Randomly drew a series of 10 !segment speeds" for 25,000 aircraft (one speed for each of 10 segments along the route). The segment speeds were drawn from a normal distribution, with a given mean and standard deviation that mapped to the experimental IV conditions.

• Calculated the mean !route speed" for each of the 25,000 aircraft (by averaging each aircraft"s 10 segment speeds)

• Sorted the list of 25,000 !route speeds" in ascending order

Actual Speed – Calculated the mean !route speed" by averaging the mean route speed of the 25,000 aircraft. Actual Speed Interval - Selected the speed from the sorted list of route speeds that represented the 2.5 and the 97.5 percentile. Note, this is equivalent to calculating the 95% CIs using the following formula: CI = M +/- 1.96 * sd / sqrt n Where M = the mean of the normal distribution from which the 250,000 velocities were selected, sd = standard deviation of that normal distribution, and n = 10. Actual TOA Scores

For TOA, the CIs could not be calculated using the CI formula above given that the TOA distribution was non-normal as discussed previously. The following steps were conducted:

• A segment elapsed time was calculated for each of the 10 segment speeds for each of the 25,000 aircraft (as determined above).

• The 10 segment elapsed times of each aircraft were summed to determine the elapsed time for each of the 25,000 aircraft.

• The 25,000 elapsed times were sorted in ascending order

Actual TOA - Calculated the mean of the 25,000 elapsed times

Actual TOA Interval - Selected the elapsed time from the sorted list of times that represented the 2.5 and the 97.5 percentile. Since the TOA distribution is not normally distributed (it is positively skewed), the equation for 95 % CI reported above for Actual Speed Interval was deemed to be inappropriate. Instead, the actual intervals were determined by selecting the 2.5th percentile and the 97.5th percentile from the distribution of route times.

121

APPENDIX C1: EXPERIMENT 3 PROTOCOL

Welcome and thank you for agreeing to participate in this study. The purpose of this study, which will also form a portion of a PhD research project at the University of Toronto, is to understand how people estimate Time-of-Arrival Uncertainty. Please take a few minutes and read this and the attached informed consent form. It describes the study you are about to participate in, and explains your rights as a subject. If you have any questions at any time, please ask the experimenter. Overview

In this study, I am interested in how you estimate time-of-arrival uncertainty. To start with, I!d like you to consider the following scenarios: Scenario A: You just got a new job working 6:00 p.m. to midnight at the community hospital which is 30 miles away on a six-lane freeway that is pretty straight, but hilly. Since you work the night shift you travel against the commuting traffic, so you don!t have much traffic to worry about, and you always try to stick to the speed limit. You set your cruise control for 65 mph and, although you maintain a pretty constant speed, you notice that you go a few mph slower up the hills and a few mph faster down the hills and occasionally you have to turn off cruise control to pass a car on the road. On average, across 100 days, it takes you about 30 minutes to get to work, but of course it is not always exactly 30 minutes since your speed varies slightly because of the hills and traffic. Scenario B: Instead of travelling the freeway, you decide to take the rural road that has a slower speed limit of 30 mph, but is only 20 miles. Since there are a lot of curves you don!t use cruise control. On this road, you encounter a lot of traffic, particularly at 6:00 when most people are returning home from their day jobs, and this means you have to slow down and speed up frequently. On average, across 100 days, it takes you 25 minutes to get to work, but sometimes you can make it much faster if traffic is light, and sometimes it takes you much longer if traffic is heavy. Now, imagine your boss is trying to create a very tightly coordinated schedule, and he asks you to predict what time you would be at work. For which scenario would it be easier to estimate a time-of-arrival for your boss? Scenario A might be pretty easy because your time of arrival would be close to the same time every day. Similarly, Scenario B might be more difficult, because your time of arrival will depend on how much traffic you encounter. Instead then, your boss asks you to provide a window of time within which you are very sure (i.e. 95% sure) that you can make it. That is, if you drove the route 100 times, you would arrive within the time window 95 times. Of course you!d want to make it as small a window as possible so as to be useful for your boss. Consider the two scenarios. Would you assign a larger window for Scenario A or Scenario B? Probably Scenario B, because you!ll never know exactly how much traffic you might encounter along the way. Today!s agenda:

The experiment is divided into three parts, with a short rest break between each block. PART 1: LEARNING SPEED PROFILES

First, you will be shown 20 "vehicles!, each traversing a route that is .5 miles long, which all move with about the same average velocity, and their velocities fluctuate about the same

122

amount across the route. Each vehicle!s speed will change 10 times as it traverses the route (e.g., it will slow down and speed up as if it were encountering varying headwinds). The 20 vehicles in each trial will share the same speed profile – that is, their average velocity is about the same, and their velocities fluctuate about the same amount across the route. There will be four different profiles and they will be identified by colour (Red, Yellow, Blue and Green). Every vehicle with the same colour icon will always share the same speed profile. Your task is to "learn! each of those 4 speed profiles and provide estimates of the average speed and the amount of variability among the speeds. As the trial progresses, you will note that, even though the vehicles are all travelling at about the same speed, each arrives at the destination at a different time. Sometimes the arrival times will be close together, sometimes they will be more spread out. Pay attention to this distribution of arrival times, as it will be important later in the experiment.

PART 2: TIME OF ARRIVAL (TOA) ESTIMATION

In the second part of the experiment, you will be shown only one vehicle at a time. Its speed profile will always be one of the four you saw in Part 1, and will be identified with the same colour code. You will watch the vehicle as it traverses the route to remind you what the speed profile looked like. Part way through the route, the vehicle!s movement will be paused. A query screen will appear that provides the estimated time of arrival (in elapsed seconds) for this vehicle to complete the route, and asks you to estimate the smallest TOA window that you are 95% certain will contain the vehicle!s actual TOA. When answering, you should consider that the vehicle shares the same profile as all of the vehicles with the same colour icon from the first part of the study, and use your knowledge of that speed profile in generating estimates. PART 3: SPEED PROFILE IDENTIFICATION

In the third part of the experiment, you will be shown one vehicle as it traverses the route. Its speed profile colour code will NOT be identified. Rather you will be asked to identify the speed profile from those you observed in PART 1 and 2 of the study, and then estimate the smallest TOA window that you are 95% certain will contain the vehicle!s actual TOA. This study is being conducted jointly by NASA Ames Research Center and the University of Toronto. You may seek further information from the following points of contact for each institution:

• Becky Hooey, Principal Investigator NASA Ames Research Center, Human System Integration Division University of Toronto, Dept. of Mechanical and Industrial Engineering 650-604-2399

• David C. Foyle, Co-Investigator NASA Ames Research Center, Human System Integration Division, 650-604-3053

• Paul Milgram, Co-Investigator University of Toronto, Dept. of Mechanical and Industrial Engineering, 416-978-3662

Detailed instructions will be provided before beginning each part of the experiment. If you have read and understood the above information, please sign the attached informed consent form and tell the experimenter when you are ready to continue.

123

PART I INSTRUCTIONS

You will be shown a screen with 20 !vehicles", each traversing a route that is .5 miles long

(as shown in the picture below), which all share the same speed profile – that is, they move

with about the same average velocity, and their velocities fluctuate about the same amount

across the route. Each vehicle"s speed will change 10 times as it traverses the route (e.g., it

will slow down and speed up as if it were encountering varying headwinds). As the trial

progresses, you will note that, even though the vehicles share the same speed profile, they

each arrive at the destination at different times. Sometimes the arrival times will be close

together, sometimes they will be more spread out. Pay attention to this distribution of arrival

times, as it will be important later in the experiment.

Once all 20 vehicles have completed their routes, a query screen will appear (as shown

below). You will be asked to enter the identification number of the vehicle that finished the

route first, and the vehicle that finished last. You must enter a vehicle number - even if you

aren"t sure, please provide your best guess. Next, you will be shown a new vehicle that is

about to begin traversing a route and will ask you to predict this vehicle"s average velocity

and then to provide the smallest range that you are 95% sure will contain this vehicle"s

124

actual average velocity. When you answer these questions, you can assume that this new

vehicle shares the same speed profile of those vehicles you just observed.

The first question “Please estimate this vehicle!s average velocity” is asking you to make

your best guess as to the average speed that this new vehicle will travel based on your

knowledge of the speed profile that it belongs to. Please provide your answers by typing a

number in the box on the screen – you can enter a number with precision up to one decimal

place (i.e. 52.6). Press “Enter” on the keyboard when you are satisfied with your answer.

For the second question, “Please provide the smallest window that you are 95% certain will

contain the vehicle!s average velocity”, use the mouse to move the left slider to represent

the lower bound of the window, and the right slider to represent the upper bound of the

window. The actual numbers of the window that you have selected are shown below the

slider scale. The window you select should represent the smallest possible range of

velocities that you are 95% sure will contain this vehicle!s average velocity. When you are

satisfied with the window you have selected, click “Continue” with the mouse. Once you

press “Continue” you will receive feedback as to whether each of your answers was correct,

too high, or too low. Answers within 1.0 of the actual answer will be considered correct.

Please note this feedback, and consider it in your subsequent answers. You should strive to

provide the most accurate answers that you can.

125

There will be four different speed profiles – each with a different colour code: Blue, Green,

Yellow, and Red. You will complete five trials of each speed profile. After the fifth trial, you

will be asked to pause and answer a short questionnaire to characterize the profile you just

observed. You will be asked the following three questions for each profile:

1. Based on the average speed, are the vehicles with this profile more likely travelling on a six-lane freeway (i.e. FAST) or a two-lane rural road? (i.e., SLOW) (please circle one) 1 2 3 4 5 6 7

Definitely Definitely two-lane unsure six-lane rural road freeway (i.e., Slow) (i.e., Fast)

2. Based on the speed fluctuations, are the vehicles with this profile more likely traveling with cruise control ON or OFF? (please circle one) 1 2 3 4 5 6 7

Definitely Definitely cruise control unsure cruise control ON OFF 3. Please characterize the time of arrival of vehicles that share this speed profile. Use this to make any notes about the time of arrival that will help you remember later on. Ask the experimenter if you have any questions. We will begin with 4 practice trials, so that you can see one of each of the four speed profiles. Then the experiment will begin.

126

PART 2 INSTRUCTIONS: TIME OF ARRIVAL ESTIMATION

In the second part of the experiment, you will be shown only one vehicle at a time. Its speed

profile will always be one of the four you saw in Part 1, and will be identified with the same

color code (Blue, Green, Red, or Yellow). You will watch the vehicle as it traverses a route

to remind you what the speed profile looked like. Then a new route will be displayed along

with an estimated time of arrival (in elapsed seconds) specific for the distance of route

shown. This estimated time of arrival is based on the average speed, but as you saw in part

1, the actual TOAs will vary. You will be asked to estimate the smallest window that you are

95% certain will contain the vehicle!s actual TOA”. When answering this question, you

should consider that the vehicle shares the same profile as all of the vehicles with

the same color icon from the first part of the study and use your knowledge of the

speed profile to answer the questions. Recall from Part 1, that even though all vehicles

were trying to arrive at the destination at the same time, there was some variability around

the arrival time. The window you select should represent the smallest possible window of

time that you are 95% sure will contain this vehicle!s actual time of arrival -- that is, that

would capture 95 of 100 vehicles. Use the mouse to move the left slider to represent the

lower bound of the time window, and the right slider to represent the upper bound of the time

window. Again, please provide answers, in seconds, with precision to one decimal point.

The actual numbers of the window you select will be shown below the slider scale. You will

not receive any feedback in this portion of the study.

We!ll start with two trials to let you practice, and then we!ll continue with the experiment.

Please ask the experimenter if you have any questions before commencing.

127

PART 3 INSTRUCTIONS: SPEED PROFILE IDENTIFICATION

This task will be identical to Part 2, except, this time the vehicle!s speed profile will not be

identified for you – that is, the vehicle!s color code will not be displayed (all vehicles will be

black). Some vehicles will match one of the speed profiles you experienced in the earlier

portions of the study, but some will not. Your first task will be to determine if it matches one

of the color-coded speed profiles, and if so, which one. Then you will continue to provide the

smallest possible window of time that you are 95% sure will contain this vehicle!s actual time

of arrival. Again, you should use your knowledge of the speed profiles that you learned in

Phase 1 to answer this question.

For each trial, you will be asked to answer the following questions:

1. To which profile is this vehicle most similar?

_____ Red _____ Yellow _____ Green _____ Blue

2. Please rate the degree of similarity between the vehicle you just observed and the profile

you selected above:

Very Low Low Moderate High Very High

1 2 3 4 5

Please ask the experimenter if you have any questions before commencing.

128

APPENDIX C2: EXPERIMENT 3 DEMOGRAPHIC QUESTIONNAIRE

1. Name: ________________________________________________________ 2. Age: _________________________ 3. Gender: Male _________ Female ______________

4. Do you normally wear glasses or contact lenses for any of the following activities?

(please check all that apply) Reading _____ Watching TV _____ Working on the computer _____ Driving _____ Other tasks, please specify _____________________________________________

5. Are you wearing either glasses or contact lenses today? No _____ Yes _____ 6. Are you currently employed? No _____ Yes _____ If yes, are you employed part-time _____, full-time _____, contract or casual hours

______ What is your job title:

________________________________________________________ 7. Are you currently a student? No _____ Yes _____ If yes, please state program, institution, and your START date (e.g. A.A/De Anza/Sept,

2005) ____________________________________________________________________________

8. What is the highest level of education that you have completed? _____ High School / GED _____Community College (2 year degree) Please specify degree: _______AA or

_______AS _____University (4 year degree). Please specify Degree / University (e.g. B.A. Sociology /

SJSU) ________________________________________________________________ ______Graduate School. Please specify Degree / University (i.e Ph.D. Chemistry / Stanford) ________________________________________________________________ ______Other Education / Diploma / Certificate program (please Specify) ________________________________________________________________ 9. Do you have (or have you ever had) a Pilot!s license? No _____ Yes _____

If yes, Please specify type (e.g., private / commercial; fixed-wing, rotorcraft) ___________________________________________________________________

10. Have you ever taken a course in statistics? No _____ Yes _____ If yes, how many statistics courses have you taken? ____________________ Please list the Department / Institution (e.g. Psychology / SJSU) for each course. ____________________________________________________________

129

APPENDIX C3: EXPERIMENT 3 DEBRIEF QUESTIONS

Consider a new speed profile with an average speed of 45 mph. If the vehicles traveled a

route that was 4 miles long at exactly 45 mph, they would arrive in 320 seconds (5 minutes

and 20 seconds). Since their speeds fluctuate due to traffic, 95% of the vehicles arrive

within an 80 second window around the expected TOA.

a) If the vehicles were then to travel a route that was only 2 miles long, the expected TOA

would be 160 seconds (2 minutes and 40 seconds). How big would the TOA window

have to be to include 95% of the vehicles! arrival times?

b) If the vehicles travel an 8 mile route, the expected TOA would be 640 seconds (10

minutes and 40 seconds). How big would the TOA window have to be to include 95% of

the vehicles! arrival times?

50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230

Expected TOA Expected TOA = 160

Seconds TOA Window =

Expected TOA Expected TOA = 320

Seconds TOA Window =

210 220 230 240 250 260 270 280 290 300 310 320 330 340 350 360 370 380 390

530 540 550 560 570 580 590 600 610 620 630 640 650 660 670 680 690 700 710

Expected TOA Expected TOA = 640 Seconds TOA Window = _____ seconds

130

1. With respect to the second block of trials – What percent of vehicle TOAs do you

think you captured within your TOA windows?

2. What strategy did you use to assess the TOA windows?

3. Were your TOA windows symmetrical or asymmetrical? How did you decide?

4. Did your strategy change over the course of the experiment? / Do you think your

TOA windows got smaller with time?

5. How did you factor in the distance of the route?

6. Did you factor in knowledge of the speed profile, i.e. other aircraft with the same

color – or just the one aircraft shown in each trial?

7. With respect to the last block of trials – when a vehicle did not match one of the

previously learned speed profiles – how did you determine the TOA windows?

Documents

The Development and Evaluation of a Model of Time-of ... · scientific discussions about time of arrival uncertainty, experimental methods, and data analyses; facilitating the development