Upload
neil-logan
View
223
Download
2
Tags:
Embed Size (px)
Citation preview
Cascaded inference
A chain of probabilistic inferences ‘Inference upon inference’
Pervasive in many domains Legal Medical Criminal Advertising Everyday
Cascaded inference
Horse race example
WEATHER
(Sun, rain, drizzle, frost…)
TRACK CONDITION
(Heavy, soft, good, firm…)
WINNER
(‘Waltzing Along’, ‘Persian Weaver’, ‘Ride the Storm’…)
Cascaded inference
Legal example
TESTIMONY
Witness testifies that accused was at crime scene
EVENT
Accused was at crime scene
CLAIM
Accused is guilty of crime
Cascaded inference
How do people do it? Very little recent research
Most done in 1971-73 Using abstract problems No comprehensive normative model
Current psychological models also seem inadequate
Normative model for cascaded inference?
Modified Bayes theorem (Dodson), Jeffrey’s rule, Chain rule
Weighted sum across all possible paths
A B C
Given evidence A:
P(C|A) = P(C|B).P(B|A) + P(C|~B).P(~B|A)
A
~B
C
B
~C
E.g., A = heavy rain; B = muddy track; C = ‘Ride the Storm’ wins
P(C|B)
P(C|~B)P(B|A)
P(~B|A)
P(~C|~B)
P(~C|B)
Simplifying heuristics
Various studies claim that people use simplifying strategies (as-if, best guess) Gettys et al., 1973; Schum et al., 1973; Steiger &
Gettys, 1972
Treat inference from first stage as-if it is true Focus on most probable path, ignore alternative
less probable paths Such strategies lead to overestimation of final
probability judgments
As-if or Best guess model
A
~B
C
B
~C
.7
.3
.7
.3
.1
.9
Select the most probable outcome at first stage of inference (from A to B)
E.g., A = heavy rain; B = muddy track; C = ‘Ride the Storm’ wins
As-if or Best guess model Select the most probable outcome at first stage of
inference (from A to B) Assume this is true for the second stage inference
(from B to C) Judged probability of C given A = .7 This overestimates the correct value = .52
A
~B
C
B
~C
.7
E.g., A = heavy rain; B = muddy track; C = ‘Ride the Storm’ wins
Current study
Looking at both abstract and applied settings Some studies claim that overestimation only occurs in
abstract settings
Vary presentation of probability information to see if this improves inferences Frequency tables Pie charts Network diagrams?
Shortcomings with previous research
Neglects structural assumptions Causal models Conditional independencies etc
These determine appropriate normative model, and likely to influence people’s reasoning
Current psychological models of probabilistic reasoning do not take these factors into account Belief activation (neural networks) Belief updating (rule-based) Heuristic models
Influence of causal model
How are cascaded inferences affected by assumptions about causal structure?
Different causal models can imply different conditional independencies
A B C
A and C are (unconditionally) dependent
BUT A and C are conditionally independent given B
A B C
A and C are (unconditionally) independent
BUT A and C are conditionally dependent given B
Causal models and cascaded inference
Measles Spots Itchy
Evidence = Measles
Infer that spots are probable
Infer that itchiness is probable
Measles Spots Chicken pox
Evidence = Measles
Infer that spots are probable
Do not infer that Chicken pox is probable
Evidence does not alter P(Chicken pox)
Causal structure influences permissible inferences
Causal models and cascaded inference
Schum – ‘conditional non-independence’
A B C
E.g., A = Weather, B = Track condition, C = Winner
Some horses may run better/worse in the rain, independent of the track condition
Future research
Construct cognitive model of cascaded inference that allows for causal and structural assumptions
How well does this conform to normative models? Various studies suggest that people are poor at probability
estimation/computation, but are quite good at qualitative causal reasoning
Decision aids To support inference when there are many variables,
complex computations etc. To correct for any systematic biases Better presentation of probability information E.g., graphical representations …
Interdisciplinary aid
What are the appropriate normative models (statistics)?
What are the plausible computational algorithms?
Are there economical heuristic procedures? What are the cognitive mechanisms that people
use (psychology, neuroscience)? Are there naturally occurring inference problems
of this kind (epidemiology, forensic, legal, history)?
Some earlier studies on hierarchical inference
Cascaded inferenceHierarchically structured informationLearning and judgment
Learning and judgment with category hierarchies
Hierarchical structure Pervasive feature of how we represent the world Organizes knowledge and structures inference
FLU
Type BType A Type C
Inference using a hierarchy
One powerful feature of a category hierarchy is that given information about categories at one level, you can make inferences about categories at another level.
This allows you to exclude alternatives, or reduce the
number you need to consider
Tabloid Broadsheet
TimesGuardianMirrorSun
Probabilistic inference using a hierarchy
In many real-world situations we must base our initial category judgments on imperfect cues, degraded stimuli, or statistical data.
What effect do such probabilistic contexts have on the hierarchical inferences that we are licensed to make?
Tabloid Broadsheet
TimesGuardianMirrorSun
Commitment heuristic
Reason ‘as-if’ probable info is true (but reduce overall confidence)
Commitment heuristic - When people select the most probable category at the superordinate level, they assume that it contains the most probable subordinate category (and vice-versa)
This leads to the neglect of subordinates from the less probable superordinate
Tabloid Broadsheet
TimesGuardianMirrorSun
How effective is such a heuristic?
Depends on the structure of the environment In certain environments it is advantageous
increases inferential power by focus on appropriate subcategories
reduces computational demands by avoiding complex Bayesian calculations
But in some environments it leads to anomalous judgments and choices
Non-aligned hierarchy
The most frequently read type of paper is a Tabloid, but the most frequently read paper is the Guardian (a Broadsheet).
Non-aligned hierarchy: the most probable superordinate category does not contain the most probable subordinate
category.
Tabloid 60 Broadsheet 40
Times 5 Guardian 35Mirror 30Sun 30
Cascaded inference
Inference from hierarchy to a related category Different levels can support opposing predictions
Tabloid 60 Broadsheet 40
Times 5 Guardian 35Mirror 30Sun 30
Party A Party B
Experiments 1 & 2
What is effect of manipulating level of representation on subsequent probability judgment?
Learning phase - participants exposed to a non-aligned hierarchical environment in which they learn to predict voting behavior from newspaper readership.
100 trials ‘reading/voting profiles’
Screen during learning phase
Broadsheet
Sun
Tabloid
Mirror Guardian Times
○ Party A
○ Party B
Reading profile for J. K.
Screen during learning phase
Broadsheet
Sun
Tabloid
Mirror Guardian Times
○ Party A
○ Party B
Reading profile for J. K.
Outcome feedback
Structure of environment
Tabloid 60 Broadsheet 40
Times
5
Guardian
35
Mirror
30
Sun
30
Party A Party B
50 50
NB Overall each party equally frequent
Judgment phase
Which paper is X most likely to read?
X is selected at random
What is the probability that X votes for Party B rather than A?
Which kind of paper is X most likely to read?
Baseline
General
Specific
Predictions
Which paper is X most likely to read?
X is selected at random
What is the probability that X votes for Party B rather than A?
Which kind of paper is X most likely to read?
Baseline
General
Specific
General choice Tabloid -> Party A
Specific choice Guardian -> Party B
Based on commitment heuristic
Results
Which paper is X most likely to read?
X is selected at random
What is the probability that X votes for Party B rather than A?
Which kind of paper is X most likely to read?
Baseline
General
Specific
General choice Tabloid -> Party A
Specific choice Guardian -> Party B
47%
28%
75%
Mean probability rating
Summary
People allow their initial categorization to shift their inferences, even though all judgments are based on the same statistical data.
Simplifying heuristic that assumes that environment is aligned
Empowers inference when hierarchical structure is aligned, otherwise can lead to error
Suggests tendency to reason as if a probable conclusion is true
Applied to Medical choices
In medical settings treatment options (and survival rates) often grouped to facilitate understanding and communication
Can this lead to errors?
SURGERY DRUGS
Type2
Type1
Type2
Type1
Medical choices
Suppose success rates are non-aligned Will grouping affect people’s treatment choices?
SURGERY 60%
DRUGS 40%
Type2
10%
Type1
70%
Type2
60%
Type1
60%
Medical choices experiment
Participants learn about success rates trial-by-trial
SURGERY 60%
DRUGS 40%
Type2
10%
Type1
70%
Type2
60%
Type1
60%
Medical choices experiment
Ungrouped control – 75% chose most effective treatment
Drug4
10%
Drug3
70%
Drug2
60%
Drug1
60%
Medical choices experiment
Grouped condition – 25% chose most effective treatment (first asked about best superordinate treatment)
SURGERY 60%
DRUGS 40%
Type2
10%
Type1
70%
Type2
60%
Type1
60%
Summary
Grouping with a non-aligned hierarchy can lead to poorer choices
How generalizable is this? Does it depend on memory processes? Will it apply to summary presentations?
Consider situations where people must use large databases with various levels of hierarchy (e.g., NHS statistics)
Useful biases?
Systematic biases reveal something about judgment and learning processes
Need not be indictment of human reasoning Heuristic strategies might be well adapted to the
inferential tasks that we commonly confront