10
Ecological Momentary Assessment to Evaluate Cognitive-Behavioral Treatment for Binge Eating Disorder Simone Munsch, PhD 1 * Andrea H. Meyer, PhD 1,2 Natasa Milenkovic, MSc 1 Barbara Schlup, MSc 1 Juergen Margraf, PhD 1 Frank H. Wilhelm, PhD 1 ABSTRACT Objective: Cognitive-behavioral treat- ment (CBT) for binge eating disorder (BED) is traditionally evaluated using clinical interviews and questionnaires. These retrospective assessment methods are discussed to be problematic due to memory recall error. Ecological momen- tary assessment (EMA) might be promis- ing for gathering ecologically valid and reliable data. Method: We assessed the feasibility of and reactivity to EMA and compared the treatment efficacy measured by tradi- tional vs. EMA-based instruments in 28 BED individuals participating in short- term CBT. Results: Patients were highly compliant and we found no reactivity to EMA. Esti- mated treatment effects for binge eating based on EMA were comparable to ques- tionnaire-based methods. The overall con- cordance between methods was moderate. Discussion: Results suggest that binge eating over 1 week can be equally accu- rately assessed by EMA or by self-report questionnaires in BED treatment trials. EMA contributes to a detailed knowledge of binge eating in daily live and helps to advance treatment options. V V C 2009 by Wiley Periodicals, Inc. Keywords: ecological momentary assess- ment; binge eating disorder; treatment efficacy; ambulatory assessment; com- puterized assessment (Int J Eat Disord 2009; 00:000–000) Introduction Binge eating disorder (BED) is a common problem within the obese population with up to 30% suffer- ing from this eating disorder. In community-based studies, prevalence rates range from 0.7 to 3.3%. 1–3 BED imposes significant psychosocial burden and is often associated with impaired quality of life, obesity, and increased morbidity and mortality. 4 According to the guidelines of the American Psy- chiatric Association and the National Institute of Clinical Excellence for the treatment of eating dis- orders, cognitive-behavioral treatment (CBT) is the best-established treatment for BED. 5–7 The primary source of data in clinical trials eval- uating treatment efficacy in BED are clinical ques- tionnaires and standardized clinical interviews, both relying on retrospective memory recall. These traditional data recording methods are known to have several shortcomings including the forgetting of information when the recall period is long and/ or memory retrieval occurs in a different setting from memory storage (e.g., recall of binge eating episodes during the past month in a therapist room, see for e.g. 8 ). Further, the most salient and most recent events at the time of recall will influ- ence the ratings of behavior or mood from the pe- riod that is being recalled. Most importantly, recent studies indicate that retrospective recall favors mental representations fitting into the self-concept, social expectations, and ideals of individuals rather than representing actual experiences or the target behavior examined. 9 In contrast, ambulatory psychological assess- ment such as the experience sampling method, 10 later also called ecological momentary assessment (EMA) 11,12 allows getting closer to the real-life sit- uations under study, both in space and time, thereby reducing retrospective recall problems. EMA has been applied to self-assessment of behavior and psychological state in a range of Parts of the manuscript have been presented at the World Congress of Psychology, Berlin, July 2008. *Correspondence to: Simone Munsch, PhD, Faculty of Psychol- ogy, University of Basel, Missionsstrasse 60/62a, 4055 Basel, Switzerland. E-mail: [email protected] Accepted 20 December 2008 1 Faculty of Psychology, Department of Clinical Psychology and Psychotherapy, University of Basel, Basel, Switzerland 2 Faculty of Psychology, Department of Clinical Psychology and Psychotherapy, Division of Applied Statistics in Life Sciences, University of Basel, Basel, Switzerland Published online in Wiley InterScience (www.interscience.wiley.com). DOI: 10.1002/eat.20657 V V C 2009 Wiley Periodicals, Inc. International Journal of Eating Disorders 00:0 000–000 2009 1 REGULAR ARTICLE

Ecological momentary assessment to evaluate cognitive-behavioral treatment for binge eating disorder

Embed Size (px)

Citation preview

Ecological Momentary Assessment to EvaluateCognitive-Behavioral Treatment for Binge

Eating Disorder

Simone Munsch, PhD1*Andrea H. Meyer, PhD1,2

Natasa Milenkovic, MSc1

Barbara Schlup, MSc1

Juergen Margraf, PhD1

Frank H. Wilhelm, PhD1

ABSTRACTObjective: Cognitive-behavioral treat-ment (CBT) for binge eating disorder(BED) is traditionally evaluated usingclinical interviews and questionnaires.These retrospective assessment methodsare discussed to be problematic due tomemory recall error. Ecological momen-tary assessment (EMA) might be promis-ing for gathering ecologically valid andreliable data.

Method: We assessed the feasibility ofand reactivity to EMA and compared thetreatment efficacy measured by tradi-tional vs. EMA-based instruments in 28BED individuals participating in short-term CBT.

Results: Patients were highly compliantand we found no reactivity to EMA. Esti-

mated treatment effects for binge eatingbased on EMA were comparable to ques-tionnaire-based methods. The overall con-cordance between methods was moderate.

Discussion: Results suggest that bingeeating over 1 week can be equally accu-rately assessed by EMA or by self-reportquestionnaires in BED treatment trials.EMA contributes to a detailed knowledgeof binge eating in daily live and helps toadvance treatment options. VVC 2009 byWiley Periodicals, Inc.

Keywords: ecological momentary assess-ment; binge eating disorder; treatmentefficacy; ambulatory assessment; com-puterized assessment

(Int J Eat Disord 2009; 00:000–000)

Introduction

Binge eating disorder (BED) is a common problemwithin the obese population with up to 30% suffer-ing from this eating disorder. In community-basedstudies, prevalence rates range from 0.7 to 3.3%.1–3

BED imposes significant psychosocial burden andis often associated with impaired quality of life,obesity, and increased morbidity and mortality.4

According to the guidelines of the American Psy-chiatric Association and the National Institute ofClinical Excellence for the treatment of eating dis-orders, cognitive-behavioral treatment (CBT) is thebest-established treatment for BED.5–7

The primary source of data in clinical trials eval-uating treatment efficacy in BED are clinical ques-tionnaires and standardized clinical interviews,both relying on retrospective memory recall. Thesetraditional data recording methods are known tohave several shortcomings including the forgettingof information when the recall period is long and/or memory retrieval occurs in a different settingfrom memory storage (e.g., recall of binge eatingepisodes during the past month in a therapistroom, see for e.g.8). Further, the most salient andmost recent events at the time of recall will influ-ence the ratings of behavior or mood from the pe-riod that is being recalled. Most importantly, recentstudies indicate that retrospective recall favorsmental representations fitting into the self-concept,social expectations, and ideals of individuals ratherthan representing actual experiences or the targetbehavior examined.9

In contrast, ambulatory psychological assess-ment such as the experience sampling method,10

later also called ecological momentary assessment(EMA)11,12 allows getting closer to the real-life sit-uations under study, both in space and time,thereby reducing retrospective recall problems.

EMA has been applied to self-assessment ofbehavior and psychological state in a range of

Parts of the manuscript have been presented at the World

Congress of Psychology, Berlin, July 2008.

*Correspondence to: Simone Munsch, PhD, Faculty of Psychol-

ogy, University of Basel, Missionsstrasse 60/62a, 4055 Basel,

Switzerland. E-mail: [email protected]

Accepted 20 December 2008

1 Faculty of Psychology, Department of Clinical Psychology and

Psychotherapy, University of Basel, Basel, Switzerland2 Faculty of Psychology, Department of Clinical Psychology and

Psychotherapy, Division of Applied Statistics in Life Sciences,

University of Basel, Basel, Switzerland

Published online in Wiley InterScience(www.interscience.wiley.com). DOI: 10.1002/eat.20657

VVC 2009 Wiley Periodicals, Inc.

International Journal of Eating Disorders 00:0 000–000 2009 1

REGULAR ARTICLE

disorders and natural settings (for an overview seeEngel et al.13), including eating disorders.13–15 Pre-vious studies on traditional questionnaire-basedself-monitoring have shown that systematic record-ing of behaviors can lead to temporary decrease ofproblematic or socially undesirable behaviors.16

Nevertheless, current findings of studies in alcoholabuse and eating disorders have shown no reactiv-ity effects after multiple electronic assessment ofthe target behavior.14,17,18 Several recent diary stud-ies evaluated concordance of EMA-data with datafrom retrospective questionnaires or interviewsand reported notable discrepancies between near-real-time reporting of experiences and later recallof the same events. In most studies with pain disor-der patients, higher pain ratings were found by ret-rospective assessment.19,20

In eating disorders, a study with subthreshold buli-mia and anorexia nervosa patients examined theconcordance between EMA and the eating disorderexamination (EDE),18,21 the common clinical inter-view for assessing eating disorders. Again the resultsindicated that binge eating and excessive exerciseover the time span of 28 days were lower when meas-ured by EMA compared to the interview. The identifi-cation of a binge eating episode refers to a complexrecollection process for verifying that the quantity offood was unusually large, consumed within a 2-h pe-riod of time, and was associated with feelings of lossof control.18 It is highly likely that this detailed infor-mation was not remembered accurately for eachbinge-like episode when the interviewer asked for itseveral weeks later. The observed discrepancies mat-ter because both researchers and clinicians base theirinterventions on patient reported data, and thusmight be misleading. But it remains open whetherthe suggested retrospective memory recall bias of anespecially complex behavior such as binge eatingalso occurs when binge eating is assessed during ashorter time-span of several days.

Studies investigating the lack of concordancebetween EMA-based and traditional questionnaire-or interview-based data have considered personal-ity and clinical traits such as negative affect, sever-ity, and stability of symptoms and characteristics ofgeneral psychopathology as possible factors thatmight influence lack of concordance between thetwo sources of data and yielded mixed results.Stone et al.22 found an overall low correspondencebetween retrospectively and momentarily assessedcoping behavior in substance abuse, especially forcomplex behavior, but revealed no associationbetween the discrepancy rates and factors such asstress level, anxiety state, depression, neuroticism,or marital satisfaction.

Despite the apparent advantages of the elec-tronic diary method, only few studies have utilizedEMA for assessment of treatment outcome in psy-chotherapy research.23 The results from these stud-ies, mostly conducted with pain disorder patients,indicate that EMA is highly feasible, leads to littlereactivity, and increases the accuracy of treatmentoutcome assessment in this patient group.

Thus, EMA appears to have the potential toincrease the knowledge about patient experienceand symptoms and furthermore, may reduce errorvariance, and thus help to reduce the number ofparticipants required in psychotherapy research.However, the reliability and validity of this methodfor assessing complex behavior such as binge eat-ing have not been addressed adequately.24 In BED,only preliminary data about the application of EMAin treatment trials have been collected. These stud-ies focused on evaluating the effects of EMA-basedself-monitoring of behavior as an additional thera-peutic component and a comparison of the eatingdisorder characteristics of BED and obese patientsduring the treatment course,12,25 and did not sys-tematically assess the feasibility and concordanceof EMA vs. traditional instruments.

The aims of the present study were threefold. We,first, systematically investigated the feasibility ofEMA in terms of patient acceptability, practicabil-ity, and compliance during a treatment trial forBED. We also accounted for the problem of reactiv-ity, and assessed whether binge eating was altereddue to the repeated assessment using EMA. Sec-ond, we compared EMA-based treatment outcomeswith outcomes based on traditional self-reportquestionnaires and with outcomes based onEDE,26,27 using effect sizes. Third, we analyzed theconcordance between questionnaire- or interview-based measures and EMA-based measures usingindices of inter-instrument agreement, therebyaccounting for the moderating influence of severityof eating disorder pathology, general psychopathol-ogy, specific mental disorders, and Body MassIndex (BMI).28,29 We expected that BED patientswith higher BMI, more severe eating disorder, morecomorbid mental disorders, and more negativeaffect would be more likely to exhibit discordancebetween traditional and EMA-based reports.

Method

Participants

Overweight to obese individuals with BED were

recruited through newspaper advertisements for a

MUNSCH ET AL.

2 International Journal of Eating Disorders 00:0 000–000 2009

treatment trial at the Department of Clinical Psychology

and Psychotherapy, University of Basel, Switzerland. Ini-

tially 136 individuals contacted the department and under-

went telephone screening for determining preliminary eli-

gibility. Study inclusion criteria required that participants

were between 18 and 70 years old, had a BMI (kg/m2)

between 27 and 40, and met full DSM-IV-TR criteria for

BED30 according to a specialized eating disorder interview

(see diagnostic assessment). All patients were free from

unstable medical conditions such as diabetes, heart dis-

ease, or endocrine disorders. Participants were excluded if

they met DSM-IV-TR criteria assessed by standardized

interviews (see diagnostic assessment) for mental disor-

ders warranting immediate treatment (IT) such as suicidal

tendency, psychosis, mania, organic dementia, or sub-

stance use disorder. Further exclusion criteria were preg-

nancy, participation in a diet program or other psychother-

apy, treatment with weight loss medication (current or dur-

ing the last 3 months), or previous surgical treatment of

obesity. The surprisingly small number of males (N 5 1)

resulted in their exclusion from data analysis. The final

sample consisted of 28 overweight to obese females with

BED (see Table 1 for sample characteristics).

Procedure

The study was approved by the local ethics committee

for medical research. All subjects were offered free treat-

ment for participating in the study. Prior to initial assess-

ments, the study was explained to the participants and

they gave written informed consent. BED individuals

were randomly assigned to either IT or a waitlist (WL)

condition using a permutated block design.33 Each IT or

WL group started whenever five to eight participants had

been recruited and randomized. Electronic diaries were

applied twice in the IT group, at pre-treatment (one week

before treatment started) and at post-treatment (last

week of treatment). Electronic diaries were applied three

times in the WL group, one week before the 8-weeks

waiting period prior to treatment, at pre-treatment, and

at post-treatment.

Ecological Momentary Assessment (EMA) with ElectronicDiaries. A menu-driven computerized questionnaire

was developed to repeatedly assess binge eating and

attributes of psychological well-being (unpublished elec-

tronic questionnaire available from the authors). Ques-

tions consisted either of a dichotomous ‘‘yes/no’’ format,

for example, item 1: ‘‘Did you experience binge eating

since your last entry?’’ or used a computerized Likert-

type scale ranging from 0 (‘‘not at all’’) to 4 (‘‘very

much’’), for example, item 11: ‘‘How do you currently

feel: depressed’’. Palm Tungsten E handheld computers

were used as recording devices. Questionnaires were pro-

grammed and displayed using Pendragon Forms 5.0 soft-

ware (Pendragon Software Corporation, Libertyville, IL).

We assessed participant responses using time- and

event-contingent signaling11 during an entire week. All

software applications on the handheld computer other

than the electronic diary were blocked.

During a 30-min standardized session, master students

of psychology instructed the participants on how to use

the handheld computers and how to fill in the electronic

diary. For time-contingent monitoring, five assessment

points per day were preset at individually defined times,

that is, each participant could specify his/her typical

awakening time on weekdays and weekends. The first

alarm was preset individually at one and a half hours af-

ter awakening, the second alarm 5 h after the first, the

third alarm 4 h after the second, the fourth alarm 3 h af-

ter the third and the fifth alarm 2 h after the fourth alarm.

The decreasing sampling interval during the day was

chosen in accordance with previous studies that demon-

strated an increased likelihood of binge eating later in

the day. To reduce the subject burden, individual devia-

tions from these times of up to 1 h were allowed in order

to adapt the signaling to participants’ daily routines (e.g.,

to avoid alarming during classes and scheduled meet-

ings). Participants were asked to fill in the questionnaire

no later than 30 min after each signaling and were

reminded within this interval by repeated signals. Data

recordings not corresponding to this predefined time

frame were only used for calculation of compliances but

removed for further analysis. For event-contingent moni-

toring, participants were instructed to fill in the question-

naire whenever they felt that an episode of overeating

associated with a sense of loss of control occurred, within

a 30-min interval. After answering the entry question

(‘‘did you experience a binge episode’’) with yes, they

were asked questions relating to the criteria of binge eat-

ing according to the EDE. After each assessment week,

study participants were asked to fill in an exit question-

naire (EXQ), measuring adherence and reaction to and

acceptability of the EMA procedure.

TABLE 1. Sample characteristics

Binge EatingGroup (n5 28)

Mean age in years (SD) 45.1 (11.1)Mean BMI (SD) 32.6 (6.4)No. (%) of participants with comorbid diagnosisCurrent comorbidity axis Ia 14 (50)Depression 2 (7.1)Anxiety disorders 12 (42.9)

Lifetime comorbidity axis Ia 9 (32.1)Depression 6 (21.4)Anxiety disorders 3 (10.7)

Comorbidity axis IIb 1 (3.6)

Notes: Current and lifetime diagnoses were assessed according to DSM-IV-TR, using structured clinical interviews.

a Mini-DIPS31.b SKID-II32.

ECOLOGICAL MOMENTARY ASSESSMENT

International Journal of Eating Disorders 00:0 000–000 2009 3

SM
Hervorheben

Diagnostic Assessment. BED diagnosis relied on the

structured clinical interview EDE26,27 and was in accord-

ance with DSM-IV-TR research criteria.30 The German

language screenings for mental disorders on axis-I (Diag-

nostisches Kurzinterview bei psychischen Storungen,

Mini-DIPS)31 and axis-II (Strukturiertes Klinisches Inter-

view fur DSM-IV, Achse-II, Personlichkeitsstorungen,

SKID-II)32 were administered to assess the current and

lifetime mental disorders. Interviewers were trained in

administering the EDE, Mini-DIPS and SKID-II, and were

supervised weekly by two of the authors (B.S. and S.M.).

In cases of uncertainties about the diagnoses of patients,

supervision of the diagnostic process was supported by

videotapes taken during the assessment.

Body Weight

Body Mass Index (BMI). Weight and height were meas-

ured on an electronic balance scale (Seca, Vogel and

Halke, Germany) and by a stadiometer. BMI was calcu-

lated as weight in kilograms divided by the square of

height in meters.

Feasibility of and Reactivity to EMA. Feasibility of EMA,

that is, practicability, acceptability, representativeness,

and compliances, were assessed in two ways, with the

EXQ and using EMA data. The EXQ assessed several per-

ceptions of participants relating to the electronic diary

assessment according to an 11-point Likert-type scale

from 0 (‘‘not at all’’) to 10 (‘‘yes, exactly’’). The following

items of the EXQ assessed practicability (‘‘Did you experi-

ence any difficulties in filling in the electronic diary?’’),

acceptability (‘‘How did you feel during the week with

the electronic diary entries? Did the electronic diary

alarm go off too often? Was your daily routine dis-

turbed?’’ (highly correlated items: r 5 .47–.72) and repre-

sentativeness (‘‘Did the previous week correspond to

your usual weekly routine?’’). Signal-compliance, defined

as the filling in of the electronic questionnaire within a

predefined time span after being alarmed (30 min)

according to Fahrenberg34 was assessed by asking: ‘‘Was

it possible for you to fill in the electronic diary 30 min af-

ter the signal?’’ (EXQ-based signal-compliance). Time

stamps for the start and end of each data entry were

automatically recorded in the electronic diary. EMA-

based signal-compliance was assessed by computing the

proportion of the weekly number of recordings that were

started within 30 min after the signal divided by the total

weekly number of recordings. To assess whether patients

responded to the signaling at all, we further determined

recording compliance denoting the rate of overall

responses to time-contingent signaling.

Reactivity was assessed according to the EXQ (‘‘Did the

frequency of binge eating change during the diary pe-

riod? Did you focus more on your psychological well-

being? Did you benefit from filling in the diary? Did the

previous week correspond to your usual weekly routine?,’’

highly correlated items: r 5 .48–.70). We further tested

whether using EMA for one week while waiting for treat-

ment had an effect on the number of daily binge eating

episodes over the course of the week, for both groups

combined, and whether the number of weekly binge eat-

ing episodes changed between the two weeks prior to

treatment in the WL group.

EMA-Based Treatment Outcomes. Patients were asked

to monitor binge eating according to time-contingent

and event-contingent assessment modes. The electronic

diary contained questions that were based on EDE ques-

tions and focused on identifying binge eating (EMA

weekly binges). The number of weekly binges was

obtained by summing up all recorded binge eating epi-

sodes, from both time-contingent and event-contingent

monitoring, across the 1-week assessment period. Sum-

ming up binge eating episodes across a specific time

interval can be problematic if compliance is not 100%, as

is usually the case. Disregarding missing values may lead

to an underestimation of number of binge eating epi-

sodes if they are assumed to have a value of 0 (i.e., miss-

ing value 5 no binge). Replacing missing values by the

mean across the defined time interval, in our study, led

to slightly higher values but otherwise comparable results,

so unadjusted values are reported. Summing up all days for

which the patient did not report any binge eating episodes

assessed the number of binge-free days. The maximum

possible number was six as this was the number of days for

which data for a complete 24 h day were available.

Abstainers were defined as having zero binge eating epi-

sodes during a 1-week EMA period of investigation.

Interview and Questionnaire-Based TreatmentOutcomes. Patients recorded their number of weekly

binges (self-reported weekly binges; number of ‘‘episodes

of overeating during which you felt out of control’’ during

the past week) according to DSM-IV-TR criteria during

the active treatment phase.35 Using Hilbert et al.’s27 Ger-

man version of the EDE,21 we further assessed the num-

ber of objective binge eating episodes (OBE, i.e., binge

eating defined as consuming unusually large quantities

of food with a subjective sense of loss of control) and

abstainer rates (proportion of patients not experiencing

objective binge days) during the last 28 days.

Concordance Between EMA- and Interview-Based Esti-mates of Signal-Compliance and Outcome Measures. Tokeep the subject burden low in our treatment trial we

could commit our patients to fill in the electronic diary

only during seven days before and after treatment.

Although this procedure allowed us to directly compare

the objective binge eating and abstainer rates according

to EMA and according to the traditional self-report ques-

tionnaire (both covering the 7-days time span), we could

MUNSCH ET AL.

4 International Journal of Eating Disorders 00:0 000–000 2009

not directly assess the concordance between EMA and

EDE-based measures of objective binge eating and absti-

nence from binge eating (7-days versus 28-days time

span). Thus, OBEs according to EDE were divided by four

to make values more comparable. Concordance of binge

eating symptomatology assessed by traditional question-

naire/interview or EMA was estimated as follows: EMA

weekly binges versus self-reported weekly binges; EMA

weekly binges versus OBE according to EDE; abstainer

rates based on EMA versus abstainer rates according to

self-reported weekly binges and abstainer rates based on

EMA versus abstainer rates according to EDE.

Factors Influencing Concordance of Traditional andEMA-Based Measures. We analyzed to what degree eat-

ing disorder pathology (General Score of the EDE, EDE-

Q), negative affect according to the BDI,36 mental disor-

ders according to the DIPS, and BMI influenced concor-

dances defined above.

Treatment. The CBT protocol for BED consisted of

eight weekly 90-min group sessions in the active treat-

ment phase.37 The manual in our study was a shortened

version of the 16-sessions group CBT program for BED

developed by Munsch,38 which has proven to be success-

ful for patients with BED.35

Statistical Analyses

Because IT and WL group were not significantly differ-

ent in the changes from pre- to post-treatment (no treat-

ment 3 time interaction) we pooled the two groups to

enhance power for all pre- versus post-treatment analyses.

Statistical Models. Continuously distributed measures

were analyzed using a linear mixed model.39 For the

analysis of the number of weekly binges (EMA-based,

based on traditional self-report measures and according

to EDE), we applied a generalized linear mixed model

(GLMM) for Poisson-distributed data.40 To compare the

abstainer rates at post-treatment/end of waiting period

we used a v2 test. To assess the changes in abstainer rates

between pre- and post-treatment for pooled groups, we

used a logistic-normal model that is a special case of

GLMM in which the random effects structure is reduced

to a random intercept only.40

Effect sizes between pre- and post-treatment are

reported as Cohen’s d (for continuously distributed varia-

bles), relative risks (RR, used for rates like number of

weekly binges per person and week) or odds ratios (OR,

for abstainer rates).

Concordance Between EMA- and Traditional Question-naire/Interview-Based Outcome Measures and Estimatesof Signal-Compliance. We used the Pearson correlation

coefficient (r) and both intraclass correlation coefficients

ICC(C,1) (for consistency) and ICC(A,1) (for absolute

agreement; McGraw and Wong41) to estimate the concord-

ance between questionnaire/interview- and EMA-based

measures of signal-compliance and binge eating before

and after treatment, except for abstainers rates. To esti-

mate the concordance regarding abstainer rates, we used

Cohen’s kappa. Data regarding signal-compliance were

standardized prior to computing concordance statistics.

Factors Influencing Concordance Between Traditionaland EMA-Based Measures. Using a linear regression

model we analyzed whether concordance between tradi-

tional and EMA-based measures was influenced by BMI,

EDE total score, BDI, and comorbidity status (presence/

absence). For these analyses, concordances between the

assessment methods regarding weekly binge eating epi-

sodes were computed using the difference between the

z-scores of the corresponding methods. As abstainer

rates are based on dichotomous data, differences

between corresponding methods were analyzed using a

logistic regression model. In total, we tested 32 models

comprised of four dependent variables (weekly binges

and abstainer rates in each of two method comparisons)

3 two time points (pre- and post-treatment) 3 four pre-

dictor variables.

Results

Dropouts

Of the 26 participants starting treatment, seven(26.9%) dropped out during treatment, three(11.5%) in the IT, and four (15.4%) in the WL group(v2(1) 5 1.5, p 5 .22). We checked whether dropoutwas completely at random (i.e., MCAR patternsensu Little and Rubin42), using t-tests for continu-ously distributed data, or Pearson’s v2 tests for di-chotomous variables for the broad range of varia-bles. After correction for multiple comparisons,only employment status had increased the risk ofdropping out of the study. Patients who were still ineducation or unemployed were more likely to dropout (v2(4)5 17.7, p 5 .001).

Feasibility of and Reactivity to EMA

Acceptability of the electronic diary according tothe EXQ was high and increased significantlybetween pre- and post-treatment (from 7.17 to7.99, p 5 .016, d 5 0.34). EXQ-based estimates ofpracticability and representativeness were high andremained stable (practicability: 2.00 and 2.55, p 5.58, d 5 0.19; representativeness: 7.17 and 6.85, p 5.78, d 5 20.11). EXQ-based signal-compliance sig-nificantly decreased between pre- and post-treat-ment (from 7.07 to 5.08; p 5 .037, d 5 20.33)whereas EMA-based signal-compliance remainedconstant during that period: mean proportions of

ECOLOGICAL MOMENTARY ASSESSMENT

International Journal of Eating Disorders 00:0 000–000 2009 5

the number of recordings starting within 30 minafter the signal to the total number of recordingswere 0.87 for both pre- and post-treatment. Partici-pants’ mean absolute deviations of entry from thescheduled alarms (in min) were 16.0 (median 51.0, SD 5 34.8, N 5 722) before and 15.3 (median 51.0, SD 5 33.2, N 5 427) after treatment, respec-tively. With altogether 28 patients with BED fillingin the diary five times a day during 2 or 3 weeks atotal of 2,485 time-contingent data entries werepossible. Participants completed 1,667 of thesetime-contingent entries, corresponding to an over-all recording compliance of 68%.

EXQ-based reactivity was moderate andremained stable (5.77 and 5.73; p 5 .95, d 520.01). No indications of reactivity due to EMAemerged as the number of daily binges remainedstable whereas waiting during one week beforetreatment start (non-significant linear temporaltrend, p 5 .62, both groups pooled). There was alsono change in the number of weekly binges betweenthe two weeks prior to treatment in the WL group(p 5 .11).

Comparison of Treatment Outcomes BetweenEMA- and Interview-Based Outcome Measures

The total number of weekly binge eating epi-sodes during the 7-days period decreased betweenpre- and post-treatment, from 2.74 to 0.53 forEMA-based weekly binges (p \ .001, RR 5 0.19)and from 2.78 to 0.85 (p\ .001, RR 5 0.31) for self-reported weekly binges. These two pre- to post-treatment effects did not differ between the two

methods (p 5 .37). There was further a decrease inOBE (values obtained for 28 days divided by 4)from 3.92 to 0.44 (p \ .001, RR 5 0.11), this effectnot being significantly higher than the oneobtained for EMA-based weekly binges (p 5 .29).

Abstainer rates based on EMA significantlyincreased between pre- and post-treatment from39% to 70% (p 5 .033, OR 5 3.7). Abstainer ratesbased on self-reported weekly binges also signifi-cantly increased from 21 to 63% (p \ .01, OR 56.4). These two pre- to post-treatment effects didnot differ between the two methods (p 5 .33).Finally, abstainer rate based on EDE was 0% (bydefinition) at pre-treatment and increased to 54%at post-treatment (p \ .001, OR not computablebecause baseline percentage was 0).

The number of EMA-based binge-free dayssignificantly increased from 4.38 (60.26) days atpre-treatment to 5.12 (60.28) days at post-treat-ment (p 5 .013, d5 20.70).

Concordance Between EMA- and TraditionalQuestionnaire/Interview-Based OutcomeMeasures and Estimates of Signal-Compliance

Concordances between EMA-based and EXQ-based signal-compliance were significantly differ-ent from zero for pre- but not for post-treatment(Table 2). Highest concordance rates for the com-parison of the core symptomatology of BED wereobtained between EMA-based and self-reportedweekly binge eating episodes and the correspond-ing abstainer rates, especially at post-treatment(Table 2). Concordance between OBE assessed by

TABLE 2. Concordance between measures based on traditional interviews and questionnaires and EMA-basedmeasures for signal compliance and objective binge eating

Pre-treatment Post-treatment

n r ICC(C,1) ICC (A,1) n r ICC (C,1) ICC (A,1)

Signal-compliance: EMA-based versusEXQ-based

17 .52* 0.52* 0.53* 12 .36 0.36 0.38

Weekly number of binges: EMA-basedversus self-report questionnaire

24 .46* 0.44* 0.45* 18 .80* 0.64* 0.64*

Weekly number of binges: EMA-basedversus OBEa according to EDE

20 .13 0.12 0.12 17, 15b .78,* .42b 0.78, 0.40b 0.79,* 0.42b

Weekly abstainer rate: EMA-based versusself-report questionnaire

24 0.22c 18 0.50b,*

Weekly abstainer rate: EMA-based versusaccording to EDE

25 —d 20 0.48b,*

ICC, intraclass correlation coefficient for consistency (C,1) and absolute agreement (A,1) according to McGraw and Wong41; r, Pearson correlation coeffi-cient; OBE: objective binge eating episodes according to Eating Disorder Examination, EDE; EXQ, exit questionnaire (unpublished questionnaire, availableby the authors).

a Values were obtained by dividing the number of OBEs by four to make them comparable to the number of EMA-based and self-reported weekly binges.b Values after removal of two extreme values.c Value denotes Cohen’s kappa.d Cohen’s kappa could not be computed as abstainer rate based on EDE at pre-treatment was equal to zero.* p\ 0.05.

MUNSCH ET AL.

6 International Journal of Eating Disorders 00:0 000–000 2009

EDE and EMA-based weekly binges was low at pre-but high at post-treatment. However, the lattervalue was largely due to two extreme values. Bothconcordances regarding abstainer rates were signif-icantly higher than 0 at post-treatment but not atpre-treatment.

Factors Influencing Concordance BetweenTraditional and EMA-Based Measures

We found no significant relationship betweenconcordance of EMA-based versus questionnaire/interview-based measures of binge eating and eitherBMI, severity of the eating disorder, comorbiditywith mental disorders, or severity of negative affectwith two exceptions. There was a negative relation-ship for the pre-treatment concordance betweenthe EMA weekly binges and self-reported weeklybinges and baseline BDI (without outlier: p 5 .004, r5 2.59, N 5 21; including outlier: p 5 .69, r 5 2.09,N 5 22): although the number of EMA weeklybinges increased with increasing baseline BDI nosuch trend could be observed for self-reportedweekly binges. There was another negative relation-ship for the pre-treatment concordance betweenEMA-based and self-reported abstainer rates andbaseline BMI (p 5 .048, OR 5 1.29, N 5 24): self-reported abstainer rates increased with higher BMIwhereas EMA-based abstainer rates did not.

Discussion

The present study examined the feasibility of andreactivity to EMA in the course of a clinical trialwith BED patients. It also compared the treatmenteffects based on EMA, traditional self-report ques-tionnaires, and interviews with each other and ana-lyzed whether outcomes of these different assess-ment methods were concordant.

We obtained high values for acceptability, prac-ticability, and representativeness according to aself-report EXQ both at pre- and post-treatment.These results corroborate previous studies in otherclinical populations, which rarely reported any dif-ficulties with the electronic diary for differentlengths of assessment periods.19,43,44 EMA-basedsignal-compliance was high and stable at 87%between before and after treatment. This is in linewith the estimated compliances of 86–92% fromprevious studies with different patient populationsusing different frequencies of time-contingentmeasurements (every 30 min to six times daily) andoverall assessment time (2–14 days).14,15,17,19,43–45

Patients with BED reported significantly lower sig-

nal-compliance on the questionnaire after treat-ment, yet our EMA data indicate that their objec-tive signal compliance behavior was still high andunchanged after the treatment. We found moderateconcordance between the two compliance assess-ment methods before and low concordanceafter treatment, probably due to the spuriouslydecreased compliance self-estimates after treat-ment. Because electronic diaries objectively moni-tor compliance behavior, the observed discrepancybetween the momentary and retrospective datamight indicate inferiority in the accuracy of retro-spective reports in this regard.

We examined both retrospective and momentarymeasures of reactivity, that is, to what extent fillingin the electronic questionnaire influenced the pat-terns of binge eating itself. Reactivity was retro-spectively reported as being moderate, and thisperception remained stable between pre- and post-treatment. Further, we did not find indications ofreactivity in terms of a possible weekly trend in theEMA-based number of binge eating episodes priorto treatment. This result is in line with the findingsfrom eating disorder and other clinical popula-tions,14,18,46 and indicates that the problem of be-havioral reactivity to self-monitoring should not beoverestimated.

From a theoretical point of view, an importantquestion addressed in our study is how electronicdiary assessment of pre- to post-treatment effectscompares to the traditional questionnaire andinterview assessment. The numbers of weeklybinge eating episodes assessed by EMA or tradi-tional self-report questionnaire were strongly andsimilarly reduced between pre- and post-treat-ment, resulting in comparable effect sizes. Withrespect to abstainer rates between pre- and post-treatment we further found only small differencesbetween EMA-based (increase from 39 to 70%) andself-report questionnaire-based (increase from 21to 63%) effects. It seems likely that the two meth-ods, retrospectively reporting binge eating over thepast seven days and immediate assessment byEMA, both result in comparable data accuracy as ithas been found in a study with pain patients byJamison et al.47 It needs to be investigated whetherthe observed small differences in outcomesbetween the two assessment methods becomemore pronounced when self-report questionnairescover longer retrospective time spans than sevendays. EDE-based effects of abstainer rates weresomewhat lower (increase from 0 to 54%), possiblydue to retrospective recall error.48,49 The ability toretain the characteristics of a complex behaviorsuch as binge eating in memory for more than a

ECOLOGICAL MOMENTARY ASSESSMENT

International Journal of Eating Disorders 00:0 000–000 2009 7

few days is limited, and therefore patients are likelyto forget or distort some of the information neces-sary to evaluate a binge eating episode correctly.This limitation might especially apply to the EDE,demanding a recall period of 28 days.

Measures of concordance ranged between lowand fair to good50 and varied considerably betweenpre- and post-treatment and also among pairs ofoutcome measures. The correlation coefficientswere lower when compared with other studies22

and also lower than in the Stein and Corte18 studywith patients with subthreshold anorexia and buli-mia nervosa. As the treatment turned out to behighly efficacious, the low rates of binge eating atpost-treatment did not allow us to accurately iden-tify concordance of the different outcome meas-ures (floor effect). Thus, these findings must beinterpreted with caution.

Contrary to our expectations, the concordancesbetween EMA-based and traditional questionnaire/interview-based measures hardly depended onpatients’ comorbidity status or eating disorder pa-thology. Yet our findings indicate that higher base-line depressiveness is associated with a higher fre-quency of EMA-based binge eating episodes butnot of binge eating episodes assessed by traditionalself-report questionnaires. Further, we showed thata higher BMI leads to higher frequencies of self-report questionnaire based binge eating but doesnot influence the frequency of binge eatingassessed by EMA. These results should be dis-cussed with caution as only two out of 32 statisticaltests addressing this topic were statistically signifi-cant. Thus, we cannot rule out that these tworesults were merely chance findings.

In interpreting our findings one should bear inmind that our study is subject to several limita-tions. First, although we found no indication ofreactivity effects to EMA, to definitely evaluate thispossible problem, future studies should control forthe effects of waiting and EMA separately. A secondmajor limitation concerns the differences in theduration of data assessment between EMA andEDE. As estimates based on EDE refer to a 28-daysperiod we had to divide measures indicating thenumbers of binge eating episodes by four to makethem comparable to our measures based on a 1-week period. A presupposition for this is that bingeeating episodes across the previous 28 and 7 days’periods are reasonably highly correlated, whichmay or may not be true. Even though patient bur-den will be high, future studies should compare theconcordance and treatment effects of these twomethods covering the required 28 days according

to EDE in order to better account for the knownfluctuating nature of binge eating in BED. It wouldbe beneficial to include direct observational dataon behavior, rather than comparing two self-reported data sources. This would allow establish-ing the superiority of one against the other methodmore definitively. In a recently developed ambula-tory assessment method, the electronically acti-vated recorder51 may provide direct acoustic evi-dence of binge eating in real life. Third, as is oftenthe case in clinical trials, our study included only amoderate sample size thereby limiting the statisti-cal power of the hypotheses tested.

In summary, this study in patients with BEDshowed the electronic diary assessment to behighly acceptable and practicable, leading to highcompliance rates. We found comparable treatmenteffects for binge eating using EMA when comparedwith traditional self-report questionnaires but onlymoderate concordance rates. Though preliminary,our results point to possible limitations of retro-spective recall of complex behavior such as bingeeating demonstrated previously52 over a sustainedtime period as required in the EDE.

Most importantly, our data demonstrate that theEMA method allows gathering precise and detailedinformation about binge eating episodes in the nat-uralistic environment during a treatment trial. Theadvantages of this assessment method in psycho-therapy research include the assessment of mo-mentary and time-stamped data, the potential toask branching questions, the possibility to collectmultiple data in a short time period, and theinstant accessibility for data analysis. Awarenessabout the specific characteristics of BED in dailylive can be utilized in the planning of individual-ized interventions for specific patients. However,further efforts are needed to establish the feasibilityof the EMA method outside specialized researchunits.

References

1. Spitzer RL, Devlin MJ, Walsh BT, Hasin D, Wing RR, Marcus MD,et al. Binge eating disorder: A multisite field trial of the diag-nostic criteria. Int J Eat Disord 1992;11:191–203.

2. Spitzer RL, Yanovski S, Wadden T, Wing R, Marcus MD, StunkardA, et al. Binge eating disorder: Its further validation in a multi-site study. Int J Eat Disord 1993;13:137–153.

3. Munsch S, Becker E, Meyer A, Schneider S, Margraf J. Recurrentbinge eating (RBE) and its characteristics in a sample of youngwomen in Germany. Eur Eat Disord Rev 2003;15:385–399.

4. Rieger E, Wilfley DE, Stein RI, Marino V, Crow SJ. A comparisonof quality of life in obese individuals with and without bingeeating disorder. Int J Eat Disord 2005;37:234–240.

MUNSCH ET AL.

8 International Journal of Eating Disorders 00:0 000–000 2009

5. Chambless DL, Ollendick TH. Empirically supported psychologi-cal interventions: Controversies and evidence. Annu Rev Psy-chol 2001;52:685–716.

6. APA. Treating Eating Disorders–A Quick Reference Guide, 2nded. American Psychiatric Association, Washington, 2006.

7. National Institute for Clinical Excellence (NICE).Eating disor-ders–Core interventions in the treatment and management ofanorexia nervosa, bulimia nervosa, and related eating disor-ders. National Clinical Practice Guideline Number CG9. TheBritish Psychological Society and Gaskell, 2004.

8. Fahrenberg J, Myrtek M, Pawlik K, Perrez M. Ambulatory assess-ment–Monitoring behavior in daily life settings: A behavioral-scientific challenge for psychology. Eur J Psychol Assess 2007;23:206–213.

9. Pohl RF. Cognitive Illusions. A Handbook on Fallacies andBiases in Thinking, Judgement, and Memory. New York:Psychology Press, 2004.

10. Csikszentmihalyi M, Larson R, editors. Validity and Reliabilityof the Experience Sampling Method. New York, NY, USA: Cam-bridge University Press, 1992.

11. Stone AA, Shiffman S. Ecological momentary assessment (EMA)in behavorial medicine. Ann Behav Med 1994;16:199–202.

12. le Grange D, Gorin A, Catley D, Stone AA. Does momentaryassessment detect binge eating in overweight women that isdenied at interview? Eur Eat Disord Rev 2001;9:309–324.

13. Engel SG, Boseck JJ, Crosby RD, Wonderlich SA, Mitchell JE,Smyth J, et al. The relationship of momentary anger and impul-sivity to bulimic behavior. Behav Res Ther 2007;45:437–447.

14. Smyth JM, Wonderlich SA, Heron KE, Sliwinski MJ, Crosby RD,Mitchell JE, et al. Daily and momentary mood and stress areassociated with binge eating and vomiting in bulimia nervosapatients in the natural environment. J Consult Clin Psychol2007;75:629–638.

15. Stein RI, Kenardy J, Wiseman CV, Dounchis JZ, Arnow BA, Wilf-ley DE. What’s driving the binge in binge eating disorder? A pro-spective examination of precursors and consequences. Int J EatDisord 2007;40:195–203.

16. Kazdin AE. Reactive self-monitoring: The effects of responsedesirability, goal setting, and feedback. J Consult Clin Psychol1974;42:704–716.

17. Hufford MR, Shields AL, Shiffman S, Paty J, Balabanis M. Reac-tivity to ecological momentary assessment: An example usingundergraduate problem drinkers. Psychol Addict Behav 2002;16:205–211.

18. Stein KF, Corte CM. Ecologic momentary assessment of eating-disordered behaviors. Int J Eat Disord 2003;34:349–360.

19. Stone AA, Broderick JE, Shiffman SS, Schwartz JE. Understandingrecall of weekly pain from a momentary assessment perspec-tive: Absolute agreement, between- and within-person consis-tency, and judged change in weekly pain. Pain 2004;107:61–69.

20. Williams DA, Gendreau M, Hufford MR, Groner K, Gracely RH,Clauw DJ. Pain assessment in patients with fibromyalgia syn-drome: A consideration of methods for clinical trials. ClinJ Pain 2004;20:348–356.

21. Cooper Z, Fairburn CG. The eating disorder examination: Asemi-structured interview for the assessment of the spe-cific psychopathology of eating disorders. Int J Eat Disord1987;6:1–8.

22. Stone AA, Schwartz JE, Neale JM, Shiffman S, Marco CA, HickcoxM, et al. A comparison of coping assessed by ecological mo-mentary assessment and retrospective recall. J Pers Soc Psychol1998;74:1670–1680.

23. Burton C, Weller D, Sharpe M. Are electronic diaries useful forsymptoms research? A systematic review. J Psychosom Res2007;62:553–561.

24. Ebner Priemer UW, Kuo J, Welch SS, Thielgen T, Witte S, BohusM, et al. A valence-dependent group-specific recall bias of ret-rospective self-reports: A study of borderline personality disor-der in everyday life. J Nerv Ment Dis 2006;194:774–779.

25. Wild B, Eichler M, Feiler S, Friederich HC, Hartmann M, HerzogW, et al. Dynamic analysis of electronic diary data of obesepatients with and without binge eating disorder. PsychotherPsychosom 2007;76:250–252.

26. Fairburn CG, Wilson GT. Binge eating: Nature, Assessment, andTreatment, 2nd ed. New York: Guilford Press, 1993.

27. Hilbert A, Tuschen Caffier B, Ohms M. Eating disorders exami-nation: A German version of the structured eating disorderinterviews eating disorder examination: Deutschsprachige Ver-sion des strukturierten Essstorungsinterviews. Diagnostica 2004;50:98–106.

28. Visscher TL, Viet AL, Kroesbergen IH, Seidell JC. Underreportingof BMI in adults and its effect on obesity prevalence estima-tions in the period 1998 to 2001. Obesity (Silver Spring) 2006;14:2054–2063.

29. Wilfley DE, Welch RR, Stein RI, Spurrell EB, Cohen LR, SaelensBE, et al. A randomized comparison of group cognitive-behav-ioral therapy and group interpersonal psychotherapy for thetreatment of overweight individuals with binge-eating disorder.Arch Gen Psychiatry 2002;59:713–721.

30. APA. Diagnostic and statistical manual of mental disordersDSM-IV-TR, American Psychiatric Association Task Force onDSM-IV, 4th ed. Washington (DC): American Psychiatric Associa-tion, 2000.

31. Margraf J. Mini-DIPS. Berlin: Springer Verlag, 1994.32. Wittchen H, Zaudig M, Fydrich T. Strukturiertes Interview fur

DSM-IV. Bern: Hogrefe Verlag Testzentrale, 1997.33. Kracht M. Randomisierung. In: Adam J, editor. Statistisches

Know-How in der medizinischen Forschung. Berlin: UllsteinMosby GmbH, 1992, pp.172–210.

34. Fahrenberg J. Assessment in daily life. A review of computer-assisted methodologies and applications in psychology andpsychophysiology, years 2000–2005. Available at: http://www.ambulatory-assessment.org/.

35. Munsch S, Biedert E, Meyer A, Michael T, Schlup B, Tuch A,et al. A randomized comparison of cognitive behavioral therapyand behavioral weight loss treatment for overweight individu-als with binge eating disorder. Int J Eat Disord 2007;40:102–113.

36. Hautzinger M. The Beck depression inventory in clinical prac-tice. Nervenarzt 1991;62:689–696.

37. Schlup B, Munsch S, Meyer AH, Wilhelm FH. The efficacy of a shortversion of a cognitive-behavioral treatment followed by boostersessions for binge eating disorder. Behav Res Ther. Submitted.

38. Munsch S. Binge eating. Kognitive Verhaltenstherapie beiEssanfallen. Weinheim: Beltz, 2003.

39. Fitzgmaurice GM, Laird NM, Ware JH. Applied LongitudinalAnalysis. Hoboken, NJ: Wiley, 2004.

40. Agresti A. Categorical Data Analysis, 2nd ed. Hoboken, New Jer-sey: Wiley, 2002.

41. McGraw KO, Wong SP. Forming inferences about some intra-class correlation coefficients. Psychol Methods 1996;1:30–46.

42. Little RJA, Rubin DB. Statistical Analyses with Missing Data, 2nded. Hoboken, New Jersey: Wiley, 2002.

43. Hilbert A, Tuschen CB. Maintenance of binge eating throughnegative mood: A naturalistic comparison of binge eatingdisorder and bulimia nervosa. Int J Eat Disord 2007;40:521–530.

44. Engel SG, Wonderlich SA, Crosby RD, Wright TL, Mitchell JE,Crow SJ, et al. A study of patients with anorexia nervosa usingecologic momentary assessment. Int J Eat Disord 2005;38:335–339.

ECOLOGICAL MOMENTARY ASSESSMENT

International Journal of Eating Disorders 00:0 000–000 2009 9

45. Boseck JJ, Engel SG, Allison KC, Crosby RD, Mitchell JE, de ZwaanM. The application of Ecological Momentary Assessment to thestudy of the night eating. Int J Eat Disord 2007;40:271–276.

46. le Grange D, Gorin A, Dymek M, Stone A. Does ecological mo-mentary assessment improve cognitive behavioural therapy forbinge eating disorder? A pilot study. Eur Eat Disord Rev 2002;10:316–328.

47. Jamison RN, Raymond SA, Slawsby EA, McHugo GJ, Baird JC. Painassessment in patients with low back pain: Comparison of weeklyrecall and momentary electronic data. J Pain 2006;7:192–199.

48. Friedman WJ, deWinstanley PA. Changes in the subjective prop-erties of autobiographical memories with the passage of time.Memory 1998;6:367–381.

49. Wilhelm P, Schoebi D, Perrez M. Frequency estimates of emo-tions in everyday life from diary method’s perspective: A com-ment on Scherer et al.’s survey-study ‘‘Emotions in everydaylife.’’ Trends Dev Res Emot 2004;43:647–665.

50. Fleiss JL. The Design and Analysis of Clinical Experiments. NewYork: Wiley, 1986.

51. Mehl MR, Pennebaker JW, Crow DM, Dabbs J, Price JH. The elec-tronically activated recorder (EAR): A device for sampling natu-ralistic daily activities and conversations. Behav Res MethodsInstrum Comput 2001;33:517–523.

52. Ortega DF, Waranch HR, Maldonado AJ, Hubbard FA. A compar-ative analysis of self-report measures of bulimia. Int J Eat Dis-ord 1987;6:301–311.

MUNSCH ET AL.

10 International Journal of Eating Disorders 00:0 000–000 2009