11
Comparison of 2 Extended Activities of Daily Living Scales With the Barthel Index and Predictors of Their Outcomes Cohort Study Within the South London Stroke Register (SLSR) Shah-Jalal Sarker, PhD; Anthony G. Rudd, MD; Abdel Douiri, PhD; Charles D.A. Wolfe, MD Background and Purpose—Basic activities of daily living measures are often supplemented by extended activities of daily living. We compared the Frenchay Activities Index (FAI) and Nottingham Extended Activities of Daily Living (NEADL) with the Barthel Index (BI) in terms of distribution of scores, concurrent validity, reliability, and their agreement and investigated the predictors of scales outcomes. Methods—Two hundred thirty-eight patients from the population-based South London Stroke Register were assessed with the BI, FAI, and NEADL 3 months after a first-ever stroke. The pairwise relationship was studied using correlations, fractional polynomial regression, and Bland and Altman plot; the baseline predictors, for example, sociodemography, case severity: National Institutes of Health Stroke Scale, and 7-day Abbreviated Memory Test, comorbidities, and acute treatments by negative binomial regression. Results—The BI was highly affected by a ceiling effect (33% had the highest score), FAI was only affected by floor effect (19%), but NEADL was symmetrical with only 4% highest and lowest score. Despite high concurrent validity of the scales (r 0.80, P0.001), they agreed poorly only for the highest and the lowest level of activities. The association and agreement of NEADL with BI was higher than that of FAI with BI. Severe stroke patients (National Institutes of Health Stroke Scale 13) had 28% lower BI (79% lower FAI and 62% lower NEADL) score than nonsevere patients (P0.001). Cognitively intact patients (Abbreviated Memory Test: 8 –10) had 2.3 times greater FAI values (65% higher NEADL) compared with impaired patients (P0.001). Conclusions—The NEADL scale was symmetrical, concurrently valid with no floor and ceiling effects. It corresponded better with BI than FAI did confirming its basic activities of daily living properties, yet it is a more sensitive tool for extended activities of daily living without the floor and ceiling effects. Future functional status could be predicted by the acute stage National Institutes of Health Stroke Scale score, whereas only extended activities of daily living status could be predicted by the Abbreviated Memory Test score. Predicting future functional status at the acute stage may decrease unnecessary length of stay in acute care settings. (Stroke. 2012;43:1362-1369.) Key Words: agreement Barthel Index Frenchay Activities Index Nottingham Extended Activities of Daily Living NIH Stroke Scale stroke register T he Barthel Index (BI) is a commonly used measure of activities of daily living (ADL) for patients with stroke. 1,2 It covers basic self-care activities and is known to have a marked ceiling effect (the percentage of subjects with the maximum possible score) and also a floor effect (the percentage of subjects with the minimum possible score) depending on the time of poststroke measurement. 1 The use of an instrument that assesses only basic ADL might give a restricted view of a patient’s functional status. 3 It is often necessary to supplement basic ADL measurements with extended activities of daily living (EADL) measures to also assess more complex activities in the home environment. 3,4 There are no universally ac- knowledged measures regarding EADL. The Frenchay Activities Index (FAI) is a frequently used EADL scale for measuring stroke outcome. 5 Another proposed EADL scale Received November 27, 2011; final revision received February 14, 2012; accepted February 23, 2012. From the Division of Health and Social Care Research (S.-J.S., A.G.R., A.D., C.D.A.W.), King’s College London, London, UK; National Institute for Health Research Comprehensive Biomedical Research Centre (A.G.R., A.D., C.D.A.W.), Guy’s & St. Thomas’ National Health Service Foundation Trust and King’s College London, London, UK; and the Centre for Experimental Cancer Medicine (S.-J.S.), Barts Cancer Institute—a CR-UK Centre of Excellence, Queen Mary, University of London, London, UK. This article presents independent research commissioned by the National Institute for Health Research (NIHR) under the Biomedical Research Centre award. The views expressed in this article are those of the authors and not necessarily those of the National Health Service, the NIHR, or the Department of Health. The online-only Data Supplement is available with this article at http://stroke.ahajournals.org/lookup/suppl/doi:10.1161/STROKEAHA.111. 645234/-/DC1. Correspondence to Shah-Jalal Sarker, PhD, Lecturer in Biostatistics, Old Anatomy Building, Charterhouse Square, London EC1M 6BQ, UK. E-mail [email protected] © 2012 American Heart Association, Inc. Stroke is available at http://stroke.ahajournals.org DOI: 10.1161/STROKEAHA.111.645234 1362 by guest on June 17, 2018 http://stroke.ahajournals.org/ Downloaded from by guest on June 17, 2018 http://stroke.ahajournals.org/ Downloaded from by guest on June 17, 2018 http://stroke.ahajournals.org/ Downloaded from by guest on June 17, 2018 http://stroke.ahajournals.org/ Downloaded from

Comparison of 2 Extended Activities of Daily Living Scales ...stroke.ahajournals.org/content/strokeaha/43/5/1362.full.pdf · With the Barthel Index and Predictors of Their Outcomes

  • Upload
    lebao

  • View
    214

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Comparison of 2 Extended Activities of Daily Living Scales ...stroke.ahajournals.org/content/strokeaha/43/5/1362.full.pdf · With the Barthel Index and Predictors of Their Outcomes

Comparison of 2 Extended Activities of Daily Living ScalesWith the Barthel Index and Predictors of Their Outcomes

Cohort Study Within the South London Stroke Register (SLSR)

Shah-Jalal Sarker, PhD; Anthony G. Rudd, MD; Abdel Douiri, PhD; Charles D.A. Wolfe, MD

Background and Purpose—Basic activities of daily living measures are often supplemented by extended activities of dailyliving. We compared the Frenchay Activities Index (FAI) and Nottingham Extended Activities of Daily Living(NEADL) with the Barthel Index (BI) in terms of distribution of scores, concurrent validity, reliability, and theiragreement and investigated the predictors of scales outcomes.

Methods—Two hundred thirty-eight patients from the population-based South London Stroke Register were assessed withthe BI, FAI, and NEADL 3 months after a first-ever stroke. The pairwise relationship was studied using correlations,fractional polynomial regression, and Bland and Altman plot; the baseline predictors, for example, sociodemography,case severity: National Institutes of Health Stroke Scale, and 7-day Abbreviated Memory Test, comorbidities, and acutetreatments by negative binomial regression.

Results—The BI was highly affected by a ceiling effect (33% had the highest score), FAI was only affected by floor effect(19%), but NEADL was symmetrical with only 4% highest and lowest score. Despite high concurrent validity of thescales (r �0.80, P�0.001), they agreed poorly only for the highest and the lowest level of activities. The associationand agreement of NEADL with BI was higher than that of FAI with BI. Severe stroke patients (National Institutes ofHealth Stroke Scale �13) had 28% lower BI (79% lower FAI and 62% lower NEADL) score than nonsevere patients(P�0.001). Cognitively intact patients (Abbreviated Memory Test: 8–10) had 2.3 times greater FAI values (65% higherNEADL) compared with impaired patients (P�0.001).

Conclusions—The NEADL scale was symmetrical, concurrently valid with no floor and ceiling effects. It correspondedbetter with BI than FAI did confirming its basic activities of daily living properties, yet it is a more sensitive tool forextended activities of daily living without the floor and ceiling effects. Future functional status could be predicted bythe acute stage National Institutes of Health Stroke Scale score, whereas only extended activities of daily living statuscould be predicted by the Abbreviated Memory Test score. Predicting future functional status at the acute stage maydecrease unnecessary length of stay in acute care settings. (Stroke. 2012;43:1362-1369.)

Key Words: agreement � Barthel Index � Frenchay Activities Index � Nottingham Extended Activities of Daily Living� NIH Stroke Scale � stroke register

The Barthel Index (BI) is a commonly used measure ofactivities of daily living (ADL) for patients with

stroke.1,2 It covers basic self-care activities and is known tohave a marked ceiling effect (the percentage of subjectswith the maximum possible score) and also a floor effect(the percentage of subjects with the minimum possiblescore) depending on the time of poststroke measurement.1

The use of an instrument that assesses only basic ADL

might give a restricted view of a patient’s functionalstatus.3 It is often necessary to supplement basic ADLmeasurements with extended activities of daily living(EADL) measures to also assess more complex activities inthe home environment.3,4 There are no universally ac-knowledged measures regarding EADL. The FrenchayActivities Index (FAI) is a frequently used EADL scale formeasuring stroke outcome.5 Another proposed EADL scale

Received November 27, 2011; final revision received February 14, 2012; accepted February 23, 2012.From the Division of Health and Social Care Research (S.-J.S., A.G.R., A.D., C.D.A.W.), King’s College London, London, UK; National Institute for

Health Research Comprehensive Biomedical Research Centre (A.G.R., A.D., C.D.A.W.), Guy’s & St. Thomas’ National Health Service Foundation Trustand King’s College London, London, UK; and the Centre for Experimental Cancer Medicine (S.-J.S.), Barts Cancer Institute—a CR-UK Centre ofExcellence, Queen Mary, University of London, London, UK.

This article presents independent research commissioned by the National Institute for Health Research (NIHR) under the Biomedical Research Centreaward. The views expressed in this article are those of the authors and not necessarily those of the National Health Service, the NIHR, or the Departmentof Health.

The online-only Data Supplement is available with this article at http://stroke.ahajournals.org/lookup/suppl/doi:10.1161/STROKEAHA.111.645234/-/DC1.

Correspondence to Shah-Jalal Sarker, PhD, Lecturer in Biostatistics, Old Anatomy Building, Charterhouse Square, London EC1M 6BQ, UK. [email protected]

© 2012 American Heart Association, Inc.

Stroke is available at http://stroke.ahajournals.org DOI: 10.1161/STROKEAHA.111.645234

1362

by guest on June 17, 2018http://stroke.ahajournals.org/

Dow

nloaded from

by guest on June 17, 2018http://stroke.ahajournals.org/

Dow

nloaded from

by guest on June 17, 2018http://stroke.ahajournals.org/

Dow

nloaded from

by guest on June 17, 2018http://stroke.ahajournals.org/

Dow

nloaded from

Page 2: Comparison of 2 Extended Activities of Daily Living Scales ...stroke.ahajournals.org/content/strokeaha/43/5/1362.full.pdf · With the Barthel Index and Predictors of Their Outcomes

is the Nottingham Extended Activities of Daily Living(NEADL) scale.5,6

A useful measure should be reliable, valid, and responsiveand should not contain floor or ceiling effects.1 Only fewcomparisons of psychometric properties (mainly responsive-ness) of the FAI have been made with those of the BI in theliterature.3,7,8 It has been reported that the FAI supplementsthe BI with minimal overlap in content.3 The use of the BI inthe subacute phase of stroke was recommended, whereas theFAI was recommended in the longer term.8 However, most ofthe previous studies were performed on selected groups ofpatients such as recruited from a hospital or rehabilitationsetting, possibly limiting the external validity of theirfindings.3,8 Unlike the FAI, the properties of the NEADLscale have not been studied and compared with those of theBI in the literature and to the best of our knowledge, nostudies have been reported assessing the agreement of FAIand NEADL with BI. Additionally, little is known aboutbaseline variables affecting poststroke BI, FAI, andNEADL scores.7

The aims of this present study are to (1) compare theFAI and NEADL scales with the BI in terms of distributionof scores, concurrent validity, reliability, and their agree-ment; and (2) investigate for common baseline predictorsof poststroke scales outcomes. Data were used from thepopulation-based multiethnic South London Stroke Regis-ter (SLSR).

MethodsA population-based stroke register, recording first-ever strokes inpatients of all age groups for a defined area of South London, wasstarted in January 1995 with follow-up interviews at 3 months, 1year, and annually thereafter.9,10 To maximize case ascertainment,16 different overlapping sources were used and standardizedmethods for ensuring completeness of case ascertainment wereestablished. The detailed methods of notification of patients anddata collection have been described elsewhere.11 For this study,consecutive patients who had their stroke from August 2002 toOctober 2004 were selected because data on NEADL wereavailable only for this period. Follow-up information up to 3months was used in this study.

The World Health Organization definition of stroke was used.Classification of the pathological subtype (cerebral infarction, pri-mary intracerebral hemorrhage, and subarachnoid hemorrhage) wasbased on results from at least 1 of the following: brain imagingwithin 30 days of stroke onset (CT or MRI), cerebrospinal fluidanalysis, or postmortem examination; 94% of the patients receivedtheir CT scan within 7 days of their stroke.

Information collected at initial assessment included self-definitionof ethnic origin (1991 census question). Ethnic origin was stratifiedinto black, white, and other. Stroke severity (time of maximumimpairment) was assessed at acute stage/baseline with the NationalInstitutes of Health Stroke Scale (NIHSS; categorized as: severe,�13; moderate/mild, �13).12 Comorbidities included atrial fibrilla-tion, diabetes, and hypertension (�140/90 mm Hg).13 Prestrokedisability was measured by prior BI (categorized as: independent,BI�20; dependent, BI �20). Cognitive impairment was measuredusing 7-days Abbreviated Memory Test (AMT; categorized as:cognitively intact, 8–10 and cognitively impaired, 0–7).14 Theservice provision variable categorized patients into those who weretreated in a stroke unit at any time point, in a general medical wardonly, and those who were not admitted to a hospital.15 Three monthsafter the stroke, survivors were interviewed by trained researchers toassess the BI, FAI, and NEADL scales.

Statistical MethodsThe BI, FAI, and NEADL have different ranges: BI from 0 to 20,FAI from 0 to 45, and NEADL from 0 to 66 hampering theircomparison. Therefore, the scores from each scale were linearlytransformed to a 0 to 100 range using the formula: 100�(observedscore�minimum possible score)/score range. Distributions of totaloutcome scale scores were displayed using notched box-and-whiskerplot. Concurrent validity was measured using Spearman rank corre-lation coefficient. The intraclass correlation coefficient (ICC), themost suitable and most commonly used reliability parameter forcontinuous measures, was used (based on 2-way random effectsmodel with absolute agreement) to measure reliability.16 Concor-dance correlation coefficient, a methodologically better measure thanICC because it takes into account of the precision (how far eachobservation deviates from the best-fit line) and accuracy (how far thebest-fit line deviates from the 45° line through the origin), was alsoused for comparison purposes.17 Another useful measure coefficientof variation for duplicate measurements was used to compare thescales pairwise variation.18 The relationship between each pair ofscales was studied using scatterplot and fractional polynomial with95% CI.19 Fractional polynomials of degree 2, 3, and 1 were fittedfor the pairs FAI and BI, NEADL and BI, and NEADL and FAI,respectively, based on the results of deviance tests and relativeprobability values were used to decide the final degree. Theagreement between each pair of scales was studied using Bland andAltman plots.20 Also a folded empirical cumulative distributionplot/mountain plot (created by computing a percentile for eachranked difference between a new method and a reference method)was drawn as a useful complementary plot to the Bland and Altmanplot to compare different distributions easily.21

Due to overdispersion, multivariable negative binomial regres-sion models were used for identifying predictors of the 3 outcomescales based on age, ethnicity, comorbidities, prestroke BI,subtype, NIHSS, AMT, and stroke unit care using untransformed(original) outcome scales scores as regression is dependent onchange of scale.22 Sex was left out of the model because it did nothave any significant effect in this analysis and also in ourprevious analysis of the register data.15,23 Also, for some predic-tors, categories with few cases were merged together to increasethe validity of the model.

Characteristics of dropouts were compared with the includedpatients to assess the generalizability of our results and alsosensitivity analysis was done to assess selection bias in multivariableanalysis results due to complete case analysis. Probability values�0.05 were defined as significant based on a 2-tailed test. Allcalculations were carried out using the statistical software packageIntercooled STATA 10.1 (Stata Corp, College Station, TX) forWindows except mountain plot, notched box-and-whisker plot,coefficient of variation, and correlations that were done usingMedCalc software for Windows Version 12 (MedCalc Software,Mariakerke, Belgium).

EthicsEthical approval was obtained from the ethics committee of Guy’sand St Thomas’ Hospital Trust, King’s College Hospital, andWestminster Hospital (London).

ResultsAmong the 395 patients with stroke with a first-ever stroke,64 patients died before a 3-month poststroke interview. Finally,a total of 238 patients were considered for the analysis afterfurther excluding 93 patients who had missing information onFAI, NEADL, or BI mainly due to loss to follow-up.

Characteristics of Analyzed PatientsOf the 238 patients, 205 had ischemic strokes, 20 primaryintracerebral hemorrhage, 9 subarachnoid hemorrhage, and 4unclassified. The sociodemographic and prestroke risk factors

Sarker et al Comparison of EADL Scales With BI 1363

by guest on June 17, 2018http://stroke.ahajournals.org/

Dow

nloaded from

Page 3: Comparison of 2 Extended Activities of Daily Living Scales ...stroke.ahajournals.org/content/strokeaha/43/5/1362.full.pdf · With the Barthel Index and Predictors of Their Outcomes

of the sample are shown in Table 1. Overall 162 (69%) of thepatients were white, 55 (23%) were black, and 21 (8%) werefrom other ethnic groups or missing ethnicity. The mean agewas 69 years (SD, 14 years). Seventy-four percent of thepatients had at least 1 present comorbid disease, whereas 26%did not have any. Eighty-one percent of the patients had aprestroke BI of 20 and only 13% patients had a severe stroke(NIHSS �13). A total of 74% patients had no abnormalitiesin their cognitive tests (AMT: 8–10). Seventy-seven percentof the patients were treated on a stroke unit, whereas 16%were treated on general medical wards and 7% patients werenot admitted to hospitals.

Distribution of Outcome Scale ScoresAt the 3-month follow-up, a significantly large number (19%)of patients had a minimum value in FAI (floor effect) ascompared with only 2% in BI and 4% in NEADL (P�0.001),whereas significantly more patients (33%) had maximum BIvalues (ceiling effect) compared with the proportion ofpatients with maximum NEADL (4%) or FAI (0%; P�0.001;Figure 1). Figure 1 also shows that median transformed totalscores (interquartile range) for FAI, BI, and NEADL were,respectively 24 (4–53), 90 (65–100), and 49 (16–81) with100 the maximum score achievable. Hence, the distributionof FAI was positively skewed (skewness�0.54), whereas thedistribution of BI was negatively skewed (skewness��1.38)and the distribution of NEADL was symmetrical.

Pairwise Association Between the OutcomeScale ScoresThe Spearman rank correlation coefficient showed that the 3scales were highly significantly concurrently valid withrespect to each other (rs �0.80, P�0.001; Table 2). However,ICC showed lower pair wise correlations (reliability) andonly correlation between NEADL and FAI (ri�0.75; CI,0.06–0.91) was significant in this case. Concordance corre-lation coefficient gave almost identical values as of ICC butmore precise. The correlations were also significantly differ-ent from each other like the Spearman rank correlation. All 3measures showed that the correlation between NEADL andBI was higher than that between FAI and BI. This isconsistent with the lower coefficient of variation betweenNEADL and BI than that of between FAI and BI (Table 2).As expected, the correlation between the 2 EADL scales, FAIand NEADL, was the highest.

Agreement Between the Outcome Scale ScoresResults of the deviance test including model R2 for the fittedfractional polynomial models in Figures 2 and 3 are given inonline-only Data Supplement Table I (http://stroke.ahajournals.org). There was a nonlinear relationship between FAI and BI(Figure 2A) as well as between NEADL and BI (Figure 2C).Only very few points lied along the line of equality suggest-ing a poor agreement of FAI and NEADL with the BI.Bland-Altman plot was V-shaped in both pairs suggestingthat the best agreement occurred for patients with a low orhigh level of ADL. However, scatterplots showed that forthose measured with high BI, both FAI and NEADL had hugeranges; similarly for low FAI or NEADL, there were hugeranges of BI. BI measured higher ADL compared with theFAI and NEADL, particularly for average active patients.The mean difference between FAI and BI was �47 (Figure2B) and between NEADL and BI was �29 (Figure 2D)suggesting that the agreement between NEADL and BI washigher than that between FAI and BI.

The relation between NEADL and FAI appeared to be quadratic(Figure 3A). The agreement between these 2 scales was fair.NEADL estimated a higher activity than FAI, particularly foraverage active patients (Figure 3B). The mean difference(NEADL–FAI) was 18 and the 95% limits of agreement (�11 and

Figure 1. Notched box-and-whisker plot with dots for distribu-tion of scores for the 3 scales: Barthel Index (BI), FrenchayActivities Index (FAI), and Nottingham Extended Activities ofDaily Living (NEADL).

Table 1. Description of Sample Characteristics at Baseline

CharacteristicPatients (n�238)

No./Total (%)

Age, mean y (SD) 68.6 (14.2)

Female sex 114/238 (47.9)

Ethnicity

White 162/236 (68.6)

Black 55/236 (23.3)

Other (including Asian) 19/236 (8.1)

Comorbidities

Diabetes 45/231 (19.5)

Arial fibrillation 34/198 (14.7)

Hypertension (�140/90 mm Hg) 155/233 (66.5)

Prestroke Barthel Index (20) 189/234 (80.8)

Subtype

Infarction 205/238 (86.1)

PICH 20/238 (8.4)

SAH 9/238 (3.8)

Unclassified 4/238 (1.7)

NIHSS (�13) 23/183 (12.6)

Cognitively intact (AMT: 8–10) 130/177 (73.5)

Service provision

Hospital (nonstroke unit) 39/238 (16.4)

Stroke unit 183/238 (76.9)

Nonhospital (community) 16/238 (6.7)

PICH indicates primary intracerebral hemorrhage; SAH, subarachnoid hem-orrhage; NIHSS, National Institutes of Health Stroke Scale; AMT, AbbreviatedMemory Test.

1364 Stroke May 2012

by guest on June 17, 2018http://stroke.ahajournals.org/

Dow

nloaded from

Page 4: Comparison of 2 Extended Activities of Daily Living Scales ...stroke.ahajournals.org/content/strokeaha/43/5/1362.full.pdf · With the Barthel Index and Predictors of Their Outcomes

48) were the smallest of all the comparisons indicating the highestagreement between these 2 scales. The inverse V-shaped Bland-Altman plot (Figure 3B) suggested the best agreement occurred forpatients with a low or high level of ADL.

Figure 3C clearly shows that NEADL agrees more with the BIthan FAI does because the fitted fractional polynomial line for theNEADL is closer to the line of equality. In the folded empiricalcumulative distribution plot (Figure 3D), the mountains are notcentered over 0 and hence the 2 scales, NEADL and FAI, were not

unbiased with respect to BI. The differences of BI with NEADLtend to be smaller (median difference, 29) than the differences of BIwith FAI (median difference, 49). Therefore, NEADL correspondsbetter with the BI than does FAI.

Predictors for the Functional OutcomesWhen looking at predictors for the 3 scales outcomes, theNIHSS score was a significant predictor for all 3 scales at the3-month follow-up (Table 3). A patient with severe stroke

Figure 2. Pairwise comparison of activities of daily living (ADL) scores for the scales: Barthel Index (BI), Frenchay Activities Index (FAI),and Nottingham Extended Activities of Daily Living (NEADL). Left side (A and C): Scatterplot with fractional polynomial regression lineand right side (B and D): Bland and Altman plot.

Table 2. Pairwise Association Among the Scores of the 3 Scales: Barthel Index, FrenchayActivities Index, and Nottingham Extended Activities of Daily Living

Spearman RankCorrelation (CI)

IntraclassCorrelation (CI)

ConcordanceCorrelation (CI)

Coefficient ofVariation*

BI and FAI 0.80 (0.74–0.84) 0.27 (�0.09 to 0.60) 0.27 (0.22–0.32) 69.21

BI and NEADL 0.88 (0.84–0.90) 0.55 (�0.07 to 0.80) 0.54 (0.48–0.60) 39.94

NEADL and FAI 0.90 (0.88–0.92) 0.75 (0.06 to 0.91) 0.75 (0.70–0.79) 41.85

BI indicates Barthel Index; FAI, Frenchay Activities Index; NEADL, Nottingham Extended Activities of Daily Living.*Coefficient of variation for duplicate measurements.

Sarker et al Comparison of EADL Scales With BI 1365

by guest on June 17, 2018http://stroke.ahajournals.org/

Dow

nloaded from

Page 5: Comparison of 2 Extended Activities of Daily Living Scales ...stroke.ahajournals.org/content/strokeaha/43/5/1362.full.pdf · With the Barthel Index and Predictors of Their Outcomes

(NIHSS �13) had a 28% lower BI (79% lower FAI and 62%lower NEADL) score compared with a patient with mild tomoderate stroke (P�0.001). The cognition score was asignificant predictor for the EADL scales: FAI and NEADLbut not for BI. Cognitively intact patients (AMT: 8–10) hadFAI values 2.31 times greater (65% higher NEADL) com-pared with cognitively impaired patients (P�0.001). Theprestroke BI score was a predictor for BI and NEADL only.These multivariable results were based on only 140 (exclud-ing 98 from 238) patients due to missing values in baselinevariables. However, sensitivity analysis assured that therewas no bias in the prediction of future functional status basedon the NIHSS score (see the online-only Data Supplement fordetails).

Relationship of NIHSS Score With theFunctional OutcomesBecause NIHSS turned out to be a universal predictor forthe future functional status, fractional polynomial modelsof degree 2 were fitted to the see the relationship between

NIHSS and outcome scales in online-only Data Supple-ment Figure I. An inverse relationship between NIHSSscores and scales outcomes was found, which was ex-pected. Although NIHSS scores increased from 0 to 24, thepredicted scores for the BI, FAI, and NEADL decreasedroughly from 95% to 35%, 58% to 5%, and 75% to 18%,respectively, indicating the ceiling effect of BI and flooreffect of FAI influenced by the acute stage NIHSS scores.It also showed that NEADL was more sensitive (to NIHSSscores) than FAI.

Representativeness of the Analyzed Patient DataWe analyzed 238 patients after excluding 93 because theyhad missing values at least in 1 of the outcome scales due toloss to follow-up. The mean age, prior stroke disability (BI�20), severe stroke, and cognitive impairment for the ex-cluded versus included patients were, respectively 71 yearsversus 69 years (P�0.15), 24% versus 20% (P�0.29), 15%versus 13% (P�0.61), and 33% versus 26% (P�0.29). Itclearly shows that the loss to follow-up patients were not

Figure 3. Comparison of activities of daily living (ADL) scores for the 3 scales: Barthel Index (BI), Frenchay Activities Index (FAI), andNottingham Extended Activities of Daily Living (NEADL). A, C, Scatterplots with fractional polynomial regression lines; (B) Bland andAltman plot; (D) mountain plot (folded empirical cumulative distribution plot). EADL indicates extended activities of daily living.

1366 Stroke May 2012

by guest on June 17, 2018http://stroke.ahajournals.org/

Dow

nloaded from

Page 6: Comparison of 2 Extended Activities of Daily Living Scales ...stroke.ahajournals.org/content/strokeaha/43/5/1362.full.pdf · With the Barthel Index and Predictors of Their Outcomes

significantly different from the analyzed patients, whichreassures the generalizability of our results.

DiscussionThis study has shown that BI had a more pronounced ceilingeffect compared with EADL scales, the FAI and NEADL.However, the FAI was highly affected by a floor effect,whereas the NEADL scale was symmetrical with no floor andceiling effects underlying its usefulness for stroke research.Although the scales were highly significantly concurrentlyvalid with respect to each other, the agreements between themwere poor because the NEADL and FAI were not unbiasedwith respect to BI. The FAI and NEADL agreed with BI onlyfor the highest and lowest level of activities. The NEADLcorresponded better with BI than FAI. Prediction of BI, FAI,and NEADL was possible by only 1 universal predictor, theNIHSS, suggesting its predictive power for future functionalstatus. Cognition was a significant predictor only for theEADL scales: FAI and NEADL.

One third of the patients with first-ever stroke whosurvived the first 3 months after a stroke were independentin their ADL. The distributional findings regarding theceiling effect of BI (33%) and floor effect of FAI (19%)are in line with previous research.7,24 Several authorsconcluded that floor and ceiling effects are considered tobe present if �15% of respondents achieve the lowest orhighest possible score, respectively.16,25 Therefore, al-though BI suffered from a ceiling effect and FAI sufferedfrom a floor effect, NEADL was free from both of theseeffects. The consequence of presence of either floor orceiling effect is that the patients with lowest or highestpossible score cannot be distinguished from each other;thus, reliability is reduced and also responsiveness islimited because changes cannot be measured in thesepatients.16 Another criticism of the FAI was that it does notrecognize some activities such as voluntary work ortelephoning.26 Moreover, it does not take into account

newer activities such as computer work. In contrast, theNEADL takes into account telephoning. Interestingly, thedistribution of NEADL scores was symmetrical furthersuggesting it to be a more useful outcome measure forEADL than FAI.

Both ICC and concordance correlation coefficient areparametric in nature and hence CIs may not be valid due tofloor and ceiling effects of the 2 outcome scales. Logtransformation to make the distribution of scores normal wasnot appropriate because the scores included 0. CI under theSpearman rank correlation coefficient seems more acceptablethan others because it is based on ranks rather than actualvalues. All 3 scales had high concurrent validity (rs �0.80).27

However, reliability exists between the 2 EADL scales only(ri�0.75) because ICC�0.70 is often recommended as aminimum standard for reliability.16 It makes sense, becausereliability measures the degree to which patients can bedistinguished (eg, with more or less severe disease). Irrespec-tive of methods, the concurrent validity and reliability (if any)between NEADL and BI were higher than those of betweenFAI and BI.

Agreement concerns how close the scores of the 2 scalesare to one another. The higher agreement of NEADL withBI than that of FAI with BI might be justified due to thelowest coefficient of variation between NEADL and BIfound in this study and minimal overlap between FAI andBI reported in the literature.3 This suggests that NEADLmay add information about ADL lacking in the BI. Someauthors combined scales in stroke trials to derive a globalstatistics to better define the effect of acute interventions.28

Others showed that such combined score had a muchimproved distribution without strong ceiling or floor ef-fects.3,29 In this case, FAI may be combined with the BI tocover the whole spectrum of activities due to the highestcoefficient of variation and minimal overlap in terms ofcorrelation and agreement.

Table 3. Factors Predicting Barthel Index, Frenchay Activities Index, or Nottingham Extended Activities of Daily LivingScore (Negative Binomial Regression Analysis)*

Variables

BIModel LR �2�46.05, P�0.001

FAIModel LR �2�53.37, P�0.001

NEADLModel LR �2�66.25, P�0.001

Relative Change† P Value Relative Change† P Value Relative Change† P Value

Age 0.99 0.67 0.99 0.21 0.99 0.21

White ethnicity 0.99 0.82 0.95 0.79 0.89 0.31

Diabetes 1.04 0.45 1.11 0.63 1.15 0.29

Atrial fibrillation 0.92 0.23 0.72 0.18 0.79 0.14

Hypertension 0.97 0.58 0.87 0.42 0.92 0.49

Infarct 0.97 0.66 0.71 0.17 0.76 0.08

Prestroke (BI�20) 1.28 �0.001 1.42 0.15 1.56 0.003

NIHSS (�13) 0.72 0.001 0.21 �0.001 0.38 �0.001

AMT (8–10) 1.08 0.14 2.31 �0.001 1.65 �0.001

Stroke unit care 0.91 0.08 0.82 0.31 0.86 0.21

BI indicates Barthel Index; FAI, Frenchay Activities Index; NEADL, Nottingham Extended Activities of Daily Living; LR, likelihood ratio; NIHSS, NationalInstitutes of Health Stroke Scale; AMT, Abbreviated Memory Test.

*Analysis was restricted to patients without missing values (n�140).†(Adjusted) relative change�exponential (estimated negative binomial regression coefficients for the model).

Sarker et al Comparison of EADL Scales With BI 1367

by guest on June 17, 2018http://stroke.ahajournals.org/

Dow

nloaded from

Page 7: Comparison of 2 Extended Activities of Daily Living Scales ...stroke.ahajournals.org/content/strokeaha/43/5/1362.full.pdf · With the Barthel Index and Predictors of Their Outcomes

Appelros7 found that age, NIHSS, and cognition weresignificantly related with FAI at 1-year poststroke. In ouranalysis, NIHSS was a universal predictor for all 3 scalesoutcomes suggesting its predictive power for future func-tional status. Kasner28 also concluded that NIHSS was usefulfor early prognostication. In our study, cognition was asignificant predictor for FAI and NEADL but not for BI. Thismight be due to the requirement of preserved mental skills formany items of EADL that are obtained by FAI and NEADLsuch as reading newspapers or books. Unlike the findings ofAppelros, age was not related to FAI. This may be due to aconfounding effect in Appelros univariable analysis. Ourresult was based on multivariable-adjusted models and henceshould be more reliable. Prestroke BI was not only asignificant predictor for poststroke BI, but also forNEADL emphasizing the general correspondence betweenBI and NEADL.

Schlegel et al12 showed in a retrospective analysis thatNIHSS predicted postacute care disposition among patientswith stroke. Our prospective study showed that patients withsevere stroke had 28% lower BI (79% lower FAI and 62%lower NEADL) score than nonsevere patients. On the otherhand, cognitively intact patients had more than double FAIvalues (65% higher NEADL) compared with impaired pa-tients. This means that the acute stage scores of the NIHSScan be successfully used for prediction/prognostication offuture functional status of patients with stroke, whereasAMT score can only be used for EADL. Predicting futurefunctional status at the acute stage may decrease unneces-sary length of stay in the acute care setting and perhaps myalso facilitate the planning of poststroke rehabilitationprograms.

There are strengths and limitations of our study. We hadonly complete information for 72% of all stroke survivors;however, they were not significantly different from thedropouts. The fact is that more severe patients are likely toproduce missing values than less severe patients, but sensi-tivity analysis did not warrant any significant bias to applymissing data treatment. Our data were derived from apopulation-based sample, which increases the external valid-ity of the findings. All data were collected by speciallytrained researchers using standardized follow-up methods.However, we did not formally check interrater reliability ofour data collection, although the reliability of FAI andNEADL was reported to be lower than that of BI in theliterature.

ConclusionsThe NEADL scale was symmetrical with no floor and ceilingeffects in a general stroke population. Despite high con-current validity of the scales, they agreed poorly only forthe highest and the lowest level of activities. The NEADLcorresponded better with BI than FAI did confirming itsbasic ADL properties, yet it is a more sensitive tool forEADL without the floor and ceiling effects. Future func-tional status could be predicted by the universal predictorNIHSS, whereas only EADL status could be predicted bythe AMT score. Predicting future functional status at the

acute stage may decrease unnecessary length of stay inacute care settings.

AcknowledgmentsWe thank all the patients and their families and the healthcareprofessionals involved. We are also very thankful for the anonymousclinical and statistical reviewers for their very helpful comments.

Sources of FundingFunding for the Register has been provided through the Northern &Yorkshire National Health Service (NHS) R & D Programme inCardiovascular Disease and Stroke, Guy’s and St Thomas’ HospitalCharity, Stanley Thomas Johnson Foundation, The Stroke Associa-tion, Department of Health HQIP grant, and the National Institute forHealth Research Programme Grant (RP-PG-0407-10184). The au-thors acknowledge financial support from the Department of Healththrough the National Institute for Health Research (NIHR) Biomed-ical Research Centre award to Guy’s & St Thomas’ NHS FoundationTrust in partnership with King’s College London. Charles Wolfe isan NIHR Senior Investigator.

DisclosuresNone.

References1. Martinsson L, Eksborg S. Activity index—a complementary ADL scale

to the Barthel Index in the acute stage in patients with severe stroke.Cerebrovasc Dis. 2006;22:231–239.

2. Hsueh IP, Wang CH, Sheu CF, Hsieh CL. Comparison of psychometricproperties of three mobility measures for patients with stroke. Stroke.2003;34:1741–1745.

3. Pedersen PM, Jorgensen HS, Nakayama H, Raaschou HO, Olsen TS.Comprehensive assessment of activities of daily living in stroke. TheCopenhagen Stroke Study. Arch Phys Med Rehabil. 1997;78:161–165.

4. Carod-Artal FJ, Gonzalez-Gutierrez JL, Herrero JA, Horan T, De SeijasEV. Functional recovery and instrumental activities of daily living:follow-up 1-year after treatment in a stroke unit. Brain Inj. 2002;16:207–216.

5. Wu CY, Chuang LL, Lin KC, Horng YS. Responsiveness and validity oftwo outcome measures of instrumental activities of daily living in strokesurvivors receiving rehabilitative therapies. Clin Rehabil. 2010;25:175–183.

6. Hsueh IP, Huang SL, Chen MH, Jush SD, Hsieh CL. Evaluation of strokepatients with the extended activities of daily living scale in Taiwan.Disabil Rehabil. 2000;22:495–500.

7. Appelros P. Characteristics of the Frenchay Activities Index one yearafter a stroke: a population-based study. Disabil Rehabil. 2007;29:785–790.

8. Schepers VP, Ketelaar M, Visser-Meily JM, Dekker J, Lindeman E.Responsiveness of functional health status measures frequently used instroke research. Disabil Rehabil. 2006;28:1035–1040.

9. Wolfe CD, Rudd AG, Howard R, Coshall C, Stewart J, Lawrence E, et al.Incidence and case fatality rates of stroke subtypes in a multiethnicpopulation: the South London Stroke Register. J Neurol Neurosurg Psy-chiatry. 2002;72:211–216.

10. Smeeton NC, Heuschmann PU, Rudd AG, McEvoy AW, Kitchen ND,Sarker SJ, et al. Incidence of hemorrhagic stroke in black Caribbean,black African, and white populations: the South London Stroke Register,1995–2004. Stroke. 2007;38:3133–3138.

11. Stewart JA, Dundas R, Howard RS, Rudd AG, Wolfe CD. Ethnic dif-ferences in incidence of stroke: prospective study with stroke register.BMJ. 1999;318:967–971.

12. Schlegel D, Kolb SJ, Luciano JM, Tovar JM, Cucchiara BL, LiebeskindDS, et al. Utility of the NIH Stroke Scale as a predictor of hospitaldisposition. Stroke. 2003;34:134–137.

13. Hajat C, Tilling K, Stewart JA, Lemic-Stojcevic N, Wolfe CD. Ethnicdifferences in risk factors for ischemic stroke: a European case–controlstudy. Stroke. 2004;35:1562–1567.

14. Jitapunkul S, Pillay I, Ebrahim S. The Abbreviated Mental Test: its useand validity. Age Ageing. 1991;20:332–336.

15. Sarker SJ, Heuschmann PU, Burger I, Wolfe CD, Rudd AG, SmeetonNC, et al. Predictors of survival after haemorrhagic stroke in a multi-

1368 Stroke May 2012

by guest on June 17, 2018http://stroke.ahajournals.org/

Dow

nloaded from

Page 8: Comparison of 2 Extended Activities of Daily Living Scales ...stroke.ahajournals.org/content/strokeaha/43/5/1362.full.pdf · With the Barthel Index and Predictors of Their Outcomes

ethnic population: the South London Stroke Register (SLSR). J NeurolNeurosurg Psychiatry. 2008;79:260–265.

16. Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, DekkerJ, et al. Quality criteria were proposed for measurement properties ofhealth status questionnaires. J Clin Epidemiol. 2007;60:34–42.

17. Lin LI-K. A note on the concordance correlation coefficient. Biometrics.2000;56:324–325.

18. Jones RG, Payne RB. Clinical Investigation and Statistics in LaboratoryMedicine. ABC Venture Publications London; 1997.

19. Royston P, Altman DG. Approximating statistical functions by usingfractional polynomial regression. Statistician. 1997;46:411–422.

20. Bland JM, Altman DG. Statistical methods for assessing agreementbetween two methods of clinical measurement. Lancet. 1986;1:307–310.

21. Krouwer JS, Monti KL. A simple, graphical method to evaluate labo-ratory assays. Eur J Clin Chem Clin Biochem. 1995;33:525–527.

22. Hilbe JM. Negative Binomial Regression. Cambridge, UK: CambridgeUniversity Press; 2007.

23. Wolfe CDASN, Coshall C, Tilling K. Survival differences after stroke ina multiethnic population: follow-up study with the South London StrokeRegister. BMJ. 2005.

24. Quinn TJ, Langhorne P, Stott DJ. Barthel Index for stroke trials: devel-opment, properties, and application. Stroke. 2011;42:1146–1151.

25. McHorney CA, Tarlov AR. Individual-patient monitoring in clinicalpractice: are available health status surveys adequate? Qual Life Res.1995;4:293–307.

26. Bond MJ, Clark MS, Smith DS, Harris RD. Lifestyle activities of theelderly: composition and determinants. DisabilRehabil. 1995;17:63–69.

27. Munro B. Statistical Methods for Health Care Research. Philadelphia,PA: Lippincott Williams & Wilkins; 2005.

28. Kasner SE. Clinical interpretation and use of stroke scales. Lancet Neurol.2006;5:603–612.

29. Hsieh CL, Hsueh IP. A cross-validation of the comprehensive assessmentof activities of daily living after stroke. Scand J Rehabil Med. 1999;31:83–88.

Sarker et al Comparison of EADL Scales With BI 1369

by guest on June 17, 2018http://stroke.ahajournals.org/

Dow

nloaded from

Page 9: Comparison of 2 Extended Activities of Daily Living Scales ...stroke.ahajournals.org/content/strokeaha/43/5/1362.full.pdf · With the Barthel Index and Predictors of Their Outcomes

Shah-Jalal Sarker, Anthony G. Rudd, Abdel Douiri and Charles D.A. Wolfe(SLSR)

Predictors of Their Outcomes: Cohort Study Within the South London Stroke Register Comparison of 2 Extended Activities of Daily Living Scales With the Barthel Index and

Print ISSN: 0039-2499. Online ISSN: 1524-4628 Copyright © 2012 American Heart Association, Inc. All rights reserved.

is published by the American Heart Association, 7272 Greenville Avenue, Dallas, TX 75231Stroke doi: 10.1161/STROKEAHA.111.645234

2012;43:1362-1369; originally published online March 29, 2012;Stroke. 

http://stroke.ahajournals.org/content/43/5/1362World Wide Web at:

The online version of this article, along with updated information and services, is located on the

http://stroke.ahajournals.org/content/suppl/2012/03/29/STROKEAHA.111.645234.DC1Data Supplement (unedited) at:

  http://stroke.ahajournals.org//subscriptions/

is online at: Stroke Information about subscribing to Subscriptions: 

http://www.lww.com/reprints Information about reprints can be found online at: Reprints:

  document. Permissions and Rights Question and Answer process is available in the

Request Permissions in the middle column of the Web page under Services. Further information about thisOnce the online version of the published article for which permission is being requested is located, click

can be obtained via RightsLink, a service of the Copyright Clearance Center, not the Editorial Office.Strokein Requests for permissions to reproduce figures, tables, or portions of articles originally publishedPermissions:

by guest on June 17, 2018http://stroke.ahajournals.org/

Dow

nloaded from

Page 10: Comparison of 2 Extended Activities of Daily Living Scales ...stroke.ahajournals.org/content/strokeaha/43/5/1362.full.pdf · With the Barthel Index and Predictors of Their Outcomes

ONLINE SUPPLEMENT

Comparison of two Extended Activities of Daily Living scales with the Barthel Index

and predictors of their outcomes– Cohort study within the South London Stroke

Register (SLSR)

Shah-Jalal Sarker, PhD1,3

; Anthony G Rudd, MD1,2

; Abdel Douiri, PhD1,2

and Charles DA

Wolfe, MD1,2

Address for correspondence

Shah-Jalal Sarker

Lecturer in Biostatistics

Old Anatomy Building, Charterhouse Square

London EC1M 6BQ

Tel: + 44 (0)20 7882 8495

Fax: + 44 (0)20 7882 5958

Email: [email protected]

Division of Health and Social Care Research, King’s College London, London, UK.1

NIHR Comprehensive Biomedical Research Centre, Guy’s & St. Thomas’ NHS Foundation

Trust and King’s College London, London, UK.2

Centre for Experimental Cancer Medicine, Barts Cancer Institute - a CR-UK Centre of

Excellence, Queen Mary, University of London, London, UK.3

Page 11: Comparison of 2 Extended Activities of Daily Living Scales ...stroke.ahajournals.org/content/strokeaha/43/5/1362.full.pdf · With the Barthel Index and Predictors of Their Outcomes

ONLINE SUPPLEMENT

Sensitivity of the multivariable analysis results The multivariable negative binomial regression analysis in Table 3 was based on 140 patients

excluding 98 patients who had at least one baseline variable missing. The median scores for

the excluded versus included patients were (85 vs 92.5, p=0.21), (15.6 vs 26.7, p=0.02) and

(38.7 vs 52.7, p=0.11) for the BI, FAI and NEADL respectively. The median NIHSS score

for the excluded patients (n=43, median=9) was significantly higher than that of included

patients (n=140, median=5), p<0.01. This shows that analysed patients had slightly less sever

stroke and hence higher functional status. To check the bias in NIHSS score based prediction,

univariable negative binomial regression models were fitted. The relative change in the BI,

FAI and NEADL scores for the excluded (n=43) versus included (n=140) patients were

respectively (0.69, p=0.03 vs 0.69, p<0.001), (0.56, p=0.19 vs 0.14, p<0.001) and (0.61,

p=0.13 vs 0.29, p<0.001). The prediction for ADL was clearly unbiased. However, it was not

possible to assess the bias for EADL prediction as the model fitted for the excluded patients

were not significant.

Table S1. Results of the deviance tests for fitted fractional polynomial models: BI, FAI and

NEADL

Fractional

Polynomial

(FP) Model†

Degree of

FP

Deviance Residual

Standard

Deviation

Deviance

difference

*P-value

Model

R2

FPFAI versus BI 2 2051 18 5.39 0.07 0.59

FPNEADL versus BI 3 1988 16 0.07 0.96 0.77

FPNEADL versus FAI 1 1925 14 4.09 0.13 0.83

*P-value from deviance difference comparing reported model with the respective next higher degree model

†BI=Barthel Index, FAI=Frenchay Activities Index and NEADL=Nottingham Extended Activities of Daily

Living.

Figure S1. The relationship between NIHSS and outcome scales: Barthel Index (BI),

Frenchay Activities Index (FAI) and Nottingham Extended Activities of Daily Living

(NEADL) using fractional polynomial regression.

020

40

60

80

10

0

Score

(%

) a

t 3-m

onth

s

0 5 10 15 20 25NIHSS at acute stage

Predicted BI (%) at 3-months

Predicted NEADL (%) at 3-months

Predicted FAI (%) at 3-months