19
Vol. 102, No. 4, 2012 381 Analytical and Theoretical Plant Pathology The Area Under the Disease Progress Stairs: Calculation, Advantage, and Application Ivan Simko and Hans-Peter Piepho First author: U.S. Department of Agriculture-Agricultural Research Service, Crop Improvement and Protection Research Unit, 1636 East Alisal Street, Salinas, CA 93905; and second author: Institut für Kulturpflanzenwissenschaften, Universität Hohenheim, Fruwirthstrasse 23, D-70593 Stuttgart, Germany. Accepted for publication 16 November 2011. ABSTRACT Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application. Phytopathology 102:381- 389. The area under the disease progress curve (AUDPC) is frequently used to combine multiple observations of disease progress into a single value. However, our analysis shows that this approach severely underestimates the effect of the first and last observation. To get a better estimate of disease progress, we have developed a new formula termed the area under the disease progress stairs (AUDPS). The AUDPS approach improves the estimation of disease progress by giving a weight closer to optimal to the first and last observations. Analysis of real data indicates that AUDPS outperforms AUDPC in most of the tested trials and may be less precise than AUDPC only when assessments in the first or last observations have a comparatively large variance. We propose using AUDPS and its stan- dardized (sAUDPS) and relative (rAUDPS) forms when combining multi- ple observations from disease progress experiments into a single value. Two types of resistance to pathogens have been described and used in plant breeding. One type is a qualitative resistance that is conferred by a series of major dominant genes that render the host incompatible with the pathogen. The other type is a quantitative resistance that is usually conferred by the effect of multiple genes of minor effects (1,9). In quantitative resistance, where differ- ences in level of resistance are usually less distinct, measuring disease progress is important for understanding plant–pathogen interaction. Disease typically starts at a low level, gradually in- creasing in incidence and/or severity over time. Progress of dis- ease on plants is usually observed several times during pathogen epidemics. Extent of disease is assessed at each observation using scales that are based on disease incidence, severity, or a combi- nation of both. To combine these repeated observations into a single value, Van der Plank (chapter 12 of literature citation 13) proposed calculating the area under the disease progress curve (AUDPC). If progress of disease fits one of the characterized models (3,4), parameters estimated from this model can be used for calculating AUDPC. However, if there is no credible evidence that one of the standard models fits the observed disease progress data or if the disease progress data varies substantially in shape for different factor levels, then the empirical determination of AUDPC is necessary (5). Our analysis shows that the AUDPC approach systematically undervalues the effects of the first and last observations. The main objectives of this study were (i) to present a new method for com- bining disease progress data that would give an appropriate weight to these two observations, and (ii) to compare the performance of this new method with AUDPC. THEORY AND APPROACHES Formulas for AUDPC and AUDPS. A common approach to determine AUDPC is through a simple midpoint (trapezoidal) rule (Fig. 1A) that breaks up a disease progress curve into a series of trapezoids, calculating the area of each, and then adding up the areas (5): ) ( 2 1 1 1 1 i i n i i i t t y y AUDPC × + = + = + (1) where y i is an assessment of a disease (percentage, proportion, ordinal score, etc.) at the ith observation, t i is time (in days, hours, etc.) at the ith observation, and n is the total number of observations. Rearrangement of this formula shows that AUDPC is calculated by multiplying each assessment value by its respective weight (Fig. 1B) × + × + × = = + 2 2 2 1 1 2 1 1 1 2 1 n n n n i i i i t t y t t y t t y AUDPC (2) where y 1 and y n are assessments at the first and last observations, respectively, and t 1 , t 2 , t n-1 , and t n are the times of the first, second, penultimate, and last observations. The weight for each assessment is the duration between the midpoint of the previous time interval to the midpoint of the subsequent time interval ( 2 2 2 1 1 1 1 + + + = = i i i i i i i t t t t t t v ) Thus, all weights are calculated by extrapolating in two direc- tions, with the exception of the first and the last assessments for which weight is extrapolated in one direction only [ 2 / ) ( 1 2 1 t t v = for y 1 , and 2 / ) ( 1 = n n n t t v for y n ]. For example, if observations are performed every tenth day, then each assessment is multiplied by the weight of 10: 10/2 + 10/2 = 10, with the exception of the first and last assessments that are multiplied only by the weight of five: 10/2 = 5. This leads to a substantial undervaluation of the first and the last assessments in comparison to all other assessments. It is possible that in some Corresponding author: I. Simko; E-mail address: [email protected] * The e -Xtra logo stands for “electronic extra” and indicates that the online version contains two supplemental tables and one supplemental material. http://dx.doi.org/10.1094/ PHYTO-07-11-0216 This article is in the public domain and not copyrightable. It may be freely re- printed with customary crediting of the source. The American Phytopathological Society, 2012. e - Xt ra *

The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

  • Upload
    others

  • View
    42

  • Download
    0

Embed Size (px)

Citation preview

Page 1: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

Vol. 102, No. 4, 2012 381

Analytical and Theoretical Plant Pathology

The Area Under the Disease Progress Stairs: Calculation, Advantage, and Application

Ivan Simko and Hans-Peter Piepho

First author: U.S. Department of Agriculture-Agricultural Research Service, Crop Improvement and Protection Research Unit, 1636 East Alisal Street, Salinas, CA 93905; and second author: Institut für Kulturpflanzenwissenschaften, Universität Hohenheim, Fruwirthstrasse 23, D-70593 Stuttgart, Germany.

Accepted for publication 16 November 2011.

ABSTRACT

Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application. Phytopathology 102:381-389.

The area under the disease progress curve (AUDPC) is frequently used to combine multiple observations of disease progress into a single value. However, our analysis shows that this approach severely underestimates the effect of the first and last observation. To get a better estimate of disease progress, we have developed a new formula termed the area under

the disease progress stairs (AUDPS). The AUDPS approach improves the estimation of disease progress by giving a weight closer to optimal to the first and last observations. Analysis of real data indicates that AUDPS outperforms AUDPC in most of the tested trials and may be less precise than AUDPC only when assessments in the first or last observations have a comparatively large variance. We propose using AUDPS and its stan-dardized (sAUDPS) and relative (rAUDPS) forms when combining multi-ple observations from disease progress experiments into a single value.

Two types of resistance to pathogens have been described and

used in plant breeding. One type is a qualitative resistance that is conferred by a series of major dominant genes that render the host incompatible with the pathogen. The other type is a quantitative resistance that is usually conferred by the effect of multiple genes of minor effects (1,9). In quantitative resistance, where differ-ences in level of resistance are usually less distinct, measuring disease progress is important for understanding plant–pathogen interaction. Disease typically starts at a low level, gradually in-creasing in incidence and/or severity over time. Progress of dis-ease on plants is usually observed several times during pathogen epidemics. Extent of disease is assessed at each observation using scales that are based on disease incidence, severity, or a combi-nation of both. To combine these repeated observations into a single value, Van der Plank (chapter 12 of literature citation 13) proposed calculating the area under the disease progress curve (AUDPC). If progress of disease fits one of the characterized models (3,4), parameters estimated from this model can be used for calculating AUDPC. However, if there is no credible evidence that one of the standard models fits the observed disease progress data or if the disease progress data varies substantially in shape for different factor levels, then the empirical determination of AUDPC is necessary (5).

Our analysis shows that the AUDPC approach systematically undervalues the effects of the first and last observations. The main objectives of this study were (i) to present a new method for com-bining disease progress data that would give an appropriate weight to these two observations, and (ii) to compare the performance of this new method with AUDPC.

THEORY AND APPROACHES

Formulas for AUDPC and AUDPS. A common approach to determine AUDPC is through a simple midpoint (trapezoidal) rule (Fig. 1A) that breaks up a disease progress curve into a series of trapezoids, calculating the area of each, and then adding up the areas (5):

)(2 1

1

1

1ii

n

i

ii ttyy

AUDPC −×+

= +

=

+∑ (1)

where yi is an assessment of a disease (percentage, proportion, ordinal score, etc.) at the ith observation, ti is time (in days, hours, etc.) at the ith observation, and n is the total number of observations. Rearrangement of this formula shows that AUDPC is calculated by multiplying each assessment value by its respective weight (Fig. 1B)

⎥⎦

⎤⎢⎣

⎡ −×+⎥

⎤⎢⎣

⎡⎟⎠

⎞⎜⎝

⎛ −×+⎥

⎤⎢⎣

⎡ −×= −

=

−+∑222

11

2

11121

nnn

n

i

iii

tty

tty

ttyAUDPC (2)

where y1 and yn are assessments at the first and last observations, respectively, and t1, t2, tn-1, and tn are the times of the first, second, penultimate, and last observations.

The weight for each assessment is the duration between the midpoint of the previous time interval to the midpoint of the subsequent time interval

(222

1111 −+−+ −+

−=

−= iiiiii

itttttt

v )

Thus, all weights are calculated by extrapolating in two direc-tions, with the exception of the first and the last assessments for which weight is extrapolated in one direction only [ 2/)( 121 ttv −= for y1, and 2/)( 1−−= nnn ttv for yn]. For example, if observations are performed every tenth day, then each assessment is multiplied by the weight of 10: 10/2 + 10/2 = 10, with the exception of the first and last assessments that are multiplied only by the weight of five: 10/2 = 5. This leads to a substantial undervaluation of the first and the last assessments in comparison to all other assessments. It is possible that in some

Corresponding author: I. Simko; E-mail address: [email protected]

* The e-Xtra logo stands for “electronic extra” and indicates that the online versioncontains two supplemental tables and one supplemental material.

http://dx.doi.org/10.1094 / PHYTO-07-11-0216 This article is in the public domain and not copyrightable. It may be freely re-printed with customary crediting of the source. The American PhytopathologicalSociety, 2012.

e-Xtra*

Page 2: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

382 PHYTOPATHOLOGY

special cases this lower relative weight may be desirable, but for the majority of evaluations there is no added benefit in giving only half a weight to assessments at these two observations. To give the appropriate importance to the first and the last assess-ments, weights for these assessments should also be extrapolated in both directions. We suggest extrapolating a weight in the missing direction using half of the average interval duration between observations ( )]1(2/[ −× nD ), where 1ttD n −= . This

new method that is used to calculate total area (including adjusted weight for the first and the last assessments) is termed area under the disease progress stairs (AUDPS) (Fig. 1C). The term “stairs” was chosen to reflect the fact that AUDPS corresponds to the area under the step function shown in Figure 1C, but not to the area under a curve or polygon obtained by connecting dots repre-senting the assessed values for individual observations (Fig. 1A and B). The mathematical formula for AUDPS is

When all observations are performed at regular intervals, the combined weight of all assessments is equal to )1/( −nDn and the AUDPS formula becomes

1−×=

n

DnyAUDPS (4)

where y is the arithmetic mean of all assessments. When ob-servations are performed at every time unit (e.g., hourly, daily, weekly, etc.), 1−= nD and the formula can be simplified as

nyAUDPS ×= (5)

Standardization of AUDPC and AUDPS and their statistical properties. AUDPC and AUDPS cover different time spans. While the time span for AUDPC is D, for AUDPS it is )1/( −nDn . To make the two measures more directly comparable, both should be standardized. A standardized version of AUDPC, denoted here as sAUDPC (5), can be written as

D

AUDPCsAUDPC = (6)

In the same way a standardized form of AUDPS is written as

Dn

nAUDPSsAUDPS

)1( −×= (7)

The standardized versions of AUDPC and AUDPS have the general form

∑=

=n

iii ywL

1 (8)

where wi indicates weights, subject to the constraint

11

=∑=

n

iiw

and iy indicates assessments at observations ni ,...,1= . For ex-ample, with AUDPC we have

Dvvvw i

n

jjii //

1== ∑

=

where ( ) 2121 ttv −= , ( ) 211 −+ −= iii ttv for 1,...,2 −= ni , and ( ) 21−−= nnn ttv . The measures sAUDPC and sAUDPS differ in

the definition of weights. The definition of weights follows slightly different concepts of measuring disease progress, and the choice of measure should be primarily based on a choice of con-cept. However, it is also useful to study the statistical properties of alternative measures. One possible approach is to ask what choice of weights maximizes the precision.

With ( )Tnwwww ,..,, 21= and ( )Tnyyyy ,..,, 21= , where the super-scripted T denotes a transpose of a matrix or vector, our measures can be written as ywL T= . This has a variance of VwwL T=)var( ,

Fig. 1. Graphical interpretation of area under the disease progress curve(AUDPC) and area under the disease progress stairs (AUDPS) calculations. Black line indicates a disease progress curve. A, Gray line indicates rectanglesthat are used to calculate AUDPC based on the trapezoidal rule. B, Re-arrangement of the formula shows that the same AUDPC can be calculated bymultiplying each assessment value by its weight. In this example each value ismultiplied by 10, with the exception of the first and last assessments that aremultiplied by 5. C, Calculation of AUDPS. Gray bars show the area that isadded to AUDPC after adjusting weights for the first and last observations.

( ) ( )

⎥⎦

⎤⎢⎣

⎡−

×+

+=⎥⎦

⎤⎢⎣

⎡−

×+

+⎥⎦

⎤⎢⎣

⎡ −×+

=

⎥⎥⎦

⎢⎢⎣

⎭⎬⎫

⎩⎨⎧

−+

−×+⎥

⎤⎢⎣

⎡⎟⎠

⎞⎜⎝

⎛ −×+

⎥⎥⎦

⎢⎢⎣

⎭⎬⎫

⎩⎨⎧

−+

−×=

+

=

+

−−

=

−+

1212)(

2

1222122

111

1

1

1

11

2

11121

n

DyyAUDPC

n

Dyytt

yy

n

Dtty

tty

n

DttyAUDPS

nnii

n

i

ii

nnn

n

i

iii

(3)

Page 3: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

Vol. 102, No. 4, 2012 383

where V is the variance-covariance matrix of y. The optimal choice of w will minimize var(L). If we impose the constraint

11

=∑=

n

iiw

then using the method of Lagrange multipliers, the optimal weights are found to be

( ) ( ) 11121 111,..,, −−−== VVwwww TTT

n (9)

where 1 is a vector of ones. Some of these optimal weights may become negative, depending on the form of V. We may impose the further constraint that 0≥iw for all i. In this case, there is no easily derived simple expression for the optimal weights, but we may use numerical optimization routines such as the NLPTR function in SAS/IML 9.2 (SAS Institute, Cary, NC). A theoretical investigation of optimal weights for special forms of V is given in Appendix A.

To further study the properties of AUDPC and AUDPS, we modi-fied weights of the first and last observation to minimize the over-all variance of L for empirical data sets. In other words, we assigned the same relative weights as in AUDPC and AUDPS to all obser-vations from 2 to n – 1, and then tested weights for the first and last observation to achieve a minimum variance. These optimal weights were determined by the same method as described above, using triplet y1, yn, and a weighted mean of y2 to yn-1 computed as

∑−

=

==2

2

1

2n

ii

n

iii

v

yvm , where ( ) 211 −+ −= iii ttv for 1,...,2 −= ni .

We then optimized weights w1, c, and wn in

nn ywcmywL ++= 11 (10)

subject to the constraints 11 =++ nwcw and 0,,1 ≥nwcw . Note that weights for intermediate observations are

∑−

=

=2

2

n

ii

ii

v

cvw

for 1,...,2 −= ni . Further note that when all vi are the same, we have )2/( −= ncwi . A justification of why we use the weighted mean m in equation 10 is given in Appendix B.

Tests of performance. Performance of AUDPC/sAUDPC and AUDPS/sAUDPS formulas was evaluated on a set of 50 trials from different plant–pathogen (or plant–fungicide interaction) sys-tems that are illustrative both of monocyclic and polycyclic dis-eases (Supplemental Table 1). The LB trials (1,8) represent interaction of potato (Solanum tuberosum L. and Solanum ber-thaultii L.) with Phytophthora infestans (Mont.) de Bary, while DM, PL, LD, Di, SL, and Tr trials represent interaction of lettuce (Lactuca sativa L. and Lactuca serriola L.) with Bremia lactucae Regel (DM), Erysiphe cichoracearum DC ex Mérat (PL), Sclerotinia minor Jaggar (LD), Tomato bushy stunt virus and Lettuce necrotic stunt virus (Di) (11,12), a combination of biotic and abiotic factors causing salad deterioration (SL), and Triforine-based fungicides (Tr) (10). The CS, PS, and PM data originate from trials testing resistance of sugar beet (Beta vulgaris L.) to Cercospora beticola Sacc. (CS), Erysiphe polygoni DC. (syn. E. betae (Vanha) Weltzien) (PS), and melon (Cucumis melo L.) to Podosphaera xanthii (Castagne) Braun & Shishkoff (PM). To find out which of the two methods performs better, we compared a number of different statistics. The F values, root of mean square error (RMSE), and coefficient of variation (CV) were obtained from one-way analysis of variance (ANOVA) calculated with JMP 6.0.3 (SAS Institute). Optimal weights were estimated as de-

scribed above using the nonlinear optimization subroutine NLPTR in the IML procedure of SAS 9.2 (SAS Institute). The variance-covariance matrix V was estimated by the MIXED procedure specifying an unstructured model. The Euclidean distance (ED) between relative optimal weights and relative actual weights used for calculation of AUDPC and AUDPS was calculated as

2)()(2)()(2)(1

)(1 )()()(),( a

no

naoao wwccwwaoED −+−+−=

where superscripts (o) and (a) indicate optimal and actual weights, respectively.

Statistics used to test performance. To compare performance of AUDPC/sAUDPC and AUDPS/sAUDPS methods, we used F value, RMSE, and CV from ANOVA. However, some of these statistics might have noticeable weaknesses. For example, if either AUDPC or AUDPS overestimates the difference between two accessions because of the shape of the disease progress curve, a higher F value will result from bias in the estimate, rather than improved precision. CV avoids this kind of bias because it does not depend on differences between accessions. However, CV is not invariant to shift transformations (addition of a constant). The RMSE of standardized values is probably the most suitable of the three statistics for our evaluations because it does not depend on differences between accessions and is unaffected if some shift-scale transformation is applied to sAUDPC or sAUDPS. Never-theless, for comparison purposes we show all three values to-gether with ED in Table 1. Relative performance of the two models (sAUDPC versus sAUDPS) was evaluated by comparing frequency of higher precision calculated for F value, RMSE, CV, and ED from 50 trials. The significance of difference between two frequencies was tested using a two-sample paired sign test.

RESULTS

Trials used for testing of the two formulas represent a range of different plant–pathogen systems and evaluation methods. The number of tested accessions range from 3 to 479, the number of replications within a trial is (on average) from less than 2 to 12, the duration from the first to last observation is from 3 to 120 days, the number of observations is from 3 to 18, average length of interval is from 1 to 16.5 days, and evaluations are based on six different rating scales. There may be some occasional hetero-geneity of variance between accessions at specific time points be-cause of differences in pathogen infection levels. This was ignored in ANOVA and optimal weight calculations because we could not find a simple approach to account for heteroscedasticity in the same way for all data sets. While some improvement would be possible by more sophisticated modeling, we believe that for the purpose of the present analyses it is satisfactory to work with the assumption of the constant variance.

Results from one-way ANOVA are summarized in Table 1. Comparison of F values indicates that sAUDPS outperforms sAUDPC in 35 of 50 trials. RMSE is smaller for sAUDPS in 46 trials; while 42 trials have smaller CV when sAUDPS was used for combining data from disease progress observations.

Theoretical optimal weights were calculated for the first and last observations, and the combination of all other ‘central’ obser-vations. The optimal weights were based on the assumption that smaller overall variance is favorable when assessments from all observations are combined into a single value. For comparison across data sets, it may be preferable to use relative weights. The actual relative weights used by AUDPC and AUDPS can be com-puted as follows:

∑=

=n

ii

ii

v

vw

1

and ∑

=

==n

ii

n

ii

v

vc

1

1

2

Page 4: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

384 PHYTOPATHOLOGY

When weights used for calculation of sAUDPC and sAUDPS were compared with the theoretical optimal weights (Supple-mental Table 2), sAUDPS outperformed sAUDPC in 41 trials for weight for the first observation (w1) and in 27 trials for weight for the last observation (wn). Based on their similarity to the optimal weights, 50 trials can be categorized into four groups: a group of 18 trials that have real weights of sAUDPS closer to the optimal weights for w1, c, and wn (Fig. 2, example CS5); a group of nine trials that have real weights of sAUDPS closer to the optimal weights for c and wn (Fig. 2, example Di); a group of 22 trials that have real weights of sAUDPS closer to the optimal weight for w1 and c (Fig. 2, example LD4); and a single trial that have real

weights of sAUDPS closer to the optimal weight for w1 only (Fig. 2, example PL1). When the Euclidean metric was used to calcu-late the overall distance between optimal and actual weights, sAUDPS outperformed sAUDPC in 49 trials, the single exception being the trial PL1 (Table 1).

Each of the four metrics (F value, RMSE, CV, and ED) showed a statistically significant frequency of higher precision for sAUDPS based on a sign test with P values ranging from 0.0066 to less than 0.0001 (Table 1). From 50 analyzed trials, PL1 is the only one in which all four statistics indicate that sAUDPC outperforms sAUDPS. In contrast, all four statistics imply that sAUDPS outperforms sAUDPC in 34 of 50 trials.

TABLE 1. Results for standardized area under the disease progress curve (sAUDPC) and standardized area under the disease progress stairs (sAUDPS) from one-way analysis of variance and Euclidean distances to optimal weightsa

Trial Formula F value RMSEb CVc EDd Trial Formula F value RMSEb CVc EDd

LB1 sAUDPC 32.514 3.344 0.751 1.278 LD7 sAUDPC 2.551 17.068 28.460 0.791 sAUDPS 30.553 3.378 0.724 1.178 sAUDPS 2.573 16.200 28.254 0.596 LB2 sAUDPC 3.017 15.569 0.463 1.019 LD8 sAUDPC 7.888 8.706 16.788 1.288 sAUDPS 3.136 14.082 0.422 0.831 sAUDPS 9.041 8.174 16.033 1.189 LB3 sAUDPC 21.070 5.808 0.376 1.323 LD9 sAUDPC 79.219 14.610 40.260 1.000 sAUDPS 22.646 5.638 0.353 1.239 sAUDPS 80.234 14.511 40.269 0.866 LB4 sAUDPC 3.936 13.065 21.422 1.070 LD10 sAUDPC 38.131 15.207 40.614 0.613 sAUDPS 3.884 12.624 20.763 0.895 sAUDPS 37.241 14.969 41.049 0.408 LB5 sAUDPC 1.506 9.098 22.530 1.092 LD11 sAUDPC 7.892 18.045 83.822 0.966 sAUDPS 1.518 7.510 17.810 0.943 sAUDPS 7.861 17.767 83.679 0.839 LB6 sAUDPC 2.993 8.357 16.709 1.123 LD12 sAUDPC 0.702 12.609 147.546 1.027 sAUDPS 3.309 6.800 13.665 0.993 sAUDPS 0.700 12.084 147.975 0.885 LB7 sAUDPC 9.494 4.233 8.407 0.950 LD13 sAUDPC 2.717 15.153 51.514 0.787 sAUDPS 11.717 3.564 7.086 0.792 sAUDPS 2.741 14.645 50.595 0.602 LB8 sAUDPC 4.193 8.547 14.174 1.059 LD14 sAUDPC 0.608 22.231 71.241 0.966 sAUDPS 4.921 6.943 11.962 0.918 sAUDPS 0.576 20.252 69.506 0.839 LB9 sAUDPC 1.399 7.801 21.523 1.066 LD15 sAUDPC 2.626 19.189 38.392 0.911 sAUDPS 1.475 6.952 17.535 0.913 sAUDPS 2.508 19.122 39.144 0.788 SL1 sAUDPC 15.262 1.008 0.154 1.185 LD16 sAUDPC 4.144 9.995 21.496 0.938 sAUDPS 17.712 0.912 0.143 1.071 sAUDPS 4.320 9.355 20.531 0.767 SL2 sAUDPC 12.365 0.903 0.118 1.318 Di sAUDPC 21.143 8.909 0.345 0.493 sAUDPS 13.576 0.843 0.112 1.240 sAUDPS 24.763 8.189 0.317 0.337 SL3 sAUDPC 54.128 0.572 0.067 1.327 Tr1 sAUDPC 243.269 4.685 10.877 1.330 sAUDPS 55.419 0.551 0.066 1.254 sAUDPS 233.352 4.692 11.062 1.271 SL4 sAUDPC 17.094 0.669 0.088 1.318 Tr2 sAUDPC 20.650 8.811 15.007 1.004 sAUDPS 16.939 0.621 0.084 1.241 sAUDPS 21.344 8.159 14.077 0.862 SL5 sAUDPC 20.222 1.366 18.469 1.102 PL1 sAUDPC 8.381 0.569 29.116 0.306 sAUDPS 21.177 1.229 17.084 0.968 sAUDPS 8.039 0.599 30.170 0.445 SL6 sAUDPC 2.393 1.027 15.144 1.352 PL2 sAUDPC 8.426 0.328 14.063 1.137 sAUDPS 2.403 0.971 14.512 1.298 sAUDPS 13.965 0.251 10.658 0.996 SL7 sAUDPC 9.920 0.849 9.783 1.319 CS1 sAUDPC 5.569 1.459 41.387 0.346 sAUDPS 10.299 0.825 9.663 1.242 sAUDPS 5.305 1.448 40.585 0.184 SL8 sAUDPC 13.656 0.737 9.883 1.242 CS2 sAUDPC 18.492 0.457 6.312 0.176 sAUDPS 13.664 0.670 9.797 1.154 sAUDPS 18.154 0.455 6.244 0.176 SL9 sAUDPC 8.677 0.843 18.361 1.075 CS3 sAUDPC 11.915 0.481 13.158 0.823 sAUDPS 8.605 0.700 15.040 0.923 sAUDPS 12.190 0.460 12.705 0.656 SL10 sAUDPC 19.576 0.779 15.254 0.928 CS4 sAUDPC 1.610 0.532 23.672 0.646 sAUDPS 20.689 0.652 12.811 0.746 sAUDPS 1.611 0.514 22.647 0.458 DM sAUDPC 6.496 0.461 0.251 0.838 CS5 sAUDPC 2.567 0.519 12.177 0.727 sAUDPS 6.602 0.415 0.230 0.641 sAUDPS 2.598 0.481 11.644 0.552 LD1 sAUDPC 9.286 11.865 0.291 0.795 PS1 sAUDPC 14.048 0.506 68.901 0.713 sAUDPS 9.540 11.557 0.281 0.617 sAUDPS 14.679 0.502 64.886 0.609 LD2 sAUDPC 2.035 10.176 44.290 1.196 PS2 sAUDPC 5.033 0.332 177.917 1.010 sAUDPS 2.127 9.610 41.580 1.053 sAUDPS 4.573 0.379 179.374 0.873 LD3 sAUDPC 2.812 20.020 50.078 1.014 PS3 sAUDPC 58.732 0.625 15.812 0.428 sAUDPS 2.981 19.600 49.186 0.800 sAUDPS 60.302 0.597 15.537 0.234 LD4 sAUDPC 1.132 17.855 32.144 0.602 PM sAUDPC 34.609 0.933 14.930 1.071 sAUDPS 1.109 17.574 32.166 0.435 sAUDPS 42.074 0.744 13.296 0.917 LD5 sAUDPC 56.028 8.395 18.308 0.899 Totale sAUDPC 15 4 8 1 sAUDPS 61.869 7.453 16.738 0.736 sAUDPS 35 46 42 49 LD6 sAUDPC 5.675 8.773 47.374 1.168 Sign test P value 0.0066 <0.0001 <0.0001 <0.0001 sAUDPS 6.228 8.504 45.347 1.026 a Values for sAUDPC or sAUDPS in each trial that indicate higher precision (higher F value, and lower RMSE, CV, and ED) are in bold type. b RMSE = root of means square error. c CV = coefficient of variation. d ED = Euclidean distance. This metric shows a distance between relative optimal weights and relative actual weights used for calculation of sAUDPC and

sAUDPS. The optimal and actual weights are shown in Supplemental material 1. e Total number of sAUDPC and sAUDPS values with higher precision. The P values are from a two-sample paired sign test.

Page 5: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

Vol. 102, No. 4, 2012 385

DISCUSSION

An alternative approach for determining optimal weights. Consideration of optimal weights leads to an alternative, self-weighting analysis. The idea is to estimate V from the data and use it to determine optimal weights. Then compute

∑=

=n

iii ywL

1

for each plot using optimal weights. It may be conjectured that this approach maximizes power to separate treatments. Estimation of V can be done by a linear model for the repeated measures taken on the same plot (6). In estimating V, all effects except plot error can be taken as fixed. If V were known, the resulting pro-cedure would be exact. In practice, an estimator needs to be used. Provided the estimator is consistent and the number of plots is large, the procedure should be approximately valid. The small-sample behavior could be investigated by simulation.

A possible criticism of this procedure is that the definition of L would be data-dependent and thus comparison across different studies would be difficult. Moreover, when combining different trials, optimal weights would differ between trials, meaning that in each trial a different definition of L is used. Therefore, it may be better to use a fixed definition of L, as in AUDPC and AUDPS.

Alternative methods of extrapolating the weight in the miss-ing direction. To extrapolate weight in a missing direction for the

first and last observation we suggested using a half of the average interval duration between observations. There are, of course, other methods that could be used instead. For example, using the full length of the first interval as weight for the first observation and similarly using a length of the last interval as weight for the last observation. To choose the most appropriate weighting for prac-tical application we analyzed 50 trials for which AUDPC was published (a list is available from authors). These data indicate that approximately 60% of trials use intervals with equal length. Another 20% of trials have intervals with a CV less than 0.2. The remaining 20% of trials have a CV between 0.2 and 0.4. Because majority of trials have a very low variability in the length of intervals, using the average interval duration appears to be a reasonable way of estimating missing weight in a majority of trials.

AUDPC versus AUDPS performance. Results that are sum-marized in Table 1 and Figure 2 indicate that most of the tested trials have more favorable values when AUDPS is applied to the disease progress data. To find out why all trails do not perform the same way, we plotted mean values and RMSE from individual ob-servations (Fig. 3). These graphs show that sAUDPC outperforms sAUDPS in the trials where variance of assessments is still sub-stantially growing at the last observation. This finding confirms our theoretical expectations that when assessments in the first or last observations have a comparatively large variance, sAUDPS may be less precise than sAUDPC. Our tests imply that a relative

Fig. 2. Ternary plots of the relative optimal and relative real weights used for calculation of standardized area under the disease progress curve (sAUDPC) and standardized area under the disease progress stairs (sAUDPS). CS5 represents a group of 18 trials where real weights of sAUDPS are closer to the optimal weights for w1, c, and wn; Di represents a group of nine trials where real weights of sAUDPS are closer to the optimal weights for c and wn; LD4 represents a group of 22 trials where real weights of sAUDPS are closer to the optimal weights for w1 and c; and PL1 is a single trial where real weights of sAUDPS are closer to theoptimal weights for w1 only. All trials with the exception of PL1 have the Euclidean distance between optimal and actual weights smaller for sAUDPS than forsAUDPC. Each side of the triangle represents a value of 0, while the point of the triangle opposite to that side representing a value of 1. Values of relative weightsare indicated on the plot by circles of different color: black = optimal, white = sAUDPC, and gray = sAUDPS. Values used for ternary plots are provided in Supplemental Table 2.

Page 6: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

386 PHYTOPATHOLOGY

performance of the two methods is independent from the mean of assessments or the shape of disease progress curve, but it is affected by variance of assessments in individual observations and correlation between observations.

For a special case, we can theoretically predict when sAUDPS outperforms sAUDPC. Assuming equally spaced observations, and the perfect positive correlation between assessments in each accession, the relative performance of sAUDPC and sAUDPS de-

Fig. 3. Disease progress curves for three plant–pathogen and one plant–fungicide systems: LB = late blight of potato, Tr = triforine effect on lettuce, PL = powdery mildew of lettuce, and PS = powdery mildew of sugar beet. The trials shown in the left column have root of mean square error (RMSE) for standardized area under the disease progress curve (sAUDPC) higher than for standardized area under the disease progress stairs (sAUDPS), while the trials shown in the right column have RMSE for sAUDPC smaller than for sAUDPS. Full line indicates mean values of assessments, and dashed line shows RMSE for the assessments in eachobservation. For simplicity and uniformity of graphs, the x axis for each trial starts at day 1, though the actual evaluations started at different days.

Page 7: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

Vol. 102, No. 4, 2012 387

pends on the standard deviation of individual observations ( iσ ). If the average standard deviation of the first and last observation is smaller than the average standard deviation of all other central observations

( cn σ<σ ,1 , where 2

1,1

nn

σ+σ=σ and

2

1

2

σ=σ∑−

=

n

n

ii

c )

then sAUDPS will outperform sAUDPC (Appendix C). Of course this limiting case itself will not occur in real data, but it is relevant when correlations are high and observations are performed regu-larly (or almost regularly). To test how this theoretical prediction works on real data, we compared n,1σ and cσ from the 50 analyzed trials. In 44 trials a prediction based on comparison of

n,1σ and cσ matched RMSE values. Though these analyzed trials do not strictly meet both assumptions (perfect correlation and equal intervals), results correspond well with those ob- tained by comparing RMSE of sAUDPC and sAUDPS. Thus it is possible to predict a relative performance of the two formulas by comparing iσ values from individual observations. However, if assumptions of the perfect correlation and equal distribution of observations are not met, accuracy of predictions decreases.

Differences in values calculated by the AUDPC and AUDPS formulas. Because only half the weight is given to the first and last observation compared to intermediate observations in the AUDPC formula, this can lead to results that are not intuitively expected. Figure 4 shows assessment scores for two accessions. Accession A has a somewhat higher score than accession B in ob-servations 2 to 5, a somewhat smaller score in the first obser-vation, but a substantially smaller score for the last observation. AUDPC for accession A is 14, while for accession B it is 13.5. On the contrary, the AUDPS score for accession A is smaller than for accession B (16 versus 19, respectively), the result that would be expected from the disease progress curve. In the special case, when assessments for all tested individuals in a trial have the same value for the first and last observations (e.g., all starting at 0%, and ending at 100%), a difference between AUDPS and AUDPC will be constant for all individuals and equals

121

−×

+n

Dyy n (11)

There is also a distinction between the AUDPC and AUDPS in the value of standardized scores. When observations are performed at regular intervals, sAUDPS is equal to the arithmetic mean of all assessments, whereas sAUDPC values are affected by a smaller weight given to the first and last observation. For example, if values of 2, 2, and 5 are assessed on days 1, 2, and 3, the sAUDPC score is 2.75. The sAUDPS score is 3, the same as the arithmetic mean of the three assessments (and in the same units).

Impact of disease duration on the AUDPC and AUDPS values. Van der Plank (chapter 12 in citation 13) suggested two hypotheses when describing the relationship between loss of yield and AUDPC. On these hypotheses, injury is proportional to the AUDPC.

“First, injury is proportional to the amount of disease. Two pustules do twice as much damage as one pustule, other things being equal.”

“Second, injury is proportional to the duration of the disease. A pustule developed two weeks before the crop ripens does twice as much damage as a pustule developed one week before, other things being equal.”

Both AUDPC and AUDPS give equivalent results regarding the first hypothesis; the values of AUDPC and AUDPS are always proportional to the disease score. For example, if accession A has disease scores 1, 2, and 5 in three consecutive days, and accession B has the disease scores of 2, 4, and 10 (double that of the acces-

sion A scores), the AUDPC for the accession B is twice as high as for the accession A (10 versus 5). Similarly, AUDPS score for the accession B is twice the AUDPS score for the accession A (16 versus 8). However, in some cases there may be a difference be-tween the two formulas when calculating values related to the second hypothesis. For example, if a disease is evaluated on accessions C, D, and E on four consecutive days and the acces-sion C shows a disease only on the last observation (recorded as 0, 0, 0, 1), the accession D shows the same disease score a day earlier (0, 0, 1, 1), and the accession E develops a disease already on the first day (1, 1, 1, 1). The AUDPC scores for the three accessions are C = 0.5, D = 1.5, E = 3. A disease on the accession E was observed as lasting twice as long as on the accession D (4 days versus 2 days), and the AUDPC score for the accession E is indeed twice as high (3 versus 1.5). But the ratio is not the same when the accessions D and C are compared. Though a disease on the accession D lasted twice as long as on the accession C (two days versus a single day), the AUDPC score for the accessions D is three times larger then for the accession C (1.5 versus 0.5). The respective AUDPS values for the three accessions are C = 1, D = 2, and E = 4, corresponding to the duration of a disease. Evalua-tion of the AUDPC and AUDPS formulas shows that though both of them have the same conceptual basis; their results are not always equivalent. When the evaluations are performed at regular intervals, AUDPS provides estimates that are always linearly pro-portional to the duration of a disease, while estimates based on AUDPC are not always linearly proportional to the duration of a disease.

Relative AUDPC and AUDPS. Relative AUDPC is calculated as a ratio between actual AUDPC and maximum potential AUDPC (2). The maximum potential AUDPC is the value that would be reached if assessments at every observation were at maximum (ymax). Similarly as for AUDPC, relative AUDPS (rAUDPS) is calculated as a ratio of actual and maximum potential AUDPS. However, because time span for AUDPS is different than for AUDPC, the rAUDPS formula is

maxmax

)1(

y

sAUDPS

ynD

nAUDPSrAUDPS =

××−×

= (12)

When a minimum potential assessment value is larger than zero (ymin > 0), the rAUDPC and rAUDPS formulas need to be ad-justed (e.g., for 1 to 9 rating scale). Theoretically, this adjustment should be carried out for any ymin ≠ 0, but in practice negative values are not used in disease progress assessments. The formula for relative AUDPS including adjustment for a nonzero minimum assessment is

Fig. 4. Differences between area under the disease progress curve (AUDPC)and area under the disease progress stairs (AUDPS) scores for two hypo-thetical accessions. Accession A (gray line) has a higher AUDPC score, butlower AUDPS score than accession B (black line) because AUDPC formula gives only half a weight to the first and last observations.

Page 8: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

388 PHYTOPATHOLOGY

minmax

min

yy

ysAUDPSrAUDPS

−−

= (13)

In addition to relative AUDPC as described by Fry (2), other types of AUDPC were termed relative. These are based on a ratio of actual AUDPC, and AUDPC of reference accessions (14) or an accession with the highest AUDPC score in a trial (8). These sta-tistics can also be calculated in the same way as rAUDPS values, but again, a proper adjustment is needed if ymin > 0.

Because relative AUDPC or AUDPS values are related to stan-dardized sAUDPC and sAUDPS by a linear transformation, there seems little practical difference between standardized and relative versions of both measures. We preferred the standardized versions in the present paper because these allow for studying relative weights.

Conclusions. We developed a new formula to combine data from a disease progress curve into a single observation. AUDPS has the same conceptual basis and seeks to estimate the same quantity as AUDPC; it only uses a different estimation approach that is based on the slight extension of the limits of integration. This new formula gives a weight closer to optimal for the first and last observations that are undervalued by the AUDPC formula. Tests of precision on 50 trials indicate that sAUDPS outperforms sAUDPC in the trials where the average standard deviation of assessments from the first and last observation is smaller than the average standard deviation of assessments from all other (central) observations. We propose using this new formula for combining disease progress data assessed at multiple observations.

Since disease progress is frequently observed and data are recorded from the onset of disease until disease reaches a plateau or it is close to it, it is possible to assume that variance among replications within each accession in those early and late obser-vations is smaller than the variance during midperiod of disease progress. If this assumption is correct and our formula for predicting relative performance of sAUDPC and sAUDPS applies, then sAUDPS outperforms sAUDPC frequently. Of course, if only a specific time period is studied that starts later than onset of a disease and/or finishes earlier than plateau of a disease, then sAUDPC might outperform sAUDPS. SAS codes for calculating AUDPS (equation 3) and sAUDPS (equation 7) are provided in Supplemental material 1.

APPENDIX A

To study the properties of the optimal weights, it is instructive to use the simple weights in equation 9 and assume for simplicity that all resulting weights are positive. Since y values are usually repeated measures taken on the same plot, V can entail a serial correlation structure. To gain some insight, we may consider the following special cases (assuming equally spaced data).

(i) Data are identically and independently distributed: 2σ= IV . The optimal weights are 1−= nwi . These are the weights used in AUDPS, which is therefore optimal in this case.

(ii) Data have a compound symmetry structure 2σ+φ= IJV , where TJ 11= . Again, the optimal weights are 1−= nwi .

(iii) V is diagonal with elements 2iσ , in other words, obser-

vations are independent, but variances differ between observa-tions. The optimal weights are

1

1

22−

=⎟⎠⎞

⎜⎝⎛ σσ= ∑

n

iiiiw

Thus, observations with large variance receive a smaller weight. In sAUDPC the first and last observation carry half the weight of intermediate observations. Thus, if the first or last observations have a comparatively large variance, sAUPDS may be less precise than sAUDPC.

(iv) V has an autoregressive first order structure AR(1) struc-ture, i.e.,

⎟⎟⎟⎟⎟⎟⎟

⎜⎜⎜⎜⎜⎜⎜

ρρρ

ρρρρρρρρρ

σ=

−−−

1

1

1

1

321

32

2

12

2

L

MOMMM

L

L

L

nnn

n

n

n

V (14)

where 10 <ρ< is the autocorrelation. Then,

( )⎟⎟⎟⎟⎟⎟⎟

⎜⎜⎜⎜⎜⎜⎜

ρ+

ρ+ρ−ρ−ρ+ρ−

ρ−ρ+

ρ−σ=−

2

2

2

2

221

1000

010

01

001

1

1

L

MOMMM

L

L

L

V (15)

(page 160 in literature citation 7). It follows that weights are

⎟⎟⎠

⎞⎜⎜⎝

ρ−ρ

+ρ+

×σ

=22 11

11iw

for the first and last observations and

ρ+×

σ=

1

112iw

for all other observations. Thus, the first and last observations have larger optimal weight than the other observations. In this situation, AUDPS is closer to the optimal weights than AUDPC, which has half the weight for the first and last observation compared with the other observations.

APPENDIX B

AUDPC and AUDPS use weights proportional to ( ) 2/11 +− += iii ttv for 1,...,2 −= ni . Denote the factor of pro-

portionality as b, such that ii bvw = for 1,...,2 −= ni . Now b needs to be chosen such that the restriction

11

=∑=

n

iiw

is obeyed. Thus, we must have

12

21

1=++= ∑∑

==n

n

ii

n

ii wvbww

If we set

∑−

==

2

2

n

iivbc

the constraint becomes 11 =++ cww n . The estimator can be written as

nn

n

iii ywywywL ++= ∑

=

1

211

where the last term can be reexpressed as

cmmvbyvbywn

ii

n

iii

n

iii =⎟

⎠⎞

⎜⎝⎛== ∑∑∑

=

=

=

1

2

1

2

1

2 with

∑−

=

==1

2

1

2n

ii

n

iii

v

yvm

Thus, the estimator can be written as nn ywcmywL ++= 11 subject to the constraint 11 =++ nwcw (equation 10).

APPENDIX C

Let ( ) SRSyV == var , where S is a diagonal matrix holding the standard deviations ( iσ ) of yi (i = 1,…, n) and R is the correlation

Page 9: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

Vol. 102, No. 4, 2012 389

matrix. We consider the case of perfectly correlated observations, corresponding to TR 11= , where 1 is a vector of ones. In this case we find 2)1()var( SwVwwL TT == .

Thus, to evaluate precision of L, we only need to consider

∑=

• σ=σ==n

iii

T wSwL1

1)var(

the standard deviation of L. In other words, the standard deviation of L is equal to the weighted mean of standard deviations of observations, with weights equal to those used in L itself. Now assume equal spacing, meaning that aww n ==1 and

bwww n ==== −132 ... . For sAUDPS we have 1−== nba , while for sAUDPC 1)]1(2[2/ −−== nba . The standard deviation of L can be expressed as cn bnaL σ−+σ= )2(2)var( ,1 , Using the defi-nitions of a and b, we find that the standard deviation of sAUDPS is larger than that of sAUDPC if and only if

( ) ( )( )1

222 ,1,1

−σ−+σ

>σ−+σ

n

n

n

n cncn

which after some algebraic rearrangement yields cn σ>σ ,1 . This confirms our proposition that the variance (standard deviation) of sAUDPS is larger than that of sAUDPC if and only if cn σ>σ ,1 for the special case of equally spaced data and perfect positive correlation.

ACKNOWLEDGMENTS

We thank A. Atallah for technical assistance with field experiments and manuscript preparation, and L. V. Madden for critically reviewing the manuscript. R. Hayes, R. Lewellen, M. McGrath, and J. McCreight kindly provided the trial data.

LITERATURE CITED

1. Ewing, E. E., Simko, I., Smart, C. D., Bonierbale, M. W., Mizubuti, E. S. G., May, G. D., and Fry, W. E. 2000. Genetic mapping from field tests of

qualitative and quantitative resistance to Phytophthora infestans in a population derived from Solanum tuberosum and Solanum berthaultii. Mol. Breed. 6:25-36.

2. Fry, W. E. 1978. Quantification of general resistance of potato cultivars and fungicide effects for integrated control of potato late blight. Phytopathology 68:1650-1655.

3. Haynes, K. G., and Weingartner, D. P. 2004. The use of area under the disease progress curve to assess resistance to late blight in potato germplasm. Am. J. Potato Res. 81:137-141.

4. Jeger, M. J., and Viljanen-Rollinson, S. L. H. 2001. The use of the area under the disease-progress curve (AUDPC) to assess quantitative disease resistance in crop cultivars. Theor. Appl. Genet. 102:32-40.

5. Madden, L. V., Hughes, G., and van den Bosch, F. 2007. The Study of Plant Disease Epidemics. American Phytopathological Society, St. Paul, MN.

6. Piepho, H. P., Büchse, A., and Richter, C. 2004. A mixed modelling approach to randomized experiments with repeated measures. J. Agron. Crop Sci. 190:230-247.

7. Rao, C. R., Toutenburg, Shalabh, H., and Heumann, C. 2008. Linear Models and Generalizations. 3rd extended ed. Springer, Berlin.

8. Rauscher, G., Simko, I., Mayton, H., Bonierbale, M., Smart, C. D., Grünwald, N. J., Greenland, A., and Fry, W. E. 2010. Quantitative resistance to late blight from Solanum berthaultii cosegregates with RPi-ber: Insights in stability through isolates and environment. Theor. Appl. Genet. 121:1553-1567.

9. Simko, I. 2002. Comparative analysis of quantitative trait loci for foliage resistance to Phytophthora infestans in tuber-bearing Solanum species. Am. J. Potato Res. 79:125-132.

10. Simko I., Hayes, R. J., Truco, M. J., and Michelmore, R. W. 2011. Mapping a dominant negative mutation for triforine sensitivity in lettuce and its use as a selectable marker for detecting hybrids. Euphytica 182:157-166.

11. Simko, I., and Hu, J. 2008. Population structure in cultivated lettuce and its impact on association mapping. J. Am. Soc. Hort. Sci. 133:61-68.

12. Simko, I., Pechenick, D. A., McHale, L. K., Truco, M. J., Ochoa, O. E., Michelmore, R. W., and Scheffler, B. E. 2009. Association mapping and marker-assisted selection of the lettuce dieback resistance gene Tvr1. BMC Plant Biol. 9:135.

13. Van der Plank, J. E. 1963. Plant Diseases: Epidemics and Control. Academic Press, New York.

14. Yuen, J. E., and Forbes, G. A. 2009. Estimating the level of susceptibility to Phytophthora infestans in potato genotypes. Phytopathology 99:782-786.

Page 10: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

  1

Supplemental Table 1. Description of datasets used for analysis of AUDPC/sAUDPC and AUDPS/sAUDPS performance 

Trial  Crop  Evaluation  Number of 

accessions 

Replications 

± SE 

Da  Nb  Average interval 

length ± SE 

Rangec  

ymin ‐ ymax 

LB1  Potato  Late blight  82  2.30±0.05  73  9  9.00±1.46  0 ‐ 100 

LB2  Potato  Late blight  112  2.22±0.16  28  5  6.75±1.31  0 ‐ 100 

LB3  Potato  Late blight  140  2.00±0.00  36  11  3.50±0.17  0 ‐ 100 

LB4  Potato  Late blight  240  2.00±0.00  11  4  3.33±0.67  0 – 100 

LB5  Potato  Late blight  100  11.63±0.32  29  5  7.00±0.00  0 – 100 

LB6  Potato  Late blight  185  2.79±0.04  15  5  3.50±0.29  0 – 100 

LB7  Potato  Late blight  233  2.85±0.03  19  6  3.60±0.40  0 – 100 

LB8  Potato  Late blight  313  2.86±0.03  22  5  5.25±0.48  0 – 100 

LB9  Potato  Late blight  100  11.50±0.32  13  5  3.00±0.00  0 ‐ 100 

SL1  Lettuce  Salad deterioration  9  5.00±1.00  50  8  7.00±0.00  0 ‐ 10 

SL2  Lettuce  Salad deterioration  54  5.41±0.37  78  12  7.00±0.00  0 ‐ 10 

SL3  Lettuce  Salad deterioration  92  7.98±0.14  85  13  7.00±0.00  0 ‐ 10 

Page 11: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

  2

SL4  Lettuce  Salad deterioration  152  4.81±0.12  78  12  7.00±0.00  0 ‐ 10 

SL5  Lettuce  Salad deterioration  6  6.83±0.70  43  7  7.00±0.00  0 – 10 

SL6  Lettuce  Salad deterioration  16  2.00±0.00  120 18  7.00±0.00  0 ‐ 10 

SL7  Lettuce  Salad deterioration  109  6.19±0.19  78  12  7.00±0.00  0 – 10 

SL8  Lettuce  Salad deterioration  74  8.51±0.14  71  11  7.00±0.00  0 – 10 

SL9  Lettuce  Salad deterioration  67  6.67±0.27  29  5  7.00±0.00  0 – 10 

SL10  Lettuce  Salad deterioration  53  5.00±0.47  29  5  7.00±0.00  0 – 10 

DM  Lettuce  Downy mildew  95  2.00±0.00  22  4  7.00±0.00  0 – 5 

LD1  Lettuce  Lettuce drop  300  2.00±0.00  32  5  7.75±0.75  0 – 100 

LD2  Lettuce  Lettuce drop  60  1.73±0.11  35  6  6.80±1.07  0 – 100 

LD3  Lettuce  Lettuce drop  12  9.00±0.00  24  4  7.67±2.33  0 – 100 

LD4  Lettuce  Lettuce drop  54  2.98±0.02  25  4  8.00±1.53  0 – 100 

LD5  Lettuce  Lettuce drop  3  12.00±2.00  30  5  7.25±0.25  0 – 100 

LD6  Lettuce  Lettuce drop  50  4.20±0.20  30  5  7.25±0.25  0 – 100 

LD7  Lettuce  Lettuce drop  36  3.00±0.00  8  4  2.33±0.33  0 – 100 

Page 12: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

  3

LD8  Lettuce  Lettuce drop  4  7.00±0.00  10  9  1.13±0.13  0 – 100 

LD9  Lettuce  Lettuce drop  4  12.00±0.00  11  7  1.67±0.42  0 – 100 

LD10  Lettuce  Lettuce drop  3  12.00±0.00  10  3  4.50±0.50  0 – 100 

LD11  Lettuce  Lettuce drop  18  9.00±0.73  10  3  4.50±0.50  0 – 100 

LD12  Lettuce  Lettuce drop  12  6.00±0.00  16  3  7.50±2.50  0 – 100 

LD13  Lettuce  Lettuce drop  12  6.00±0.00  8  3  3.50±1.50  0 – 100 

LD14  Lettuce  Lettuce drop  12  6.00±0.00  10  3  4.50±0.50  0 – 100 

LD15  Lettuce  Lettuce drop  12  6.00±0.00  16  5  3.75±1.80  0 – 100 

LD16  Lettuce  Lettuce drop  41  3.07±0.77  22  4  7.00±0.00  0 – 100 

Di  Lettuce  Dieback  251  2.18±0.06  27  4  8.67±1.33  0 – 100 

Tr1  Lettuce  Triforine  12  3.00±0.00  17  17  1.00±0.00  0 – 100 

Tr2  Lettuce  Triforine  8  3.00±0.00  18  7  2.83±0.91  0 ‐ 100 

PL1  Lettuce  Powdery mildew  11  2.00±0.00  3  3  1.00±0.00  0 – 5 

PL2  Lettuce  Powdery mildew  11  2.00±0.00  7  4  2.00±1.00  0 – 5 

CS1  Sugar beet  Cercospora leaf spot 24  4.00±0.00  22  3  10.50±3.50  0 – 9 

Page 13: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

  4

CS2  Sugar beet  Cercospora leaf spot 100  4.00±0.00  28  4  9.00±2.52  1 – 10 

CS3  Sugar beet  Cercospora leaf spot 479  4.08±0.13  22  4  7.00±0.00  0 – 9 

CS4  Sugar beet  Cercospora leaf spot 170  2.02±0.05  22  4  7.00±0.00  0 – 9 

CS5  Sugar beet  Cercospora leaf spot 191  2.80±0.06  29  5  7.00±0.00  0 – 9 

PS1  Sugar beet  Powdery mildew  64  6.00±0.00  58  7  9.67±2.14  0 – 9 

PS2  Sugar beet  Powdery mildew  24  4.00±0.00  34  3  16.50±4.50  0 – 9 

PS3  Sugar beet  Powdery mildew  48  6.00±0.00  28  4  9.00±2.00  0 ‐ 9 

PM  Melon  Powdery mildew  13  5.77±0.57  32  3  15.50±7.50  1 ‐ 9 

a D is the duration in days from the first to the last observation. 

b N is the number of observations. 

c ymin and ymax are minimum and maximum potential assessment values. 

Page 14: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

Supplemental  Table  2:  Values  of  weights  used  for  ternary  plots  in  Figure  2.

Trial Formula Euclidean  distancew1 c wn

LB1 optimal 0.983 0.017 0AUDPC 0.056 0.896 0.049 1.278AUDPS 0.105 0.796 0.099 1.178

LB2 optimal 0.664 0 0.336AUDPC   0.056 0.796 0.148 1.019AUDPS 0.144 0.637 0.219 0.831

LB3 optimal 0.999 0 0.001AUDPC 0.043 0.914 0.043 1.323AUDPS 0.084 0.831 0.084 1.239

LB4 optimal 0.097 0 0.903AUDPC 0.2 0.7 0.1 1.070AUDPS 0.275 0.525 0.2 0.895

LB5 optimal 0.918 0 0.082AUDPC 0.125 0.75 0.125 1.092AUDPS 0.2 0.6 0.2 0.943

LB6 optimal 0.998 0 0.002AUDPC 0.143 0.714 0.143 1.123AUDPS 0.214 0.571 0.214 0.993

LB7 optimal 0.449 0.002 0.548AUDPC 0.083 0.778 0.139 0.950AUDPS 0.153 0.648 0.199 0.792

LB8 optimal 0.905 0.095 0AUDPC 0.095 0.762 0.143 1.059AUDPS 0.176 0.61 0.214 0.918

LB9 optimal 0.883 0 0.117AUDPC 0.125 0.75 0.125 1.066AUDPS 0.2 0.6 0.2 0.913

SL1 optimal 0.101 0.001 0.89AUDPC   0.071 0.857 0.071 1.185

Weights

Page 15: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

AUDPS 0.125 0.75 0.125 1.071

SL2 optimal 0.002 0 0.998AUDPC   0.045 0.909 0.045 1.318AUDPS   0.083 0.833 0.083 1.240

SL3 optimal 0 0 1AUDPC 0.042 0.917 0.042 1.327AUDPS   0.077 0.846 0.077 1.254

SL4 optimal 0.001 0 0.999AUDPC   0.045 0.909 0.045 1.318AUDPS 0.083 0.833 0.083 1.241

SL5 optimal 0.207 0 0.793AUDPC   0.083 0.833 0.083 1.102AUDPS 0.143 0.714 0.142 0.968

SL6 optimal 0 0 1AUDPC   0.029 0.941 0.029 1.352AUDPS 0.056 0.889 0.056 1.298

SL7 optimal 0 0 1AUDPC   0.045 0.909 0.045 1.319AUDPS 0.083 0.833 0.083 1.242

SL8 optimal 0.095 0 0.905AUDPC   0.05 0.9 0.05 1.242AUDPS 0.091 0.818 0.091 1.154

SL9 optimal 0.895 0 0.105AUDPC   0.125 0.75 0.125 1.075AUDPS 0.2 0.6 0.2 0.923

SL10 optimal 0.592 0 0.408AUDPC   0.125 0.75 0.125 0.928AUDPS 0.2 0.6 0.2 0.746

DM optimal 0.634 0 0.366AUDPC 0.167 0.667 0.167 0.838AUDPS 0.25 0.5 0.25 0.641

LD1 optimal 0.492 0.077 0.431

Page 16: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

AUDPC   0.161 0.726 0.113 0.795AUDPS   0.229 0.581 0.19 0.617

LD2 optimal 1 0 0AUDPC   0.083 0.75 0.167 1.196AUDPS 0.167 0.6 0.233 1.053

LD3 optimal 0.25 0 0.75AUDPC   0.152 0.761 0.087 1.014AUDPS 0.239 0.571 0.19 0.800

LD4 optimal 0.566 0.194 0.24AUDPC   0.146 0.625 0.229 0.602AUDPS 0.234 0.469 0.297 0.435

LD5 optimal 0.727 0.097 0.175AUDPC   0.121 0.759 0.121 0.899AUDPS 0.197 0.607 0.197 0.736

LD6 optimal 1 0 0AUDPC   0.121 0.759 0.121 1.168AUDPS 0.197 0.607 0.197 1.026

LD7 optimal 0.413 0 0.587AUDPC   0.143 0.643 0.214 0.791AUDPS 0.232 0.482 0.286 0.596

LD8 optimal 0.993 0.007 0AUDPC   0.056 0.889 0.056 1.288AUDPS 0.105 0.79 0.105 1.189

LD9 optimal 0.308 0 0.691AUDPC   0.05 0.8 0.15 1.000AUDPS 0.114 0.686 0.2 0.866

LD10 optimal 0.481 0 0.519AUDPC   0.222 0.5 0.278 0.613AUDPS 0.315 0.333 0.352 0.408

LD11 optimal 1 0 0AUDPC   0.222 0.5 0.278 0.966AUDPS 0.315 0.333 0.352 0.839

Page 17: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

LD12 optimal 1 0 0AUDPC   0.167 0.5 0.333 1.027AUDPS 0.278 0.333 0.389 0.885

LD13 optimal 0.742 0 0.258AUDPC   0.143 0.5 0.357 0.787AUDPS 0.262 0.333 0.405 0.602

LD14 optimal 1 0 0AUDPC   0.222 0.5 0.278 0.966AUDPS 0.315 0.333 0.352 0.839

LD15 optimal 0.081 0 0.919AUDPC   0.033 0.667 0.3 0.911AUDPS 0.127 0.533 0.34 0.788

LD16 optimal 0.826 0 0.174AUDPC   0.167 0.667 0.167 0.938AUDPS 0.25 0.5 0.25 0.767

Di optimal 0.211 0.258 0.531AUDPC   0.192 0.615 0.192 0.493AUDPS 0.269 0.462 0.269 0.337

Tr1 optimal 0.974 0 0.025AUDPC   0.031 0.938 0.031 1.330AUDPS 0.059 0.882 0.059 1.271

Tr2 optimal 0.697 0.137 0.166AUDPC   0.029 0.882 0.088 1.004AUDPS 0.097 0.756 0.147 0.862

PL1 optimal 0.375 0.625 0AUDPC   0.25 0.5 0.25 0.306AUDPS 0.333 0.333 0.333 0.445

PL2 optimal 1 0 0AUDPC   0.083 0.583 0.333 1.137AUDPS 0.188 0.438 0.375 0.996

CS1 optimal 0.299 0.274 0.427AUDPC   0.333 0.5 0.167 0.346AUDPS 0.389 0.333 0.278 0.184

Page 18: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

CS2 optimal 0.054 0.537 0.409AUDPC   0.096 0.635 0.269 0.176AUDPS 0.197 0.476 0.327 0.176

CS3 optimal 0.752 0.088 0.16AUDPC   0.167 0.667 0.167 0.823AUDPS 0.25 0.5 0.25 0.656

CS4 optimal 0.566 0.169 0.266AUDPC   0.167 0.667 0.167 0.646AUDPS 0.25 0.5 0.25 0.458

CS5 optimal 0.544 0.176 0.28AUDPC   0.125 0.75 0.125 0.727AUDPS 0.2 0.6 0.2 0.552

PS1 optimal 0.679 0.321 0AUDPC   0.158 0.807 0.035 0.713AUDPS 0.207 0.692 0.102 0.609

PS2 optimal 1 0 0AUDPC   0.182 0.5 0.318 1.010AUDPS 0.288 0.333 0.379 0.873

PS3 optimal 0.348 0.284 0.367AUDPC   0.13 0.63 0.241 0.428AUDPS 0.222 0.472 0.306 0.234

PM optimal 1 0 0AUDPC   0.129 0.5 0.371 1.071AUDPS 0.253 0.333 0.414 0.917

Values  for  AUDPC  or  AUDPS  in  each  trial  that  indicate  better  match  with  optimal  weight  (or  smaller  Euclidean  distance)  are  boldfaced.

Page 19: The Area Under the Disease Progress Stairs: Calculation ...€¦ · Simko, I., and Piepho, H.-P. 2012. The area under the disease progress stairs: Calculation, advantage, and application

Supplemental Material 1. SAS code for AUDPS (equation 3) and sAUDPS (equation 7). ------------------------------------ ----------------------------------------- /*This example of 4 individuals assessed at 5 times is used to calculate AUDPC, AUDPS, sAUDPC, and sAUDPS. Input data are needed for the number of observations, the times of observations, and the observed scores for individuals.*/ %let n=5; /*Insert here the number of time points (observations).*/ /*This dataset contains the actual times of observations of the n time points as a single row. Insert here the times when observations were performed.*/ data t; input t1-t&n; datalines; 3 5 8 10 13 ; /*This dataset contains the observed scores y1 to yn for the 4 individuals to be analyzed. Insert here the actual observed scores for each individual. Add (or delete) rows and columns for more (or fewer) observations or individuals as needed.*/ data y; input y1-y&n; datalines; 10 12 14 19 22 9 19 19 19 19 12 16 17 20 30 11 17 18 21 26 ; /*Merge two datasets t and y and compute AUDPC, AUDPS, sAUDPC, and sAUDPS.*/ data yt; array t t1-t&n; array y y1-y&n; set y; if _N_=1 then set t; audpc=y1*(t2-t1)/2 + y&n*(t&n-t[&n-1])/2; do i=2 to &n-1; audpc=audpc+y[i]*(t[i+1]-t[i-1])/2; end; D=t&n-t1; AUDPS=AUDPC+(y1+y&n)/2*D/(&n-1); s_AUDPC=AUDPC/D; s_AUDPS=AUDPS*(&n-1)/D/&n; /*Note that the fourth individual has higher AUDPC and sAUDPC scores than the third individual, but lower AUDPS and sAUDPS scores.*/ proc print data=yt; var audpc audps s_audpc s_audps; run;