3
Commentary What can variations in cJisease outcome teii us about risk? Firoze MarijT and Nico Nagelkerke^ 'International Development Research Center, Healtb Sciences Division, Regional Office for Eastern and Souttiern Africa, PO Box 62084, ^Kenya Medical Researcti Institute, Medical Research Centre, PO Box 20752, Nairobi, Kenya In recent years there has been a growing interest in risk assessment in dental dis- eases, with several major international conferences held to address the issue. The identification of risk factors and/or risk groups is of interest for two principal reasons: first, knowledge of risk factors can shed light on the etiology of the dis- ease; secondly, knowledge of risk factors may provide guidance as to priorities in the allocation of resources. In the search for predictors of disease, it has often been assumed that considerable information about potential risk factors may be gained by examining the distribution of the outcome variable (disease) in a pop- ulation. In this paper we consider some of the problems of doing so. It may be useful to begin by saying that risk should be distinguished from outeome. Risk is the probability of an outcome rather than the outcome itself When tossing a coin, the probability of heads or tails is fixed (e.g. 0.5), but the outcome varies. The reason for this is that we cannot make any better predic- tion of the outcotne of the toss than the "mean". It is important to note here that it is of no concern whether the outcome is the result of intrinsically random events (e.g. as is, according to quantum theory, the outcome of atomic events), or wheth- er the influences on the outcome are many and unknown, turning a theoreti- cally deterministic event into an effectiye- ly random one. Risk is not something that exists inde- pendent of the knowledge of the person who assesses it. For exatnple, consider two tumors, one benign and the other malignant, occurring, say, with the same frequency. If we were ignorant as to the distinction between them, we would con- sider both as one clinical entity, and we would therefore "know" that individuals with such tutnors have a 50% probability of survival. Once, however, we know how to distinguish between the lesions (say, as a consequence of the development of a new technique) then we could state that individuals with the malignant type have, if untreated, a near zero chance of survi- val, whereas those with the benign type have a near 100% chance (i.e. the risk is now qualitatively different merely as a result of a change in our knowledge). Insurance companies are constantly try- ing to improve their capability of making accurate predictions (i.e. their knowledge of outcome); the irony is that were they to be completely successful they would soon be out of business! For many diseases, including both dental caries and periodontal disease, it is possible to identify two "sources" of ignorance which may lead to random (or unpredictable) outcome: first, ignorance of the exact etiological mechanisms in- volved in the disease (e.g. genetic, immu- nological, microbial, etc.), i.e. the less we know about the exact mechanisms, the less accurately we can predict the out- come; secondly, ignorance about exposi- tion to etiological factors between time of risk assessment and time of outcome, which may also lead to unpredictable outcomes: for example, supposing we knew everything about etiological mech- anisms involved in dental caries, we would still have no idea how each of the factors involved (e.g. sugar consumption habits, salivary flow rates, salivary buffer capacity of an individual) will vary over the period between risk assessment and outcome. The combined effect of these two sources of ignorance can, perhaps, best be illustrated by the example of the decline in caries prevalence in industrial- ized countries over the last two decades. Had we endeavored to predict in the 1960s the likely trends in dental caries prevalence in the 1980s, we would have predicted considerably higher levels than in fact occurred. Part of the change in the prevalence of dental caries over this period has occurred as a result of unpre- dictable changes in known risk factors (e.g. use of topical fluorides), and part due to changes in parameters that are hitherto poorly understood (i.e. about which we are ignorant). From the above it should be apparent that variations in outcome (levels, distri- butions, etc.) will say little about varia- tions in risk. Recently, there has been sotne interest in trying to estimate the potential for assessing risk from the shape of the distri- bution of the outcome variable (e.g. loss of attachment, DMF scores, etc.), the argument being that if the distribution of the outcome variable is not nortnal (i.e. it has a long "tail", or is skewed), then it cannot merely be the outcome of a tnyri- ad of stnall unidentifiable effects but, in- stead, must be evidence for the existence of at least one major etiological factor which merely needs to be identified. The underlying statistical idea behind this is, perhaps, the central limit theorem which states that the sum of the effects of many small random variables will be approxi- mately normally distributed. However, if effects work multiplicatiyely, then the re- sult of many small effects will certaitily not be normal. Conversely, an outcome variable which is linearly related to an (itnportant) risk factor will nevertheless be normally distributed if the risk factor is itself normally distributed. In short, the shape of the distribution of the out- come variable is completely uninforma- tive (unless, of course everyone has exact- ly the same outcome!) - and the existence of long "tails" or skewness is thus not a priori evidence for the existence of risk groups. The only evidence for the existence of risk factors or groups is by actually iden- tifying them lotigitudinally. If we find an association between a variable x and the outcome variably y, t years later, then we can try to predict the state of y, t years later, from x. If this is successful, we have identified a risk factor - and those in whom we find a high value for this factor are the "risk group". In many instances, x

What can variations in disease outcome tell us about risk?

Embed Size (px)

Citation preview

Page 1: What can variations in disease outcome tell us about risk?

Commentary

What can variations in cJiseaseoutcome teii us about risk?

Firoze MarijT and Nico Nagelkerke^'International Development Research Center,Healtb Sciences Division, Regional Office forEastern and Souttiern Africa, PO Box 62084,^Kenya Medical Researcti Institute, MedicalResearch Centre, PO Box 20752, Nairobi,Kenya

In recent years there has been a growinginterest in risk assessment in dental dis-eases, with several major internationalconferences held to address the issue. Theidentification of risk factors and/or riskgroups is of interest for two principalreasons: first, knowledge of risk factorscan shed light on the etiology of the dis-ease; secondly, knowledge of risk factorsmay provide guidance as to priorities inthe allocation of resources. In the searchfor predictors of disease, it has often beenassumed that considerable informationabout potential risk factors may begained by examining the distribution ofthe outcome variable (disease) in a pop-ulation. In this paper we consider someof the problems of doing so.

It may be useful to begin by sayingthat risk should be distinguished fromouteome. Risk is the probability of anoutcome rather than the outcome itselfWhen tossing a coin, the probability ofheads or tails is fixed (e.g. 0.5), but theoutcome varies. The reason for this isthat we cannot make any better predic-tion of the outcotne of the toss than the"mean". It is important to note here thatit is of no concern whether the outcome isthe result of intrinsically random events(e.g. as is, according to quantum theory,the outcome of atomic events), or wheth-er the influences on the outcome aremany and unknown, turning a theoreti-cally deterministic event into an effectiye-ly random one.

Risk is not something that exists inde-pendent of the knowledge of the personwho assesses it. For exatnple, considertwo tumors, one benign and the othermalignant, occurring, say, with the samefrequency. If we were ignorant as to thedistinction between them, we would con-sider both as one clinical entity, and wewould therefore "know" that individualswith such tutnors have a 50% probabilityof survival. Once, however, we know howto distinguish between the lesions (say,as a consequence of the development of

a new technique) then we could state thatindividuals with the malignant type have,if untreated, a near zero chance of survi-val, whereas those with the benign typehave a near 100% chance (i.e. the risk isnow qualitatively different merely as aresult of a change in our knowledge).Insurance companies are constantly try-ing to improve their capability of makingaccurate predictions (i.e. their knowledgeof outcome); the irony is that were theyto be completely successful they wouldsoon be out of business!

For many diseases, including bothdental caries and periodontal disease, itis possible to identify two "sources" ofignorance which may lead to random (orunpredictable) outcome: first, ignoranceof the exact etiological mechanisms in-volved in the disease (e.g. genetic, immu-nological, microbial, etc.), i.e. the less weknow about the exact mechanisms, theless accurately we can predict the out-come; secondly, ignorance about exposi-tion to etiological factors between timeof risk assessment and time of outcome,which may also lead to unpredictableoutcomes: for example, supposing weknew everything about etiological mech-anisms involved in dental caries, wewould still have no idea how each of thefactors involved (e.g. sugar consumptionhabits, salivary flow rates, salivary buffercapacity of an individual) will vary overthe period between risk assessment andoutcome. The combined effect of thesetwo sources of ignorance can, perhaps,best be illustrated by the example of thedecline in caries prevalence in industrial-ized countries over the last two decades.Had we endeavored to predict in the1960s the likely trends in dental cariesprevalence in the 1980s, we would havepredicted considerably higher levels thanin fact occurred. Part of the change inthe prevalence of dental caries over thisperiod has occurred as a result of unpre-dictable changes in known risk factors(e.g. use of topical fluorides), and part

due to changes in parameters that arehitherto poorly understood (i.e. aboutwhich we are ignorant).

From the above it should be apparentthat variations in outcome (levels, distri-butions, etc.) will say little about varia-tions in risk.

Recently, there has been sotne interestin trying to estimate the potential forassessing risk from the shape of the distri-bution of the outcome variable (e.g. lossof attachment, DMF scores, etc.), theargument being that if the distribution ofthe outcome variable is not nortnal (i.e.it has a long "tail", or is skewed), then itcannot merely be the outcome of a tnyri-ad of stnall unidentifiable effects but, in-stead, must be evidence for the existenceof at least one major etiological factorwhich merely needs to be identified. Theunderlying statistical idea behind this is,perhaps, the central limit theorem whichstates that the sum of the effects of manysmall random variables will be approxi-mately normally distributed. However, ifeffects work multiplicatiyely, then the re-sult of many small effects will certaitilynot be normal. Conversely, an outcomevariable which is linearly related to an(itnportant) risk factor will neverthelessbe normally distributed if the risk factoris itself normally distributed. In short,the shape of the distribution of the out-come variable is completely uninforma-tive (unless, of course everyone has exact-ly the same outcome!) - and the existenceof long "tails" or skewness is thus not apriori evidence for the existence of riskgroups.

The only evidence for the existence ofrisk factors or groups is by actually iden-tifying them lotigitudinally. If we find anassociation between a variable x and theoutcome variably y, t years later, then wecan try to predict the state of y, t yearslater, from x. If this is successful, we haveidentified a risk factor - and those inwhom we find a high value for this factorare the "risk group". In many instances, x

Page 2: What can variations in disease outcome tell us about risk?

Commentary 107

is identical to y, t years earlier, e.g. caries The search for risk factors in dental to the intrinsic randomness or unpredict-at age 6 yr predicting (being a risk factor) diseases is clearly itnportant. Many have ability of the inputs. There is, however,for caries at age 15 yr (i.e. previous disease been found, and no doubt many others no way to tell a priori where the searchexperience being a good predictor of sub- will be found. However, there is clearly for risk factors will be successful andsequent disease development). a limit to what can be predicted owing where it won't.

Page 3: What can variations in disease outcome tell us about risk?