Download pdf - Journal of Process Control - MIT - Massachusetts Institute ...web.mit.edu/braatzgroup/Jiang_JPrcoCon_2015.pdf · Journal of Process Control 32 (2015) ... connective structure information,

Cs

Ba

b

a

ARRAA

KFCCDC

1

litaiaocat

vtcti

M

h0

Journal of Process Control 32 (2015) 109–116

Contents lists available at ScienceDirect

Journal of Process Control

j ourna l ho me pa ge: www.elsev ier .com/ locate / jprocont

anonical variate analysis-based monitoring of process correlationtructure using causal feature representation

enben Jianga,b, Xiaoxiang Zhub, Dexian Huanga, Richard D. Braatzb,∗

Department of Automation, Tsinghua University and Tsinghua National Laboratory for Information Science and Technology, Beijing 100084, ChinaDepartment of Chemical Engineering, Massachusetts Institute of Technology, Cambridge, MA 02139, USA

r t i c l e i n f o

rticle history:eceived 3 February 2015eceived in revised form 21 May 2015ccepted 26 May 2015vailable online 17 June 2015

eywords:ault monitoringorrelation structural fault

a b s t r a c t

Although the monitoring of process variables has been extensively studied, techniques for monitoringfaults in the process correlation structures have not yet been fully investigated. The typical methodsbased on the covariance matrix of the process data for process monitoring have limited capability toeffectively monitor underlying structural changes. This paper proposes a canonical variate analysis (CVA)approach based on the feature representation of causal dependency (CD) for the monitoring of faults asso-ciated with changes in process structures, which employs CD for pretreating the data and subsequentlyutilizes CVA for quantifying dissimilarity. Apart from the improved performance of capturing the under-lying connective structure information, the utilization of the CD feature in the first step provides more

ausal mapimensionality reduction techniqueanonical variate analysis

application-dependent representations compared with the original data, as well as decreased degree ofredundancy in the feature space by incorporating causal information. The effectiveness of the proposedCD-based approach for the monitoring of structural changes is demonstrated for both single faults andmultiple faults in simulation studies of a networked system. In the simulation results, the CD-basedmethod performs the best, followed by correlation-based and then variable-based methods.

© 2015 Elsevier Ltd. All rights reserved.

. Introduction

As manufacturing facilities become increasingly integrated andarge scale, the potential for faults to dynamically propagate in non-ntuitive ways to produce significant harm to equipment, life, andhe environment has increased. The rapid detection of faults andssociated identification of their causes are, therefore, of outmostmportance, which has attracted the interest of many researchersnd practitioners. However, most of the process monitoring meth-ds developed so far, including the multivariate statistical controlhart approaches [1–3], dimensionality reduction techniques [4–8],nd time-series or state-space methods [9,10], focus on the moni-oring of process variable faults [11,12].

Compared with the achievements in the monitoring of processariable faults, limited exploration has been done on the impor-ant complementary problem of monitoring the faults of process
orrelation structures [13]. With the emerging big data concept,he monitoring of correlation structures has become of increasednterest [14]. The explosive growth of process data along with the
∗ Corresponding author at: 77 Massachusetts Avenue, Room E19-551, Cambridge,A 02139, USA. Tel.: +1 617 253 3112; fax: +1 617 258 0546.

E-mail address: [email protected] (R.D. Braatz).

ttp://dx.doi.org/10.1016/j.jprocont.2015.05.004959-1524/© 2015 Elsevier Ltd. All rights reserved.

increasingly large-scale industries provides a valuable resource forthe analysis and operations of industrial processes. This paper aimsto develop a new approach for monitoring changes in the processstructure.

Typical approaches adopted for monitoring the process cor-relations are based on the process information described by thecovariance matrix. Example approaches include the generalizedvariance [15,16,18], the conditional entropy [17], and the likelihoodratio test (LRT) [19,20]. The covariance-based methods experienceinherent limitations in the detection of local anomalies as well asthe subsequent identification of the fault origins. In other words,the inherent structure of the process is not explicitly taken intoaccount in the covariance-based methods. For instance, two vari-ables, x and y, can be connected in several different ways such as (i)direct connection x → y, (ii) indirect connection x → z → y or (iii)co-regulated by a third variable z (z → x and z → y). In all thesecases, the variability between x and y may be similar, and a devi-ation on their underlying causal relationship may pass undetectedand undiscerned by only monitoring the covariance. In order toaccess and use the underlying information of the process corre-
lation structure, alternative measures of structural relationshipsshould be adopted in the process monitoring procedures.
An efficient way to explore the relationship for the under-lying structure amongst variables is using causal information. A

dx.doi.org/10.1016/j.jprocont.2015.05.004

http://www.sciencedirect.com/science/journal/09591524

http://www.elsevier.com/locate/jprocont

http://crossmark.crossref.org/dialog/?doi=10.1016/j.jprocont.2015.05.004&domain=pdf

mailto:[email protected]

dx.doi.org/10.1016/j.jprocont.2015.05.004

110 B. Jiang et al. / Journal of Process C

catsmwpttltdc

cmcttaTtm

onCsc[tdtafCsfirDogtCsp

rScis

2

tr[

using the CVA states as

T −1/2

Fig. 1. Causal linkage of the variables x, y, and z.

ausal map can clearly reflect the directional linkages betweenny two variables. In addition, the information of causal connec-ivity can be used to reduce the correlated degree for the featurepace of correlation, thereby enhancing the performance of faultonitoring. It was shown in [21] that the monitoring performanceas more sensitive in a space of uncorrelated features. For exam-le, in Fig. 1, without considering the causal connectivity amonghe three variables, the feature representation of correlation con-ains three components, i.e., rx,y, ry,z, and rx,z. From the causalinks in Fig. 1, rx,z is redundant with (rx,y, ry,z). In other words,he inclusion of rx,z in the feature representation of correlationoes not provide more information but increases the degree oforrelation.

It is important to note that the directional connectivity of theomponents within the plant can play a significant role in faultonitoring [22–24]. Several papers describe the construction of the

ausal map from a process flow diagram or piping and instrumen-ation diagram [23,25,40]. Their results showed that data-drivenechniques in combination with cause-and-effect relationshipsmong process variables can lead to efficient monitoring of faults.he linkage of data-driven analysis with a causal relationship ofhe process has been identified as a promising approach to fault

onitoring [22,23,25].Previously, the authors introduced a feature representation

f causal dependency (CD) to incorporate the data-driven tech-iques in conjunction with the causal information [26–28]. TheD is utilized to quantify the similarity between the relation-hip of two causally related variables under current operatingonditions and historical operating conditions. It was reported in29] that better fault monitoring results are achieved for featureshat are more application-dependent. In terms of application-ependence in process facilities, the CD is an appropriate featureo represent deviations in the process structure. In this article,n approach based on the CD feature representation is proposedor the monitoring of process structural faults. The proposedD-based approach can potentially facilitate to detect processtructural faults and to identify the root causes, owing to thener description of the underlying structure and the less cor-elated feature space of the CD by utilizing causal information.ifferent from the previous work [27] and [28] which focusedn the monitoring of process variable faults, this article investi-ates the monitoring of process correlation structures. In addition,his work incorporates a dimensionality reduction step based onanonical Variate Analysis (CVA), which generates dynamic state-pace models from the process data to improve the monitoringroficiency.

The rest of this paper is organized as follows. Section 2 brieflyeviews the traditional CVA-based data-driven monitoring method.ection 3 describes the CD-based approach for monitoring pro-ess structural faults. The effectiveness of the proposed methods demonstrated by a gene network system in Section 4. Section 5ummarizes the conclusions.

. Canonical variate analysis

The Canonical Variate Analysis (CVA) method, which serves as
he technique in the step of dissimilarity quantification, is brieflyeviewed in this section. More details of CVA can be found in30,31].
ontrol 32 (2015) 109–116

2.1. CVA theorem

CVA is a dimensionality reduction technique in multivariatestatistical analysis which maximizes the correlation between twoselected sets of variables. Consider vectors of process variablesx ∈ Rm and y ∈ Rn with covariance matrices ˙xx and ˙yy and cross-covariance matrix ˙xy . The orthogonal basis is chosen as thoselinear combinations of a variable set x that are more correlatedwith the linear combinations of another variable set y in the CVAapproach, where these linear combinations are named canonicalvariables and are represented as c and d, namely

c = Jx (1)

d = Ly (2)

where J and L are projection matrices that can be calculated bysolving the SVD [31]:

˙−1/2xx ˙xy˙−1/2

yy = U˙VT (3)

where is the diagonal matrix of nonnegative singular values indescending order, and U and V are matrices of the right and leftsingular vectors. Then the projection matrices J and L are obtainedby

J = UT˙−1/2xx (4)

L = VT˙−1/2yy (5)

The matrices UT and VT guarantee that the canonical variablesare only pairwise correlated, and the matrices ˙−1/2

xx and ˙−1/2yy

scale the canonical variables so that c and d have unit variance.

2.2. CVA algorithm

Hotelling initially proposed the CVA concept for multivariatestatistical analysis, which was not employed to system identifica-tion until Akaike’s work on ARMA models [30]. The CVA methodwas further developed for identifying state-space models by Lari-more [30]. Given time series output data y(t) ∈ Rmy and input datau(t) ∈ Rmu , the linear state-space model is given by

x(t + 1) = Ax(t) + Bu(t) + v(t) (6)

y(t) = Cx(t) + Du(t) + Ev(t) + w(t) (7)

where x(t) is a state vector, v(t) and w(t) are independent whitenoise processes, and A, B, C, D and E are coefficient matrices.

Important to the CVA algorithm is the concept of past and futurevectors. At a particular time instant t ∈ (1, · · ·, n), the past vectorp(t) containing the past outputs and inputs is

p(t) =[

yT(t − 1), yT(t − 2), · · ·, yT(t − l), uT(t − 1), uT(t − 2), · · ·, uT(t − l)]T

(8)

and the future vector f (t) comprising of the outputs in the presentand future is

f (t) =[yT(t), yT(t + 1), · · ·, yT(t + h)

]T(9)

where l and h are the numbers of lags in vectors p(t) and f(t), respec-tively.

By substituting the matrix ˙xy with ˙pf, ˙xx with ˙pp, and ˙yy

with ˙ff , the matrices J, L, and D can be calculated via the SVD as(3). The states x(t) in (6) and (7) can be obtained from the data

xd(t) = Jdp(t) = Ud˙pp p(t) (10)

where Jd = UTd−1/2

pp and Ud contains the first d columns of U in(3) [31].

ocess C

im

2

fw

T

a

T

w

x

a[

cr

3m

3

vtttdlavs

toptomctrit

ritbcdncv

ttap

B. Jiang et al. / Journal of Pr

The numbers of lags l and h and the state order d is not knownn advance, which can be determined by the lags and the order that

inimize the Akaike information criterion [31].

.3. Process monitoring measures

In the CVA method, two types of statistics Ts and Tr are computedrom the process model in normal operating conditions (NOC),hich are defined as

s(t) = xTd(t)xd(t) (11)

nd

r(t) = xTe (t)xe(t) (12)

here

e(t) = Jep(t) = UTe−1/2

pp p(t) (13)

nd Ue contains the last e = l(mu + my) − d columns of U in (3)31,32].

The Ts statistic measures the variations inside the canoni-al states, while the Tr statistic measures the variations in theesiduals.

. The proposed causal dependence-based approach foronitoring process structural faults

.1. Feature representation of causal dependence

Feature representation, which is a step that generates derivedalues (features) from an initial set of data to facilitate the desiredask, is crucial for fault monitoring. Sarfraz et al. [29] reportedhat better fault monitoring results are achieved for featureshat are more application-dependent. In terms of application-ependence of the feature generation, features that measure the

inkages between process variables (e.g., correlation coefficient)re more appropriate than features directly using the processariables themselves for the monitoring of process correlationtructures.

In addition, the performance of process monitoring is reportedo be more sensitive in a space of uncorrelated features [21]. Inther words, less redundancy in the feature space results in betterrocess monitoring results. Noticeably, the cause-and-effect rela-ionships among process variables can be used to reduce the degreef correlation in the feature representation. In the example of Fig. 1,onitoring based on traditional correlation features require the

orrelation between every two variables (rx,y, ry,z, and rx,z), evenhough the correlation coefficient rx,z is redundant with the cor-elation coefficients (rx,y, ry,z). Such redundancy can be avoided byncorporating the information on the causal connectivity amonghe variables.

In this regard, Chiang and Braatz [26,27] proposed a featureepresentation that incorporates multivariate statistics with causalnformation, named the Causal Dependence (CD), to detect devia-ions in the relationship between causally related variables. Theasic idea of the CD-based feature measures for process structuralhanges is that the causal dependencies (CDs) under abnormal con-itions show some deviation from the distribution of the CDs underormal conditions. In the CD-based feature representation, the pro-ess behavior is measured by the CDs instead of by the processariables themselves.

The CD, which is derived based on the multivariate T2 statis-
ic, is used to quantify the similarity between the relationship ofwo causally related variables under current operating conditionsnd historical operating conditions. When the CD is larger than theredefined threshold, the causal dependency of the two variables
ontrol 32 (2015) 109–116 111

is violated. The CD requires a causal map containing the causalrelationship between all of the measured variables, which can bederived either by a priori knowledge of the process or by the tech-niques of causality, as for example [33].

To formally define the causal dependency, let c refer to the causevariable and e as the effect variable. The CD is based on the multi-variable T2 statistic, which is defined by

CDc,e = T2c,e = (y − y)TS−1

c,e (y − y) (14)

where y = [c e]T, y is the mean of y, and Sc,e is the sample covari-ance for variables c and e. Based on the training data under NOC,the mean of the variable y = [c e]T and the sample covariance Sc,e

are computed for each causal dependency.For the example in Fig. 1, only two components (CDx,y and CDy,z)

are required in the feature representation of the CD. Accordingto the application-dependent and uncorrelated aspects of the fea-ture representations, the CD-based feature is expected to performthe best for the monitoring of process structures, followed by thecorrelation-based and then the variable-based features (which isdemonstrated for case studies in Section 4).

Remark 1. The CD feature can be extended to handle serial cor-relations in the data by augmenting the observation vectors with

time lags, i.e., y =[

c(t) c(t + 1) · · ·e(t) e(t + 1) · · ·

].

Remark 2. The feature representation of the CD is computa-tionally efficient for the feature generation step, compared to thecorrelation feature representation, especially when the numberof process variables n is very large. This difference in computa-tional cost is a result of the typically high number of correlationcoefficients calculated in the latter (n(n − 1)/2, for every pairing oftwo variables), while only the coefficient calculations between thecausally related variables are required for the CD.

3.2. Dissimilarity quantification and fault detection

Once the calculation of CDs for normal conditions (training data)is completed in (14), quantification of dissimilarity among the CDsis performed, as well as the determination of thresholds for normaloperating limits. Distance- or angle-based metrics are typically uti-lized for the dissimilarity quantification [34]. The distance-basedmetrics are more often encountered for fault detection methods,including the principal component analysis (PCA)-, partial least-squares (PLS)-, and CVA-based methods.

This article uses the CVA method to assess the dissimilarityamong the CDs obtained from normal operating data. In otherwords, CVA is performed on the training CDs to determine the uppercontrol limits. Two detection indices termed as Ds and Dr, similarto the CVA’s Ts and Tr ((11) and (12)) respectively, are defined as

Ds(t) = xTd(t)xd(t) (15)

and

Dr(t) = xTe (t)xe(t) (16)

where the past vector p(t) and the future vector f (t) are

p(t) =[CDT(t − 1), CDT(t − 2), · · ·

]T(17)

and

f (t) =[CDT(t), CDT(t + 1), · · ·

]T(18)

The correlation structures of the process is considered normal
if the dissimilarity statistics are below the thresholds, i.e., Ds ≤ T2
˛and Dr ≤ ı2

˛, where T2˛ and ı2

˛ denote the upper control limits forthe dissimilarity index in the canonical states and residuals witha significance level ˛, respectively. CVA is one way to determine

112 B. Jiang et al. / Journal of Process Control 32 (2015) 109–116

Table 1Definition of the faults and respective variables involved for correlation structures. ı is a multiplicative factor that changes the model parameters (ı = 1 for NOC).

Fault Correlation structure changed

0u1(t

1(t − 10x1(t

tcs

atttbtcval

4

CicvaCatesos

c[a

GN1 u1 → x1 (x1(t) = 1.2ı (u1(t) + 0.60u1(t − 1) + 0.3

GN2

{u1 → y6

x1 → x2

({y6(t) = 1 + 0.40ı (u1(t) + 0.60ux2(t) = 0.05 + 0.22ı (x1(t) − 0.4

he dissimilarities among different samples; other approachesan be utilized to derive distance-based or angle-based similaritytatistics.

Similar to the standard PCA and CVA method, the CDs gener-ted from the training and testing data are first normalized beforehe SVD is implemented. Additionally, T2

˛ and ı2˛ in the CD con-

ext can be derived in two ways. One way is the analytical methodhat computes the thresholds based on assumed Gaussian distri-utions [35]. The other way is the empirical approach based onhe calibration samples under NOC [32,36]. For instance, a 99%onfidence upper control limit can be obtained as the Ds or Dr

alue below which 99% of the calibration data are located. Thisrticle uses the latter method to determine the upper controlimits.

. Case studies for a gene network

In this section, a gene network is used to evaluate the proposedD-based approach for the monitoring of process structural faults,

n comparison with correlation-based (i.e., perform CVA on theorrelation coefficients where the two statistics that measure theariations inside the canonical states, and the residuals are termeds Cs and Cr, respectively) and variable-based (i.e., directly performVA on the process variables where the two statistics are termeds Ts and Tr) methods. In each case study, the normal data set forraining contains 2500 observations; and the testing data set forach fault also consists of 2500 observations. Each faulty data settarted with no faults, and the faults were triggered after the 1000thbservation into the run. The data have a sampling time of oneecond.

The proposed CD-based approach for the monitoring of pro-
ess structural faults is evaluated for a gene network adapted from37], which is composed by 16 nodes (or variables) causally relatedccording to the description provided in Fig. 2.
Fig. 2. Causal description of a gene network [37].

− 2)) + 0.8u2(t) + v1(t))) − 0.30u1(t − 2)) + v8(t)− 1) − 0.20x1(t − 2)) + v3(t)

)

In the case studies, the original variable relationships are lin-earized according to⎧⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎨⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎪⎩

x1(t) = 1.2 (u1(t) + 0.60u1(t − 1) + 0.30u1(t − 2)) + 0.8u2(t) + v1(t)

y1(t) = 0.60 (x1(t) + 0.50x1(t − 1) + 0.20x1(t − 2)) + v2(t)

x2(t) = 0.05 + 0.22 (x1(t) − 0.40x1(t − 1) − 0.20x1(t − 2)) + v3(t)

y2(t) = 1 + 0.40 (x1(t) − 0.20x1(t − 1) − 0.10x1(t − 2)) + v4(t)

y3(t) = 0.062 + 0.16 (x1(t) + 0.40x1(t − 1) + 0.60x1(t − 2)) + v5(t)

y4(t) = 0.60 (x1(t) + 0.80x1(t − 1) + 0.10x1(t − 2)) + v6(t)

y5(t) = 0.70 (x1(t) + 0.40x1(t − 1) + 0.20x1(t − 2)) + v7(t)

y6(t) = 1 + 0.40 (u1(t) + 0.60u1(t − 1) − 0.30u1(t − 2)) + v8(t)

y7(t) = 0.56 + 0.15 (u1(t) + 0.40u1(t − 1) + 0.60u1(t − 2)) + v9(t)

y8(t) = 0.80 (u3(t) + 0.60u3(t − 1) + 0.30u3(t − 2)) + 0.51x2(t) + v10(t)

y9(t) = 1.30 (x2(t) + 0.50x2(t − 1) + 0.50x2(t − 2)) + v11(t)

y10(t) = 1 + 0.40 (x2(t) + 0.40x2(t) + 0.60x2(t)) + v12(t)

y11(t) = 0.028 + 1.30 (x2(t) + 0.60x2(t) − 0.30x2(t)) + v13(t)

(19)

where u1, u2, and u3 are the input variables, x1 and x2 are the states,and the rest are outputs. Each vi is a white-noise sequence witha signal-to-noise ratio of 10%. The system was subject to the twofaults (see Table 1), where Fault GN1 is a single fault and Fault GN2 isa multiple fault. The statistics for detecting a single fault are directlyapplicable for detecting multiple faults since the thresholds for (15)and (16) depend only on the data from the normal operating con-ditions. For the faults, ı denotes a multiplicative factor that causesa change in a model parameter.

4.1. Single-fault case study

A single fault (Fault GN1) is examined in this scenario, whichoccurs in the process structure from input variable u1 to state vari-able x1. In order to provide a sound comparative assessment ofthe detection performances, the false alarm rates are maintainedat the same level ( = 1%) for comparing the missed detection ratesin different circumstances throughout this study. The same proce-dure was repeated 1000 times in order to estimate the confidencelevels of the missed detection rates at different fault magnitudes.The mean missed detection rate as a function of the fault magni-tude ı for Fault GN1 by using the CD-based, correlation-based, andvariable-based methods are plotted in Fig. 3.

As the fault magnitude increases, the mean missed detectionrate decreases in all of the methods. Moreover, the mean misseddetection rate in all three methods approach zero as the fault mag-nitude increases, indicating the enhanced detectability for largefaults. In addition, the gap in the missed detection rates betweenthe CD-based and correlation-based methods enlarges as the mag-nitude of fault increases from 1.1 to 1.5, while the gap between theCD-based and variable-based methods enlarges as fault magnitudeincreases from 1.1 to 1.8.

For the same fault magnitude ı, the performance of fault detec-tion using the correlation-based method always outperformed that
using the variable-based method (Fig. 3), which confirms thatthe correlation-based feature representation was more appropri-ate for the monitoring of the correlation structural faults thanvariable-based feature representation. In addition, the CD-based

B. Jiang et al. / Journal of Process Control 32 (2015) 109–116 113

1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 20

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Fault magnitude δ

Th

e m

ea

n m

isse

d d

ete

ctio

n r

ate

CD

Variable

Correlation

Fig. 3. The mean missed detection rate as a function of the fault magnitude for FaultGN1 (ı = 1 under NOC). The methods in the legend are: CD (i.e., perform CVA on theCdw

maeocivCTatatt

tpt

rpa

dsriodlatfFastc

4

ti

At the fault magnitude ı = 1.3, the mean missed detection rate forthe CD-based algorithm is more than a factor of 2.5 lower (31.3%vs. 81.1%) than for the variable-based algorithm, and nearly a fac-tor of 2.3 for the correlation-based algorithm (31.3% vs. 74.5%).

Ds), correlation (i.e., perform CVA on the correlation coefficients), and Variable (i.e.,irectly perform CVA on the process variables) methods. For each method, a faultas indicated if either the state or residual statistic violated threshold.

ethod always performed better than both the correlation-basednd variable-based methods for the same fault magnitude ı. Forxample, at the fault magnitude ı = 1.6, the missed detection ratesf the CD-based algorithm is 34.4%, compared to 50.1% to theorrelation-based algorithm, which is more than a factor of 1.4mproved detection for the CD-based method. Compared to theariable-based algorithm, the mean missed detection rate for theD-based algorithm is nearly a factor of 1.7 lower (34.4% vs. 57.6%).he main reasons for the superior performance of the CD-basedpproach are that (i) the CD-based feature representation can effec-ively capture the changes in the process correlation structures (thepplication-dependent aspect); and (ii) the lower redundancy ofhe CD-based feature representation leads to the improved moni-oring performance by incorporating the causal information.

Overall, the results are consistent with the above discussionshat the CD-based feature performs the best for the monitoring ofrocess structural faults, followed by the correlation-based, andhen the variable-based features.

Once a fault has been detected, a propagation path of faulty cor-elations can be derived by using causal linkages. Such informationlays an important role in the determination of the root cause ofbnormal events.

Contribution plots [31] are the commonly used technique foretermining which components are most likely related to thetatistics no longer being within normal operation (aka “faulty cor-elation”). Here the CVA-based contributions [38] are utilized todentify the faulty correlations of Fault GN1. The contribution plotf the structural change with the case of fault’s magnitude ı = 1.5 isisplayed in Fig. 4 as a color map [39], which indicates that corre-

ations u1 → x1, u2 → x1, x1 → x2, x1 → y4, x1 → y5, x2 → y8, x2 → y9,nd x2 → y11 are faulty for Fault GN1. Further, considering the direc-ional links among these faulty correlations, a propagation path ofaulty correlations for Fault GN1 can be obtained, as shown in Fig. 5.ig. 5 indicates that the faulty correlation u1 → x1 and/or u2 → x1 aressociated with the source of the propagation path and are mosttrongly associated with the origin of Fault GN1. This identifica-ion is accurate, as Fault GN1 is associated with a change in theorrelation u1 → x1 (Table 1).

.2. Multiple faults case study

Multiple faults of correlation structures occurring at the sameime are likely to happen for many industrial processes. Fornstance, this situation can be caused by a change in a physical

Fig. 4. CVA-based contributions for Fault GN1 with fault magnitude ı = 1.5.

coefficient that affects at least two different correlations. The taskof identifying multiple faults is rather challenging. A multiple fault(Fault GN2) in the gene network system is used to further inves-tigate the performance of the proposed approach for monitoringstructural changes in this part. Fault GN2 occurs in the correlationsof u1 → y6 and x1 → x2. The mean missed detection rate as a func-tion of the fault magnitude ı for Fault GN2 by using the CD-based,correlation-based, and variable-based methods are plotted in Fig. 6.

Again, as the fault magnitude ı increases, the mean misseddetection rate decreases in all of the methods (Fig. 6). The meanmissed detection rates in all three methods also approach zero asthe fault magnitude increases, indicating the improved detectabil-ity for larger faults. The mean missed detection rate is sensitive tothe fault magnitude for small to intermediate values of ı, with thefault magnitude of highest sensitivity dependent on the method.

For the same fault magnitude ı, the CD-based algorithm per-forms the best for the monitoring of Fault GN2, followed by thecorrelation-based and then the variable-based algorithms (Fig. 6).

Fig. 5. A propagation path of faulty correlations for Fault GN1 (faulty correlationsare shown as vectors between nodes with red color).

114 B. Jiang et al. / Journal of Process C

1.1 1.2 1.3 1.4 1.5 1.6 1.7 1.8 1.9 20

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

Fault mag nitude δ

The

me

an

mis

sed

de

tecti

on r

ate

CD

Variable

Correlation

Fig. 6. The mean missed detection rate as a function of the fault magnitude for FaultGN2 (ı = 1 under NOC). The methods in the legend are: CD (i.e., perform CVA on theCdw

T(e(

Ds), correlation (i.e., perform CVA on the correlation coefficients), and Variable (i.e.,irectly perform CVA on the process variables) methods. For each method, a faultas indicated if either the state or residual statistic violated threshold.

he main reasons are similar as discussed above, which are thati) the deviations in the process correlation structures can beffectively captured by the CD-based feature representation; andii) the improved monitoring performance results from the less

(a)

(c)

500 1000 1500 2000 25000

20

40

60

Ts

500 1000 1500 2000 25000

20

40

60

80

Time

Tr

C

500 1000 0

500

1000

1500

Ds

500 1000 0

500

1000

1500

2000

2500

Time

Dr

Fig. 7. Multivariate statistics for the detection for Fault GN2 with the fault magnitude

ontrol 32 (2015) 109–116

redundancy of the CD-based feature representation by incorporat-ing causal information.

The objective of fault detection is to be sensitive enough to detectall possible faults, while being robust to data that are independent ofthe training set. The sensitivity was quantified by the mean misseddetection rate (aka type II error [31,32]) for faults in the testingset. The robustness was quantified by the false alarm rate (type Ierror [31,32]) for the testing set under NOC. The extent to whichthe methods (with statistics Ds, Dr, Cs, Cr, Ts, and Tr) are sensitive toFault GN2 is displayed in Fig. 7 with the type I error maintained atthe same level ( = 1%) in all methods to enable a fair comparisonof the type II errors. Both the state and residual statistics Ds, Dr forthe CD method much more persistently indicate a fault comparedto the other methods.

The contributions of the structural changes with the fault mag-nitude of ı = 1.5 for Fault GN2 are plotted in Fig. 8. The faultycorrelations are u1 → y6, x1 → x2, x2 → y8, x2 → y9, and x2 → y11. Thepropagation paths of faulty correlations for Fault GN2 are displayedin Fig. 9, which indicates that there are two separate propagationpaths, i.e., u1 → y6 and x1 → x2 → y8(x2 → y9, x2 → y11). Fig. 8 indi-cates that Fault GN2 corresponds to multiple faults, with the faultycorrelations u1 → y6 and x1 → x2 being the sources of the two prop-
agation paths, that is, are most likely related to the root causesof Fault GN2. The faulty correlations u1 → y6 and x1 → x2 indicatethat the structural faults occur in the relationships between vari-able u1 and variable y6 and between variable x1 and variable x2:
(b)

0 500 1000 1500 2000 25000

50

100

150

200

Cs

0 500 1000 1500 2000 25000

500

1000

1500

2000

Time

r

1500 2000 2500

1500 2000 2500

ı = 1.6 for (a) variable-based, (b) correlation-based, and (c) CD-based methods.

B. Jiang et al. / Journal of Process C

Fig. 8. CVA-based contributions for Fault GN2 with fault magnitude ı = 1.5.

Fig. 9. A propagation path of faulty correlations for Fault GN2 (faulty correlationsaet

t(

af

5

faefiutubscis

[

[

[

[

[

[

[

[

[

[

[

[

[

[

[

[

re shown as vectors between nodes with red color). (For interpretation of the ref-rences to color in this figure legend, the reader is referred to the web version ofhis article.)

he proposed approach identified the correct causes of Fault GN2Table 1).

The results demonstrate effectiveness of the proposed CD-basedpproach not only for identifying the case of single faults but alsoor multiple faults.

. Conclusions

This article presents a causal dependency (CD)-based approachor the monitoring of process correlation structures. Two stepsre involved in the proposed monitoring approach: (1) the gen-ration of CD features and (2) dissimilarity quantification. Therst step provides means to construct application-dependent andncorrelated features that facilitate the subsequent step for quan-ifying similarity or dissimilarity. In addition to handling thenderlying connective structure information in the data, the CD-ased approach can also bring the benefits for detecting process
tructural faults and for identifying the origins of such structuralhanges, owing to the less correlated feature space of the CDs byncorporating causal information. The simulation results demon-trate effectiveness of the proposed CD-based method for the cases
[

[

ontrol 32 (2015) 109–116 115

of both single faults and multiple faults for a gene network model.The proposed approach also applies to systems with material andenergy recycles and feedback control loops. The causal map is suf-ficiently general to handle these more complicated relationshipsbetween variables, with the causal dependencies and canoni-cal variate analysis computed the same way as described in thearticle

Acknowledgements

This work is supported by the National Basic Research Programof China (2012CB720505) and the National Natural Science Founda-tion of China (21276137). The first author is grateful for the financialsupport from China Scholarship Council.

References

[1] W.A. Shewhart, Economic Control of Quality of Manufactured Product, D. VanNostrand Company, Inc., New York, 1931.

[2] H. Hotelling, The generalization of student’s ratio, Ann. Math Stat. 2 (1931)360–378.

[3] R.B. Crosier, Multivariate generalizations of cumulative sum quality-controlschemes, Technometrics 30 (1988) 291–303.

[4] A. Raich, A. Cinar, Statistical process monitoring and disturbance diagnosis inmultivariable continuous processes, AIChE J. 42 (1996) 995–1009.

[5] J.F. MacGregor, C. Jaechle, C. Kiparissides, M. Koutoudi, Process monitoring anddiagnosis by multiblock PLS methods, AIChE J. 40 (1994) 826–838.

[6] B.M. Wise, N.B. Gallagher, The process chemometrics approach to process mon-itoring and fault detection, J. Process Control. 6 (1996) 329–348.

[7] W. Li, H.H. Yue, S. Valle-Cervantes, S.J. Qin, Recursive PCA for adaptive processmonitoring, J. Process Control 10 (2000) 471–486.

[8] W. Ku, R.H. Storer, C. Georgakis, Disturbance detection and isolation by dynamicprincipal component analysis, Chemom. Intell. Lab. Syst. 30 (1995) 179–196.

[9] A. Negiz, A. C inar, PLS, balanced, and canonical variate realization techniquesfor identifying VARMA models in state space, Chemom. Intell. Lab. Syst. 38(1997) 209–221.

10] A. Negiz, A. C inar, Statistical monitoring of multivariable dynamic processeswith state-space models, AIChE J. 43 (1997) 2002–2020.

11] B. Abbasi, S. Niaki, M. Abdollahian, S. Hosseinifard, A transformation based mul-tivariate chart to monitor process dispersion, Int. J. Adv. Manuf. Technol. 44(2009) 748–756.

12] V.B. Ghute, D.T. Shirke, A multivariate synthetic control chart for process dis-persion, Qual. Technol. Quant. Manag. 5 (2008) 271–288.

13] L. Liu, J. Zhong, Y. Ma, A multivariate synthetic control chart for monitoringcovariance matrix based on conditional entropy, in: E. Qi, J. Shen, R. Dou (Eds.),Proceedings of the 19th International Conference on Industrial Engineering andEngineering Management, Springer, Berlin/Heidelberg, 2013, pp. 99–107.

14] V. Mayer-Schonberger, K. Cukier, Big Data: A Revolution That Will TransformHow We Live, Work, and Think, John Murray Publishers, London, 2013.

15] M.A. Djauhari, Improved monitoring of multivariate process variability, J. Qual.Technol. 37 (2005) 32–39.

16] F.B. Alt, Multivariate quality control, in: N.L. Johnson, S. Kotz, C.R. Read (Eds.),The Encyclopedia of Statistical Sciences, Wiley, New York, 1984, pp. 110–122.

17] J.L. Guerrero-Cusumano, Testing variability in multivariate quality control: aconditional entropy measure approach, Inf. Sci. 86 (1995) 179–202.

18] M.A. Djauhari, M. Mashuri, D.E. Herwindiati, Multivariate process variabil-ity monitoring communications in statistics, Theory Methods 37 (2008)1742–1754.

19] F.B. Alt, N.D. Smith, Multivariate process control, in: P.R. Krishnaiah, C.R. Rao(Eds.), Handbook of Statistics, Elsevier, 1988, pp. 333–351.

20] C.L. Yen, J.J.H. Shiau, A multivariate control chart for detecting increases inprocess dispersion, Stat. Sin. 20 (2010) 1683–1707.

21] T.J. Rato, M.S. Reis, Sensitivity enhancing transformations for monitoring theprocess correlation structure, J. Process Control. 24 (2014) 905–915.

22] N.F. Thornhill, A. Horch, Advances and new directions in plant-wide disturb-ance detection and diagnosis, Control Eng. Pract. 15 (2007) 1196–1206.

23] J. Thambirajah, L. Benabbas, M. Bauer, N.F. Thornhill, Cause-and-effect anal-ysis in chemical processes utilizing XML, plant connectivity and quantitativeprocess history, Comput. Chem. Eng. 33 (2009) 503–512.

24] H. Jiang, R. Patwardhan, S.L. Shah, Root case diagnosis of plant-wide oscilla-tions using the concept of the adjacency matrix, J. Process Control 19 (2009)1347–1354.

25] N.F. Thornhill, J.W. Cox, M.A. Paulonis, Diagnosis of plant-wide oscillationthrough data-driven analysis and process understanding, Control Eng. Pract.11 (2003) 1481–1490.

26] L.H. Chiang, Fault Detection and Diagnosis for Large-scale Systems, Universityof Illinois, Urbana, Illinois, 2001, Ph.D. Thesis.

27] L.H. Chiang, R.D. Braatz, Process monitoring using causal map and multivariatestatistics: fault detection and identification, Chemom. Intell. Lab. Syst. 65 (2003)159–178.

http://refhub.elsevier.com/S0959-1524(15)00126-2/sbref0205















































































































































































































































































































































































































































































































































































































































1 ocess C

[

[

[

[

[

[

[

[

[

[

[

17–25.

16 B. Jiang et al. / Journal of Pr

28] L.H. Chiang, B. Jiang, X. Zhu, D. Huang, R.D. Braatz, Diagnosis of multiple andunknown faults using the causal map and multivariate statistics, J. ProcessControl 28 (2015) 27–39.

29] M.S. Sarfraz, O. Hellwich, Z. Riaz, Feature extraction and representation for facerecognition, in: M. Oravec (Ed.), Face Recognition, InTech, Croatia, 2010, pp.1–20.

30] W.E. Larimore, Canonical variate analysis in control and signal processing,in: T. Katayama, S. Sugimoto (Eds.), Statistical Methods in Control and SignalProcessing, Marcel Dekker Inc., New York, 1997, pp. 83–120.

31] L.H. Chiang, E.L. Russell, R.D. Braatz, Fault Detection and Diagnosis in IndustrialSystems, Springer Verlag, London, UK, 2001.

32] E.L. Russell, L.H. Chiang, R.D. Braatz, Fault detection in industrial processesusing canonical variate analysis and dynamic principal component analysis,
Chemom. Intell. Lab. Syst. 51 (2000) 81–93.
33] C.W.J. Granger, Investigating causal relationships by econometric models andcross-spectral methods, Econometrica 37 (1969) 424–438.

34] A. Raich, A. C inar, Diagnosis of process disturbances by statistical distance andangle measures, Comput. Chem. Eng. 21 (1997) 661–673.

[

[

ontrol 32 (2015) 109–116

35] S.J. Qin, Statistical process monitoring: basics and beyond, J. Chemom. 17 (2003)480–502.

36] B.M. Wise, N.B. Gallagher, S.W. Butler, D.D. White, G.G. Barna, A compari-son of principal component analysis, multiway principal component analysis,trilinear decomposition and parallel factor analysis for fault detection in asemiconductor etch process, J. Chemom. 13 (1999) 379–396.

37] Y. Tamada, S. Kim, H. Bannai, S. Imoto, K. Tashiro, S. Kuhara, S. Miyano, Estimat-ing gene networks from gene expression data by combining Bayesian networkmodel with promoter element detection, Bioinformatics 19 (2003) ii227–ii236.

38] B. Jiang, D. Huang, X. Zhu, F. Yang, R.D. Braatz, Canonical variate analysis-based contributions for fault identification, J. Process Control 26 (2015)

39] X. Zhu, R.D. Braatz, Two-dimensional contribution map for fault identification,IEEE Control Syst. 34 (2014) 72–77.

40] G. Lee, S. Song, E.S. Yoon, Multiple-fault diagnosis based on system decompo-sition and dynamic PLS, Ind. Eng. Chem. Res. 42 (2003) 6145–6154.