13
This article was downloaded by: [University of Western Ontario] On: 12 November 2014, At: 06:41 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH, UK Journal of Business & Economic Statistics Publication details, including instructions for authors and subscription information: http://www.tandfonline.com/loi/ubes20 A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations Francesco Audrino a & Fabio Trojani b a Department of Economics, Institute of Mathematics and Statistics, University of St. Gallen, CH-9000 St. Gallen, Switzerland b University of Lugano and Swiss Finance Institute, CH-6900 Lugano, Switzerland Published online: 01 Jan 2012. To cite this article: Francesco Audrino & Fabio Trojani (2011) A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations, Journal of Business & Economic Statistics, 29:1, 138-149, DOI: 10.1198/ jbes.2010.08117 To link to this article: http://dx.doi.org/10.1198/jbes.2010.08117 PLEASE SCROLL DOWN FOR ARTICLE Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained in the publications on our platform. However, Taylor & Francis, our agents, and our licensors make no representations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the Content. Any opinions and views expressed in this publication are the opinions and views of the authors, and are not the views of or endorsed by Taylor & Francis. The accuracy of the Content should not be relied upon and should be independently verified with primary sources of information. Taylor and Francis shall not be liable for any losses, actions, claims, proceedings, demands, costs, expenses, damages, and other liabilities whatsoever or howsoever caused arising directly or indirectly in connection with, in relation to or arising out of the use of the Content. This article may be used for research, teaching, and private study purposes. Any substantial or systematic reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any form to anyone is expressly forbidden. Terms & Conditions of access and use can be found at http:// www.tandfonline.com/page/terms-and-conditions

A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

  • Upload
    fabio

  • View
    216

  • Download
    0

Embed Size (px)

Citation preview

Page 1: A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

This article was downloaded by: [University of Western Ontario]On: 12 November 2014, At: 06:41Publisher: Taylor & FrancisInforma Ltd Registered in England and Wales Registered Number: 1072954 Registered office: MortimerHouse, 37-41 Mortimer Street, London W1T 3JH, UK

Journal of Business & Economic StatisticsPublication details, including instructions for authors and subscription information:http://www.tandfonline.com/loi/ubes20

A General Multivariate Threshold GARCH Model WithDynamic Conditional CorrelationsFrancesco Audrinoa & Fabio Trojaniba Department of Economics, Institute of Mathematics and Statistics, University of St.Gallen, CH-9000 St. Gallen, Switzerlandb University of Lugano and Swiss Finance Institute, CH-6900 Lugano, SwitzerlandPublished online: 01 Jan 2012.

To cite this article: Francesco Audrino & Fabio Trojani (2011) A General Multivariate Threshold GARCH ModelWith Dynamic Conditional Correlations, Journal of Business & Economic Statistics, 29:1, 138-149, DOI: 10.1198/jbes.2010.08117

To link to this article: http://dx.doi.org/10.1198/jbes.2010.08117

PLEASE SCROLL DOWN FOR ARTICLE

Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) containedin the publications on our platform. However, Taylor & Francis, our agents, and our licensors make norepresentations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose ofthe Content. Any opinions and views expressed in this publication are the opinions and views of the authors,and are not the views of or endorsed by Taylor & Francis. The accuracy of the Content should not be reliedupon and should be independently verified with primary sources of information. Taylor and Francis shallnot be liable for any losses, actions, claims, proceedings, demands, costs, expenses, damages, and otherliabilities whatsoever or howsoever caused arising directly or indirectly in connection with, in relation to orarising out of the use of the Content.

This article may be used for research, teaching, and private study purposes. Any substantial or systematicreproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in anyform to anyone is expressly forbidden. Terms & Conditions of access and use can be found at http://www.tandfonline.com/page/terms-and-conditions

Page 2: A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

A General Multivariate Threshold GARCH ModelWith Dynamic Conditional Correlations

Francesco AUDRINODepartment of Economics, Institute of Mathematics and Statistics, University of St. Gallen,CH-9000 St. Gallen, Switzerland ([email protected])

Fabio TROJANIUniversity of Lugano and Swiss Finance Institute, CH-6900 Lugano, Switzerland ([email protected])

We introduce a new multivariate GARCH model with multivariate thresholds in conditional correlationsand develop a two-step estimation procedure that is feasible in large dimensional applications. Optimalthreshold functions are estimated endogenously from the data and the model conditional covariance ma-trix is ensured to be positive definite. We study the empirical performance of our model in two applica-tions using U.S. stock and bond market data. In both applications our model has, in terms of statisticaland economic significance, higher forecasting power than several other multivariate GARCH models forconditional correlations.

KEY WORDS: Dynamic conditional correlations; Multivariate GARCH models; Tree-structuredGARCH models.

1. INTRODUCTION

In this article, we present a new multivariate GARCH modelwith dynamic conditional correlations (DCC) that extends pre-vious approaches by admitting multivariate thresholds in theconditional volatilities and correlations of multivariate time se-ries. This extension allows us to account for rich asymmetriceffects and dependencies of conditional volatilities and correla-tions that are often encountered, for instance, in financial realdata applications. As in the classical Engle (2002) DCC model,our model estimation is numerically feasible in large dimen-sions. Moreover, the positive definiteness of the conditional co-variance matrix is ensured in a natural way by the structure ofthe model. Finally, thresholds in volatilities and correlations ofour model are not fixed ex ante, but are estimated from the datatogether with all other parameters in the model.

To define the threshold function in our model, we extend thetree-structured state space partition in Audrino and Bühlmann(2001) to a setting with multivariate thresholds in volatilitiesand correlations. As shown in Audrino and Trojani (2006) andAudrino (2006), the tree-structured threshold construction canincorporate a potentially large number of multivariate regimesin univariate settings in a parsimonious way. In this article,we study a multivariate model with a potentially large num-ber of tree-structured thresholds in volatilities and correlationsand we develop a feasible estimation strategy that can be ap-plied to estimate the model in large dimensional applicationsas well. The threshold construction is obtained using a binarytree in which each terminal node defines local GARCH-typedynamics for volatilities and correlations over a partition cell ofthe multivariate state space. The estimation is performed by asimple two-step procedure that estimates the number and struc-ture of the underlying thresholds together with the parametersof the local GARCH dynamics for volatilities and correlations.The optimal threshold structure is identified by solving a high-dimensional model selection problem based on the SchwarzBayesian information criterion (BIC).

We estimate our model in two distinct applications to U.S.stock and bond market data and focus on the explanatory powerfor future conditional correlations, comparing the results with aset of competing models in the literature: Engle’s (2002) DCCmodel, Ledoit, Santa-Clara, and Wolf’s (2003) flexible multi-variate GARCH model, and Pelletier’s (2006) regime switch-ing dynamic correlations (RSDC) model. As in our model, theDCC and the RSDC models can be estimated by a two-step pro-cedure that separates the estimation of the conditional volatilityand correlation dynamics. In order to measure, where possi-ble, the additional forecasting power for correlations, we es-timate these models using a set of univariate tree-structuredGARCH dynamics for volatility identical to the one in ourmodel. The flexible multivariate GARCH model cannot be es-timated by a two-step estimation procedure. Therefore, the es-timated volatility dynamics are different from the dynamics ofour model. Our model differs from the others in the way it spec-ifies the correlation dynamics. The DCC and the flexible mul-tivariate GARCH models are single-regime models for correla-tions and the RSDC model specifies a very simple regime struc-ture for conditional correlations. By contrast, our setting canaccount parsimoniously for GARCH-type dynamics and multi-variate conditional correlation thresholds without ex ante fixingthe structure and the number of thresholds.

Using our tree-structured GARCH-DCC model, we empiri-cally study the relative importance of GARCH and threshold ef-fects in the conditional correlation dynamics of U.S. stock andbond returns. The conditional volatility functions of U.S. stockreturns exhibit GARCH and threshold features, but conditionalcorrelations depend on a piecewise constant threshold function.By contrast, we find both threshold and GARCH effects for thecorrelation process of stock and government bond returns, with

© 2011 American Statistical AssociationJournal of Business & Economic Statistics

January 2011, Vol. 29, No. 1DOI: 10.1198/jbes.2010.08117

138

Dow

nloa

ded

by [

Uni

vers

ity o

f W

este

rn O

ntar

io]

at 0

6:41

12

Nov

embe

r 20

14

Page 3: A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

Audrino and Trojani: General Multivariate Threshold GARCH Model 139

estimated thresholds that are functions of lagged stock and bondreturns. In all applications, the estimated tree-structured parti-tion of the state space improves the forecasting power for con-ditional correlations relative to the other multivariate GARCHmodels. Out-of-sample improvements are, in most cases, sta-tistically and economically significant based on different cri-teria recently proposed in the literature including the test forsuperior predictive ability (SPA) introduced by Hansen (2005),the model confidence set approach of Hansen, Lunde, and Na-son (2003), and the economic measure of volatility and correla-tion timing ability for portfolio allocation in Engle and Colac-ito (2006) and Bandi, Russell, and Zhu (2008). These findingshighlight the importance of flexible multivariate threshold andGARCH structures for forecasting the conditional correlationof stock and bond markets.

In this article, we develop a multivariate GARCH modelfor variances and correlations, which has good forecastingpower and which can be estimated exclusively from informa-tion on multivariate returns. A completely different approachcan instead formulate a dynamic model for realized volatili-ties and correlations, using information from intraday returnswhen available, in order to produce even more accurate fore-casts; see, for example, Andersen et al. (2003) or Audrino andCorsi (2009). This empirical strategy produces very good re-sults in univariate settings. However, its extension to the multi-variate context is not straightforward because of the difficulty ofaccurately estimating realized correlations under nonsynchro-nous intraday returns. It was shown that this problem can pro-duce high efficiency losses in estimating high-dimensional real-ized variance covariance matrices; see, for example, Barndorff-Nielsen et al. (2008) or Chiriac and Voev (2009). Solving thisimportant issue is a crucial topic of ongoing research.

Section 2 presents our tree-structured GARCH-DCC modeland Section 3 describes the two-step estimation procedure thatcan be applied to estimate it. Section 4 presents our empiricalstudy of the conditional correlation dynamics of U.S. stock andbond returns. Section 5 summarizes the main results and con-cludes.

2. THE MODEL

We consider a multivariate stochastic process (Xt)t∈Z withvalues in R

d:

Xt = Dtεt, (2.1)

where Dt := diag[σ1,t, . . . , σd,t] and σi,t is the conditional stan-dard deviation of the ith component of Xt at time t − 1. (εt)t∈Z

is a zero-mean process in Rd with components having a unit

conditional standard deviation by construction. To simplify thenotation, conditional means of Xt were set to zero in Equa-tion (2.1). The conditional covariance matrix of εt at time t − 1is denoted by Rt. Therefore, we obtain the following standardfactorization of the conditional covariance matrix of Xt:

covt−1(Xt) = DtRtDt. (2.2)

Our tree-structured DCC-GARCH model parameterizes theconditional volatility matrix Dt and the conditional correla-tion matrix Rt by means of two parametric threshold functions.

Each diagonal element of Dt is modeled as a univariate tree-structured threshold GARCH(1,1)-model, as in Audrino andBühlmann (2001) and Audrino and Trojani (2006). The condi-tional correlation matrix Rt is modeled according to a thresholdDCC-type model described in more detail in the following.

2.1 Tree-Structured Model for Dt

Let Xt,j be the jth component of Xt. In principle, the thresh-olds in the volatility dynamics of Xt,j may depend on allcomponents of Xt−1. For simplicity of exposition, let us as-sume that they are functions of (Xt−1,1,Xt−1,j). Let Pj ={R1,j, . . . , Rkj,j} be a partition of the state space G := R

2 × R+

of (Xt−1,1,Xt−1,j, σ2t−1,j):

Pj = {R1,j, . . . , Rkj,j

},

kj⋃s=1

Rs,j = G, Ri,j ∩ Rs,j = ∅ (i �= s).

Given a partition cell Ri,j, we specify the local conditional vari-ance dynamics of Xt,j on Ri,j as a GARCH(1,1) model. There-fore, the threshold function σ 2

t,j takes the form

σ 2t,j =

kj∑i=1

(αij + βijX2t−1,j + γijσ

2t−1,j)

× I[(Xt−1,1,Xt−1,j,σ2t−1,j)∈Ri,j], (2.3)

where I[·] is the indicator function and θ1,j is the parameter vec-tor

θ1,j = {αij, βij, γij; i = 1, . . . , kj}.To completely specify the conditional variance function inEquation (2.3), we have to define the class of partitions Pj

that are admissible in our tree-structured model. The first as-sumption is that Pj is composed of rectangular cells Ri,j, i =1, . . . , kj, so that they can be easily parameterized by a set ofmultivariate thresholds for (Xt−1,1,Xt−1,j, σ

2t−1,j). The second

assumption is that the potential partition cells satisfy a hierar-chical structure that can be mapped one-to-one on a so-calledbinary tree by means of an iterative statistical procedure. In thisway, the multivariate threshold function in the model followsthe structure of a binary tree Tj in which every terminal noderepresents a particular regime.

In constructing the binary tree, it is first tested whether athreshold d1 can split the whole state space into two rectangu-lar partition cells representing two different volatility regimes.In this case, d1 defines the first node on the binary tree and thefirst two partition cells represent the two stems associated withthe first node. This tree-structure is preferred with respect toa model with no variance–covariance thresholds if the impliedimprovement in goodness of fit is large enough. In the secondstep, it is checked whether one of the two cells identified fromthe first step can be further split into two rectangular subcellsby an additional threshold d2. The decision on which subcellcan be further split is again based on a comparison of the im-provement in the implied goodness of fit. Threshold d2 definesa second node on the binary tree. Two subcells are associatedwith this node that define two new stems of the tree and two

Dow

nloa

ded

by [

Uni

vers

ity o

f W

este

rn O

ntar

io]

at 0

6:41

12

Nov

embe

r 20

14

Page 4: A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

140 Journal of Business & Economic Statistics, January 2011

further volatility subregimes. Such a procedure is iterated untila maximal tree and a maximal number of variance–covarianceregimes are obtained. See the following for additional detailson the model construction and the estimation procedure. Detailson the interpretation of binary trees in the context of volatilitymodels are provided in Audrino and Trojani (2006).

Consider, for example, a model with three regimes, i.e., twothresholds, and partitioning cells R1,j, R2,j, R3,j of the form:

R1,j = {Xt−1,j ≤ d1},R2,j = {Xt−1,j > d1 and Xt−1,1 ≤ d2},R3,j = {Xt−1,j > d1 and Xt−1,1 > d2},

where the parameters d1,d2 define the two multivariate thresh-olds in the model. In this case, R1,j is associated with aregime of low conditioning values Xt−1,j. R2,j correspondsto a regime with higher conditioning values of Xt−1,j, butlow values of Xt−1,1. Finally, R3,j implies a regime in whichboth conditioning values Xt−1,1 and Xt−1,j are large. For eachcomponent Xt,j, estimation of Equation (2.3) is achieved by ahigh-dimensional model selection problem that determines theoptimal number and the structure of the relevant thresholds (andhence the partition cells) in Pj. Details of this estimation pro-cedure for univariate tree structured GARCH(1,1) models aregiven in Audrino and Bühlmann (2001) and Audrino and Tro-jani (2006, section 2.3).

2.2 Tree-Structured Model for Rt

Given θ1 = {θ1,j : j = 1, . . . ,d}, let

εt = Dt(θ1)−1Xt

and

Rt = corrt−1(Xt) = covt−1(εt).

We model Rt by means of a tree-structured model in which con-ditional correlations satisfy an Engle (2002)-type local DCCmodel across several multivariate thresholds. In order to keepthe model tractable, we assume that thresholds in the Rt dy-namics depend on εt−1 only via the average

ρt−1 = 1

d(d − 1)

∑u�=v

εt−1,uεt−1,v,

of the cross products of the component of εt−1. Intuitively, thischoice allows us to account for asymmetric effects in condi-tional correlations as a function of particular lagged process re-alizations Xt−1 and specific movements in average lagged con-ditional correlation shocks ρt−1.

To define the parametric threshold function Rt in our model,let P = {R1, . . . , Rw} be a partition of the state space G :=R

d+1 of (Xt−1, ρt−1). We consider the following family offunctional forms for Rt

Rt =w∑

i=1

ciRitI{(Xt−1,ρt−1)∈Ri}

+(

1 −w∑

i=1

ciI[(Xt−1,ρt−1)∈Ri]

)Id, (2.4)

where c1, . . . , cw ∈ [0,1], Id is the d-dimensional identity ma-trix and the parametric processes for Rit, i = 1, . . . ,w, are givenby

Rit = diag[Qit]−1/2Qit diag[Qit]−1/2, (2.5)

with

Qit = (1 − φi − λi)Q + φiεt−1ε′t−1 + λiQit−1, (2.6)

parameters φi, λi ≥ 0 such that φi + λi < 1 for all i = 1, . . . ,w,and Q is, as in the classical Engle (2002) DCC model, the un-conditional covariance matrix of the residuals εt. Given a fixedpartition P , the parameter vector

θ2 = {ci, φi, λi,vech(Q); i = 1, . . . ,w}, (2.7)

completely parameterizes the threshold function defining theconditional correlation function in Equation (2.4).

Since for any i = 1, . . . ,w, the local model for Qit satisfies anEngle (2002) DCC-type dynamics, positive definiteness of theresulting threshold model for Rt is easily implied by the modelstructure under the above conditions on the model parameters.When P = {G}, i.e., the partition is trivial, we obtain the Engle(2002) DCC model by setting c1 = · · · = cw = 1. Therefore,this model is nested in our model. Moreover, by setting φi =λi = 0 for i = 1, . . . ,w, we can write Rt as

Rt =w∑

i=1

ciRI[(Xt−1,ρt−1)∈Ri]

+(

1 −w∑

i=1

ciI[(Xt−1,ρt−1)∈Ri]

)Id, (2.8)

where R is a fixed d-dimensional correlation matrix. In thiscase, we obtain a piecewise constant correlation matrix definedby a multivariate threshold function over the partition P . Incontrast to the RSDC model in Pelletier (2006), this particularsubcase of our model can account for a flexible description ofmultiple multivariate regimes in correlations because the num-ber and the structure of the regimes in the estimated model doesnot have to be fixed from the beginning. Finally, when φi > 0or λi > 0 for i = 1, . . . ,w and P is not a trivial partition, by set-ting c1 = · · · = cw = 1 we obtain a tree-structured DCC modellocally satisfying Engle’s DCC dynamics over the distinct par-titioning cells Ri.

As for the univariate tree-structured volatility dynamics ofthe last section, we need to define the class of admissible parti-tions P for our correlation function. Again, the only restrictionwe put on P is that it is composed of rectangular partition cells.Consistent with our assumptions, these partition cells are delim-ited by a set of multivariate thresholds for (Xt−1, ρt−1). In orderto construct such rectangular partition cells, we make use of abinary tree in which every terminal node represents a cell Ri.Estimation of the threshold function in the correlation dynam-ics in Equation (2.4) is achieved by a high-dimensional modelselection procedure that determines the optimal number and thestructure of the relevant thresholds in the underlying partition.This model selection scheme is not computationally feasible ifapplied directly to the multivariate time series (εtε

′t)t∈Z. A nat-

ural way to reduce estimation complexity is to notice that thepartition P is identical to the one implied by a corresponding

Dow

nloa

ded

by [

Uni

vers

ity o

f W

este

rn O

ntar

io]

at 0

6:41

12

Nov

embe

r 20

14

Page 5: A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

Audrino and Trojani: General Multivariate Threshold GARCH Model 141

tree-structured univariate model for the time series (ρt)t∈Z. In-deed, since Rt = Et−1(εtε

′t) it follows:

Et−1(ρt) = 1

d(d − 1)

∑u�=v

Ruvt

=w∑

i=1

ci

(1

d(d − 1)

∑u�=v

Ruvit

)I[(Xt−1,ρt−1)∈Ri], (2.9)

where Ruvt (Ruv

it ) denotes the uvth component of the matrix Rt(Rit). Therefore, the tree-structured model

ρt = Et−1(ρt) + ηt, (2.10)

where (ηt)t∈Z is a martingale difference process and Et−1(ρt)

is given by Equation (2.9), defines a univariate tree-structuredprocess for ρt based on the same partition P as in the correla-tion dynamics in Equation (2.4). It follows that we can exploitthe univariate model in Equation (2.10) to estimate the thresh-old structure in Equation (2.4). In particular, we can develop amodel selection procedure for selecting the optimal thresholdstructure in the correlation dynamics. The simplest dynamicsarise in the piecewise constant case

Et−1(ρt) =w∑

i=1

ci

(1

d(d − 1)

∑u�=v

Ruv

)I[(Xt−1,ρt−1)∈Ri], (2.11)

where Ruv

is the uv component of the correlation matrix R in thepiecewise-constant dynamics in Equation (2.8). This piecewise-constant function is the optimal one that was estimated in ourapplications to the U.S. equity market in Section 4.1. Moregenerally, for c1 = · · · = cw = 1 and λi, φi > 0, i = 1, . . . ,w,we can also encompass the univariate dynamics of ρt that areconsistent with a tree-structured DCC model of the form inEquation (2.4) for correlations. This threshold structure is theone we estimate in our application of Section 4.2 where wemodel the correlation between Treasury bond and stock returns.Model selection across this class of potential threshold func-tions for Et−1[ρt] is performed using the BIC information cri-terion. Once the partition P in Equation (2.10) is estimatedthe parameter in Equation (2.7) of the multivariate correla-tion dynamics can be estimated using a multivariate conditionalpseudo-likelihood for εt in which the selected partition P isheld fixed. The next section provides additional details on theestimation procedure used to estimate our tree-structured DCCmodel.

3. ESTIMATION OF THE TREE–STRUCTUREDDCC MODEL

Estimation of our tree-structured model is accomplishedin two steps. In the first step, an estimate of the volatil-ity process Dt is obtained by performing d estimations ofthe univariate tree-structured conditional volatility dynamicsσt,1(θ1,1), . . . , σt,d(θ1,d) implied by the specification in Equa-tion (2.3). The resulting point estimate Dt := Dt (θ1) is used tocompute the estimated scaled residuals

εt := D−1t Xt. (3.1)

The scaled residuals εt are used in the second step of our proce-dure to estimate the tree-structured conditional correlation dy-namics in Equation (2.4).

3.1 Estimation of Tree-Structured UnivariateGARCH-Dynamics

Estimation of the d tree-structured univariate volatility func-tions in Equation (2.3) is achieved by a high-dimensional modelselection problem that determines the optimal structure of therelevant thresholds in any partition Pj of the univariate volatil-ity dynamics in Equation (2.3), j = 1, . . . ,d.

In the first step, the largest univariate tree-structured GARCHmodel is estimated for any j = 1, . . . ,d, given a fixed maxi-mal number Mj of possible thresholds in Equation (2.3). Thisfirst step delivers a maximal possible partition P max

j of the rele-vant state space G in the univariate volatility dynamics in Equa-tion (2.3).

In the second step, a tree-structured model selection proce-dure for nonnested models is applied that selects the optimalsubpartition Pj ⊂ P max

j out of the maximal one. Model se-lection is performed according to the BIC information crite-rion implied by a conditionally Gaussian log-likelihood for anyprocess coordinate Xt,j, j = 1, . . . ,d. The resulting optimal tree-structured volatility model minimizes the BIC information cri-terion across all tree-structured subpartitions of P max

j . The com-plete algorithm used to estimate univariate tree-structuredGARCH(1,1) models is given in Audrino and Bühlmann(2001) and Audrino and Trojani (2006).

The construction of the largest partition P maxj proceeds as

follows: We first fix a maximal number Mj + 1 of partition cellsin the tree. Because of the tree-structured construction of P max

j ,this first step implies a maximal number Mj + 1 of conditionalvolatility regimes (i.e., the number of terminal nodes in the bi-nary tree). A parsimonious specification of the maximal num-ber Mj of thresholds ensures a statistically and computationallytractable model dimension. Moreover, it avoids (over) fittingan overly flexible model dynamics, which will result in a poorout-of-sample forecasting power. For any coordinate axis of themultivariate state space that has to be split, we search for mul-tivariate thresholds over grid points that are empirical α quan-tiles of the data along the relevant coordinate axis. We fix theempirical quantiles as α = i/mesh, i = 1, . . . ,mesh − 1, wheremesh determines the fineness of the grid on which we search formultivariate thresholds. We choose mesh = 8 because, as wasshown in different empirical studies in the literature, this valueleads to reliable forecasting results. The partition of the statespace G = R

d × R+ into a maximal number of Mj + 1 cells

is performed as follows. A first threshold d1 ∈ R or R+ in one

coordinate indexed by a component index ι1 ∈ {1,2, . . . ,d + 1}partitions G as

G = Rleft ∪ Rright,

where Rleft = {(Xt−1, σ2t−1) ∈ R

d × R+; (Xt−1, σ

2t−1)ι1 ≤ d1}

and (Xt−1, σ2t−1)ι1 denotes the ι1 component of the tuple

(Xt−1, σ2t−1). Rright is defined analogously using the relation

“>” instead of “≤.” In the second step, one of the partitioncells Rleft, Rright is further partitioned with a second thresh-old d2 and a second component index ι2, in the same way aspreviously. We then iterate this procedure. For the mth itera-tion step, we specify a new pair (dm, ιm), which defines a newthreshold dm for the coordinate indexed by ιm and an existingpartition cell that is going to be split into two subcells. For a

Dow

nloa

ded

by [

Uni

vers

ity o

f W

este

rn O

ntar

io]

at 0

6:41

12

Nov

embe

r 20

14

Page 6: A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

142 Journal of Business & Economic Statistics, January 2011

new pair (d, ι) ∈ R × {1, . . . ,d + 1} refinement of an existingpartition P (old) is obtained by picking Rj∗ ∈ P (old) and splittingit as

Rj∗ = Rj∗,left ∪ Rj∗,right. (3.2)

This procedure produces a new (finer) partition of G, given by

P (new) = {Rj, Rj∗,left, Rj∗,right, j �= j∗}. (3.3)

In this partition, the tuple (d, ι) describes a threshold d anda component index ι such that Rj∗,left = {(Xt−1, σ

2t−1) ∈ Rj∗ ;

(Xt−1, σ2t−1)ι ≤ d}. Rj∗,right is defined analogously, with the

relation “>” instead of “≤.” The whole procedure finally de-termines a partition P max

j = {R1,j, . . . , RMj+1,j}. This partitioncan be represented and summarized by a binary tree in whichevery terminal node represents a partition cell of P max

j . To se-lect the specific threshold and component index (d, ι) in each it-eration step of the above procedure we optimize the correspond-ing conditional negative (pseudo) log-likelihood in the model.

3.2 Estimation of Tree-Structured DCC-Dynamics

In the first step, we estimate the optimal partition P using thetree-structured model in Equation (2.10) for ρt and the scaledestimated residuals εt. In the second step, we fix the partitionP estimated for the univariate model in Equation (2.10), andestimate the parameter θ2 in Equation (2.7) by a multivariatepseudo-maximum likelihood estimator.

(i) Estimation of the univariate tree-structured model inEquation (2.10). Let

ρt =∑u�=v

εt−1,uεt−1,v/[d(d − 1)]. (3.4)

The following tree–structured model for ρt is estimated; com-pare with Equation (2.10)

ρt = Et−1(ρt) + ηt, (3.5)

where (ηt)t∈Z is a martingale difference sequence and

Et−1(ρt) =w∑

i=1

ci

(1

d(d − 1)

∑u�=v

Ruvit

)I[(Xt−1 ,ρt−1)∈Ri]. (3.6)

In this equation, Rit denotes for i = 1, . . . ,w a constant corre-lation matrix when the tree-structured model for correlationsimplies a piecewise-constant correlation matrix. It then fol-lows in this case that the conditional mean of ρt is simplya piecewise-constant threshold function. More generally, forc1 = · · · = cw = 1 and φi > 0 or λi > 0, where i = 1, . . . ,w,the local conditional correlation matrix Rit is simply defined inthe same way as Rit in Equations (2.5) and (2.6), but with εt−1replacing εt−1 in Equation (2.6).

We apply to the series ρt the same estimation proceduregiven in the last section for individual conditional variances.First, we estimate a largest univariate tree-structured model forρt, given a fixed maximal number M of possible thresholds inEquation (2.10). In all our empirical applications, we fix themaximal number of candidate thresholds in Equation (2.10)at M = 4. A tree-structured model selection procedure for

nonnested models is then applied that selects the optimal sub-partition P out of the maximal one. Model selection is per-formed according to the BIC criterion implied by a condition-ally Gaussian pseudo log-likelihood for ρt; see again Audrinoand Trojani (2006, section 2.3), for details on this estima-tion procedure. In our empirical study, we find that this pro-cedure offers a simple and effective way to reduce the com-putational costs implied by the estimation of our multivariatetree-structured model. In particular, in the applications of Sec-tion 4 a piecewise-constant conditional correlation function isestimated for equity returns. However, local DCC-type struc-tures are found to better model the conditional correlations be-tween equity and bond returns.

(ii) Estimation of the tree-structured conditional correlationfunction Rt. In the second step of our estimation procedure, wefix the partition P estimated in step (i) and we estimate the pa-rameter vector θ2 in Equation (2.7) by a pseudo-maximum like-lihood estimator θ2 for θ2, under a Gaussian multivariate condi-tional pseudo-likelihood for εt. If in step (i) the optimal thresh-old function does not imply piecewise-constant correlations, weestimate the matrix Q in the dynamics in Equation (2.6) by do-ing correlation targeting, as proposed by Engle and Sheppard(2001) and Pelletier (2006). If in step (i) a piecewise-constantcorrelation structure is selected, in the second step we estimatea piecewise constant correlation process of the form in Equa-tion (2.8). In such a case, we estimate the constant matrix R bydoing correlation targeting in a rolling window of one year ofdata. The piecewise-constant correlation structure significantlyreduces the number of parameters over which the likelihoodfunction has to be maximized.

3.3 Consistency

Proofs of the consistency of our model selection procedureare very difficult to obtain for the case where the true model is inthe class of tree-structured GARCH-DCC models. Analogouslyto the standard classification and regression trees (CART) pro-cedure introduced by Breiman et al. (1984), it is possible toprove theorems that study the behavior of the prevailing para-meter estimators when growing the tree. However, such resultsdo not imply model selection consistency. Furthermore, it isquite unlikely that the “correct” generating process in our ex-ample and other similar ones with real data is indeed exactly atree-structured model for volatilities and correlations. For thisreason, it is more important to prove consistency of the estima-tors for the parameters of a tree-structured model under a modelmisspecification than it is to prove consistency of the model se-lection strategy under the assumption of a correctly specified,tree-structured model. Such consistency results can be found inAudrino and Bühlmann (2001). Based on these results, consis-tency of the two-step estimates (θ1, θ2) in the tree-structuredDCC-GARCH model under a possible model misspecificationcan be derived in the standard way under mild regularity con-ditions; see, for instance, Newey and McFadden (1994). More-over, efficient estimates can be obtained by performing a furtherone-step Newton–Raphson estimation of the full likelihood, us-ing as starting values the parameter estimates obtained from thetwo-step procedure (see, e.g., Pagan 1986).

Dow

nloa

ded

by [

Uni

vers

ity o

f W

este

rn O

ntar

io]

at 0

6:41

12

Nov

embe

r 20

14

Page 7: A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

Audrino and Trojani: General Multivariate Threshold GARCH Model 143

4. RESULTS

In this section we test the in-sample and out-of-sample ex-planatory power of our tree-structured GARCH-DCC(TreeDCC) model in two different applications involving theeconometric analysis of U.S. stock and bond returns. We com-pare our model with several multivariate GARCH models thatwere recently proposed in the literature. Some of these modelsare nested in ours and can be estimated by a two-step estimationprocedure:

• The CCC-GARCH model, as proposed by Bollerslev(1990); this model is nested in our model.

• The DCC-GARCH model, as proposed by Engle (2002);this model is nested in our model.

• The RSDC-GARCH model with switching regimes inconditional correlations, as proposed in Pelletier (2006).This model is not formally nested in ours.

Since the individual volatility processes are estimated sep-arately from the correlation dynamics in these models, inour empirical study we can easily focus on the additionalexplanatory power for conditional correlations, which is themain topic of this article. To achieve this goal, we estimatevolatility processes identical to those in our tree-structuredDCC-GARCH setting. The volatility processes are all speci-fied as univariate tree-structured GARCH(1,1) processes. Wealso study the performance of our model relative to the flexiblemultivariate GARCH setting in Ledoit, Santa-Clara, and Wolf(2003), which does not include thresholds in volatilities or cor-relations and is based on a more general correlation dynamicsthan the one implied by the Engle (2002) DCC model. There-fore, this model is not nested in our setting. Flexible multivari-ate GARCH models were shown by Ledoit, Santa-Clara, andWolf (2003) to describe the dynamics of stock returns quite ac-curately. Therefore, they are further natural competitors of ourapproach, especially in applications that study the multivariatedynamics of stock markets, as in our first empirical example.

To quantify and compare the in-sample and out-of-sample fitof the different models, we compute several goodness-of-fit sta-tistics for conditional covariances. Since the individual volatil-ity processes are identical for all but the flexible multivariateGARCH model, this comparison allows us to investigate theadditional explanatory power of our model for explaining thecorrelation dynamics. We consider the following goodness-of-fit measures:

• The multivariate negative log-likelihood statistic (NL),• The multivariate version of the classical mean absolute er-

ror statistic (MAE), and• The multivariate version of the classical mean squared er-

ror statistic (MSE).

The last two performance measures require the specification ofsensible values for the unknown true conditional covariancematrix. A powerful way of computing good proxies for thismatrix is by means of the so-called realized covariance ap-proach, which is the natural multivariate version of the real-ized volatility approach proposed, among others, by Andersenet al. (2001, 2003) and Barndorff-Nielsen and Shephard (2001,2002). We follow this approach in our two real data applica-tions, in which we collect tick-by-tick return data to compute

the realized covariance between returns using the methodologyproposed in Corsi and Audrino (2007). Using such an accu-rate proxy for the unobservable conditional covariance matrixallows us to avoid possible misleading results implied by an un-fortunate choice of the loss function used to quantify the good-ness of fit, and therefore, a wrong ranking of the different mod-els under investigation; see Laurent, Rombouts, and Violante(2009) for more details. All estimated models also include alinear autoregressive conditional mean function modeled by asimple diagonal VAR(1) process.

The different statistics used to quantify the in-sample andout-of-sample goodness of fit in our empirical analysis are de-fined as follows (IS denotes in-sample and OS denotes out-of-sample):

IS-NL: −log-likelihood(Xn1; θ1, θ2),

OS-NL: −log-likelihood(Ynout1 ; θ1, θ2),

IS-MAE:1

d2

d∑i,j=1

1

n

n∑t=1

|vt,ij − vt,ij|,

OS-MAE:1

d2

d∑i,j=1

1

nout

nout∑t=1

|vt,ij − vt,ij(Yt−11 )|,

IS-MSE:1

d2

d∑i,j=1

1

n

n∑t=1

|vt,ij − vt,ij|2,

OS-MSE:1

d2

d∑i,j=1

1

nout

nout∑t=1

|vt,ij − vt,ij(Yt−11 )|2,

where in the OS performance measures the expressionvt,ij(Y

t−11 ) is the ijth covariance prediction implied by our

out-of-sample data Ynout1 = {Y1, . . . ,Ynout} at time t under the

parameter estimates obtained from the in-sample data Xn1 =

{X1, . . . ,Xn}. vt,ij is the realized covariance between the returnseries i and j at time t. In all cases, a lower goodness-of-fitmeasure indicates a higher forecasting power of a model forconditional correlations.

To evaluate whether differences in performance among themodels considered are statistically and/or economically signif-icant, we perform a series of tests recently proposed in the lit-erature. We investigate the statistical relevance of the improve-ments in forecasting accuracy of the TreeDCC model relativeto the other approaches:

• We perform the superior predictive ability (SPA) test intro-duced by Hansen (2005). This allows us to verify whethereach of the models considered is significantly outper-formed by one (or more) of the alternatives.

• We construct a model confidence set (MCS) at the 5% and10% confidence levels as proposed by Hansen, Lunde, andNason (2003). We introduce the MCS approach with thegoal of characterizing the best subset of models out of aset of competing ones; see the Appendix for more detailsabout the MCS construction.

We also evaluate the alternative model specifications using aneconomic criterion. We apply the methodology suggested byWest, Edison, and Cho (1993) and Fleming, Kirby, and Ostdiek

Dow

nloa

ded

by [

Uni

vers

ity o

f W

este

rn O

ntar

io]

at 0

6:41

12

Nov

embe

r 20

14

Page 8: A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

144 Journal of Business & Economic Statistics, January 2011

(2001, 2003) to quantify the economic benefit of different cor-relation forecasts in the context of a portfolio strategy based onvolatility timing. Similarly to Engle and Colacito (2006) andBandi, Russell, and Zhu (2008), we employ the variance com-ponent of an investor’s long-run mean–variance utility as a met-ric to quantify the economic differences between the alternativecorrelation (covariance) forecasts

AU = λ

2

1

nout

nout∑t=1

(Rpt − R

p)2,

where the portfolio return at time t is given by

Rpt = Rf + w′

t−1(Yt − Rf Id), t = 1, . . . ,nout.

Rp

is the sample mean of the portfolio returns across the out-of-sample period and λ is a coefficient of risk aversion. Asin Bandi, Russell, and Zhu (2008), we use three values ofλ = 2,7,10. In the computation of the portfolio returns, Rf

is the risk-free rate, Id is a d × 1 unit vector, and wt−1 is ad vector of portfolio weights obtained by solving the classi-cal mean–variance optimization problem at time t − 1, givena fixed target return on the portfolio. In our real data applica-tions, we set Rf equal to the average value of the U.S. three-month rate in the out-of-sample period. In the mean–varianceoptimization, we use the one-step-ahead conditional covarianceforecasts vt,ij(Y

t−11 ) obtained from the alternative multivari-

ate GARCH models under investigation. We can interpret thedifference between the quantity AU implied by the TreeDCCmodel and each other model as the fee that an investor will bewilling to pay to switch from correlation forecasts generated byeach alternative model to those of the TreeDCC model. We alsouse the generalization of the Diebold–Mariano test for pairwiseequal predictive ability (EPA) and the joint test introduced inEngle and Colacito (2006) to study the statistical significanceof the differences in estimated economic gains. For details, seeagain Engle and Colacito (2006) and Bandi, Russell, and Zhu(2008).

4.1 First Real Data Application: U.S. Equity Returns

We consider a multivariate time series of (annualized) dailylog-returns for 10 U.S. stocks: Alcoa, Citigroup, Hasbro, HarleyDavidson, Intel, Microsoft, Nike, Pfizer, Tektronix, and Exxon.Data are for the sample period between January 2, 2001 and De-cember 30, 2005, amounting to 1256 trading days. The sourceof the data is Tick Data, a division of Nexa Technologies,Inc. (see the webpage http://www.tickdata.com). Using thesetick-by-tick data, we construct realized covariances using themethod in Corsi and Audrino (2007) and obtain the quantitiesvt,ij needed to compute our goodness-of-fit measures.

We split the sample into two subperiods. The first one con-sists of n = 752 trading days, from January 2, 2001 to Decem-ber 31, 2003. Data from this subperiod are used for in-sampleestimation and performance evaluation. The second subperiodconsists of the remaining nout = 504 observations, up to De-cember 30, 2005, and is used for out-of-sample performanceevaluation.

We focus on differences in goodness of fit implied by theconditional correlation matrix dynamics under the differentmodel settings. We estimate our model in two steps as follows.

First, we separately estimate the univariate conditional volatil-ity dynamics for each single return series and include as possi-ble conditioning variables in the threshold definition (i) its es-timated conditional volatility and (ii) the first lag of all compo-nents in the multivariate return series. This threshold volatilitystructure proved to produce good empirical results in applica-tions of tree-structured GARCH models to financial data; see,for example, Audrino and Trojani (2006). This first step of theestimation procedure is kept identical for all models in whichvolatilities can be estimated separately from correlations: theCCC, the DCC, the RSDC, and our tree-structured DCC model.In this way, we ensure that differences in the goodness of fit ofthese models with respect to the estimated conditional covari-ance matrix dynamics are exclusively due to differences in theexplanatory power with respect to conditional correlations. Inthe second step of our estimation procedure, we estimate pos-sible tree-structured thresholds and GARCH-type dynamics inconditional correlations. We include as possible conditioningvariables for the definition of the threshold structure of con-ditional correlations (i) the first lag of the average conditionalcorrelation shocks across returns and (ii) the first lag of all com-ponents of our multivariate return series; see again Section 2.2for details.

4.1.1 Estimation Results. The estimation results of ourTreeDCC-GARCH model for the 10-dimensional time series ofU.S. stock returns introduced earlier are summarized in Table 1.

Table 1, Panel A, highlights that, at most, two regimes arenecessary to model the individual conditional variance dynam-ics accurately. The most important predictor variables impact-ing on the corresponding threshold structures are the lagged re-turns of Microsoft and Harley Davidson. Microsoft and HarleyDavidson are the largest stocks in our empirical example. Thus,the apparent influence of their lagged returns on estimated vari-ance covariance regimes is likely to proxy for the latent impactof the aggregate market return on the multivariate variance co-variance dynamics. The structure of the estimated conditionalcorrelation dynamics in our model is summarized in Panel Bof Table 1. Similar to volatilities, the most important and sta-tistically significant predictor variable impacting on the thresh-old structure of conditional correlations is the lagged return ofHarley Davidson. Moreover, the complete threshold structureof conditional correlations is characterized using only two fur-ther lagged stock returns, the returns of Alcoa and Intel (in de-scending order of statistical significance), implying four cor-relation regimes overall. The first regime is associated withsimultaneously low (i.e., under the threshold values) laggedHarley Davidson and Alcoa returns. The second one arises forlagged, low Harley Davidson returns and for large Alcoa re-turns. The third regime is obtained for lagged low Intel returnsand large Harley Davidson returns. Finally, the fourth regimeis caused by contemporarily somewhat higher returns of HarleyDavidson and Intel. An important difference between the esti-mated volatility and correlation dynamics is that the local cor-relation dynamics never exhibit GARCH-type effects across thedifferent correlation regimes. In other words, conditional corre-lations are regime dependent but piecewise constant. The localaverage correlation levels in the different regimes are similarand vary from 0.897 to 0.962. However, we find that the BICcriterion increases significantly in all cases when incorporating

Dow

nloa

ded

by [

Uni

vers

ity o

f W

este

rn O

ntar

io]

at 0

6:41

12

Nov

embe

r 20

14

Page 9: A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

Audrino and Trojani: General Multivariate Threshold GARCH Model 145

Table 1. Estimation results for a multivariate time series of 10 daily(annualized) U.S. stock returns (in %)

Panel A: Individual conditional variance structures

Series Regimes Optimal predictors

Alcoa 1 —Citigroup 2 MicrosoftHasbro 2 Harley DavidsonHarley Davidson 2 Harley DavidsonIntel 1 —Microsoft 1 —Nike 2 ExxonPfizer 2 MicrosoftTektronix 2 Harley DavidsonExxon 1 —

Panel B: Conditional correlation structure and parameters

Cond. corr. structure Cond. corr. parametersRk ck

Xt−1,Harley Davidson ≤ −19.983 and 0.918Xt−1,Alcoa ≤ −3.553 (0.029)

Xt−1,Harley Davidson ≤ −19.983 and 0.897Xt−1,Alcoa > −3.553 (0.056)

Xt−1,Harley Davidson > −19.983 and 0.962Xt−1,Intel ≤ −15.016 (0.016)

Xt−1,Harley Davidson > −19.983 and 0.935Xt−1,Intel > −15.016 (0.023)

NOTE: Data are for the in-sample time period between January 2, 2001 and De-cember 31, 2003, consisting of 752 observations. Estimated individual conditional vari-ance structures (Panel A) and estimated conditional correlation structure and parameters(Panel B) are for the tree-structured GARCH-DCC model fit. Standard errors computedusing 1000 model-based bootstrap replications are given in parentheses.

additional correlation regimes into the model. This finding isalso supported by our out-of-sample tests on the model’s fore-casting power for correlations, which further indicate a clearsuperiority of our model over a CCC model with constant cor-relations.

4.1.2 Multivariate Performance Results. We now com-pare the accuracy of the conditional correlation predictions

implied by our model with those implied by the CCC, theDCC, the RSDC, and the flexible multivariate GARCH model.Since all models have the same individual volatility dynamics,with the sole exception being the flexible multivariate GARCHmodel, any difference in the goodness of fit for the forecasts ofthe return covariance matrix is due to a difference in the qual-ity of the forecasts for conditional correlations. In Table 2, wepresent the goodness-of-fit measures defined in Section 4 forthe real data application to our 10-dimensional stock return se-ries.

Our multivariate tree-structured model achieves the highestgoodness of fit overall across the models with identical volatil-ity dynamics (TreeDCC, CCC, DCC, and RSDC models).The RSDC model achieves a better in-sample log-likelihoodcriterion, but it implies worse MAE and MSE criteria both in-sample and out-of-sample. Since the RSDC model has morethan twice the number of parameters of the other models, weinterpret this finding as evidence of over-fitting for this partic-ular model. The TreeDCC model’s better performance relativeto CCC, DCC, and RSDC ranges between 0.5% and 17% de-pending on the goodness-of-fit measure applied. In compari-son with the flexible multivariate GARCH model, we find thatthe out-of-sample forecasting performance of the latter spec-ification is slightly better in two out of three cases, despitethis model’s having more than double the number of parame-ters of ours. The heavy parameterization of the flexible mul-tivariate GARCH model can make it impracticable for set-tings that include dozens to hundreds of individual time series.Our TreeDCC model can produce flexible conditional covari-ance specifications combined with a sufficient degree of parsi-mony. This is why the TreeDCC model, like the CCC, DCC,and RSDC models, can be used to estimate the conditionalvariance–covariance dynamics of very high-dimensional time-series settings.

4.1.3 Statistical and Economic Significance of the Improve-ments. In this section, we provide additional evidence of thestatistical and economic significance of the goodness-of-fit im-provements provided by our model. We focus first on the sta-tistical significance of improvements in conditional variance–covariance forecasts. To this end, we first compute Hansen

Table 2. Goodness of fit of different models for a multivariate time series of 10 daily (annualized) U.S. stock returns (in %)

U.S. equity returns: Goodness-of-fit results

Model # par. NL IS-MAE MSE NL OS-MAE MSE

CCC-GARCH 80 34,959 269.413 176,147 21,694 105.470 73,205.9(0.0002) (0) (0)

DCC-GARCH 82 34,942 269.525 177,848 21,652 103.259 72,731.0(0.008) (0) (0)

RSDC-GARCH 173 34,684 276.604 182,999 21,634 105.470 74,520.1(0.2203) (0) (0)

TreeDCC-GARCH 84 34,927 241.830 152,813 21,594 86.0280 68,831.5(0.857) (0.124) (0.453)

F-MGARCH 185 34,378 188.850 135,608 21,588 79.6050 69,832.7(0.603) (0.509) (0.0826)

NOTE: Data are for the time period between January 2, 2001 and December 30, 2005, for a total of 1256 observations. The in-sample estimation period goes from the beginning ofthe sample to the end of 2003 (752 observations). NL, MAE, and MSE are multivariate versions of the standard univariate negative log-likelihood, the mean absolute error, and the meansquared error statistics. # par. reports the number of parameters estimated by the different models. For the out-of-sample performance measures, p-values of superior predictive ability(SPA) tests are reported in parentheses.

Dow

nloa

ded

by [

Uni

vers

ity o

f W

este

rn O

ntar

io]

at 0

6:41

12

Nov

embe

r 20

14

Page 10: A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

146 Journal of Business & Economic Statistics, January 2011

(2005) tests of superior predictive ability. In Table 2, p-values ofthese tests are reported for each model and each out-of-sampleperformance measure under investigation.

Consistent with our previous results, the TreeDCC and flex-ible multivariate GARCH models are the only ones not signifi-cantly outperformed by any other model at the 5% significancelevel based on the MAE and MSE statistics. With respect to thenegative log-likelihood statistic, the RDSC model is also notsignificantly outperformed by any other model at the 20% sig-nificance level. In this case, however, the p-values implied byTreeDCC and flexible GARCH models are even higher (above85% and 60%, respectively). To formally characterize the setof models not significantly outperformed by other ones, wealso computed Hansen, Lunde, and Nason (2003) model con-fidence sets (MCS). Consistent with the previous findings forSPA tests, we find that the 5% MCS based on the OS-MAE andOS-MSE criteria using the range and semiquadratic statisticsconsist solely of the TreeDCC and flexible MGARCH mod-els. Ten percent MCS based on the OS-NL criterion include theTreeDCC, flexible MGARCH and RSDC models.

We now economically quantify the improvements impliedby the different variance–covariance forecasts and compute thedifference between the variance component of the average util-ity AU implied by TreeDCC model and the one implied by eachof the other models. Table 3 presents results for three levels ofrisk aversion parameters λ = 2,7,10 and three target expectedreturns (6%, 10%, and 15%) for the mean variance optimalportfolio. The resulting difference between average utilities isshown as an annualized fee and the (annualized) risk-free rateis 3%.

The economic gains relative to CCC and DCC models rangefrom about 1 to 180 basis points. The economically most im-portant gains arise for highly risk-averse investors (λ = 10)and high target expected returns of 15%. Economically, very

Table 3. U.S. equity returns: Annualized fees (in basis points)

Target Alternative model λ = 2 λ = 7 λ = 10

6% CCC-GARCH 2.24 7.87 11.24DCC-GARCH 1.97 6.88 9.83RSDC-GARCH 0.66 2.31 3.31F-MGARCH 0.94 3.29 4.73

10% CCC-GARCH 12.24 42.85 61.22DCC-GARCH 10.70 37.46 53.52RSDC-GARCH 3.60 12.59 17.99F-MGARCH 4.37 15.29 21.85

15% CCC-GARCH 35.98 125.92 179.88DCC-GARCH 31.46 110.09 157.28RSDC-GARCH 10.58 37.02 52.88F-MGARCH 16.17 56.60 80.85

NOTE: The table contains the annualized fees (in basis points) that a conditional mean-variance investor with absolute risk-aversion parameter λ = 2, 7, and 10 will be willingto pay to perform volatility timing using the one-step-ahead conditional correlation andcovariance forecasts from the TreeDCC model (benchmark model) versus those obtainedusing the CCC, DCC, RSDC, and the flexible MGARCH models. The portfolio weightsare obtained by minimizing the one-step-ahead conditional variance forecast of a portfoliocontaining 10 U.S. stocks and the risk-free asset for a given target expected return on theportfolio. The annual risk-free rate is set to be equal to 3%. Data are for the out-of-sampletime period beginning in January 2004 and ending in December 2005, for a total of 504daily observations.

small gains arise for low risk aversion (λ = 2) and a targetexpected return of 6%. Consistent with our previous findings,economic gains are clearly smaller relative to RSDC and flex-ible MGARCH models. In this case, for a high risk aversionλ = 10 and a target expected return of 15%, they are 52 and 81basis points, respectively. We can use a Diebold–Mariano-typetest and the joint test proposed in Engle and Colacito (2006) totest the statistical significance of these differences. Similarly toBandi, Russell, and Zhu (2008), we find that these differencesare not significant at the 5% significance level.

Overall, these results provide evidence that economically andstatistically significant differences in variance–covariance fore-casts of our TreeDCC model are likely to arise with respect tothe CCC and DCC models, whereas with respect to the RSDCand flexible GARCH models, such differences are less likely.

4.1.4 Sensitivity Analysis. To investigate the robustnessof our findings, it is useful to study the sensitivity of resultsto moderate changes in the structure of the estimated TreeDCCmodel. We perform this task along several dimensions.

First, we investigate whether moderate changes in the es-timated location and threshold parameters of the conditionalcorrelation process imply significantly different out-of-sampleperformances given a fixed tree-structure. Overall, the result-ing effect on the out-of-sample performance relative to the esti-mated TreeDCC model is economically small. The largest im-pact arises by modifying the location parameter c4 in the fourthregime (of ±1 standard error) or the first threshold parameter d1

(to the nearest possible threshold value in the positive and neg-ative directions). However, the changes in forecasting accuracyare typically smaller than the changes in MAE and MSE ob-served for the CCC, DCC, RDSC, and flexible GARCH mod-els.

Second, we investigate whether moderate changes in the tree-structure of the estimated TreeDCC model imply significantlydifferent out-of-sample performances. We consider the second,third, and fourth best (in-sample) TreeDCC models estimatedaccording to the procedure described in Section 3. Once again,the good forecasting performance of our TreeDCC modelingapproach relative to other models seems to be quite robustacross various similar choices of the parameters and thresholdstructure in the model. More detailed results of this sensitivityexercise are available from the authors upon request.

4.2 Second Real Data Application: U.S. Stock Indexand Bond Returns

We consider a two-dimensional time series of (annualized)daily log-returns for the U.S. S&P500 stock index and the U.S.30-year Treasury bond. The time period under investigationgoes from January 3, 1996 to October 30, 2003 and contains1899 trading days. The data are provided by Tick Data. As inthe previous section, we exploit the tick-by-tick data to con-struct the series of realized volatilities and covariances betweenstock index and bond returns. As before, for forecasting evalu-ation purposes we split the sample in two subperiods. The firstsubperiod consists of n = 1219 trading days and goes from Jan-uary 3, 1996 to December 29, 2000. The second subperiod con-sists of the last three years of data (nout = 680 observations).

Dow

nloa

ded

by [

Uni

vers

ity o

f W

este

rn O

ntar

io]

at 0

6:41

12

Nov

embe

r 20

14

Page 11: A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

Audrino and Trojani: General Multivariate Threshold GARCH Model 147

Table 4. Estimation results for a two-dimensional time series of daily(annualized) returns (in %) for the U.S. S&P500 index and

the U.S. 30-year Treasury bond

Panel A: Individual conditional variance structures

Series Regimes Optimal predictors

S&P500 3 S&P50030-year Treasury bond 2 30-year Treasury bond

Panel B: Conditional correlation structure and parameters

Cond. corr. structureRk

Cond. corr. parameters

φk λk

Xt−1,S&P500 ≤ −3.847098 0.0490 0.9129(0.001) (0.024)

Xt−1,S&P500 > −3.847098 0.0222 0.9724(0.002) (0.019)

NOTE: Data are for the in-sample time period between January 3, 1996 and Decem-ber 29, 2000, consisting of 1219 observations. Estimated individual conditional vari-ance structures (Panel A) and estimated conditional correlation structure and parameters(Panel B) are for the tree-structured GARCH-DCC model fit. Standard errors computedusing 1000 model-based bootstrap replications are given in parentheses.

4.2.1 Estimation Results. The estimation procedure fol-lows the same steps as the one described in Section 4.1. The in-dividual variance structures and correlation threshold functionsestimated for our TreeDCC model are summarized in Table 4.

The estimated threshold functions for volatility each dependonly on one lagged return of each time series of returns. Sim-ilarly to previous findings in the literature, e.g., Audrino andTrojani (2006), we find that the conditional variance of stockindex returns implies more than two regimes. The estimatedconditional variance of Treasury bond returns implies only tworegimes.

We estimate only two regimes as well for the estimatedconditional correlation function of stock index and Treasurybond returns. These regimes depend on the lagged return of theS&P500 index only. Each regime features local GARCH-typeDCC effects as in Engle (2002), in which the regime-dependentparameters φk and λk imply a different persistence of correla-tion shocks: Here the lagged S&P500 index return is below or

above the threshold value d1 = −3.847. This threshold corre-sponds to the 37.5% quantile of the distribution of S&P500 in-dex returns. The differences in the estimated parameters of thelocal DCC dynamics are statistically significant. Interestingly,we find that correlation shocks are quite substantially more per-sistent, conditional on a sufficiently negative past stock indexreturn (Xt−1,S&P500 ≤ −3.847). This might be interpreted as anindication that correlation shocks between bond and stock re-turns are likely to last longer conditional on flight-to-quality ef-fects caused by a drop in the stock market. It is also interestingto note that the correlation dynamics estimated for stock indexand bond returns are quite different from those estimated earlierin our application to a 10-dimensional stock returns time series.The flexibility of our TreeDCC setting is crucial for allowingus to take these different dynamic correlation features of someasset returns adequately into account.

4.2.2 Multivariate Performance Results. Table 5 presentsresults for the goodness-of-fit measures in Section 4, estimatedusing stock index and Treasury bond return data.

The TreeDCC model clearly has the best goodness-of-fit re-sults across all in-sample and out-of-sample measures used.The improvements in out-of-sample performance range from1% to 25%, depending on the goodness-of-fit measure applied.Constant or piecewise-constant conditional correlation dynam-ics (like the ones implied by the CCC and RSDC models) arelargely rejected by the data and lead to very inaccurate correla-tion forecasts. In contrast to the previous application to singlestock returns, GARCH-type DCC effects are now crucial in or-der to improve the model’s out-of-sample forecasting power forcorrelations.

4.2.3 Statistical and Economic Significance of the Improve-ments. In Table 5, p-values of Hansen (2005) SPA tests arereported. They show that the TreeDCC model is the only onenot significantly outperformed by any other model at the 10%confidence level and for all performance criteria used.

The Engle-DCC model yields quite good results and is notoutperformed by any model at the 5% confidence level. Allother models are clearly dominated at standard significance lev-els. MCS results consistently support these findings. Ten per-cent MCS using the range and the semiquadratic statistics con-sist only of the TreeDCC model for all performance criteriaused. Additionally, 5% MCS include the DCC model.

Table 5. U.S. index and bond returns: Goodness-of-fit results

Model # par. NL IS-MAE MSE NL OS-MAE MSE

CCC-GARCH 25 9334.7 63.6310 21,197.8 5511.5 97.2933 27,592.1(0.0003) (0) (0)

DCC-GARCH 27 9299.2 59.4418 20,458.1 5451.7 74.8853 22,357.5(0.4469) (0.0868) (0.3319)

RSDC-GARCH 30 9334.7 63.6310 21,197.8 5511.5 97.2933 27,592.0(0.0014) (0) (0)

TreeDCC-GARCH 29 9290.2 58.7296 20,413.5 5440.9 70.7387 20,544.6(0.8435) (0.2711) (0.2246)

F-MGARCH 13 9389.2 65.0761 22,752.7 5483.4 86.1455 28,271.6(0.0012) (0.0007) (0.0652)

NOTE: Goodness of fit of different models for a two-dimensional time series of daily (annualized) returns (in %) on the U.S. S&P500 index and the 30-year Treasury bond. Data arefor the time period between January 3, 1996 and October 30, 2003, for a total of 1899 observations. The in-sample estimation period goes from the beginning of the sample to the endof 2000 (1219 observations). NL, MAE, and MSE are the multivariate versions of the standard univariate negative log-likelihood, the mean absolute error, and the mean squared errorstatistics. # par. reports the number of parameters estimated in the different models. For the out-of-sample performance measures, p-values of SPA tests are reported in parentheses.

Dow

nloa

ded

by [

Uni

vers

ity o

f W

este

rn O

ntar

io]

at 0

6:41

12

Nov

embe

r 20

14

Page 12: A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

148 Journal of Business & Economic Statistics, January 2011

Table 6. U.S. index and bond returns: Annualized fees(in basis points)

Target Alternative model λ = 2 λ = 7 λ = 10

6% CCC-GARCH 5.54 19.38 27.68DCC-GARCH 0.54 1.87 2.68RSDC-GARCH 5.54 19.38 27.68F-MGARCH 3.68 12.89 18.41

10% CCC-GARCH 30.14 105.50 150.72DCC-GARCH 2.92 10.21 14.59RSDC-GARCH 30.14 105.50 150.72F-MGARCH 20.05 70.18 100.26

15% CCC-GARCH 88.58 310.05 442.92DCC-GARCH 8.57 29.99 42.85RSDC-GARCH 88.58 310.05 442.92F-MGARCH 58.93 206.25 294.64

NOTE: The table contains the annualized fees (in basis points) that a conditional mean-variance investor with absolute risk-aversion parameter λ = 2, 7, and 10 will be willingto pay to perform volatility timing using the one-step-ahead conditional correlation andcovariance forecasts from the TreeDCC model (benchmark model) versus those obtainedusing the CCC, DCC, RSDC, and the flexible MGARCH models. The portfolio weightsare obtained by minimizing the one-step-ahead conditional variance forecast of a portfoliocontaining the U.S. S&P500 index, the 30-year U.S. Treasury bond, and the risk-free assetfor a given target expected return on the portfolio. The annual risk-free rate is set to beequal to 3%. Data are for the out-of-sample time period beginning in January, 2001 andending in October, 2003, for a total of 680 daily observations.

We conclude this section by economically quantifying theforecast improvements of our TreeDCC model relative to thecompeting ones. Table 6 presents the annualized fees (in basispoints) that a conditional mean–variance investor will be readyto pay in order to forecast future variance–covariance matri-ces with the TreeDCC instead of the other models considered.The (annualized) risk-free rate is equal to 3%.

The largest economic gains arise with respect to the CCCand RSDC models: For risk aversion parameters λ ≥ 7 andtarget expected returns larger than 10%, they range from ap-proximately 100 to 440 basis points per year. Sizable economicgains between 70 and 294 basis points for risk aversion pa-rameters λ ≥ 7 and target expected returns larger than 10%also arise with respect to the flexible MGARCH model. Eco-nomic gains relative to the DCC model are small in almost allcases and never exceed 43 basis points per year. Using Diebold–Mariano-type tests we find that estimated economic gains of theTreeDCC are significant at the 1% significance level relative tothe CCC and RSDC model. Estimated economic gains relativeto the flexible MGARCH model are significant at the 5% signif-icance level. Improvements relative to the classical DCC modelare not statistically significant. Overall, these results provideevidence that economically and statistically significant differ-ences in variance–covariance forecasts of our TreeDCC modelare likely to arise with respect to the CCC, RSDC, and flexi-ble GARCH models, and less likely relative to the DCC model,which was clearly dominated by the TreeDCC model in the pre-vious empirical application.

5. CONCLUSION

We propose a new multivariate DCC-GARCH model thatextends previous models by admitting multivariate thresholdsin conditional volatilities and correlations. The thresholds are

modeled by a tree-structured partition of the multivariate statespace and are estimated with all other model parameters. Tworeal data applications support the overall higher forecastingpower of the TreeDCC model for return correlations, relativeto Bollerslev’s CCC model, Engle’s DCC model, Pelletier’sRSDC model, and the flexible MGARCH model. We find thatthe conditional correlations of financial data are often charac-terized by multivariate thresholds and local GARCH-type struc-tures and that the forecast improvements of our TreeDCC modelare often economically relevant. Our model can cope in a par-simonious way with such features of the data even in appli-cations with large cross sections of financial assets. An inter-esting avenue for future research is the joint empirical model-ing of the dynamic correlation of the returns of several assetclasses, which are likely to exhibit rich threshold and GARCH-type effects that can be parsimoniously taken into account byour model.

APPENDIX: TESTING STATISTICAL RELEVANCE:THE MODEL CONFIDENCE SET

We formally test for differences in the forecasting power ofthe competing models in order to select, if possible, a best one(or a best subset of models) that significantly dominates the oth-ers in our real data application. To this end, we apply the modelconfidence set (MCS) method proposed by Hansen, Lunde, andNason (2003).

Without loss of generality, let us denote by Dt,wk the differ-ences of each term in the OS-MSE statistic

Dt,wk = Ut;modelw − Ut;modelk ,

t = 1, . . . ,nout,w, k = 1, . . . ,9,w < k,

wherenout∑t=1

Ut;model = OS-MSE.

Statistics based on time averages Dwk of Dt,wk allow us to in-vestigate whether there is a systematic difference in the out-of-sample forecasting power between the different models. Testsbased on Dt,wk are t-type tests. In a similar way, one can pro-ceed by using a different out-of-sample goodness-of-fit statis-tic. In our application, we compute tests based on OS-MSE,OS-MAE, and OS-NL.

The MCS is defined as the smallest set of models, which at agiven confidence level α, cannot be significantly distinguishedbased on forecasting power. The MCS is determined after se-quentially trimming the set of candidate models, which in ourapplication consists of the five multivariate GARCH specifica-tions introduced earlier. At each step of such a trimming pro-cedure, the null-hypothesis of equal predictive ability (EPA)H0 : E[Dt,wk] = 0,∀w, k ∈ M, is tested for the relevant set ofmodels M at a confidence level α. In the first step, M consistsof all models under investigation. If, in the first step, H0 is re-jected, then the worst-performing model according to the rele-vant criterion is eliminated. The test procedure is then repeatedfor the new set M of surviving models and it is iterated un-til the first nonrejection of the EPA hypothesis occurs. The setof resulting models is called the model confidence set Mα at

Dow

nloa

ded

by [

Uni

vers

ity o

f W

este

rn O

ntar

io]

at 0

6:41

12

Nov

embe

r 20

14

Page 13: A General Multivariate Threshold GARCH Model With Dynamic Conditional Correlations

Audrino and Trojani: General Multivariate Threshold GARCH Model 149

the given confidence level α. In our application, we work withα = 0.05, 0.10.

Our tests of EPA are based on the range statistic TR and theless conservative semiquadratic statistic TSQ

TR = maxk,w∈M

|Dkw|√var(Dkw)

and TSQ =∑k<w

D2kw

var(Dkw),

where the sum in TSQ is taken over the models in M, Dkw =n−1

out∑nout

t=1 Dt,kw, and var(Dkw) is an estimate of var(Dkw) ob-tained from a block-bootstrap of the series Dt,kw, t = 1, . . . ,nout.Using statistics TR or TSQ, we test the null hypothesis EPA atconfidence level α for model set M. If the hypothesis EPAis rejected for model set M, we compute a worst-performingindex in order to trim the worst-performing model from M.

The worst-performing index for Modelk is computed as themean across models w �= k of statistic Dwk. More specifically, itis defined as Dk/

√var(Dk), where Dk = meanw �=k∈M Dkw. As

earlier, our estimate of var(Dk) is based on a block-bootstrap.The model with the highest worst-performing index is finallytrimmed from M. Consistency of estimates of the (asymptotic)distributions of TR and TSQ can be proved under mild regularityconditions on the bootstrap. For details, see Hansen, Lunde, andNason (2003).

ACKNOWLEDGMENTS

The authors thank the editors, Torben Andersen and SerenaNg, the associate editor, and two anonymous referees for manyvaluable comments. Financial support by the Swiss NationalScience Foundation (grants 100012-103781, 100012-105745,and NCCR FINRISK) and by the Foundation for Researchand Development of the University of Lugano is gratefully ac-knowledged. Financial Support by the NCCR FINRISK Com-petence Center for Research of the Swiss National ScienceFoundation is gratefully acknowledged.

[Received April 2008. Revised August 2009.]

REFERENCES

Andersen, T. G., Bollerslev, T., Diebold, F. X., and Labys, P. (2001), “The Dis-tribution of Exchange Rate Volatility,” Journal of the American StatisticalAssociation, 96, 42–55. [143]

(2003), “Modeling and Forecasting Realized Volatility,” Economet-rica, 71 (2), 579–625. [139,143]

Audrino, F. (2006), “Tree-Structured Multiple Regimes in Interest Rates,” Jour-nal of Business & Economic Statistics, 24 (3), 338–353. [138]

Audrino, F., and Bühlmann, P. (2001), “Tree-Structured GARCH Models,”Journal of the Royal Statistical Society, Ser. B, 63 (4), 727–744. [138-142]

Audrino, F., and Corsi, F. (2009), “Modeling Tick-by-Tick Realized Correla-tions,” working paper, University of St. Gallen. [139]

Audrino, F., and Trojani, F. (2006), “Estimating and Predicting MultivariateVolatility Regimes in Global Stock Markets,” Journal of Applied Econo-metrics, 21 (3), 345–369. [138-142,144,147]

Bandi, F., Russell, J. R., and Zhu, Y. (2008), “Using High-Frequency Data inDynamic Portfolio Choice,” Econometric Reviews, 27, 163–198. [139,144,146]

Barndorff-Nielsen, O. E., and Shephard, N. (2001), “Non-Gaussian Ornstein–Uhlembech-Based Models and Some of Their Uses in Financial Eco-nomics,” Journal of the Royal Statistical Society, Ser. B, 63, 167–241. [143]

(2002), “Econometric Analysis of Realized Volatility and Its Use inEstimating Stochastic Volatility Models,” Journal of the Royal StatisticalSociety, Ser. B, 64, 253–280. [143]

Barndorff-Nielsen, O. E., Hansen, P. R., Lunde, A., and Shephard, N. (2008),“Multivariate Realised Kernels: Consistent Positive Semi-Definite Estima-tors of the Covariation of Equity Prices With Noise and Non-SynchronousTrading,” working paper, Stanford University. [139]

Bollerslev, T. (1990), “Modelling the Coherence in Short-Run Nominal Ex-change Rates: A Multivariate Generalized ARCH Model,” The Review ofEconomics and Statistics, 72, 498–505. [143]

Breiman, L., Friedman, J. H., Olshen, R. A., and Stone, C. J. (1984), Classifi-cation and Regression Trees, Belmont, CA: Wadsworth. [142]

Chiriac, R., and Voev, V. (2009), “Modelling and Forecasting Multivariate Re-alized Volatility,” working paper, Aarhus University. [139]

Corsi, F., and Audrino, F. (2007), “Realized Correlation Tick-by-Tick,” discus-sion paper, University of St. Gallen, Dept. of Economics. [143,144]

Engle, R. F. (2002), “Dynamic Conditional Correlation—A Simple Class ofMultivariate GARCH Models,” Journal of Business & Economic Statistics,20, 339–350. [138,140,143,147]

Engle, R., and Colacito, R. (2006), “Testing and Valuing Dynamic Correlationsfor Asset Allocation,” Journal of Business & Economic Statistics, 24, 238–253. [139,144,146]

Engle, R. F., and Sheppard, K. (2001), “Theoretical and Empirical Propertiesof Dynamic Conditional Correlation Multivariate GARCH,” mimeo, Uni-versity of California, San Diego. [142]

Fleming, J., Kirby, C., and Ostdiek, B. (2001), “The Economic Value of Volatil-ity Timing,” Journal of Finance, 56, 329–352. [143,144]

(2003), “The Economic Value of Volatility Timing Using RealizedVolatility,” Journal of Financial Economics, 67, 473–509. [144]

Hansen, P. R. (2005), “A Test for Superior Predictive Ability,” Journal of Busi-ness & Economic Statistics, 23, 365–380. [139,143,145-147]

Hansen, P. R., Lunde, A., and Nason, J. M. (2003), “Choosing the Best Volatil-ity Models: The Model Confidence Set Approach,” Oxford Bulletin of Eco-nomics and Statistics, 65, 839–861. [139,143,146,148,149]

Laurent, S., Rombouts, J. V. K., and Violante, F. (2009), “Consistent Rankingof Multivariate Volatility Models,” discussion paper, CORE. [143]

Ledoit, O., Santa-Clara, P., and Wolf, M. (2003), “Flexible MultivariateGARCH Modeling With an Application to International Stock Markets,”The Review of Economics and Statistics, 85 (3), 735–747. [138,143]

Newey, W. K., and McFadden, D. (1994), “Large Sample Estimation and Hy-pothesis Testing,” in Handbook of Econometrics, Vol. 4, Amsterdam: North-Holland, pp. 2111–2245. [142]

Pagan, A. (1986), “Two Stage and Related Estimators and Their Applications,”The Review of Economic Studies, 53 (4), 517–538. [142]

Pelletier, D. (2006), “Regime Switching for Dynamic Correlations,” Journal ofEconometrics, 131, 445–473. [138,140,142,143]

West, K. D., Edison, H. J., and Cho, D. (1993), “A Utility-Based Comparison ofSome Models of Exchange Rate Volatility,” Journal of International Eco-nomics, 35, 23–45. [143]

Dow

nloa

ded

by [

Uni

vers

ity o

f W

este

rn O

ntar

io]

at 0

6:41

12

Nov

embe

r 20

14