Received 30 November 2015; accepted 27 December 2015; published 30 December 2015
1. Introduction
Since the seminal contribution of [1] , the literature on multivariate Generalized Autoregressive Conditional Heteroskedasticity (GARCH) models has rapidly developed (see [2] and [3] , for surveys). To date, three generations of models can be counted. First generation models, likewise the VEC model of [1] and the BEKK model of [4] , are straightforward extensions of the univariate GARCH model. They allow for very general conditional variance covariance matrix dynamics, yet at the cost of a very profligate parameterization, which limits their use to small sets of time series. This drawback has been overcome by second generation models, yet at the cost of imposing either parameter restrictions on the BEKK model, as for the case of the scalar BEKK model and the exponentially weighted moving average model introduced by [5] , or on the conditional correlation matrix, assumed time-invariant in the constant conditional correlation CCC model of [6] . Alternatively, restrictions have been imposed through factor structures, likewise [7] and the orthogonal models of [8] -[10] and [11] . On the other hand, a different approach has been pursued by the most recent third generation of multivariate GARCH models, i.e. the dynamic conditional correlation models, grounded on a two-step estimation procedure, involving the estimation of univariate GARCH models for the conditional variances in the first step and then the estimation of the conditional covariances in the second step. Although inefficient, the latter sequential procedure is consistent and asymptotically normal. Moreover, by dramatically reducing the numerical optimization burden, it can be implemented also in the case of vast sets of time series. In this respect, seminal is the Dynamic Conditional Correlation models (DCC) of [12] and [13] . Further extensions are [14] , the Dynamic Conditional Equi- Correlation (DECO) model of [15] , [16] and [17] .
Dynamic conditional correlation models, in order to ensure positive definiteness of the conditional variance-covariance matrix, posit the correlation matrix to be a transformation of a latent matrix, which is a function of past devolatilized innovations. In particular, while the CCC model of [6] assumes time invariant, but pairwise specific correlations, the DECO model of [15] makes the opposite assumption, positing time varying correlations, but equal across series. Both CCC and DECO therefore rely on assumptions on conditional correlation dynamics which are unlikely to be supported by the data. On the other hand, in the alternative formulation of [13] , the correlation matrix is modeled directly and as a function of past correlations of devolatilized innovations. As a common drawback, all of the available dynamic conditional correlation models rely on the choice, neither unique nor obvious, of a long run target for the conditional variance-covariance or correlation matrix.
In the light of the above issues, the paper then contributes to the literature by introducing a new simple semiparametric estimator of the conditional variance-covariance and correlation matrix (SP-DCC). While sharing a similar sequential approach to DCC and DECO, SP-DCC has the advantage of not requiring the direct parameterization of the conditional covariance or correlation processes, therefore also avoiding any assumption on their long-run target. In the proposed framework, conditional variances are estimated by univariate GARCH models, for actual and suitable transformed series, in the first step; the latter are then nonlinearly combined in the second step, according to basic properties of the covariance and correlation operator, to yield nonparametric estimates of the corresponding conditional covariances and correlations. In contrast to available DCC methods, SP-DCC allows for straightforward estimation also for the non-symultaneous case, i.e. for the estimation of conditional cross-covariances and correlations displaced at any time horizon of interest. A simple ex-post procedure to ensure well behaved conditional covariance and correlation matrices, grounded on nonlinear shrinkage, is finally proposed. Due to its sequential implementation and scant computational burden, SP-DCC is very simple to apply and suitable for the modeling of vast sets of conditionally heteroskedastic time series. We point to [18] for an empirical application of the proposed approach.
2. Semiparametric Estimation of Dynamic Conditional Correlations
Consider a discrete time, real-valued vector stochastic process of dimension
(1)
where is the conditional mean vector, is a vector of parameter, is the sigma field, and
(2)
where is a positive definite matrix of dimension.
The random vector is of dimension and assumed to be i.i.d. with first two moments
(3)
(4)
where is the identity matrix of dimension N.
It is straightforward to show that is the conditional variance-covariance matrix; in fact
In general both and depend on the parameter vector. While, the conditional mean vector does not depend on the conditional variance parameter, apart from the GARCH-in-mean case, the conditional variance matrix depends on the conditional mean parameters through the residuals. In what follow, for simplicity, we leave out from notation and neglect the conditional mean vector, which might be modelled in various ways, i.e. by means of univariate or multivariate ARMA models.
2.1. The Conditional Variance Process
We assume the elements along the main diagonal of follow a GARCH (1, 1) process
(5)
subject to the usual restrictions required to ensure that the generic ith conditional variance process is positive almost surely at any point in time. For instance, sufficient (not necessary) conditions are, , , with stationarity condition.1
An extended specification is in principle also viable, i.e.
yet actually feasible only for small N.
2.2. The Conditional Covariance Process
Consider the identity
(6)
given that.
The off-diagonal elements of can then be defined accordingly, i.e.
(7)
By defining the new variables and, and assuming a GARCH (1, 1) specification for their conditional variance processes and
(8)
(9)
subject to the usual restrictions required to ensure a well behaved conditional variance process, (7) becomes
(10)
Moreover, if residuals are obtained from linear transformations of the original variables2, then and; hence, (8) and (9) can be written as
(11)
(12)
By means of the proposed method conditional cross-covariances and correlations can also be computed, as
(13)
2.3. Estimation
Consistent and asymptotically normal estimation is performed in two steps.
Firstly, the conditional variances, , i.e. the elements along the main diagonal of, and, , , , are estimated equation by equation by means of; this yields,
, and, , ,.
Then, in the second step the off-diagonal elements of, , , , are estimated nonparametrically by computing
(14)
By defining
the conditional correlation matrix is then estimated as
By definition the matrix is positive definite; our estimation approach does not restrict to be almost surely positive definite at any point in time. The latter property can however be checked ex-post by computing the eigenvalues of, which by being a real, square and symmetric matrix, under positive definiteness are expected to be all positive. In practice this can be performed by means of Descartes’ rule of alternating signs applied to its characteristic polynomial3, as well as by means of Sylvester’s criterion4, or by assessing the existence and uniqueness of its Cholesky decomposition.
However, the positive definiteness property might also be imposed ex-post, by means of shrinkage methods, as in Ledoit and Wolf (2004, 2012). In the latter case a compromise estimate of the conditional correlation matrix is obtained by shrinking the estimated conditional correlation matrix towards the identity matrix, i.e. by computing
where is the shrinkage intensity at time period. The compromise estimate of, i.e., can then be obtained as
which is positive definite by construction, as is positive definite and the elements of are well-defined.
2.4. Ex-Post Correction for Well-Behaved Conditional Covariances and Correlations
Alternatively, the validity of the Cauchy-Schwarz inequality and the condition of positive definiteness can be imposed sequentially, at each point in time t, following the below procedure.
Firstly, the estimated conditional correlations in, , , , are bounded to lie within the range, by applying the sign-preserving bounding transformation
(15)
where and even; the value of k can be selected optimally by solving
(16)
i.e. by setting k in such a way that the sum of Frobenious norms over the temporal sample is minimized; this yields, the transformed correlation matrix, which satisfies, by construction, the Cauchy-Schwarz inequality.
Secondly, positive definiteness is enforced by computing the eigenvalue-eigenvector decomposition of the transformed conditional correlation matrix, yielding
where is the diagonal matrix containing the N ordered eigenvalues along the main diagonal, and is the matrix containing the N associated orthogonal eigenvectors. In the case of violation of the positive definiteness condition one or more of the eigenvalues will be negative; an empirically viable strategy to impose positive definiteness ex-post consists of replacing the negative sample eigenvalues with positive values, computed for instance from their sample average value when positive or from the grand average across sample eigenvalues. The rationale guiding this practice is the well-known issue of downward biased estimation of the smallest eigenvalues (versus upward biased estimation of the largest eigenvalues). Rather than shrinking all the sample eigenvalues towards their grand average, as occurring by implementing [19] , only the negative eigenvalues are shrank towards positive average values. The latter practice is consistent with nonlinear shrinkage of the covariance matrix ([20] ), allowing in principle for different shrinkage intensities to be applied to the various eigenvalues.
The shrank matrix of eigenvalues would then be obtained, and therefore
(17)
which, by construction, is well-behaved at each point in time. The implied conditional covariance process at time period t can then be obtained as
where
as before. The implied estimated variance-covariance matrix then obeys the Cauchy-Schwarz inequality and the positive definiteness condition, at each point in time, by construction.
2.5. Asymptotic Properties
Under assumptions (1) through (5), estimation and inference for the parameters of the univariate GARCH (1, 1) processes (5), (8) and (9) can be performed by means of. The Gaussian log likelihood function for the generic process, assuming for simplicity and a GARCH (1, 1) structure
can then be written as
and numerically maximized with respect to the vector of parameters. Similarly for the other variables and.
Under fairly general conditions, the asymptotic distribution of is
where denotes the true value of the vector of parameters, and where is the Hessian and is the outer product gradient, both of which are evaluated at the true parameter values. This also establishes the consistent and asymptotically normal estimation of the conditional variance of, , as well as of the transformed variables and.
Consistent and asymptotically normal estimation of the off-diagonal elements of the conditional variance-co- variance matrix then follows directly from the consistent and asymptotically normal estimation of the conditional variances of the transformed variables in (8) and (9). In fact, considering the generic off-diagonal element of, , , one has
as the conditional covariance estimator is a linear combination of the (consitent and asymptotically normal) conditional variance estimators for the transformed variables and .
3. Conclusion
The paper introduces a new simple semiparametric estimator of the conditional variance-covariance and correlation matrix (SP-DCC). While sharing a similar sequential approach to existing dynamic conditional correlation methods, SP-DCC has the advantage of not requiring the direct parameterization of the conditional covariance or correlation processes. In the first step, conditional variances are estimated by univariate GARCH models for actual and suitably transformed series. In the second step, the estimated conditional covariances are then nonlinearly combined, according to basic properties of the covariance and correlation operator, to yield nonparametric estimates of the various conditional covariances and correlations. At this step, SP-DCC also allows for the estimation of conditional cross-covariances and correlations, displaced at any time horizon. In the third step, well behaved conditional variance-covariance and correlation matrices are obtained by means of nonlinear shrinkage. Due to its sequential implementation and scant computational burden, SP-DCC is very simple to apply and suitable for the modeling of vast sets of conditionally heteroskedastic time series.
Acknowledgements
The author is grateful to the referee, M. Rockinger and M. Dacorogna for their comments. This project has received funding from the European Union’s Seventh Framework Programme for research, technological development and demonstration under grant agreement no. 3202782013-2015. On rainy days, be in the rain/In windy days, be in the wind (Mitsuo Aida).
NOTES
1The GARCH (1, 1) model is chosen for simplicity; the approach is very flexible and can accommodate any model of the GARCH family.
2This, for instance, would occur when the conditional mean vector is specified as a vector autoregressive (VAR) process, yet not in the presence of a VARMA structure. In the latter case residuals and should be computed from time series models specified for the new variables and.
3By ordering the terms of the characteristic polynomial with real coefficients by descending variable exponent, the number of positive roots of the polynomial is then either equal to the number of sign differences between consecutive nonzero coefficients, or is less than it by an even number. Multiple roots of the same value are counted separately.
4According to Sylvester’s criterion, a real, square symmetric positive definite matrix shows all positive leading principal minors, where the kth leading principal minor of a matrix M is the determinant of its upper-left k by k sub-matrix. In practice the M matrix is reduced to an upper triangular matrix by means of row operations, as in the first part of the Gaussian elimination method, preserving the sign of its determinant during pivoting process. Since the kth leading principal minor of a triangular matrix is the product of its diagonal elements up to row k, positive definiteness can be assessed by checking whether its diagonal elements are all positive. The latter condition is then checked each time a new row k of the triangular matrix is obtained.