Abstract
This study investigates and compares the performance of center method, equal weighted convex combination and unequal-weighted convex combination methods through various GARCH and copula-based approaches for the analysis of relationship between gold and crude oil prices using interval data in Comex and Nymex tradings. The results of this study confirm that unequal-weighted convex combination method improves the estimation and it tends to perform better than both the center method and its equal-weighted variant. In addition, the marginal from the best fit GARCH model is used to measure dependence via copula function in the form of Student-t copula as selected according to the lowest AIC among all candidates. Finally, we can conclude that there exists the dependence between Comex and Nymex not only in the normal event, but also in the extreme event.
Access provided by CONRICYT-eBooks. Download conference paper PDF
Similar content being viewed by others
1 Introduction
Commodities, especially gold and crude oil, and bond are important instruments to diversify the risk. In the calculation of optimum risky weights for portfolio, gold and crude oil are the most attractive commodities to be included for hedging risks in portfolio of investors. Gold and crude oil have played the vital role in economics. In the last decade, many economists paid much attention to investigating the volatility and the relationship between gold and crude oil prices. The most effective tool that is employed to measure this volatility and relation is Copula based GARCH model. The multivariate GARCH models have demonstrated to be useful and effective for analyzing the pattern of multivariate random series and estimating the conditional linear dependence of volatility or co-volatility in different markets.
The study of the volatility and the relationship between gold and crude oil prices, using Copula based GARCH, has been intensively conducted in the last decade. However, those studies investigated their works using a closing price data series. Thus, if we consider only the closing prices, we might lack a valuable intraday information and the obtained results might not be reasonable [11]. Recently some studies have proposed to use interval data, e.g. the lowest and the highest price during each day or period of time, as an alternative to single value data. In the ideal world, we should be able to predict both the lowest daily price and the highest daily price. However, in practice, this is difficult, so we would like to predict at least some daily price between these bounds. In the past, researchers tried to predict the representative of the lowest and highest prices, for example, a Center and MinMax methods of Billard and Diday [3], Center and Range method of Neto and Carvalho [10], and model M by Blanco-Fernndez, Corral, Gonzlez-Rodrguez [4], to deal with the interval data. These methods aim to construct the model without taking the interval as a whole. However, we expect that these methods, especially center method, may not be robust enough to explain the real behavior of the interval data and this may lead to the misspecification of the model. Therefore, it will be of great benefit to relax this assumption of mid-point of center method and assign appropriate weights between intervals. Thus in this study, a convex combination method of Chanaim et al. [6] is employed to obtain the appropriate weights.
This study investigates and compares the dependence structure of crude oil and gold prices using different interval values for copula-based GARCH model estimation and prediction. The examined interval value methods include the center method, equal weighted, and unequal-weighted convex combination. The main findings will confirm the usefulness of the convex combination in copula-based GARCH approach for evaluating the relationship, joint distribution and co-movement between crude oil prices and gold prices for investors whose investment interest is in gold and crude oil.
The remainder of the paper is organized as follows: Sect. 2 provides methodology of study. Section 3 proposes the empirical results. Section 4 summarizes this paper.
2 Methodology
In this section, we brief the convex combination in GARCH, EGARCH, and GJR-GARCH models; and copula family for estimating joint density of the obtained marginal from the GARCH families.
2.1 Operation with Interval Arithmetic
Let \(p_i=[\underline{p}_i,\overline{p}_i]\) be lower and upper interval data at time i. This data can be defined for arithmetic operations as in the following:
1. Addition
2. Subtraction
3. Multiplication
4. Division , p > 0
5. Addition and Multiplication by scalar
6. logarithm function, \(p_i>0\)
2.2 Center Method
This method has been proposed by Billard and Diday [3]. The main idea is that it uses the center of the interval data \(p^c_t\) which is obtained from upper and lower values of interval, say \(\underline{p}_t\) and \(\overline{p}_t\), and can be derived by
2.3 Autoregressive Moving Average-GARCH Model
Many previous studies suggested that volatility of financial return data is not constant over time, but is rather clustered. This issue can be tackled using volatility modeling. Within a class of autoregressive processes with white noises having conditional heteroscedastic variances, this paper considers a GARCH(1,1) model to estimate the dynamic volatility. It is the workhorse model and mostly applied in many financial data. The model is able to reproduce the volatility dynamics of financial data. Thus, in this study, we consider ARMA(p,q)-GARCH(1,1) which can be written as
where \(\eta _t\) is a strong white noise which has normal distribution with mean zero and variance one. \(\sigma ^2_t\) is the conditional variance in GARCH process by Tim Bollerslev [5]. Some standard restrictions on the variance parameters are given.
Furthermore, Simon [12] presented a family of variance models in asymmetry EGARCH model and GJR-GARCH model.
2.4 EGARCH (Exponential GARCH)
From GARCH model, by introducing the parameters \(\lambda \) and \(\nu \), for \(\lambda \) = \(\nu \) =1. Then we can rewrite GARCH(1,1), Eq. (11) as
The form of the expected value terms associated with ARCH coefficients in the EGARCH equation depends on the distribution of innovation. If the innovation distribution is Gaussian, then
If the innovation distribution is Student’s t with \(\nu > 2\) degrees of freedom, then
2.5 GJR-GARCH
The GJR-GARCH model is a GARCH variant that includes leverage terms for modeling an asymmetric volatility clustering. In the GJR formulation, large negative changes are more likely to be clustered than positive changes. The GJR model is named for Glosten, Jagannathan, and Runkle [8]. The GJR-GARCH model is a recursive equation for the variance process, and the simple GJR-GARCH(1,1) can be written as
The indicator function \(I[\varepsilon _{t-1}<0]\) equals 1 if \(\varepsilon _{t-1}<0\), and 0 otherwise. Thus, the leverage coefficients are applied to negative innovations, giving negative changes additional weight. For stationarity and positivity, the GJR model has the following constraints
2.6 Convex Combination Method
The convex combination method is applied to deal with the interval return data, where the appropriate value over the range of interval can be computed by
where \(\alpha _0~and~\alpha _1\) are the weighted parameters with value between 0 and 1. In this study, we consider both fixed weighted and unequal-weighted convex combination methods. Thus, we set \(\alpha \) = 0.5 for fixed weighted convex combination while \(\alpha \) \(\varepsilon \) [0, 1] is set as the parameter to be estimated for unequal-weighted convex combination method. For example, in the case of ARMA(1,1)-GARCH(1,1), we can rewrite Eqs. (9)–(11) as
2.7 Model Selection by Akaike Information Criterion (AIC)
In this study, we compare our models using Akaike information criterion applied from Kullback Leibler Information. It is defined as:
where \(\hat{L}\) is maximized value of likelihood function, K is the number of parameters in the model.
2.8 Bivariate Copula Approach
Let X, Y be random variables, the continuous marginal distributions are F(x), G(y) then H(x, y) is a joint distribution, then 2-dimensional copulas \(C:[0,1]^2\) \(\rightarrow [0,1]\) can be defined by
Copula if property
so
where C is copula function of marginal distribution random 2 variables. If marginal has continuous distribution, the copula function is
\(\bullet \) Gaussian Copula
Lower and upper tail dependence or order parameters of Gaussian Copula is \(k_L=k_u\)=\(\frac{2}{(1+\rho )}\).\(\varPhi ^{-1}_n\) is quantile function for normal distribution function and \(x=\varPhi ^{-1}(u), y=\varPhi ^{-1} (v) \) and \(u, v~\in [0,1]\)
\(\bullet \) Student−t Copula
where \(t^{-1}_{v}(u)\) and \(t^{-1}_{v}(v)\) are quantile functions with student−t distribution, where v is degree of freedom and \(f_{t_{1(v)}}(x,y)\) is joint density function.
\(\bullet \) Frank Copula
\(\bullet \) Clayton Copula
\(\bullet \) Gumbel Copula
\(\bullet \) Joe Copula
Furthermore,this study also uses the bivariate copula family, presented by Joe [9] for asymmetric lower and upper tail dependence including,
\(\bullet \) BB1 coupla
\(\bullet \) BB2 coupla
\(\bullet \) BB3 copula
\(\bullet \) BB4 copula
\(\bullet \) BB5 copula
\(\bullet \) BB6 copula
\(\bullet \) BB7 copula
\(\bullet \) BB8 copula
3 Empirical Result
3.1 Data Description
The data set consists of the Comex and Nymex for the period from 8 May 2009 to 15 July 2016, covering 376 observations. Interval data is the most important issue in examining the interaction among these commodity prices. Therefore, we have considered the weekly minimum and maximum of these prices and they were collected from Thomson Reuters DataStream. The data description is shown in Table 1 and the interval return plot is show in Figs. 1 and 2.
3.2 Results of Optimal Weights for ARMA-GARCH, ARMA-EGARCH and ARMA-GJR GARCH Model Using Convex Combination Method
In this section, we use Comex and Nymex interval returns to estimate ARMA-GARCH(1,1), ARMA-EGARCH and ARMA-GJR-GARCH models with the convex combination method to find the appropriate weights in the model in the range of [0,1]. We conduct a grid search to find the best fit weight. Here, the AIC is used to determine the appropriate weight in the interval [0,1] and the results are shown in Table 2. Then, we compare three GARCH models using Akaike Information Criterion (AIC) and the lowest AIC is preferred. The results are also provided in Table 2 and we find that the GJR-GARCH model with Student−t distribution is appropriate for present volatility of Comex and EGARCH model with Student−t distribution is appropriate for present volatility of Nymex. Therefore, we use this GARCH specification to obtain our marginals.
Table 3 presents the results of Comex from the estimation by GJR-GARCH models. The results show that \(\omega _1+\omega _2\) = 0.89. This indicates that Comex exhibits a significantly high persistent volatility. Table 4 presents the results of Nymex from the estimation by EGARCH models. The results show that \(\omega _1+\omega _2\) = 0.96. This indicates that Nymex exhibits a significantly high persistent volatility. Moreover, we try to compare the results of the model with convex combination and the center method, we find that the AIC of convex combination is lower than center method for both ARMA(3.4)-GJR-GARCH(1,1) and ARMA(3,4)-EGARCH(1,1). This result indicates the superiority of convex combination method over the center method.
3.3 In-Sample Forecast and Volatility
Then, the best fit GARCH model is used to predict the return of intervals and volatility of Comex and Nymex as shown in Figs. 3 and 4. These figures illustrate the accuracy of the predicted return against actual interval return (upper panel) and closing price returns (bottom panel) to see the performance of GJR-GARCH and EGARCH with convex combination. In addition, the predicted volatility \(\sigma ^2_t\) is also plotted in the middle panel. From the graph, it is obvious that the performance is satisfactory. Different results of predicted volatilities are shown in the middle of Figs. 3 and 4. We observed that the volatility of Comex is high during 2013–2014 corresponding to the Greek crisis. For Nymex, we observed that the volatility is high during 2015–2016. We expected that the increasing doubts about the success of the oil producers meeting and rising production as well as the record US and global crude oil inventories have put a high pressure on crude oil prices.
3.4 Results of Copulas
In this section, copula model is employed to measure the dependency of Comex and Nymex. The obtained standardized residuals from the best fit GARCH process are used to compute the dependence in the copula model. First of all, we present the scatter plot of copula between Comex and Nymex in Fig. 5. We observed an unclear relationship between Comex and Nymex, thus the various families of copulas are proposed to capture the relationship between these two variables.
Finally, the results of copula model are presented in Table 5. We found that among 14 copula families, Student-t copula function shows the lowest value of AIC (−19.5115). Thus, we selected Student-t copula function to explain the dependence between Comex and Nymex. The result indicates that there exists a weak positive dependence between Comex and Nymex (\(\rho \) = 0.2479, degree of freedom = 8.4774). Moreover, we found the tail dependence between these two, where the upper and lower tail dependence was found to be 0.0392. We can conclude that there exists a dependence between Comex and Nymex not only in the normal event, but also in the extreme event.
4 Conclusion
This study investigates the performance of convex combination via various GARCH families and copula-based approach. In this study, we consider crude oil and gold with interval data as the application study. The results confirm that the EGARCH and GJR-GARCH with convex combination method improve the estimation. We also used the obtained standardized residuals from GARCH process with the copula model to measure the dependency of crude oil and gold. We found that among the various copula families, Student-t copula shows the lowest value of AIC, thus we used a Student-t copula function to join the marginal of crude oil and gold. The result of copula model showed that there exists a positive dependence between these two variables. Moreover, we also found the positive tail dependence which indicates that there exists a dependence in the extreme event.
References
Akaike, H.: Information theory and an extension of the maximum likelihood principle. In: Proceedings of 2nd International Symposium on Information Theory, Budapest, pp. 267–281 (1973)
Abbate, A., Marcellino, M.: Point, interval and density forecasts of exchange rates with time-varying parameter models. Discussion paper, Deutsche Bundesbank, No. 19/2016 (2016)
Billard, L., Diday, E.: Regression analysis for interval-valued data. In: Data Analysis, Classification and Related Methods, Proceedings of the 7th Conference for the IFCS, IFCS 2002, pp. 369–374. Springer, Berlin (2000)
Blanco Fernndez, A., Corral, N., Gonzlez Rodrguez, G.: Estimation of a flexible simple linear model for interval data based on set arithmetic. Comput. Stat. Data Anal. 55(9), 2568–2578 (2011). North-Holland
Bollerslev, T.: Generalized autoregressive conditional heteroskedasticity. J. Econ. 31, 307–327 (1986)
Hansen, B.E.: Reversion. Econometrics. www.ssc.wisc.edu/bhansen (2013)
Chanaim, S., Sriboonchitta, S., Rungruang, C.: A convex combination method for linear regression with interval data. In: Integrated Uncertainty in Knowledge Modelling and Decision Making, 5th International Symposium, IUKM 2016, Da Nang, Vietnam, 30 November–2 December 2016, Proceedings, pp. 469-480. Springer (2016)
Engle, R.F.: Dynamic conditional correlation - a simple class of multivariate GARCH models. J. Bus. Econ. Stat. 20(3), 339–350 (2002)
Glosten, L., Jagannathan, R., Runkle, D.: On the relation between the expected value and the volatility of the nominal excess return on stocks. J. Financ. 1993, 1779–1801 (1993)
Joe, H.: Multivariate Models and Multivariate Dependence Concepts. CRC Press, Boca Raton (1997)
Neto, L.E.A., de Carvalho, F.A.T.: Centre and range method for fitting a linear regression model to symbolic interval data. Comput. Stat. Data Anal. 52, 1500–1515 (2008)
Phochanachan, P., Pastpipatkul, P., Yamaka, W., Sriboonchitta, S.: Threshold regression for modeling symbolic interval data. Int. J. Appl. Bus. Econ. Res. 15(7), 195–207 (2017)
Hentschel, L.: All in the family nesting symmetric and asymmetric GARCH models. J. Financ. Econ. 39(1995), 71–104 (1994). Simon. Elsevier
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Teetranont, T., Chanaim, S., Yamaka, W., Sriboonchitta, S. (2018). Investigating Relationship Between Gold Price and Crude Oil Price Using Interval Data with Copula Based GARCH. In: Kreinovich, V., Sriboonchitta, S., Chakpitak, N. (eds) Predictive Econometrics and Big Data. TES 2018. Studies in Computational Intelligence, vol 753. Springer, Cham. https://doi.org/10.1007/978-3-319-70942-0_47
Download citation
DOI: https://doi.org/10.1007/978-3-319-70942-0_47
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-70941-3
Online ISBN: 978-3-319-70942-0
eBook Packages: EngineeringEngineering (R0)