Hypothesis Testing for Independence Under Blocked Compound Symmetric Covariance Structure

Tsukada, Shin-ichi

doi:10.1007/s40304-018-0130-4

Hypothesis Testing for Independence Under Blocked Compound Symmetric Covariance Structure

Published: 30 April 2018

Volume 6, pages 163–184, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Communications in Mathematics and Statistics Aims and scope Submit manuscript

Hypothesis Testing for Independence Under Blocked Compound Symmetric Covariance Structure

Download PDF

Shin-ichi Tsukada¹

251 Accesses
6 Citations
Explore all metrics

Abstract

One type of covariance structure is known as blocked compound symmetry. Recently, Roy et al. (J Multivar Anal 144:81–90, 2016) showed that, assuming this covariance structure, unbiased estimators are optimal under normality and described hypothesis testing for independence as an open problem. In this paper, we derive the distributions of unbiased estimators and consider hypothesis testing for independence. Representative test statistics such as the likelihood ratio criterion, Wald statistic, Rao’s score statistic, and gradient statistic are derived, and we evaluate the accuracy of the test using these statistics through numerical simulations. The power of the Wald test is the largest when the dimension is high, and the power of the likelihood ratio test is the largest when the dimension is low.

Hypothesis Testing for Independence given a Blocked Compound Symmetric Covariance Structure in a High-Dimensional Setting

Article 03 April 2023

Multiple Testing of Conditional Independence Hypotheses Using Information-Theoretic Approach

The Permutation Testing Approach in the Light of Conditionality and Sufficiency Principles

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In multivariate statistical analysis, the covariance matrix can have various specific structures. One of these is the blocked compound symmetric (BCS) covariance structure. The BCS covariance structure for doubly multivariate observations is a multivariate generalization of the compound symmetric covariance structure for multivariate observations. The BCS covariance structure is defined as:

$$\begin{aligned} {\varvec{\varSigma }}&= {\varvec{I}}_u\otimes ({\varvec{\varSigma }}_0 - {\varvec{\varSigma }}_1) + {\varvec{J}}_u\otimes {\varvec{\varSigma }}_1 = \begin{pmatrix} {\varvec{\varSigma }}_0 &{} {\varvec{\varSigma }}_1 &{} \cdots &{} {\varvec{\varSigma }}_1 \\ {\varvec{\varSigma }}_1 &{} {\varvec{\varSigma }}_0 &{} \cdots &{} {\varvec{\varSigma }}_1 \\ \vdots &{} \vdots &{}&{}\vdots \\ {\varvec{\varSigma }}_1 &{} {\varvec{\varSigma }}_1 &{} \cdots &{} {\varvec{\varSigma }}_0 \end{pmatrix}, \end{aligned}$$

where ${\varvec{I}}_u$ is the $u\times u$ identity matrix, ${\varvec{1}}_u$ is a $u\times 1$ vector of ones, ${\varvec{J}}_u={\varvec{1}}_u{\varvec{1}}_u'$, and $\otimes $ denotes the Kronecker product. We assume that $u\ge 2$, ${\varvec{\varSigma }}_0$ is a positive-definite symmetric $p\times p$ matrix, and ${\varvec{\varSigma }}_1$ is a symmetric $p\times p$ matrix. We also assume that ${\varvec{\varSigma }}_0 - {\varvec{\varSigma }}_1$ and ${\varvec{\varSigma }}_0 +(u-1) {\varvec{\varSigma }}_1$ are positive-definite matrices so that ${\varvec{\varSigma }}$ is a positive-definite matrix. Arnold [2] studied this covariance structure in the general linear model when the error vectors are assumed to be exchangeable and normally distributed. Szatrowski [13] discussed the BCS covariance structure and used a model to analyze an educational testing problem. Leiva [8] derived maximum likelihood estimates (MLEs) of the BCS covariance structure, developed classification rules for doubly multivariate observations and generalized Fisher’s linear discrimination method under the BCS covariance structure.

Recently, the BCS covariance structure has been actively researched. For three-level multivariate data, Roy and Leiva [10] and Coelho and Roy [3] have developed hypothesis testing frameworks for a covariance structure. Roy et al. [11] and Zezula et al. [15] studied hypothesis testing for the equality of mean vectors in two populations under the BCS covariance structure. Roy et al. [12] proved that the unbiased estimators of the BCS covariance structure are optimal under normality.

We consider hypothesis testing for independence under the BCS covariance structure, i.e.,

$$\begin{aligned} H_0 : {\varvec{\varSigma }}_1 = {\varvec{O}} \text{ versus } H_1 : {\varvec{\varSigma }}_1 \ne {\varvec{O}}, \end{aligned}$$

where ${\varvec{O}}$ is a $p\times p$ zero matrix. This problem is the extension of an independence test for a covariance matrix to an independence test for a blocked covariance matrix. We investigate the properties of the unbiased estimator of the covariance matrix and use them to derive the Wald statistic. We also derive the likelihood ratio criterion (LRC), the modified LRC using the Bartlett correction, Rao’s score statistic, and Terrel’s [14] gradient statistic. The asymptotic behavior of these test statistics is similar, but the accuracy of their convergence to the significance level and the powers of test using these statistics are investigated for finite samples through numerical simulations. From the simulation results, we find that the accuracy of convergence to the significance level differs depending on the statistic. Therefore, we also simulate the bootstrap test using these test statistics. Simulation results show that the tests using these statistics converge to the significance level for large samples, the power of the test using the Wald statistic is the largest when the dimension is high, and the power of the likelihood ratio test is the largest when the dimension is low.

The remainder of this article is organized as follows. The properties of the unbiased estimator are obtained in Sect. 2. In Sect. 3, the LRC, modified LRC, Wald statistic, Rao’s score statistic and gradient criterion are derived, and the process of the bootstrap test using the relevant statistics is described. Numerical simulations and an application to real data are reported in Sect. 4. Finally, Sect. 5 contains our conclusions.

2 Estimators

We assume that ${\varvec{x}}_{r,s}$ is a p-variate vector of measurements on the r-th individual at the s-th site ($r=1, \ldots , n$, $s=1, \ldots ,u$). The n individuals are all independent. Let ${\varvec{x}}_r=({\varvec{x}}_{r,1}', \ldots , {\varvec{x}}_{r,u}')'$ be the up-variate vector of all measurements corresponding to the r-th individual. Finally, we assume that ${\varvec{x}}_1, {\varvec{x}}_2, \ldots , {\varvec{x}}_n$ be a random sample of size n drawn from the population $N_{up}({\varvec{\mu }}, {\varvec{\varSigma }})$, where ${\varvec{\mu }}=({\varvec{\mu }}_1',\ldots ,{\varvec{\mu }}_u')'$ is a $up\times 1$ vector and ${\varvec{\varSigma }}$ is a $up\times up$ positive-definite matrix that has the BCS covariance structure (cf. Leiva [8]).

In this section, we discuss estimators under the BCS covariance structure. Roy et al. [12] derive unbiased estimators as follows:

Theorem 2.1

(Roy et al. [12]) Assume that ${\varvec{x}}_1, {\varvec{x}}_2, \ldots , {\varvec{x}}_n$ is a random sample of size n drawn from the population $N_{up}({\varvec{\mu }}, {\varvec{\varSigma }})$. Let $\bar{{\varvec{x}}}=(\bar{{\varvec{x}}}'_{1}, \bar{{\varvec{x}}}'_{2}, \ldots , \bar{{\varvec{x}}}'_{u})'$,

$$\begin{aligned} {\varvec{C}}_0&= \sum _{s=1}^{u} \sum _{r=1}^{n} \left( {\varvec{x}}_{r,s}-\bar{{\varvec{x}}}_{s}\right) \left( {\varvec{x}}_{r,s}-\bar{{\varvec{x}}}_{s}\right) '\mathrm{,} \\ {\varvec{C}}_1&= \mathop {\sum _{s=1}^{u} \sum _{s^*=1}^{u}}_{s\ne s^*} \sum _{r=1}^{n} \left( {\varvec{x}}_{r,s}-\bar{{\varvec{x}}}_{s}\right) \left( {\varvec{x}}_{r,s^*}-\bar{{\varvec{x}}}_{s^*}\right) '\mathrm{,} \end{aligned}$$

where $\bar{{\varvec{x}}}_{s}=\sum _{r=1}^{n}{\varvec{x}}_{r,s}/n$ ($s=1, \ldots , u$). Then, $\bar{{\varvec{x}}}$ is distributed as $N_{up}({\varvec{\mu }}, {\varvec{\varSigma }}/n)$ and is the unbiased estimator for the mean vector ${\varvec{\mu }}$. The estimators

$$\begin{aligned} \tilde{{\varvec{\varSigma }}}_0=\frac{1}{u(n-1)}{\varvec{C}}_0\quad and \quad \tilde{{\varvec{\varSigma }}}_1=\frac{1}{u(u-1)(n-1)}{\varvec{C}}_1 \end{aligned}$$

are unbiased estimators for ${\varvec{\varSigma }}_0$ and ${\varvec{\varSigma }}_1$, respectively.

Therefore, the unbiased estimator for ${\varvec{\varSigma }}$ is

$$\begin{aligned} \tilde{{\varvec{\varSigma }}}={\varvec{I}}_u\otimes (\tilde{{\varvec{\varSigma }}}_0 - \tilde{{\varvec{\varSigma }}}_1) + {\varvec{J}}_u\otimes \tilde{{\varvec{\varSigma }}}_1. \end{aligned}$$

For further inference, we derive the distribution for these estimators under some assumptions. The distribution of an unbiased estimator for ${\varvec{\mu }}$ is $N_{up}({\varvec{\mu }}, {\varvec{\varSigma }}/n)$, but the estimators $\tilde{{\varvec{\varSigma }}}_0$ and $\tilde{{\varvec{\varSigma }}}_1$ do not follow a Wishart distribution, even when the population distribution is normal. We obtain the exact distribution of $\tilde{{\varvec{\varSigma }}}_0$ and $\tilde{{\varvec{\varSigma }}}_1$. Roy et al. [11] indicated that

$$\begin{aligned} {\varvec{W}}_1&\equiv (n-1)(u-1)\left( \tilde{{\varvec{\varSigma }}}_0-\tilde{{\varvec{\varSigma }}}_1\right) \sim W_{p}((n-1)(u-1), {\varvec{\varSigma }}_0-{\varvec{\varSigma }}_1), \end{aligned}$$

(2.1)

$$\begin{aligned} {\varvec{W}}_2&\equiv (n-1)\left\{ \tilde{{\varvec{\varSigma }}}_0+(u-1)\tilde{{\varvec{\varSigma }}}_1\right\} \sim W_{p}(n-1, {\varvec{\varSigma }}_0+(u-1){\varvec{\varSigma }}_1), \end{aligned}$$

(2.2)

and these estimators are independent of each other. The estimator ${\varvec{W}}_1$ is positive-definite when $(n-1)(u-1)\ge p$ and the estimator ${\varvec{W}}_2$ is positive-definite when $n-1\ge p$. When $n>p$, these inequalities are true for $u\ge 2$. Since

$$\begin{aligned}&(n-1)u\tilde{{\varvec{\varSigma }}}_0 = {\varvec{W}}_1 + {\varvec{W}}_2, \\&(n-1)u(u-1)\tilde{{\varvec{\varSigma }}}_1 = (u-1){\varvec{W}}_2 - {\varvec{W}}_1, \end{aligned}$$

the exact distributions of $\tilde{{\varvec{\varSigma }}}_0$ and $\tilde{{\varvec{\varSigma }}}_1$ are obtained as the sum and the difference of Wishart matrices.

Lemma 2.2

Let ${\varvec{\varDelta }}_1={\varvec{\varSigma }}_0-{\varvec{\varSigma }}_1$, and ${\varvec{\varDelta }}_2={\varvec{\varSigma }}_0+(u-1){\varvec{\varSigma }}_1$. When $u\ge 2$ and $n>p$, the exact distribution of $\tilde{{\varvec{\varSigma }}}_0$ is as follows:

$$\begin{aligned}&K_0\ \mathrm{etr}\left[ -\frac{(n-1)u}{2}{\varvec{\varDelta }}_1^{-1}\tilde{{\varvec{\varSigma }}}_0\right] \left| \tilde{{\varvec{\varSigma }}}_0\right| ^{u(n-1)/2-p-1}\\&\quad \times {}_1F_1\left[ \frac{1}{2}(n-1); \frac{1}{2}(n-1)u; \frac{(n-1)u}{2}\left( {\varvec{\varDelta }}_1^{-1}-{\varvec{\varDelta }}_2^{-1}\right) \tilde{{\varvec{\varSigma }}}_0\right] , \end{aligned}$$

where $\mathrm{etr}({\varvec{H}})=\exp \left[ \mathrm{tr}({\varvec{H}})\right] $,

$$\begin{aligned} K_0= \left[ \left\{ \frac{2}{(n-1)u}\right\} ^{u(n-1)p/2} \varGamma _p\left[ \frac{1}{2}u(n-1)\right] \left| {\varvec{\varDelta }}_1\right| ^{(n-1)(u-1)/2} \left| {\varvec{\varDelta }}_2\right| ^{(n-1)/2} \right] ^{-1}, \end{aligned}$$

and ${}_1F_1\left[ a; b; {\varvec{H}}\right] $ is the hypergeometric function of a matrix argument defined by (5.1).

Proof

The details of the proof are described in “Appendix A”. $\square $

Lemma 2.3

When $u\ge 2$ and $n>p$, the exact distribution of $\tilde{{\varvec{\varSigma }}}_1$ is as follows:

$$\begin{aligned}&K_1 \mathrm{etr}\left[ -\frac{(n-1)u}{2}{\varvec{\varDelta }}_2^{-1}\tilde{{\varvec{\varSigma }}}_1\right] \left| \tilde{{\varvec{\varSigma }}}_1\right| ^{u(n-1)/2-p-1} \\&\quad \times \Psi \left[ \frac{1}{2}(n\!-\!1)(u\!-\!1), \frac{1}{2}(n\!-\!1)u; \frac{1}{2}\left\{ (n\!-\!1)u{\varvec{\varDelta }}_2^{-1}\!+\!(n\!-\!1)u(u\!-\!1){\varvec{\varDelta }}_1^{-1}\right\} \tilde{{\varvec{\varSigma }}}_1\right] , \end{aligned}$$

where $\Psi [a,c;{\varvec{R}}]$ is the confluent hypergeometric function defined by (5.2), and

$$\begin{aligned} K_1&= \left[ \left\{ \frac{2}{(n-1)u}\right\} ^{u(n-1)p/2} \varGamma _p\left[ \frac{1}{2}(n-1)\right] \left( \frac{1}{u-1}\right) ^{(n-1)(u-1)p/2} \right. \\&\quad \times \left. \left| {\varvec{\varDelta }}_1\right| ^{(n-1)(u-1)/2} \left| {\varvec{\varDelta }}_2\right| ^{(n-1)/2} \, \right] ^{-1}. \end{aligned}$$

Proof

The details of the proof are described in “Appendix A”. $\square $

The exact distributions of $\tilde{{\varvec{\varSigma }}}_0$ and $\tilde{{\varvec{\varSigma }}}_1$ contain a hypergeometric function of the matrix argument, which is generally difficult to calculate. We may need the asymptotic distribution of the estimators.

Since the estimators $\text{ vec }\left( \tilde{{\varvec{\varSigma }}}_0\right) $ and $\text{ vec }\left( \tilde{{\varvec{\varSigma }}}_1\right) $ are represented as follows:

$$\begin{aligned} \text{ vec }\left( \tilde{{\varvec{\varSigma }}}_0\right)&= \text{ vec }\left( {\varvec{\varSigma }}_0\right) +\frac{u-1}{u}\text{ vec }\left( \tilde{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) +\frac{1}{u}\text{ vec }\left( \tilde{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) ,\\ \text{ vec }\left( \tilde{{\varvec{\varSigma }}}_1\right)&= \text{ vec }\left( {\varvec{\varSigma }}_1\right) -\frac{1}{u}\text{ vec }\left( \tilde{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) +\frac{1}{u}\text{ vec }\left( \tilde{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) , \end{aligned}$$

the following theorem can be obtained using the properties of Wishart matrices.

Theorem 2.4

Let

$$\begin{aligned} {\varvec{\varPhi }}_0&= \frac{u-1}{u^2} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_1\otimes {\varvec{\varDelta }}_1\right) + \frac{1}{u^2} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_2\otimes {\varvec{\varDelta }}_2\right) ,\\ {\varvec{\varPhi }}_1&= \frac{1}{u^2(u-1)} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_1\otimes {\varvec{\varDelta }}_1\right) + \frac{1}{u^2} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_2\otimes {\varvec{\varDelta }}_2\right) , \end{aligned}$$

where ${\varvec{K}}_{p,p}$ is the commutation matrix.

The vectors $(n-1)^{1/2}\text{ vec }\left( \tilde{{\varvec{\varSigma }}}_0-{\varvec{\varSigma }}_0\right) $ and $(n-1)^{1/2}\text{ vec }\left( \tilde{{\varvec{\varSigma }}}_1-{\varvec{\varSigma }}_1\right) $ are asymptotically distributed as a $p(p+1)/2$-variate normal distribution with mean vector ${\varvec{0}}$ and covariance matrices ${\varvec{\varPhi }}_0$ and ${\varvec{\varPhi }}_1$, respectively.

Proof

The details of the proof are described in “Appendix B”. $\square $

3 Test Statistics and Bootstrap Test

In general, no test in multivariate analysis is uniformly the most powerful. Thus, in this section, we derive the fundamental test statistics, i.e., the LRC, Wald statistic, Rao’s score statistic, and gradient statistic for testing the hypothesis

$$\begin{aligned} H_0 : {\varvec{\varSigma }}_1={\varvec{O}} \text{ versus } H_1 : {\varvec{\varSigma }}_1\ne {\varvec{O}}. \end{aligned}$$

Finally, we explain the process of the bootstrap test using these statistics.

3.1 Likelihood Ratio Criterion

Based on the work of Leiva [8], we derive the LRC and the moment of the LRC. Furthermore, we obtain the modified LRC using the moment of the LRC. The likelihood function is

$$\begin{aligned} L({\varvec{\mu }}, {\varvec{\varSigma }})&= (2\pi )^{-nup/2} \left| {\varvec{\varSigma }}\right| ^{-n/2}\exp \left[ -\frac{1}{2}\sum _{r=1}^{n} \left( {\varvec{x}}_r-{\varvec{\mu }}\right) '{\varvec{\varSigma }}^{-1}\left( {\varvec{x}}_r-{\varvec{\mu }}\right) \right] . \end{aligned}$$

(3.1)

Since we have assumed that the covariance matrix ${\varvec{\varSigma }}$ is BCS, the inverse matrix of the covariance matrix ${\varvec{\varSigma }}$ is

$$\begin{aligned} {\varvec{\varSigma }}^{-1}={\varvec{I}}_u\otimes {\varvec{A}}+{\varvec{J}}_u\otimes {\varvec{B}}, \end{aligned}$$

where

$$\begin{aligned} {\varvec{A}}&= \left( {\varvec{\varSigma }}_0-{\varvec{\varSigma }}_1\right) ^{-1}={\varvec{\varDelta }}_1^{-1},\\ {\varvec{B}}&= \frac{1}{u}\left[ \left\{ {\varvec{\varSigma }}_0+(u-1){\varvec{\varSigma }}_1\right\} ^{-1}-\left( {\varvec{\varSigma }}_0-{\varvec{\varSigma }}_1\right) ^{-1}\right] =\frac{1}{u}\left( {\varvec{\varDelta }}_2^{-1}-{\varvec{\varDelta }}_1^{-1}\right) . \end{aligned}$$

We denote $Q_n$ as the sum of the quadratic forms in (3.1), and can rearrange $Q_n$ as follows:

$$\begin{aligned} Q_n&= \sum _{r=1}^{n} \left( {\varvec{x}}_r-{\varvec{\mu }}\right) '{\varvec{\varSigma }}^{-1}\left( {\varvec{x}}_r-{\varvec{\mu }}\right) \\&= \sum _{r=1}^{n} \sum _{s=1}^{u} \left( {\varvec{x}}_{r, s}-{\varvec{\mu }}_s\right) '\left( {\varvec{A}}+{\varvec{B}}\right) \left( {\varvec{x}}_{r, s}-{\varvec{\mu }}_s\right) \\&\quad + \sum _{r=1}^{n} \mathop { \sum _{s=1}^{u} \sum _{s^*=1}^{u}}_{s\ne s^*} \left( {\varvec{x}}_{r, s}-{\varvec{\mu }}_s\right) ' {\varvec{B}} \left( {\varvec{x}}_{r, s^*}-{\varvec{\mu }}_{s^*}\right) \\&= \mathrm{tr}\left[ \left( {\varvec{A}}+{\varvec{B}}\right) \sum _{r=1}^{n} \sum _{s=1}^{u} \left( {\varvec{x}}_{r, s}-{\varvec{\mu }}_s\right) \left( {\varvec{x}}_{r, s}-{\varvec{\mu }}_s\right) '\right] \\&\quad + \mathrm{tr}\left[ {\varvec{B}}\sum _{r=1}^{n} \mathop { \sum _{s=1}^{u} \sum _{s^*=1}^{u}}_{s\ne s^*} \left( {\varvec{x}}_{r, s}-{\varvec{\mu }}_s\right) \left( {\varvec{x}}_{r, s^*}-{\varvec{\mu }}_{s^*}\right) '\right] . \end{aligned}$$

Since $\bar{{\varvec{x}}}_s=\sum _{r=1}^{n}{\varvec{x}}_{r, s}/n$, we have $\sum _{r=1}^{n}\left( {\varvec{x}}_{r, s}-\bar{{\varvec{x}}}_s\right) ={\varvec{0}}$. Since

$$\begin{aligned}&{ \sum _{r=1}^{n} \sum _{s=1}^{u} \left( {\varvec{x}}_{r, s}-{\varvec{\mu }}_s\right) \left( {\varvec{x}}_{r, s}-{\varvec{\mu }}_s\right) ' }\nonumber \\&\quad = \sum _{r=1}^{n} \sum _{s=1}^{u} \left( {\varvec{x}}_{r, s}-\bar{{\varvec{x}}}_s\right) \left( {\varvec{x}}_{r, s}-\bar{{\varvec{x}}}_s\right) ' +n \sum _{s=1}^{u}\left( \bar{{\varvec{x}}}_s-{\varvec{\mu }}_s\right) \left( \bar{{\varvec{x}}}_s-{\varvec{\mu }}_s\right) '\nonumber \\&\quad \equiv {\varvec{C}}_0+n \sum _{s=1}^{u}\left( \bar{{\varvec{x}}}_s-{\varvec{\mu }}_s\right) \left( \bar{{\varvec{x}}}_s-{\varvec{\mu }}_s\right) ' , \end{aligned}$$

(3.2)

$$\begin{aligned}&{ \sum _{r=1}^{n} \mathop { \sum _{s=1}^{u} \sum _{s^*=1}^{u}}_{s\ne s^*} \left( {\varvec{x}}_{r, s}-{\varvec{\mu }}_s\right) \left( {\varvec{x}}_{r, s^*}-{\varvec{\mu }}_{s^*}\right) ' }\nonumber \\&\quad = \sum _{r=1}^{n} \mathop { \sum _{s=1}^{u} \sum _{s^*=1}^{u}}_{s\ne s^*} \left( {\varvec{x}}_{r, s}-\bar{{\varvec{x}}}_s\right) \left( {\varvec{x}}_{r, s^*}-\bar{{\varvec{x}}}_{s^*}\right) ' +n \mathop { \sum _{s=1}^{u} \sum _{s^*=1}^{u}}_{s\ne s^*} +\left( \bar{{\varvec{x}}}_s-{\varvec{\mu }}_s\right) \left( \bar{{\varvec{x}}}_{s^*}-{\varvec{\mu }}_{s^*}\right) '\nonumber \\&\quad \equiv {\varvec{C}}_1+n \mathop { \sum _{s=1}^{u} \sum _{s^*=1}^{u}}_{s\ne s^*} \left( \bar{{\varvec{x}}}_s-{\varvec{\mu }}_s\right) \left( \bar{{\varvec{x}}}_{s^*}-{\varvec{\mu }}_{s^*}\right) ', \end{aligned}$$

(3.3)

letting ${\varvec{\varSigma }}_*={\varvec{I}}_n\otimes {\varvec{\varSigma }}$, we can rearrange $Q_n$ as follows:

$$\begin{aligned} Q_n&= \sum _{r=1}^{n} \left( {\varvec{x}}_r-{\varvec{\mu }}\right) '{\varvec{\varSigma }}^{-1}\left( {\varvec{x}}_r-{\varvec{\mu }}\right) \nonumber \\&= \mathrm{tr}\left( {\varvec{\varSigma }}_*^{-1} {\varvec{C}}\right) + \left( {\varvec{1}}_n\otimes \left( \bar{{\varvec{x}}}-{\varvec{\mu }}\right) \right) ' {\varvec{\varSigma }}_*^{-1} \left( {\varvec{1}}_n\otimes \left( \bar{{\varvec{x}}}-{\varvec{\mu }}\right) \right) , \end{aligned}$$

(3.4)

where

$$\begin{aligned} {\varvec{C}} = {\varvec{I}}_{nu}\otimes \frac{1}{nu}\left( {\varvec{C}}_0-\frac{1}{u-1}{\varvec{C}}_1\right) + \left( {\varvec{I}}_n\otimes {\varvec{J}}_u\right) \otimes \frac{1}{nu(u-1)}{\varvec{C}}_1. \end{aligned}$$

Therefore, $Q_n$ is minimized when $\hat{{\varvec{\mu }}}=\bar{{\varvec{x}}}$, and then the log-likelihood function reduces to

$$\begin{aligned} \log L(\bar{{\varvec{x}}}, {\varvec{\varSigma }}_*) = -\frac{nup}{2}\log (2\pi )-\frac{1}{2}\log \left| {\varvec{\varSigma }}_*\right| -\frac{1}{2}\mathrm{tr}\left( {\varvec{\varSigma }}_*^{-1}{\varvec{C}}\right) . \end{aligned}$$

(3.5)

From Lemma 3.2.2 of Anderson [1], the log-likelihood function is maximized when

$$\begin{aligned} \hat{{\varvec{\varSigma }}}_*={\varvec{C}}. \end{aligned}$$

Thus, the maximum of the likelihood function is

$$\begin{aligned} L(\bar{{\varvec{x}}}, \hat{{\varvec{\varSigma }}}_*) = \frac{e^{-nup/2}}{(2\pi )^{nup/2}\left| \hat{{\varvec{\varSigma }}}\right| ^{n/2}}. \end{aligned}$$

(3.6)

From (3.4), the maximum likelihood estimators of ${\varvec{\varSigma }}_0$ and ${\varvec{\varSigma }}_1$ are

$$\begin{aligned} \hat{{\varvec{\varSigma }}}_0&= \frac{1}{nu}{\varvec{C}}_0 = \frac{1}{nu} \sum _{r=1}^{n} \sum _{s=1}^{u} \left( {\varvec{x}}_{r, s}-\bar{{\varvec{x}}}_s\right) \left( {\varvec{x}}_{r, s}-\bar{{\varvec{x}}}_s\right) ', \end{aligned}$$

(3.7)

$$\begin{aligned} \hat{{\varvec{\varSigma }}}_1&= \frac{1}{nu(u-1)}{\varvec{C}}_1 = \frac{1}{nu(u-1)} \sum _{r=1}^{n} \mathop { \sum _{s=1}^{u} \sum _{s^*=1}^{u} }_{s\ne s^*} \left( {\varvec{x}}_{r, s}-\bar{{\varvec{x}}}_s\right) \left( {\varvec{x}}_{r, s^*}-\bar{{\varvec{x}}}_{s^*}\right) '. \end{aligned}$$

(3.8)

Next, we consider the maximum of the likelihood function under the null hypothesis $H_0 : {\varvec{\varSigma }}_1={\varvec{O}}$. Under $H_0$, we have

$$\begin{aligned} {\varvec{\varSigma }}={\varvec{I}}_u\otimes {\varvec{\varSigma }}_0, \quad {\varvec{\varSigma }}^{-1}={\varvec{I}}_u\otimes {\varvec{\varSigma }}_0^{-1}, \quad \left| {\varvec{\varSigma }}\right| =\left| {\varvec{\varSigma }}_0\right| ^n. \end{aligned}$$

Thus, the likelihood function is

$$\begin{aligned} L({\varvec{\mu }}, {\varvec{\varSigma }}_0)&= (2\pi )^{-nup/2}\left| {\varvec{\varSigma }}\right| ^{-n/2} \exp \left[ -\frac{1}{2}\sum _{r=1}^{n}\sum _{s=1}^{u}\left( {\varvec{x}}_{r, s}-{\varvec{\mu }}_s\right) '{\varvec{\varSigma }}_0^{-1}\left( {\varvec{x}}_{r, s}-{\varvec{\mu }}_s\right) \right] . \end{aligned}$$

(3.9)

We denote the sum of the quadratic forms in (3.9) as Q, and arrange this as follows:

$$\begin{aligned} Q&= \sum _{r=1}^{n}\sum _{s=1}^{u}\left( {\varvec{x}}_{r, s}-{\varvec{\mu }}_s\right) '{\varvec{\varSigma }}_0^{-1}\left( {\varvec{x}}_{r, s}-{\varvec{\mu }}_s\right) \\&= \mathrm{tr}\left[ {\varvec{\varSigma }}_0^{-1} \sum _{r=1}^{n}\sum _{s=1}^{u}\left( {\varvec{x}}_{r, s}-\bar{{\varvec{x}}}_s\right) \left( {\varvec{x}}_{r, s}-\bar{{\varvec{x}}}_s\right) ' + n {\varvec{\varSigma }}_0^{-1} \sum _{s=1}^{u}\left( \bar{{\varvec{x}}}_{s}-{\varvec{\mu }}_s\right) \left( \bar{{\varvec{x}}}_{s}-{\varvec{\mu }}_s\right) ' \right] \\&= \mathrm{tr}\left[ {\varvec{\varSigma }}_0^{-1} \sum _{r=1}^{n}\sum _{s=1}^{u}\left( {\varvec{x}}_{r, s}-\bar{{\varvec{x}}}_s\right) \left( {\varvec{x}}_{r, s}-\bar{{\varvec{x}}}_s\right) '\right] + n \left( \bar{{\varvec{x}}}-{\varvec{\mu }}\right) '{\varvec{\varSigma }}^{-1}\left( \bar{{\varvec{x}}}-{\varvec{\mu }}\right) . \end{aligned}$$

When $\hat{{\varvec{\mu }}}=\bar{{\varvec{x}}}$, Q is minimized. Then, the log-likelihood function reduces to

$$\begin{aligned} \log L(\bar{{\varvec{x}}}, {\varvec{\varSigma }}_0)&= -\frac{nup}{2}\log (2\pi ) -\frac{nu}{2}\log \left| {\varvec{\varSigma }}_0\right| \\&\quad -\frac{1}{2} \mathrm{tr}\left[ {\varvec{\varSigma }}_0^{-1} \sum _{r=1}^{n}\sum _{s=1}^{u}\left( {\varvec{x}}_{r, s}-\bar{{\varvec{x}}}_s\right) \left( {\varvec{x}}_{r, s}-\bar{{\varvec{x}}}_s\right) '\right] . \end{aligned}$$

From Lemma 3.2.2 of Anderson [1], the log-likelihood function is maximized when $\hat{{\varvec{\varSigma }}}_0={\varvec{C}}_0/(nu)$, and the maximum of the likelihood function is

$$\begin{aligned} L(\hat{{\varvec{\mu }}}, \hat{{\varvec{\varSigma }}}_0)&= \frac{e^{-nup/2}}{(2\pi )^{nup/2}\left| \hat{{\varvec{\varSigma }}}_0\right| ^{nu/2}}. \end{aligned}$$

(3.10)

From the maximums (3.6) and (3.10), the LRC $\varLambda $ is

$$\begin{aligned} \varLambda&= \frac{\max _{H_0}L({\varvec{\mu }}, {\varvec{\varSigma }})}{\max L({\varvec{\mu }}, {\varvec{\varSigma }})} = \frac{ \left| \hat{{\varvec{\varSigma }}}_0-\hat{{\varvec{\varSigma }}}_1\right| ^{n(u-1)/2} \left| \hat{{\varvec{\varSigma }}}_0+(u-1)\hat{{\varvec{\varSigma }}}_1\right| ^{n/2} }{\left| \hat{{\varvec{\varSigma }}}_0\right| ^{nu/2}}. \end{aligned}$$

(3.11)

Therefore, we have

$$\begin{aligned} -2\log \varLambda&= nu\log \left| \hat{{\varvec{\varSigma }}}_0\right| -n(u-1)\log \left| \hat{{\varvec{\varSigma }}}_0-\hat{{\varvec{\varSigma }}}_1\right| -n\log \left| \hat{{\varvec{\varSigma }}}_0+(u-1)\hat{{\varvec{\varSigma }}}_1\right| . \end{aligned}$$

(3.12)

Next, we obtain the h-th moment of $\varLambda $ to derive the modified LRC. We express the LRC using ${\varvec{W}}_1$ and ${\varvec{W}}_2$ as follows:

$$\begin{aligned} \varLambda&= \frac{(nu)^{nup/2}}{\left\{ n(u-1)\right\} ^{n(u-1)p/2}n^{np/2}} \cdot \frac{ \left| {\varvec{W}}_1 \right| ^{n(u-1)/2} \left| {\varvec{W}}_2 \right| ^{n/2} }{\left| {\varvec{W}}_1 +{\varvec{W}}_2 \right| ^{nu/2}}. \end{aligned}$$

(3.13)

Letting

$$\begin{aligned} \lambda = \frac{ \left| {\varvec{W}}_1 \right| ^{n(u-1)/2} \left| {\varvec{W}}_2 \right| ^{n/2} }{\left| {\varvec{W}}_1 +{\varvec{W}}_2 \right| ^{nu/2}}, \end{aligned}$$

the h-th moment of $\lambda $ is

$$\begin{aligned} E[\lambda ^h]= & {} \frac{\varGamma _p\left[ \frac{1}{2}(n-1)u\right] }{\varGamma _p\left[ \frac{1}{2}(n-1)u(1+h)\right] } \cdot \frac{\varGamma _p\left[ \frac{1}{2}(n-1)(u-1)(1+h)\right] }{\varGamma _p\left[ \frac{1}{2}(n-1)(u-1)\right] }\\&\cdot \frac{\varGamma _p\left[ \frac{1}{2}(n-1)(1+h)\right] }{\varGamma _p\left[ \frac{1}{2}(n-1)\right] } \end{aligned}$$

in the same way as in Section 10.4 of Anderson [1]. Since we can write the criterion as

$$\begin{aligned} \varLambda&= \left\{ \frac{nu}{n(u-1)}\right\} ^{\frac{1}{2}pn(u-1)} \left( \frac{nu}{n}\right) ^{\frac{1}{2}pn} \lambda = \left\{ \left( \frac{1}{k_1}\right) ^{k_1} \left( \frac{1}{k_2}\right) ^{k_2} \right\} ^{\frac{1}{2}pn} \lambda , \end{aligned}$$

where $k_1=(u-1)/u$ and $k_2=1/u$, the h-th moment of $\varLambda $ is as follows:

$$\begin{aligned} E[\varLambda ^h] = \left\{ \left( \frac{1}{k_1}\right) ^{k_1} \left( \frac{1}{k_2}\right) ^{k_2} \right\} ^{\frac{1}{2}pnh} E[\lambda ^h]. \end{aligned}$$

Using the general theory of asymptotic expansions from Section 8.5 of Anderson [1], we have the modified LRC $-2\rho \log \varLambda $, which converges quickly to the chi-squared distribution compared to $-2\log \varLambda $, where

$$\begin{aligned} \rho =1-\frac{u^2-u+1}{(n-1)u(u-1)}\cdot \frac{2p^2+3p-1}{6(p+1)}. \end{aligned}$$

(3.14)

The effect of this modification is confirmed in the simulation described in Sect. 4.

3.2 Wald Statistic

From Theorem 2.4, we can construct the Wald statistic. Since we have ${\varvec{\varDelta }}_1={\varvec{\varDelta }}_2={\varvec{\varSigma }}_0$ under the null hypothesis $H_0$, the asymptotic covariance matrix is

$$\begin{aligned} {\varvec{\varPhi }}_1=\frac{1}{u(u-1)}\left( {\varvec{I}}_{p^2} + {\varvec{K}}_{p,p}\right) \left( {\varvec{\varSigma }}_0\otimes {\varvec{\varSigma }}_0\right) . \end{aligned}$$

Hence, we obtain the following theorem.

Theorem 3.1

When the null hypothesis $H_0$ is true, the vector $(n-1)^{1/2}\text{ vec }\left( \tilde{{\varvec{\varSigma }}}_1\right) $ is asymptotically distributed as a $p(p+1)/2$-variate normal distribution with mean vector ${\varvec{0}}$ and covariance matrix $\left( {\varvec{I}}_{p^2} + {\varvec{K}}_{p,p}\right) \left( {\varvec{\varSigma }}_0\otimes {\varvec{\varSigma }}_0\right) /\{u(u-1)\}.$

Noting that

$$\begin{aligned} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) ^{-} = \frac{1}{4} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) , \quad \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \text{ vec }\left( \tilde{{\varvec{\varSigma }}}_1\right) = 2 \text{ vec }\left( \tilde{{\varvec{\varSigma }}}_1\right) , \end{aligned}$$

using Theorem 3.1, the Wald statistic

$$\begin{aligned} W=\frac{(n-1)u(u-1)}{2}\text{ vec }'\left( \tilde{{\varvec{\varSigma }}}_1\right) \left( \tilde{{\varvec{\varSigma }}}_0^{-1}\otimes \tilde{{\varvec{\varSigma }}}_0^{-1}\right) \text{ vec }\left( \tilde{{\varvec{\varSigma }}}_1\right) \end{aligned}$$

(3.15)

is asymptotically distributed as a chi-squared distribution with $p(p+1)/2$ degrees of freedom, where ${\varvec{A}}^{-}$ denotes the Moore–Penrose inverse matrix of ${\varvec{A}}$.

3.3 Rao’s Score Statistic

Assuming the BCS covariance structure, the log-likelihood function (3.5) is represented as follows:

$$\begin{aligned} \log L(\bar{{\varvec{x}}}, {\varvec{\varSigma }}_*)&= -\frac{nup}{2}\log (2\pi ) -\frac{n}{2}\log \left| {\varvec{\varSigma }}\right| \nonumber \\&\qquad -\frac{1}{2}\mathrm{tr}\left[ \left( {\varvec{I}}_n\otimes {\varvec{I}}_u\otimes {\varvec{A}}\right) {\varvec{C}}\right] -\frac{1}{2}\mathrm{tr}\left\{ \left( {\varvec{I}}_n\otimes {\varvec{J}}_u\otimes {\varvec{B}}\right) {\varvec{C}}\right\} . \end{aligned}$$

(3.16)

Details are given in “Appendix C”, but the derivative of the log-likelihood function with respect to ${\varvec{\varSigma }}_1$ is

$$\begin{aligned} {\varvec{U}}({\varvec{\varDelta }}_1, {\varvec{\varDelta }}_2)&= \frac{\partial }{\partial {\varvec{\varSigma }}_1} \log L(\bar{{\varvec{x}}}, {\varvec{\varSigma }}_*) \nonumber \\&= -\frac{n(u-1)}{2}\left\{ {\varvec{\varDelta }}_1^{-1}\left( \hat{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) {\varvec{\varDelta }}_1^{-1}\right\} \nonumber \\&\qquad +\frac{n(u-1)}{2} \left\{ {\varvec{\varDelta }}_2^{-1}\left( \hat{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) {\varvec{\varDelta }}_2^{-1}\right\} . \end{aligned}$$

(3.17)

From this result, the information matrix is as follows:

$$\begin{aligned} {\varvec{I}}\left( {\varvec{\varDelta }}_1, {\varvec{\varDelta }}_2\right)&= E\left[ \text{ vec }\left( {\varvec{U}}({\varvec{\varDelta }}_1, {\varvec{\varDelta }}_2)\right) \text{ vec }'\left( {\varvec{U}}({\varvec{\varDelta }}_1, {\varvec{\varDelta }}_2)\right) \right] \nonumber \\&= \frac{(n-1)(u-1)}{4} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_1^{-1} \otimes {\varvec{\varDelta }}_1^{-1}\right) \nonumber \\&\quad + \frac{(n-1)(u-1)^2}{4} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_2^{-1} \otimes {\varvec{\varDelta }}_2^{-1}\right) . \end{aligned}$$

(3.18)

Let $\check{{\varvec{\varDelta }}}_1$ and $\check{{\varvec{\varDelta }}}_2$ be MLEs of ${\varvec{\varDelta }}_1$ and ${\varvec{\varDelta }}_2$, respectively, under the null hypothesis $H_0$. When the null hypothesis $H_0$ is true, we have

$$\begin{aligned} \check{{\varvec{\varDelta }}}_1=\check{{\varvec{\varDelta }}}_2=\hat{{\varvec{\varSigma }}}_0. \end{aligned}$$

Since the score $\text{ vec }\left( {\varvec{U}}(\check{{\varvec{\varDelta }}}_1, \check{{\varvec{\varDelta }}}_2)\right) $ is

$$\begin{aligned} \text{ vec }\left( {\varvec{U}}(\check{{\varvec{\varDelta }}}_1, \check{{\varvec{\varDelta }}}_2)\right)&= \frac{nu(u-1)}{2} \left( \hat{{\varvec{\varSigma }}}_0^{-1} \otimes \hat{{\varvec{\varSigma }}}_0^{-1}\right) \text{ vec }\left( \hat{{\varvec{\varSigma }}}_1\right) , \end{aligned}$$

(3.19)

Rao’s score statistic is

$$\begin{aligned} S=\frac{nu(u-1)}{2} \text{ vec }'\left( \hat{{\varvec{\varSigma }}}_1\right) \left( \hat{{\varvec{\varSigma }}}_0^{-1} \otimes \hat{{\varvec{\varSigma }}}_0^{-1}\right) \text{ vec }\left( \hat{{\varvec{\varSigma }}}_1\right) . \end{aligned}$$

(3.20)

Using the score (3.19) under the null hypothesis $H_0$, we find that the gradient statistic is the same as Rao’s score statistic.

3.4 Bootstrap Test

Following Efron and Tibshirani [4], we perform the bootstrap test using the criteria $-2\log \varLambda $, $-2\rho \log \varLambda $, W, and S as follows:

(i)
Calculate the mean vector $\bar{{\varvec{x}}}$, the unbiased covariance matrix $\tilde{{\varvec{\varSigma }}}_0$, and the criteria $-2\log \varLambda _x$, $-2\rho \log \varLambda _x$, $W_x$, and $S_x$ from the original sample ${\varvec{x}}$.
(ii)
Form B bootstrap datasets ${\varvec{y}}$ of size n from the normal population $N(\bar{{\varvec{x}}}, {\varvec{I}}_u\otimes \tilde{{\varvec{\varSigma }}}_0)$.
(iii)
Evaluate the criteria $-2\log \varLambda _y$, $-2\rho \log \varLambda _y$, $W_y$, and $S_y$ from each dataset ${\varvec{y}}$.
(iv)
Approximate an achieved significance level (ASL) as:
$$\begin{aligned} {\widehat{ASL}}_1&=\frac{\#\left\{ -2\log \varLambda _y>-2\log \varLambda _x\right\} }{B},\\ {\widehat{ASL}}_2&=\frac{\#\left\{ -2\rho \log \varLambda _y>-2\rho \log \varLambda _x\right\} }{B}, \\ {\widehat{ASL}}_3&=\frac{\#\left\{ W_y>W_x\right\} }{B}, \quad {\widehat{ASL}}_4 =\frac{\#\left\{ S_y>S_x\right\} }{B}. \end{aligned}$$
If the value of ${\widehat{ASL}}$ is less than the significance level $\alpha $, we reject the null hypothesis.

We use the bootstrap test in our simulations because it can be allied to hypothesis testing using these statistics, and the ASLs of the bootstrap test are guaranteed to be accurate as the sample size becomes large.

4 Numerical Example

In this section, we investigate the accuracy of the test using the above criteria and apply them to real data. The simulation uses 100,000 samples.

4.1 Numerical Simulation

First, we investigate the accuracy of the significance level for the test using the criteria $-2\log \varLambda $, $-2\rho \log \varLambda $, W, and S under the null hypothesis. Letting

$$\begin{aligned} {\varvec{\varSigma }}_0=\sigma ^2\begin{pmatrix} 1 &{} \varrho &{} \cdots &{} \varrho ^{p-1} \\ \varrho &{}1 &{} \cdots &{} \varrho ^{p-2} \\ \vdots &{} \vdots &{} &{}\vdots \\ \varrho ^{p-1} &{} \varrho ^{p-2} &{} \cdots &{} 1 \end{pmatrix}, \end{aligned}$$

(4.1)

where $\sigma =2$ and $\varrho =0.5$, we set the population distribution such that the mean vector ${\varvec{\mu }}$ is the zero vector and the covariance matrix is

$$\begin{aligned} {\varvec{\varSigma }}={\varvec{I}}_{u}\otimes {\varvec{\varSigma }}_0. \end{aligned}$$

We change the dimension p and the number u of sites, and set the sample size n for each case. Table 1 presents the ASLs using the 95th percentile of the chi-squared distribution.

The results show that the ASLs of the likelihood ratio test and the modified likelihood ratio test are greater than 0.05, meaning that these tests fail to control the significance level. In contrast, the ASL of the Wald test is less than 0.05 and the Rao’s score test retains the significance level. We have found that the correction using $\rho $ improves the convergence to the significance level.

Table 1 Achieved significance level of normal test for $\alpha =0.05$

Full size table

We consider the bootstrap test using these test statistics because their ASLs are different.

Table 2 presents the ASL for the bootstrap test using these statistics for the significance level $\alpha =0.05$. The number of bootstrap replications is 1000. The results show that the ASLs of the bootstrap test using $-2\log \varLambda $, $-2\rho \log \varLambda $, and W are greater than 0.05 and the ASL of the bootstrap test using S is less than 0.05. We have found that the bootstrap test is dominant in terms of ensuring the stability of the significance level. When the sample size is large, the bootstrap test using $-2\rho \log \varLambda $ or W retains the significance level.

Table 2 Achieved significance level of bootstrap test for $\alpha =0.05$

Full size table

Next, we investigate the power of the test in two cases. We set the sample size n, dimension p, and number u of sites as for the situation under the null hypothesis. Since the convergence of each statistic to the significance level is different, we cannot make a simple comparison of the powers of the test, but instead compare the powers of the bootstrap test by taking the convergence to the significance level into consideration. Let ${\varvec{\varSigma }}_0$ be as in (4.1), and consider Case 1: ${\varvec{\varSigma }}_1=\tau _1{\varvec{I}}_p$ and Case 2: ${\varvec{\varSigma }}_1=\tau _2{\varvec{1}}_p{\varvec{1}}_p'$. We set $\tau _1$ and $\tau _2$ as shown in the following table.

	$p=3$	$p=5$	$p=9$
$\tau _1$	0.8	0.3	0.15
$\tau _2$	1.3	0.5	0.2

Since the alternative hypothesis, the population covariance matrix is

$$\begin{aligned} {\varvec{\varSigma }}={\varvec{I}}_u\otimes ({\varvec{\varSigma }}_0-{\varvec{\varSigma }}_1)+{\varvec{J}}_u\otimes {\varvec{\varSigma }}_1. \end{aligned}$$

The upper part of Table 3 presents the powers of the test in Case 1. Since the criteria $-2\log \varLambda $ and $-2\rho \log \varLambda $ are essentially the same, the powers of the bootstrap test using these criteria are equal. When the dimension is high, the power of the bootstrap test using W is largest followed by the power of the bootstrap test using S. The powers of the modified likelihood ratio test are the largest when the dimension is low.

Table 3 Power of bootstrap test (Significance level: $\alpha =0.05$)

Full size table

The lower part of Table 3 presents the powers of the test in Case 2. The same tendencies as in Case 1 can be observed. The power of the bootstrap test using W is largest, followed by the power of the bootstrap test using S; the power of the bootstrap test using the modified LRC is the third largest when the dimension is high. The powers of the modified likelihood ratio test are largest when the dimension is low, but the powers of the bootstrap test using the modified LRC, W and S are almost the same in the case of a low dimension and large sample.

4.2 Example Using Real Data

We apply hypothesis testing using real data taken from Johnson and Wichern [7]. To examine whether dietary supplements would slow bone loss in 25 older women, the mineral content of bones (radius, humerus, and ulna) was measured by photon absorptiometry. Measurements were recorded for three bones on the dominant and non-dominant sides, i.e., $p = 3$ and $u = 2$. Roy and Leiva [10] demonstrated that the data fail to reject the null hypothesis that the covariance structure is of BCS form (p-value = 0.5786). The unbiased estimator for ${\varvec{\mu }}$ is

$$\begin{aligned} (0.8438,\ 1.7927,\ 0.7044,\ 0.8183,\ 1.7348,\ 0.6938)', \end{aligned}$$

and the unbiased estimators for ${\varvec{\varSigma }}_0$ and ${\varvec{\varSigma }}_1$ are

$$\begin{aligned} \tilde{{\varvec{\varSigma }}}_0= \begin{pmatrix} \ 0.0122\ &{}\ 0.0217\ &{}\ 0.0090\ \\ \ 0.0217\ &{}\ 0.0749\ &{}\ 0.0168\ \\ \ 0.0090\ &{}\ 0.0168\ &{}\ 0.0111\ \end{pmatrix}, \quad \tilde{{\varvec{\varSigma }}}_1= \begin{pmatrix} \ 0.0104\ &{}\ 0.0193\ &{}\ 0.0082\ \\ \ 0.0193\ &{}\ 0.0668\ &{}\ 0.0153\ \\ \ 0.0082\ &{}\ 0.0153\ &{}\ 0.0081\ \end{pmatrix}. \end{aligned}$$

The maximum likelihood estimators are

$$\begin{aligned} \hat{{\varvec{\varSigma }}}_0= \begin{pmatrix} \ 0.0117\ &{}\ 0.0209\ &{}\ 0.0087\ \\ \ 0.0209\ &{}\ 0.0719\ &{}\ 0.0161\ \\ \ 0.0087\ &{}\ 0.0161\ &{}\ 0.0106\ \end{pmatrix}, \quad \hat{{\varvec{\varSigma }}}_1= \begin{pmatrix} \ 0.0100\ &{}\ 0.0185\ &{}\ 0.0079\ \\ \ 0.0185\ &{}\ 0.0641\ &{}\ 0.0147\ \\ \ 0.0079\ &{}\ 0.0147\ &{}\ 0.0077\ \end{pmatrix}. \end{aligned}$$

Noting that $\rho =0.9323$, the criteria are

$$\begin{aligned}&-2\log \varLambda =71.7279, \quad -2\rho \log \varLambda =66.8713,\\&W_1=38.3102, \quad W_2=24.8056, \quad S=39.9065. \end{aligned}$$

Since the upper 5% point of the chi-squared distribution with 6 degrees of freedom is 12.5916, we reject the null hypothesis ${\varvec{\varSigma }}_1={\varvec{O}}$ with a significance level 0.05. We also applied the bootstrap test using the same criteria. The ASL values for each statistic are approximately 0.0000, and the result is the same as for the previous test.

5 Conclusions

We have treated hypothesis testing for independence under the BCS covariance structure. The LRC, modified LRC, Wald statistic, and Rao’s score statistic have been derived. We have shown that the test using these statistics is effective in specific situations. In particular, we found that the bootstrap test is superior in terms of convergence to the significance level, that the power of the bootstrap test using the Wald statistic is largest when the dimension is high, that the power of the bootstrap test using the modified LRC is largest when the dimension is low, and that the power of the bootstrap test using the Wald statistic is the same as the power of the bootstrap test using the modified LRC when the dimension is low and the sample size is large. We recommend the bootstrap test using the Wald statistic.

Recently, high-dimensional multivariate analysis has been extensively studied (see Fujikoshi and Ulyanov [5] and Pourahmadi [9]). It may also be possible to study hypothesis testing for independence under the BCS covariance structure under high-dimensional situations ($up>n$). However, we cannot employ statistics using the determinant, such as the LRC, because the matrices ${\varvec{W}}_i$ are singular under high-dimensional conditions. Thus, it is necessary to consider new test statistics using the trace of ${\varvec{W}}_i$ for hypothesis testing, which is left as a future problem.

References

Anderson, T.W.: An Introduction to Multivariate Statistical Analysis, 2nd edn. Wiley, New York (1984)
MATH Google Scholar
Arnold, S.F.: Linear models with exchangeably distributed errors. J. Am. Stat. Assoc. 74, 194–199 (1979)
Article MathSciNet MATH Google Scholar
Coelho, C.A., Roy, A.: Testing the hypothesis of a block compound symmetric covariance matrix for elliptically contoured distributions. TEST 26, 308–330 (2017)
Article MathSciNet MATH Google Scholar
Efron, B., Tibshirani, R.J.: An Introduction to the Bootstrap. CRC Press, New York (1994)
MATH Google Scholar
Fujikoshi, Y., Ulyanov, V.V., Shimizu, R.: Multivariate Statistics: High-Dimensional and Large-Sample Approximations. Wiley, Hoboken (2010)
Book MATH Google Scholar
Gupta, A.K., Nagar, D.K.: Matrix Variate Distributions. Chapman and Hall, Boca Raton (1999)
MATH Google Scholar
Johnson, R.A., Wichern, D.W.: Applied Multivariate Statistical Analysis, 6th edn. Pearson Prentice Hall, Englewood Cliffs (2007)
MATH Google Scholar
Leiva, R.: Linear discrimination with equicorrelated training vectors. J. Multivar. Anal. 98, 384–409 (2007)
Article MathSciNet MATH Google Scholar
Pourahmadi, M.: High-Dimensional Covariance Estimation: With High-Dimensional Data. Wiley, New York (2013)
Book MATH Google Scholar
Roy, A., Leiva, R.: Estimating and testing a structured covariance matrix for three-level multivariate data. Commun. Stat. Theory Methods 40, 1945–1963 (2011)
Article MathSciNet MATH Google Scholar
Roy, A., Leiva, R., Žežula, I., Klein, D.: Testing the equality of mean vectors for paired doubly multivariate observations in blocked compound symmetric covariance matrix setup. J. Multivar. Anal. 137, 50–60 (2015)
Article MathSciNet MATH Google Scholar
Roy, A., Zmyślony, R., Fonseca, M., Leiva, R.: Optimal estimation for doubly multivariate data in blocked compound symmetric covariance structure. J. Multivar. Anal. 144, 81–90 (2016)
Article MathSciNet MATH Google Scholar
Szatrowski, T.H.: Testing and estimation in the block compound symmetry problem. J. Educ. Stat. 7, 3–18 (1982)
Article Google Scholar
Terrell, G.R.: The gradient statistic. Comput. Sci. Stat. 34, 206–215 (2002)
Google Scholar
Zezula, I., Klein, D., Roy, A.: Testing of multivariate repeated measures data with block exchangeable covariance structure. TEST (2017). https://doi.org/10.1007/s11749-017-0549-z
Google Scholar

Download references

Acknowledgements

The author thanks Stuart Jenkinson, Ph.D., from Edanz Group (www.edanz-editing.com/ac) for editing a draft of this manuscript, and is grateful to three anonymous referees for comments to revise the original manuscript.

Author information

Authors and Affiliations

School of Education, Meisei University, 2-1-1 Hodokubo Hino, Tokyo, 191-8506, Japan
Shin-ichi Tsukada

Authors

Shin-ichi Tsukada
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shin-ichi Tsukada.

Appendices

Appendix

A Proof of Lemmas 2.2 and 2.3

First, we show three lemmas to derive the exact distribution of $\tilde{{\varvec{\varSigma }}}_0$ and $\tilde{{\varvec{\varSigma }}}_1$.

Lemma 5.1

(Theorem 3.3.1 in Gupta and Nagar [6]) When $a>0$ and ${\varvec{S}}\sim W_p(n, {\varvec{\varSigma }})$, we have $a{\varvec{S}}\sim W_p(n, a{\varvec{\varSigma }})$.

Lemma 5.2

(p. 127 in Gupta and Nagar [6]) When ${\varvec{S}}_1$ and ${\varvec{S}}_2$ are independent of each other, ${\varvec{S}}_1\sim W_p(n_1, {\varvec{\varSigma }}_1)$, and ${\varvec{S}}_2\sim W_p(n_2, {\varvec{\varSigma }}_2)$, the distribution of ${\varvec{P}}={\varvec{S}}_1+{\varvec{S}}_2$ is as follows:

$$\begin{aligned}&\left\{ 2^{(n_1+n_2)p/2} \varGamma _p\left[ \frac{1}{2}(n_1+n_2)\right] \left| {\varvec{\varSigma }}_1\right| ^{n_1/2} \left| {\varvec{\varSigma }}_2\right| ^{n_2/2} \right\} ^{-1}\\&\quad \!\times \! \mathrm{etr}\left[ -\frac{1}{2}{\varvec{\varSigma }}_1^{-1}{\varvec{P}}\right] \left| {\varvec{P}}\right| ^{(n_1+n_2)/2-p-1} {}_1F_1\left[ \frac{1}{2}n_2; \frac{1}{2}(n_1+n_2); \frac{1}{2}\left( {\varvec{\varSigma }}_1^{-1}-{\varvec{\varSigma }}_2^{-1}\right) {\varvec{P}}\right] , \end{aligned}$$

where

$$\begin{aligned} {}_1F_1\left[ a; b; {\varvec{H}}\right] = \frac{\varGamma _p(b)}{\varGamma _p(a)\varGamma _p(b-a)}\int _{{\varvec{O}}<{\varvec{Y}}<{\varvec{I}}_p}\mathrm{etr}({\varvec{YH}})|{\varvec{Y}}|^{a-(p+1)/2}|{\varvec{I}}_p-{\varvec{Y}}|^{b-a-(p+1)/2}d{\varvec{Y}}. \end{aligned}$$

(5.1)

Proof

Letting ${\varvec{P}}={\varvec{S}}_1+{\varvec{S}}_2$ and ${\varvec{Q}}={\varvec{S}}_2$, we transform the simultaneous density function of ${\varvec{S}}_1$ and ${\varvec{S}}_2$ into the simultaneous density function of ${\varvec{P}}$ and ${\varvec{Q}}$. We obtain the distribution of ${\varvec{P}}$ by integrating the simultaneous density function of ${\varvec{P}}$ and ${\varvec{Q}}$ with respect to ${\varvec{Q}}$. $\square $

Lemma 5.3

When ${\varvec{S}}_1$ and ${\varvec{S}}_2$ are independent of each other, ${\varvec{S}}_1\sim W_p(n_1, {\varvec{\varSigma }}_1)$, and ${\varvec{S}}_2\sim W_p(n_2, {\varvec{\varSigma }}_2)$, the distribution of ${\varvec{M}}={\varvec{S}}_1-{\varvec{S}}_2$ is as follows:

$$\begin{aligned}&\left\{ 2^{(n_1+n_2)p/2} \varGamma _p\left[ \frac{1}{2}n_1\right] \left| {\varvec{\varSigma }}_1\right| ^{\frac{1}{2}n_1} \left| {\varvec{\varSigma }}_2\right| ^{\frac{1}{2}n_2} \right\} ^{-1}\\&\quad \!\times \! \mathrm{etr}\left[ -\frac{1}{2}{\varvec{\varSigma }}_1^{-1}{\varvec{M}}\right] \left| {\varvec{M}}\right| ^{(n_1+n_2)/2-p-1} \Psi \left[ \frac{1}{2}n_2, \frac{1}{2}(n_1+n_2); \frac{1}{2}\left( {\varvec{\varSigma }}_1^{-1}+{\varvec{\varSigma }}_2^{-1}\right) {\varvec{M}}\right] , \end{aligned}$$

where

$$\begin{aligned} \Psi [a,c;{\varvec{R}}] = \frac{1}{\varGamma _p\left[ a\right] } \int _{{\varvec{S}}>0} \mathrm{etr}\left( -{\varvec{R}}{\varvec{S}}\right) \left| {\varvec{S}}\right| ^{a-(p+1)/2}\left| {\varvec{I}}_p+{\varvec{S}}\right| ^{c-a-(p+1)/2} d{\varvec{S}}.\nonumber \\ \end{aligned}$$

(5.2)

Proof

Letting ${\varvec{M}}={\varvec{S}}_1-{\varvec{S}}_2$ and ${\varvec{Q}}={\varvec{S}}_2$, we transform the simultaneous density function of ${\varvec{S}}_1$ and ${\varvec{S}}_2$ into the simultaneous density function of ${\varvec{M}}$ and ${\varvec{Q}}$. We obtain the distribution of ${\varvec{M}}$ by integrating the simultaneous density function of ${\varvec{M}}$ and ${\varvec{Q}}$ with respect to ${\varvec{Q}}$. $\square $

We derive the distribution of $\tilde{{\varvec{\varSigma }}}_0$. We have

$$\begin{aligned} \tilde{{\varvec{\varSigma }}}_0=\frac{1}{(n-1)u}{\varvec{W}}_1+\frac{1}{(n-1)u}{\varvec{W}}_2, \end{aligned}$$

and

$$\begin{aligned}&\frac{1}{(n-1)u}{\varvec{W}}_1\sim W_p\left( (n-1)(u-1), \frac{1}{(n-1)u}{\varvec{\varDelta }}_1\right) ,\\&\frac{1}{(n-1)u}{\varvec{W}}_2\sim W_p\left( n-1, \frac{1}{(n-1)u}{\varvec{\varDelta }}_2\right) , \end{aligned}$$

from Lemma 5.1. From Lemma 5.2, the distribution of $\tilde{{\varvec{\varSigma }}}_0=\left( {\varvec{W}}_1+{\varvec{W}}_2\right) /\{(n-1)u\}$ is

$$\begin{aligned}&\left\{ 2^{u(n-1)p/2} \varGamma _p\left[ \frac{1}{2}u(n-1)\right] \left| \frac{1}{(n-1)u}{\varvec{\varDelta }}_1\right| ^{(n-1)(u-1)/2} \left| \frac{1}{(n-1)u}{\varvec{\varDelta }}_2\right| ^{(n-1)/2} \right\} ^{-1}\\&\quad \times \mathrm{etr}\left[ -\frac{(n-1)u}{2}{\varvec{\varDelta }}_1^{-1}\tilde{{\varvec{\varSigma }}}_0\right] \left| \tilde{{\varvec{\varSigma }}}_0\right| ^{u(n-1)/2-p-1}\\&\quad \times {}_1F_1\left[ \frac{1}{2}(n-1); \frac{1}{2}(n-1)u; \frac{1}{2}\left\{ (n-1)u{\varvec{\varDelta }}_1^{-1}-(n-1)u{\varvec{\varDelta }}_2^{-1}\right\} \tilde{{\varvec{\varSigma }}}_0\right] . \end{aligned}$$

Similarly, we have

$$\begin{aligned} \tilde{{\varvec{\varSigma }}}_1=\frac{1}{(n-1)u}{\varvec{W}}_2-\frac{1}{(n-1)u(u-1)}{\varvec{W}}_1, \end{aligned}$$

and

$$\begin{aligned} \frac{1}{(n-1)u}{\varvec{W}}_2&\sim W_p\left( n-1, \frac{1}{(n-1)u}{\varvec{\varDelta }}_2\right) ,\\ \frac{1}{(n-1)u(u-1)}{\varvec{W}}_1&\sim W_p\left( (n-1)(u-1), \frac{1}{(n-1)u(u-1)}{\varvec{\varDelta }}_1\right) , \end{aligned}$$

from Lemma 5.1. From Lemma 5.3, the distribution of $\tilde{{\varvec{\varSigma }}}_1={\varvec{W}}_2/\{(n-1)u\}-{\varvec{W}}_1/\{(n-1)u(u-1)\}$ is

$$\begin{aligned}&\left\{ 2^{u(n-1)p/2} \varGamma _p\left[ \frac{1}{2}(n-1)\right] \left| \frac{1}{(n-1)u}{\varvec{\varDelta }}_2\right| ^{(n-1)/2} \left| \frac{1}{(n-1)u(u-1)}{\varvec{\varDelta }}_1\right| ^{(n-1)(u-1)/2} \right\} ^{-1}\\&\quad \times \mathrm{etr}\left[ -\frac{(n-1)u}{2}{\varvec{\varDelta }}_2^{-1}\tilde{{\varvec{\varSigma }}}_1\right] \left| \tilde{{\varvec{\varSigma }}}_1\right| ^{u(n-1)/2-p-1}\\&\quad \times \Psi \left[ \frac{1}{2}(n-1)(u-1); \frac{1}{2}(n-1)u; \frac{1}{2}\left\{ (n-1)u{\varvec{\varDelta }}_2^{-1}+(n-1)u(u-1){\varvec{\varDelta }}_1^{-1}\right\} \tilde{{\varvec{\varSigma }}}_1\right] . \end{aligned}$$

B Covariance Matrix of the Unbiased Estimator

From the result of Roy et al. [11], we have

$$\begin{aligned} \tilde{{\varvec{\varDelta }}}_1&\sim W_{p}\left( (n-1)(u-1), \frac{1}{(n-1)(u-1)}{\varvec{\varDelta }}_1\right) ,\\ \tilde{{\varvec{\varDelta }}}_2&\sim W_{p}\left( n-1, \frac{1}{n-1}{\varvec{\varDelta }}_2\right) . \end{aligned}$$

Consequently, we have

$$\begin{aligned} E\left[ \text{ vec }\left( \tilde{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) \text{ vec }'\left( \tilde{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) \right]&= \frac{1}{(n-1)(u-1)} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_1\otimes {\varvec{\varDelta }}_1\right) ,\\ E\left[ \text{ vec }\left( \tilde{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) \text{ vec }'\left( \tilde{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) \right]&= \frac{1}{n-1} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_2\otimes {\varvec{\varDelta }}_2\right) . \end{aligned}$$

First, we calculate the covariance matrix of $\text{ vec }\left( \tilde{{\varvec{\varSigma }}}_0-{\varvec{\varSigma }}_0\right) $. Since

$$\begin{aligned} \text{ vec }\left( \tilde{{\varvec{\varSigma }}}_0-{\varvec{\varSigma }}_0\right) = \frac{u-1}{u} \text{ vec }\left( \tilde{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) + \frac{1}{u} \text{ vec }\left( \tilde{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) , \end{aligned}$$

the covariance matrix of $\text{ vec }\left( \tilde{{\varvec{\varSigma }}}_0-{\varvec{\varSigma }}_0\right) $ is as follows:

$$\begin{aligned}&{ E\left[ \text{ vec }\left( \tilde{{\varvec{\varSigma }}}_0-{\varvec{\varSigma }}_0\right) \text{ vec }'\left( \tilde{{\varvec{\varSigma }}}_0-{\varvec{\varSigma }}_0\right) \right] }\nonumber \\&\quad = \frac{u-1}{(n-1)u^2} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_1\otimes {\varvec{\varDelta }}_1\right) + \frac{1}{(n-1)u^2} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_2\otimes {\varvec{\varDelta }}_2\right) . \end{aligned}$$

(5.3)

Similarly, since

$$\begin{aligned} \text{ vec }\left( \tilde{{\varvec{\varSigma }}}_1-{\varvec{\varSigma }}_1\right) = \frac{1}{u} \text{ vec }\left( \tilde{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) - \frac{1}{u} \text{ vec }\left( \tilde{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) , \end{aligned}$$

the covariance matrix of $\text{ vec }\left( \tilde{{\varvec{\varSigma }}}_1-{\varvec{\varSigma }}_1\right) $ is as follows:

$$\begin{aligned}&{ E\left[ \text{ vec }\left( \tilde{{\varvec{\varSigma }}}_1-{\varvec{\varSigma }}_1\right) \text{ vec }'\left( \tilde{{\varvec{\varSigma }}}_1-{\varvec{\varSigma }}_1\right) \right] }\nonumber \\&\quad = \frac{1}{(n-1)u^2(u-1)} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_1\otimes {\varvec{\varDelta }}_1\right) \nonumber \\&\qquad + \frac{1}{(n-1)u^2} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_2\otimes {\varvec{\varDelta }}_2\right) . \end{aligned}$$

(5.4)

C The Score and the Information Matrix

Assuming the BCS covariance structure, the log-likelihood function (3.5) is represented as follows:

$$\begin{aligned} \log L(\bar{{\varvec{x}}}, {\varvec{\varSigma }}_*)&= -\frac{nup}{2}\log (2\pi ) -\frac{n}{2}\log \left| {\varvec{\varSigma }}\right| \nonumber \\&\qquad -\frac{1}{2}\mathrm{tr}\left[ \left( {\varvec{I}}_n\otimes {\varvec{I}}_u\otimes {\varvec{A}}\right) {\varvec{C}}\right] -\frac{1}{2}\mathrm{tr}\left\{ \left( {\varvec{I}}_n\otimes {\varvec{J}}_u\otimes {\varvec{B}}\right) {\varvec{C}}\right\} , \end{aligned}$$

(5.5)

where

$$\begin{aligned} {\varvec{A}}&= \left( {\varvec{\varSigma }}_0-{\varvec{\varSigma }}_1\right) ^{-1}={\varvec{\varDelta }}_1^{-1},\\ {\varvec{B}}&= \frac{1}{u}\left[ \left\{ {\varvec{\varSigma }}_0+(u-1){\varvec{\varSigma }}_1\right\} ^{-1}-\left( {\varvec{\varSigma }}_0-{\varvec{\varSigma }}_1\right) ^{-1}\right] =\frac{1}{u}\left( {\varvec{\varDelta }}_2^{-1}-{\varvec{\varDelta }}_1^{-1}\right) . \end{aligned}$$

We show the following lemma used to derive the score function.

Lemma 5.4

Let ${\varvec{X}}$ be a $p\times p$ matrix and ${\varvec{H}}$ be a $p\times p$ constant matrix. Then, we have

(1)
$\displaystyle {\frac{{d}}{{d}{\varvec{X}}}\log |{\varvec{X}}| = ({\varvec{X}}^{-1})'}$,
(2)
$\displaystyle { \frac{{d}}{{ d}{\varvec{X}}}\text{ tr }({\varvec{X}}^{-1}{\varvec{H}}) =-({\varvec{X}}^{-1}{\varvec{H}}{\varvec{X}}^{-1})'.}$

Since the second term of the log-likelihood function (5.5) can be rewritten as:

$$\begin{aligned} -\frac{n}{2}\log \left| {\varvec{\varSigma }}\right|&= -\frac{n}{2}\log \left| {\varvec{\varDelta }}_1\right| ^{u-1}\left| {\varvec{\varDelta }}_2\right| =-\frac{n(u-1)}{2}\log \left| {\varvec{\varDelta }}_1\right| -\frac{n}{2}\log \left| {\varvec{\varDelta }}_2\right| , \end{aligned}$$

using Lemma 5.4 (1), we have

$$\begin{aligned} \frac{\partial }{\partial {\varvec{\varSigma }}_1}\left( -\frac{n}{2}\log \left| {\varvec{\varSigma }}\right| \right) = \frac{n(u-1)}{2}{\varvec{\varDelta }}_1^{-1} - \frac{n(u-1)}{2}{\varvec{\varDelta }}_2^{-1}. \end{aligned}$$

(5.6)

We can rewrite the third term of the log-likelihood function (5.5) as follows:

$$\begin{aligned} -\frac{1}{2}\mathrm{tr}\left[ \left( {\varvec{I}}_n\otimes {\varvec{I}}_u\otimes {\varvec{A}}\right) {\varvec{C}}\right]&= -\dfrac{nu}{2}\text{ tr }\left( {\varvec{A}}\hat{{\varvec{\varSigma }}}_0\right) , \end{aligned}$$

and so Lemma 5.4 (2) implies that

$$\begin{aligned} \frac{\partial }{\partial {\varvec{\varSigma }}_1}\left[ -\frac{1}{2}\mathrm{tr}\left[ \left( {\varvec{I}}_n\otimes {\varvec{I}}_u\otimes {\varvec{A}}\right) {\varvec{C}}\right] \right]= & {} \frac{\partial }{\partial {\varvec{\varSigma }}_1}\left[ -\frac{nu}{2}\text{ tr }\left( {\varvec{\varDelta }}_1^{-1}\hat{{\varvec{\varSigma }}}_0\right) \right] \nonumber \\= & {} -\frac{nu}{2}\left( {\varvec{\varDelta }}_1^{-1}\hat{{\varvec{\varSigma }}}_0{\varvec{\varDelta }}_1^{-1}\right) . \end{aligned}$$

(5.7)

We can rewrite the fourth term of log-likelihood function (5.5) as follows:

$$\begin{aligned}&{ \mathrm{tr}\left\{ \left( {\varvec{I}}_n\otimes {\varvec{J}}_u\otimes {\varvec{B}}\right) {\varvec{C}}\right\} = n\text{ tr }\left[ \left( {\varvec{J}}_u\otimes {\varvec{B}}\right) \hat{{\varvec{\varSigma }}}\right] = nu\text{ tr }\left[ {\varvec{B}}\hat{{\varvec{\varSigma }}}_0+(u-1){\varvec{B}}\hat{{\varvec{\varSigma }}}_1\right] }\\&\quad = n\text{ tr }\left( {\varvec{\varDelta }}_2^{-1}\hat{{\varvec{\varSigma }}}_0\right) - n\text{ tr }\left( {\varvec{\varDelta }}_1^{-1}\hat{{\varvec{\varSigma }}}_0\right) {+} n(u-1)\text{ tr }\left( {\varvec{\varDelta }}_2^{-1}\hat{{\varvec{\varSigma }}}_1\right) {-} n(u-1)\text{ tr }\left( {\varvec{\varDelta }}_1^{-1}\hat{{\varvec{\varSigma }}}_1\right) . \end{aligned}$$

Therefore, we have

$$\begin{aligned}&{ \frac{\partial }{\partial {\varvec{\varSigma }}_1}\left[ -\frac{1}{2}\mathrm{tr}\left\{ \left( {\varvec{I}}_n\otimes {\varvec{J}}_u\otimes {\varvec{B}}\right) {\varvec{C}}\right\} \right] }\nonumber \\&\quad = \frac{n(u-1)}{2}{\varvec{\varDelta }}_2^{-1}\hat{{\varvec{\varSigma }}}_0{\varvec{\varDelta }}_2^{-1} +\frac{n}{2}{\varvec{\varDelta }}_1^{-1}\hat{{\varvec{\varSigma }}}_0{\varvec{\varDelta }}_1^{-1} \nonumber \\&\qquad +\frac{n(u-1)^2}{2}{\varvec{\varDelta }}_2^{-1}\hat{{\varvec{\varSigma }}}_1{\varvec{\varDelta }}_2^{-1} +\frac{n(u-1)}{2}{\varvec{\varDelta }}_1^{-1}\hat{{\varvec{\varSigma }}}_1{\varvec{\varDelta }}_1^{-1} \nonumber \\&\quad = \frac{n(u-1)}{2} {\varvec{\varDelta }}_2^{-1}\left\{ \hat{{\varvec{\varSigma }}}_0+(u-1)\hat{{\varvec{\varSigma }}}_1\right\} {\varvec{\varDelta }}_2^{-1} +\frac{n}{2} {\varvec{\varDelta }}_1^{-1}\left\{ \hat{{\varvec{\varSigma }}}_0+(u-1)\hat{{\varvec{\varSigma }}}_1\right\} {\varvec{\varDelta }}_1^{-1}. \end{aligned}$$

(5.8)

From (5.6), (5.7), and (5.8), the derivative of the log-likelihood function is

$$\begin{aligned} {\varvec{U}}({\varvec{\varDelta }}_1, {\varvec{\varDelta }}_2)&= \frac{\partial }{\partial {\varvec{\varSigma }}_1} \log L(\bar{{\varvec{x}}}, {\varvec{\varSigma }}_*) \nonumber \\&\quad = \frac{n(u-1)}{2}{\varvec{\varDelta }}_1^{-1} - \frac{n(u-1)}{2}{\varvec{\varDelta }}_2^{-1} -\frac{nu}{2}\left( {\varvec{\varDelta }}_1^{-1}\hat{{\varvec{\varSigma }}}_0{\varvec{\varDelta }}_1^{-1}\right) \nonumber \\&\qquad +\frac{n(u-1)}{2} {\varvec{\varDelta }}_2^{-1}\left\{ \hat{{\varvec{\varSigma }}}_0+(u-1)\hat{{\varvec{\varSigma }}}_1\right\} {\varvec{\varDelta }}_2^{-1} \nonumber \\&\qquad +\frac{n}{2} {\varvec{\varDelta }}_1^{-1}\left\{ \hat{{\varvec{\varSigma }}}_0+(u-1)\hat{{\varvec{\varSigma }}}_1\right\} {\varvec{\varDelta }}_1^{-1}. \end{aligned}$$

(5.9)

Since

$$\begin{aligned} \hat{{\varvec{\varSigma }}}_0=\frac{u-1}{u}\hat{{\varvec{\varDelta }}}_1+\frac{1}{u}\hat{{\varvec{\varDelta }}}_2, \end{aligned}$$

we have

$$\begin{aligned} \text{ vec } \left( {\varvec{U}}({\varvec{\varDelta }}_1, {\varvec{\varDelta }}_2) \right)&= -\dfrac{n(u-1)}{2} \left( {\varvec{\varDelta }}_1^{-1}\otimes {\varvec{\varDelta }}_1^{-1}\right) \text{ vec }\left( \hat{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) \nonumber \\&\quad + \dfrac{n(u-1)}{2} \left( {\varvec{\varDelta }}_2^{-1}\otimes {\varvec{\varDelta }}_2^{-1}\right) \text{ vec }\left( \hat{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) . \end{aligned}$$

(5.10)

Before we calculate the information matrix, we obtain the expectations and the covariance matrices of $\hat{{\varvec{\varDelta }}}_1$ and $\hat{{\varvec{\varDelta }}}_2$. Since $\tilde{{\varvec{\varDelta }}}_i=n\hat{{\varvec{\varDelta }}}_i/(n-1)$, we have

$$\begin{aligned} \hat{{\varvec{\varDelta }}}_1&\sim W_p\left( (n-1)(u-1), \frac{1}{n(u-1)}{\varvec{\varDelta }}_1\right) , \\ \hat{{\varvec{\varDelta }}}_2&\sim W_p\left( n-1, \frac{1}{n}{\varvec{\varDelta }}_2\right) . \end{aligned}$$

The expectations of $\text{ vec }\left( \hat{{\varvec{\varDelta }}}_1\right) $ and $\text{ vec }\left( \hat{{\varvec{\varDelta }}}_2\right) $ are

$$\begin{aligned} E\left[ \text{ vec }\left( \hat{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) \right]&= \frac{n-1}{n}\text{ vec }\left( {\varvec{\varDelta }}_1\right) -\text{ vec }\left( {\varvec{\varDelta }}_1\right) = -\frac{1}{n}\text{ vec }\left( {\varvec{\varDelta }}_1\right) , \\ E\left[ \text{ vec }\left( \hat{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) \right]&= \frac{n-1}{n}\text{ vec }\left( {\varvec{\varDelta }}_2\right) -\text{ vec }\left( {\varvec{\varDelta }}_2\right) = -\frac{1}{n}\text{ vec }\left( {\varvec{\varDelta }}_2\right) , \end{aligned}$$

and the covariance matrices of $\text{ vec }\left( \hat{{\varvec{\varDelta }}}_1\right) $ and $\text{ vec }\left( \hat{{\varvec{\varDelta }}}_2\right) $ are

$$\begin{aligned}&{ E\left[ \text{ vec }\left( \hat{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) \text{ vec }'\left( \hat{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) \right] }\\&\quad = E\left[ \left\{ \text{ vec }\left( \hat{{\varvec{\varDelta }}}_1-\frac{n-1}{n}{\varvec{\varDelta }}_1\right) -\frac{1}{n}\text{ vec }\left( {\varvec{\varDelta }}_1\right) \right\} \right. \\&\left. \qquad \left\{ \text{ vec }\left( \hat{{\varvec{\varDelta }}}_1-\frac{n-1}{n}{\varvec{\varDelta }}_1\right) -\frac{1}{n}\text{ vec }\left( {\varvec{\varDelta }}_1\right) \right\} ' \right] \\&\quad = \frac{n-1}{n^2(u-1)} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_1\otimes {\varvec{\varDelta }}_1\right) + \frac{1}{n^2} \text{ vec }\left( {\varvec{\varDelta }}_1\right) \text{ vec }'\left( {\varvec{\varDelta }}_1\right) , \\&{ E\left[ \text{ vec }\left( \hat{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) \text{ vec }'\left( \hat{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) \right] }\\&\quad = E\left[ \left\{ \text{ vec }\left( \hat{{\varvec{\varDelta }}}_2-\frac{n-1}{n}{\varvec{\varDelta }}_2\right) -\frac{1}{n}\text{ vec }\left( {\varvec{\varDelta }}_2\right) \right\} \right. \\&\left. \qquad \left\{ \text{ vec }\left( \hat{{\varvec{\varDelta }}}_2-\frac{n-1}{n}{\varvec{\varDelta }}_2\right) -\frac{1}{n}\text{ vec }\left( {\varvec{\varDelta }}_2\right) \right\} ' \right] \\&\quad = \frac{n-1}{n^2} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_2\otimes {\varvec{\varDelta }}_2\right) + \frac{1}{n^2} \text{ vec }\left( {\varvec{\varDelta }}_2\right) \text{ vec }'\left( {\varvec{\varDelta }}_2\right) . \end{aligned}$$

Therefore, the information matrix is

$$\begin{aligned}&{ {\varvec{I}}\left( {\varvec{\varDelta }}_1, {\varvec{\varDelta }}_2\right) = E\left[ \text{ vec }\left( {\varvec{U}}({\varvec{\varDelta }}_1, {\varvec{\varDelta }}_2)\right) \text{ vec }'\left( {\varvec{U}}({\varvec{\varDelta }}_1, {\varvec{\varDelta }}_2)\right) \right] }\nonumber \\&\quad = \frac{n^2(u-1)^2}{4} \left( {\varvec{\varDelta }}_1^{-1} \otimes {\varvec{\varDelta }}_1^{-1}\right) E\left[ \text{ vec }\left( \hat{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) \text{ vec }'\left( \hat{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) \right] \left( {\varvec{\varDelta }}_1^{-1} \otimes {\varvec{\varDelta }}_1^{-1}\right) \nonumber \\&\qquad -\frac{n^2(u-1)^2}{4} \left( {\varvec{\varDelta }}_1^{-1} \otimes {\varvec{\varDelta }}_1^{-1}\right) E\left[ \text{ vec }\left( \hat{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) \text{ vec }'\left( \hat{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) \right] \left( {\varvec{\varDelta }}_2^{-1} \otimes {\varvec{\varDelta }}_2^{-1}\right) \nonumber \\&\qquad -\frac{n^2(u-1)^2}{4} \left( {\varvec{\varDelta }}_2^{-1} \otimes {\varvec{\varDelta }}_2^{-1}\right) E\left[ \text{ vec }\left( \hat{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) \text{ vec }'\left( \hat{{\varvec{\varDelta }}}_1-{\varvec{\varDelta }}_1\right) \right] \left( {\varvec{\varDelta }}_1^{-1} \otimes {\varvec{\varDelta }}_1^{-1}\right) \nonumber \\&\qquad + \frac{n^2(u-1)^2}{4} \left( {\varvec{\varDelta }}_2^{-1} \otimes {\varvec{\varDelta }}_2^{-1}\right) E\left[ \text{ vec }\left( \hat{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) \text{ vec }'\left( \hat{{\varvec{\varDelta }}}_2-{\varvec{\varDelta }}_2\right) \right] \left( {\varvec{\varDelta }}_2^{-1} \otimes {\varvec{\varDelta }}_2^{-1}\right) \nonumber \\&\quad = \frac{(n-1)(u-1)}{4} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_1^{-1} \otimes {\varvec{\varDelta }}_1^{-1}\right) \nonumber \\&\qquad + \frac{(n-1)(u-1)^2}{4} \left( {\varvec{I}}_{p^2}+{\varvec{K}}_{p,p}\right) \left( {\varvec{\varDelta }}_2^{-1} \otimes {\varvec{\varDelta }}_2^{-1}\right) . \end{aligned}$$

(5.11)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tsukada, Si. Hypothesis Testing for Independence Under Blocked Compound Symmetric Covariance Structure. Commun. Math. Stat. 6, 163–184 (2018). https://doi.org/10.1007/s40304-018-0130-4

Download citation

Received: 14 September 2017
Revised: 02 January 2018
Accepted: 15 March 2018
Published: 30 April 2018
Issue Date: June 2018
DOI: https://doi.org/10.1007/s40304-018-0130-4

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

	\(p=3\)	\(p=5\)	\(p=9\)
\(\tau _1\)	0.8	0.3	0.15
\(\tau _2\)	1.3	0.5	0.2

Hypothesis Testing for Independence Under Blocked Compound Symmetric Covariance Structure

Abstract

Similar content being viewed by others

Hypothesis Testing for Independence given a Blocked Compound Symmetric Covariance Structure in a High-Dimensional Setting

Multiple Testing of Conditional Independence Hypotheses Using Information-Theoretic Approach

The Permutation Testing Approach in the Light of Conditionality and Sufficiency Principles

1 Introduction

2 Estimators

Theorem 2.1

Lemma 2.2

Proof

Lemma 2.3

Proof

Theorem 2.4

Proof

3 Test Statistics and Bootstrap Test

3.1 Likelihood Ratio Criterion

3.2 Wald Statistic

Theorem 3.1

3.3 Rao’s Score Statistic

3.4 Bootstrap Test

4 Numerical Example

4.1 Numerical Simulation

4.2 Example Using Real Data

5 Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix

A Proof of Lemmas 2.2 and 2.3

Lemma 5.1

Lemma 5.2

Proof

Lemma 5.3

Proof

B Covariance Matrix of the Unbiased Estimator

C The Score and the Information Matrix

Lemma 5.4

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation