On change-points tests based on two-samples U-Statistics for weakly dependent observations

Ngatchou-Wandji, Joseph; Elharfaoui, Echarif; Harel, Michel

doi:10.1007/s00362-021-01242-3

On change-points tests based on two-samples U-Statistics for weakly dependent observations

Regular Article
Published: 28 May 2021

Volume 63, pages 287–316, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Statistical Papers Aims and scope Submit manuscript

On change-points tests based on two-samples U-Statistics for weakly dependent observations

Download PDF

Joseph Ngatchou-Wandji¹,
Echarif Elharfaoui² &
Michel Harel^3,4

387 Accesses
4 Citations
Explore all metrics

Abstract

We study change-points tests based on U-statistics for absolutely regular observations. Our method avoids some technical assumptions on the data and the kernel. The asymptotic properties of the U-statistics are studied under the null hypothesis, under fixed alternatives and under a sequence of local alternatives. The asymptotic distributions of the test statistics under the null hypothesis and under the local alternatives are given explicitly and the tests are shown to be consistent. A small set of simulations is done for evaluating the performance of the tests in detecting changes in the mean, variance and autocorrelation of some simple time series.

Change-Point Detection Under Dependence Based on Two-Sample U-Statistics

On nonparametric change point estimator based on empirical characteristic functions

Article 21 July 2016

Z-Process Method for Change Point Problems in Time Series

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

We are interested in detecting possible differences between the distributions of real-valued random variables $X_1, X_2, \ldots ,X_n$. In practice, this issue can be of primary importance for data from industrial quality control, financial markets, medical diagnostics, hydrology, climatology etc.

This statistical matter is known as change-points problem whose theory, well developed for independent data, has considerably been studied in the literature both from parametric and non-parametric point of view. Since Page (1954), so many tests have been proposed for testing changes in the distribution of iid data. Among others, Chernoff and Zacks (1964) propose test statistics for detecting shifts in the mean of a normal distribution function. Their results are generalized to exponential family by Kander and Zacks (1966), Gardner (1969) and MacNeill (1974). Matthews et al. (1985) study maximal score statistics to test for constant hazard against a change-point alternative. Haccou et al. (1988) propose a likelihood ratio test for a change-point in a sequence of independent exponentially distributed random variables. They prove their test is optimal in the sense of Bahadur. Yao and Davis (1986) consider the asymptotic behavior of the likelihood ratio statistic for testing a shift in mean in a sequence of independent Gaussian random variables. Csörgő and Horváth (1987) propose statistics based on processes of linear rank statistics with quantile scores. A review on non-parametric procedures is given by Wolfe and Schechtman (1984), who summarize among others, pioneer works on change-points as Page (1954, 1955), Bhattacharya and Zhou (2017), Sen and Srivastava (1975) and Pettitt (1979).

These last years there is a growing interest in change-points study in time series data. Most of the techniques and approaches used are mainly based either on testing for the existence of changes or for their locations, or on estimating the locations. Some relevant references on change-points estimation are Härdle and Tsybakov (1997), Härdle et al. (1998), Bardet and Wintenberger (2009), Döring (2010, 2011), Ciuperca (2011), Bardet and Kengne (2014), Amano (2012), Horváth and Hušková (2005), Yang and Song (2014), Mohr and Selk (2020) and Yang et al. (2020). A non-exhaustive list of references on testing approaches are, among others, Kengne (2012), Chen et al. (2011), Dehling et al. (2013), Dehling et al. (2015), Wang and Phillips (2012), Francq and Zakoïan (2012), Bardet et al. (2012), Zhou (2014), Fotopoulos et al. (2009), Huh (2010), Enikeeva et al. (2018), Gombay (2008), Gombay and Serban (2009), Hlávka et al. (2020) and Ma et al. (2020) which uses both estimation and testing. Meintanis (2016) gives an interesting survey of testing procedures based on the empirical characteristic function. We would also like to mention the interesting and related work of Rackauskas and Wendler (2020) who deal with a robust test based on the Wilcoxon statistic for detecting epidemic changes. While the asymptotic behavior of the test statistic is studied under the null hypothesis, with techniques close to ours, its consistency is only discussed.

In almost all the existing testing papers, except perhaps Fotopoulos et al. (2009), Khakhubia (1987), Bhattacharyya and Johnson (1968), Dehling et al. (2013), Dehling et al. (2017a) and Dehling et al. (2017b), the local power is rarely studied. This issue is considered in this paper where the tests studied are derived from basic processes of the general form

$$\begin{aligned} Z_n^*(\lambda )= n^{-3/2}\sum _{i=1}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n}h(X_{i},X_{j}), \ 0\le \lambda \le 1, \end{aligned}$$

with $h:{\mathbb {R}}^{2}\rightarrow {\mathbb {R}}$ a kernel function.

The asymptotic distribution of a related process has been studied in a Hölder space by Rackauskas and Wendler (2020) for stationary mixing data and antisymmetric h, while this process has been studied in a Skorohod space for instance by Csörgő and Horváth (1988) for iid data, and by Dehling et al. (2015) for data assumed to be functions of mixing random variables satisfying some other conditions such as1-approximating functional, 1-sided functional, bounded density function, with kernels h being 1-continuous. These conditions are avoided here by our mixing assumption on the data, and our study is done in a Skorohod space. Furthermore, besides the Kolmogorov-Smirnov (KS) type test usually studied in the literature, we study a Cramer-von Mises (CM) version which has the advantage that its theoretical limiting distribution under the null hypothesis can be approximated for any kernel h. We restrict our study to the classical case of one change-point detection. But our results can be generalized to multi-change-points detection which we postpone to a future paper.

The paper is organized as follows. In Sect. 2, we define useful quantities such as the test statistics, and we list some assumptions. In Sect. 3 we study the asymptotic properties of our tests statistics under the null hypothesis, under a sequence of local alternatives and under fixed alternatives. Practical considerations are presented and discussed in Sect. 4, while the last section contains the proofs of the main results.

2 General definitions and assumptions

For cumulative distribution functions Q and R, denote by $\theta (Q,R)$ the following real number

$$\begin{aligned} \theta (Q,R)=\int \int h(x,y)dQ(x)dR(y). \end{aligned}$$

For any $i=1,2, \ldots ,n$, let $F_i$ be the cumulative distribution function of $X_i$. We aim to check possible differences between the $F_i$’s. We restrict ourselves to checking if there exists only one index $i_0$ for which $F_{i_0}$ and $F_{i_0+1}$ are different. We study this problem by testing the hypothesis ${\mathcal {H}}_0$ against the alternative ${\mathcal {H}}_1$, defined respectively by

$$\begin{aligned}&{\mathcal {H}}_0: F_1(x)=F_2(x)= \ldots =F_n(x), \ x\in {\mathbb {R}}\\&{\mathcal {H}}_1: \exists \ \lambda _0\in (0,1) \text{ such } \text{ that } F_1(x)=F_2(x)= \ldots =F_{[n\lambda _0]}(x)=F(x), x\in {\mathbb {R}}\ \text {and}\\&\quad F_{[n\lambda _0]+1}(x)= \ldots =F_n(x)=G(x), x\in {\mathbb {R}},\ \text {and}\ \theta (F,F)\ne \theta (F,G). \end{aligned}$$

Figure 1 exhibits the chronograms of some time series each of size 200, owning a change-point at $t=100$. The first graphic in the first row shows a change in the mean of a shifted Gaussian white noise, while the one at its right shows a change in its variance. The first graphic in the second row shows a change in both the mean and the variance of a shifted Gaussian white noise, and the second shows a change in the autocorrelation of a Gaussian AR(1) model.

In order to evaluate the capacity of the tests to detect weak changes, we also consider the local alternatives ${\mathcal {H}}_{1,n}$ of the form

$$\begin{aligned}&{\mathcal {H}}_{1,n}: \exists \ \lambda _0\in (0,1) \text{ such } \text{ that } F_1(x)=F_2(x)= \ldots =F_{[n\lambda _0]}(x)=F(x),\ \text {and}\\&\quad F_{[n\lambda _0]+1}(x)= \ldots =F_n(x)=G^{(n)}(x), x\in {\mathbb {R}},\ \text {and}\\&\quad \theta (F,G^{(n)})= \theta (F,F) +n^{-1/2}[A+o(1)],\ \text {for some}\ \gamma \in {\mathbb {R}}^*. \end{aligned}$$

Remark 1

Particular examples of local alternatives ${\mathcal {H}}_{1,n}$ are those for which there exists a constant $\gamma $ such that $G^{(n)}(x)=F(x+n^{-1/2}\gamma )$ and the kernel function h is twice differentiable with finite integral $\int \int (\partial h(x,y)/\partial y)dF(x)dF(y)$ and bounded second-order derivatives $\partial ^2h(x,y)/ \partial ^2y$. This can be checked easily by a suitable application of the Taylor-Young formula. With this example, one finds

$$\begin{aligned} A=-\gamma \int \int {\partial h \over \partial u}(x,u)dF(x)dF(u). \end{aligned}$$

In the purpose of solving our testing problem, the tests we are going to use are based on the following KS and CM statistics

$$\begin{aligned} T_{1,n}= & {} \max _{1 \le k \le n-1}\left| n^{-3/2}\sum _{i=1}^k \sum _{j=k+1}^{n}\big \{h(X_{i},X_{j})-\theta ({{\widehat{F}}}, {{\widehat{F}}})\big \} \right| \end{aligned}$$

(1)

$$\begin{aligned} T_{2,n}= & {} {1 \over n}\sum _{1 \le k \le n-1}\left\{ n^{-3/2}\sum _{i=1}^k \sum _{j=k+1}^{n}\left[ h(X_{i},X_{j})-\theta ({{\widehat{F}}}, {{\widehat{F}}})\right] \right\} ^2, \end{aligned}$$

(2)

where ${{\widehat{F}}}$ stands for any consistent estimator of F, a simple example being the empirical cumulative distribution function.

Denote by [x] the integer part of any real number x. Noting that for any $k \in \{1, \ldots , n-1 \}$, there exists $\lambda _* \in [0,1]$ such that $k=[\lambda _*n]$, one can write, asymptotically,

$$\begin{aligned} T_{1,n}= & {} \sup _{\lambda \in [0,1]}\left| Z_n(\lambda )\right| \\ T_{2,n}= & {} \int _0^1 Z_n^2(\lambda ) d \lambda , \end{aligned}$$

where $Z_n$ stands for the following stochastic process

$$\begin{aligned} Z_n(\lambda )= n^{-3/2}\sum _{i=1}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n}\left[ h(X_{i},X_{j})-\theta (F,F)\right] , \ 0\le \lambda \le 1. \end{aligned}$$

(3)

Define the following U-statistic $U_n$ with kernel h, and the following functions

$$\begin{aligned}&U_n =\frac{2}{n(n - 1)}\sum _{1\le i< j\le n}h(X_i,X_j) \\&h_{F,1}(x)=\int h(x,y)dF(y)-\theta (F,F)\\&h_{F,2}(y)=\int h(x,y)dF(x)-\theta (F,F)\\&h_{G,1}(x)=\int h(x,y)dG(y)- \theta (F,G)\\&h_{G,2}(y)=\int h(x,y)dF(x)- \theta (F,G)\\&g_F(x,y)=h(x,y)-h_{F,1}(x)-h_{F,2}(y)+\theta (F,F)\\&g_G(x,y)=h(x,y)- h_{G,1}(x)- h_{G,2}(y)+ \theta (F,G). \end{aligned}$$

Consider the Hoeffding decomposition of $U_n$ under ${\mathcal {H}}_0$

$$\begin{aligned} U_n = \theta (F,F) + U^{(1)}_{n,1} + U^{(1)}_{n,2} + U^{(2)}_n, \end{aligned}$$

(4)

where

$$\begin{aligned}&U^{(1)}_{n,1} = n^{-1}\sum ^n_{i=1}h_{F,1}(X_i)\\&U^{(1)}_{n,2} = n^{-1}\sum ^n_{i=1}h_{F,2}(X_i)\\&U^{(2)}_n = \frac{2}{n(n - 1)}\sum _{1\le i< j\le n}\left[ h(X_i,X_j) - h_{F,1}(X_i) - h_{F,2}(X_j)\right] - \theta (F,F). \end{aligned}$$

Also, define the following numbers

$$\begin{aligned} \sigma _{kl} = {\mathbb {E}}\left[ h_{F,k}(X_1)h_{F,l}(X_1)\right] + 2\sum _{j=1}^{\infty }{\mathbb {C}}\text{ ov } \left( h_{F,k}(X_1), h_{F,l}(X_{1+j})\right) , \ k, l = 1, 2. \end{aligned}$$

We make the following assumptions :

(A1)
The sequence $\{{X}_i\}_{i\in {\mathbb {N}}}$ is absolutely regular with the rate
$$\begin{aligned} \beta (k)={\mathcal {O}}(\tau ^k), \ \ 0<\tau <1. \end{aligned}$$
(5)
(A2)
$\{{X}_i\}_{i\in {\mathbb {N}}}$ is stationary.
(A3)
We consider $(Y_{i})_{1\le i\le n}$ a sequence of stationary and absolute regular random variables with rate (5). We assume the cumulative distribution function of the $Y_i$’s is G.
(A4)
We consider $(Y_{ni})_{1\le i\le n, n\ge 1}$ a sequence of stationary and absolute regular random variables with cumulative distribution function $G^{(n)}(x)=F(x+\eta n^{-{1\over 2}})$. We assume the cumulative distribution functions $G_{ij}^{(n)}$ and $G_{ij}^{*(n)}$ of the $(Y_{ni}, Y_{nj})$’s and $(X_i, Y_{nj})$’s respectively satisfy
$$\begin{aligned} \lim _{n \rightarrow \infty }G_{ij}^{(n)}(x, y)=F_{ij}(x, y) \text{ and } \lim _{n \rightarrow \infty }G_{ij}^{*(n)}(x, y)= F_{ij}(x, y), \ 1\le i<j \le n, \end{aligned}$$
where $F_{ij}$ is the cumulative distribution function of $(X_i,X_j)$.
- We recall from Harel and Puri (1994) that a non-necessarily stationary triangular sequence $\{{\mathcal {V}}_{ni}, 1\le i \le n,n \ge 1\}$ is absolutely regular if $\beta (k) \longrightarrow 0$, as $k \rightarrow \infty $, where
  $$\begin{aligned} \beta (k)=\sup _{n\in {\mathbb {N}}}\sup _{k \le n}\max _{1\le j\le n-k} {\mathbb {E}}\left( \sup _{A\in {\mathcal {A}}_{n,j+k}^\infty }\left| {\mathbb {P}}(A\mid {\mathcal {A}}_{n,0}^j)-{\mathbb {P}}(A)\right| \right) , \end{aligned}$$
  with ${\mathcal {A}}_{n,i}^j$ standing for the $\sigma $-algebra generated by ${\mathcal {V}}_{ni},\ldots ,{\mathcal {V}}_{nj}, \ i, j \in {\mathbb {N}}\cup \{\infty \}$. It will be said to be strong mixing or $\alpha $-mixing if $\alpha (k) \longrightarrow 0$ as $k \rightarrow \infty $, where
  $$\begin{aligned} \alpha (k)=\sup _{n\in {\mathbb {N}}}\max _{1\le j\le n-k} \sup _{A\in {\mathcal {A}}_{n,j+k}^\infty , B \in {\mathcal {A}}_{n,0}^j}\left| {\mathbb {P}}(A \cap B)-{\mathbb {P}}(A) {\mathbb {P}}(B)\right| . \end{aligned}$$
- Note that a $\beta $-mixing sequence of random variables is also $\alpha $-mixing.

Remark 2

We assume a geometrical mixing rate by convenience. We believe our results can be established as well for arithmetical mixing rates to be found.

3 Asymptotics

In this section, we state all our theoretical results. Most of the proofs are postponed to the last section.

Theorem 1

Assume that the assumptions (A1)–(A3) hold. Then, under ${\mathcal {H}}_0$, if $\max \{\sup _{i,j} {\mathbb {E}}\left[ |h(X_i, X_j)|^{2+\delta } \right] , \int \int _{{\mathbb {R}}^2}|h(x, y)|^{2+\delta }dF(x)dF(y) \} < \infty $ for some $\delta > 0$ and the absolute regularity condition (5) is satisfied, then for any $k,l=1,2$, $\sigma _{kl} < \infty $.

If in addition $\sigma _{kl} >0$, $1\le k,l \le 2$, then the sequence of processes of $\{Z_{n}(\lambda ); 0\le \lambda \le 1\}_{n\in {\mathbb {N}}}$ converges in distribution towards a zero-mean Gaussian process with representation

$$\begin{aligned} Z(\lambda ) = (1 - \lambda )W_1(\lambda ) + \lambda (W_2(1) - W_2(\lambda )), \ 0 \le \lambda \le 1, \end{aligned}$$

where $\{W_1(\lambda ), W_2(\lambda )\}_{0\le \lambda \le 1}$ is a two-dimensional zero-mean Brownian motion with covariance kernel matrix with entries ${\mathbb {C}}{ov}(W_k(s), W_l(t)) =\min (s, t)\sigma _{kl}, \ k,l=1,2.$

Proof

See Appendix.

Remark 3

The covariance kernel of the Gaussian process Z defined in Theorem 1 is given for all $s, t \in [0,1]$ by

$$\begin{aligned} \varDelta (s,t)= & {} {\mathbb {C}}\text{ ov }(Z(s), Z(t)) \nonumber \\&= \sigma _{11}(1-s)(1-t) \min (s, t)+\sigma _{22}st[1-s-t+\min (s, t)]\nonumber \\&+\sigma _{12}\{t(1-s)[s-\min (s, t)]+s(1-t)[t-\min (s, t)]\}. \end{aligned}$$

(6)

Theorem 2

Assume (A1), (A2) and (A4) hold, h is twice differentiable with bounded second-order derivatives $\partial ^{2}h(x,y)/\partial x\partial y$, and the integral $\int \int (\partial h(x,y)/\partial y)dF(x)dF(y)$ is finite. Then, under ${\mathcal {H}}_{1,n}$, if

$$\begin{aligned}&\sup _{1\le i,j \le n}{\mathbb {E}}\left[ |h(X_i,X_j)|^{2+\delta } \right] , \ \ \sup _{n \ge 1}\sup _{i,j}{\mathbb {E}}\left[ |h(Y_{ni},Y_{nj})|^{2+\delta } \right] ,\\&\sup _{n \ge 1} \sup _{1\le i,j \le n}{\mathbb {E}}\left[ |h(X_i,Y_{nj})|^{2+\delta }\right] , \ \ \int \int _{{\mathbb {R}}^2}|h(x,y)|^{2+\delta }dF(x)dF(y),\\&\sup _{n \ge 1} \int \int _{{\mathbb {R}}^2}|h(x,y)|^{2+\delta }dG^{(n)}(x)dG^{(n)}(y), \ \ \sup _{n \ge 1}\int \int _{{\mathbb {R}}^2}|h(x,y)|^{2+\delta } dF(x)dG^{(n)}(y) \end{aligned}$$

are finite for some $\delta > 0$, if for any $k,l=1,2$, $\sigma _{kl} >0$, then the sequence of processes $\{Z_{n}(\lambda ); 0\le \lambda \le 1\}_{n\in {\mathbb {N}}}$ converges in distribution towards a Gaussian process ${{\widetilde{Z}}}$ with mean $(1-\lambda )\lambda A$ and representation

$$\begin{aligned} {{\widetilde{Z}}}(\lambda )=(1- \lambda )\lambda A+ Z(\lambda ), \ \ 0\le \lambda \le 1, \end{aligned}$$

where $\{Z(\lambda )\}_{0\le \lambda \le 1}$ is the zero-mean Gaussian process defined in Theorem 1.

Proof

See Appendix. $\square $

Theorem 3

Assume (A1)–(A3) hold and that under ${\mathcal {H}}_1$, the integrability conditions in Theorem 2 hold. Then,

$$\begin{aligned} n^{-1/2}Z_n^*(t){\mathop {\longrightarrow }\limits ^{ a.s. }}_{n\rightarrow \infty } \left\{ \begin{array}{ll} \theta (F,F) t(\lambda _0-t)+\theta (F,G) t(1-\lambda _0), &{} 0\le t\le \lambda _0\\ \theta (G,G)(t-\lambda _0)(1-t)+\theta (F,G)\lambda _0(1-t), &{} \lambda _0\le t<1. \end{array} \right. \end{aligned}$$

(7)

Proof

See Appendix.

Theorem 4

Assume that the assumptions of Theorem 2 hold. Let $(Z(\lambda ) : 0 \le \lambda \le 1)$ be the limiting process defined in Theorems 1 and 2 , and $\varDelta $ its covariance kernel. Then

(i)
Under ${\mathcal {H}}_0$, as n tends to infinity, one has the following convergence in distribution,
$$\begin{aligned}&T_{1,n} \longrightarrow \sup _{\lambda \in [0,1]}\left| Z(\lambda ) \right| \\&T_{2,n} \longrightarrow \sum _{j \ge 1} \zeta _j \chi _j^2, \end{aligned}$$
where the $ \chi _j^{2}$’s are iid chi-square random variables with one degree of freedom and the $\zeta _j$’s are standing for the eigenvalues of the linear integral operator $\nabla $ defined for any square integrable function $\tau $ on [0, 1] by
$$\begin{aligned} \nabla [\tau (\cdot )]= \int _0^1 \varDelta (\cdot ,s) \tau (s)ds. \end{aligned}$$
(8)
(ii)
Under ${\mathcal {H}}_{1,n}$, as n tends to infinity, one has the following convergence in distribution,
$$\begin{aligned}&T_{1,n} \longrightarrow \sup _{\lambda \in [0,1]}\left| (1-\lambda ) \lambda A+ Z(\lambda ) \right| \\&T_{2,n} \longrightarrow \sum _{j \ge 1} \zeta _j \chi _j^{*2}, \end{aligned}$$
where the $ \chi _j^{*2}$’s are iid non-central chi-square random variables with one degree of freedom and non-centrality parameters $\rho _j^2\zeta _j^{-1}$ with the $e_j$’s standing for the eigenvectors of the integral operator $\nabla $, associated with the eigen-value $\zeta _j$, and
$$\begin{aligned} \rho _j=A \int _0^1 \lambda (1-\lambda ) e_j(\lambda ) d \lambda . \end{aligned}$$
(iii)
Under ${\mathcal {H}}_1$, as n tends to infinity, one has the following convergence in probability,
$$\begin{aligned} T_{1,n} \longrightarrow \infty , \ \ \ T_{2,n} \longrightarrow \infty . \end{aligned}$$

Proof

(i):: From Theorem 1 and the continuous mapping theorem, $T_{1,n}$ and $T_{2,n}$ converge in distribution respectively to $\sup _{\lambda \in [0,1]} |Z(\lambda )|$ and $\int _0^1 Z^2(\lambda )d\lambda $.

Now, we show that $\int _0^1 Z^2(\lambda )d\lambda $ has the same distribution as a sum of weighted iid chi-square distribution with one degree of freedom. Noting that $\varDelta $ is a Mercer kernel (it is easy to prove), it follows from Riesz and Nagy (1972) that the integral operator defined by (8) admits eigenvalues $\zeta _1\ge \zeta _2\ge \ldots \ge 0$ with associated eigenfunctions $e_1,e_2, \ldots $ forming an orthonormal basis of $L^2[0,1]$, the set of square integrable functions on [0, 1]. From this result, it is an easy matter that the zero-mean Gaussian process Z as a function in $L^2[0,1]$, has the Karhunen-Loève representation

$$\begin{aligned} Z(\lambda )=\sum _{j \ge 1} N_j e_j(\lambda ), \lambda \in [0,1], \end{aligned}$$

with the independent random variables’ $N_j$’s defined as $N_j=\int _{[0,1]}Z(\lambda )e_j(\lambda )d \lambda \sim {\mathcal {N}}(0, \zeta _j)$. One easily deduces from this that in distribution

$$\begin{aligned} \int _0^1 Z^2(\lambda )d\lambda =\sum _{j \ge 1} \zeta _j \chi _j^2, \end{aligned}$$

where the $\chi _j^2$’s are iid chi-square random variables with one degree of freedom.

(ii):

From Theorem 2 and the continuous the mapping theorem, $T_{1,n}$ and $T_{2,n}$ converge in distribution respectively to $\sup _{\lambda \in [0,1]} |(1-\lambda ) \lambda A+Z(\lambda )|$ and $\int _0^1 |(1-\lambda ) \lambda A+Z(\lambda )|^2 d\lambda $.

For the same reasons as above, one has the decomposition

$$\begin{aligned} (1-\lambda ) \lambda A+Z(\lambda )=\sum _{j \ge 1} {{\widetilde{N}}}_j e_j(\lambda ), \lambda \in [0,1], \end{aligned}$$

with the independent random variables ${{\widetilde{N}}}_j$’s defined as ${{\widetilde{N}}}_j=\int _0^1Z(\lambda )e_j(\lambda )d \lambda \sim {\mathcal {N}}(\rho _j, \zeta _j)$.

It follows from this that, in distribution,

$$\begin{aligned} \int _0^1 | (1-\lambda ) \lambda A+Z(\lambda )|^2d\lambda =\sum _{j \ge 1} \zeta _j \chi _j^{*2}, \end{aligned}$$

where the $\chi _j^{*2}$’s are non-central iid chi-square random variables with one degree of freedom and non-centrality parameter $\rho _j^2 \zeta _j^{-1}$.

(iii):

The last part is an easy consequence of Theorem 3.

$\square $

Define $\sigma ^2$ by

$$\begin{aligned} \sigma ^2={\mathbb {V}}\text{ ar }(h_{F,1}(X_1))+2 \sum _{j=1} {\mathbb {C}}\text{ ov } (h_{F,1}(X_1),h_{F,1}(X_{1+j})). \end{aligned}$$

Corollary 1

Assume that the assumptions of Theorem 2 hold, and that h is such that its associated $h_{F,1}$ and $h_{F,2}$ satisfy $h_{F,1}(x)=-h_{F,2}(x)$. Then

(i)
Under $H_0$, as n tends to infinity, one has the following convergences in distribution
$$\begin{aligned}&T_{1,n} \longrightarrow \sigma \sup _{\lambda \in [0,1]}\left| W^0(\lambda ) \right| \\&T_{2,n} \longrightarrow \sigma ^2 \sum _{j \ge 1} {1 \over j^2 \pi ^2} \chi _j^2. \end{aligned}$$
(ii)
Under $H_{1,n}$, as n tends to infinity, one has the following convergences in distribution
$$\begin{aligned}&T_{1,n} \longrightarrow \sup _{\lambda \in [0,1]}\left| (1-\lambda ) \lambda A+ \sigma W^0(\lambda ) \right| \\&T_{2,n} \longrightarrow \sum _{j \ge 1} {1 \over j^2 \pi ^2} \chi _j^{*2}, \end{aligned}$$

where $W^0$ is the Brownian bridge on [0, 1], the $ \chi _j^{2}$’s and $ \chi _j^{*2}$’s are as in Theorem 4 but the non-centrality parameters are $2A^2 \left\{ {2[1-(-1)^j] / j\pi } \right\} ^2 \sigma ^{-2}$.

Proof

i- If h is such that its associated $h_{F,1}$ and $h_{F,2}$ satisfy $h_{F,1}(x)=-h_{F,2}(x)$, then from the proof of Theorem 1 one sees that $W_1(\lambda )=-W_2(\lambda )$. Whence, the representation of the limit process Z in that theorem reduces to

$$\begin{aligned} Z(\lambda )=W_1(\lambda )-\lambda W_1(1), \ \lambda \in [0,1], \end{aligned}$$

where for any $ \lambda \in [0,1]$, $W_1(\lambda )= \sigma W^0(\lambda )$ with $W^0(\lambda )$ standing for the Brownian bridge on [0, 1]. Thus, again, from the continuous mapping theorem, one has that $T_{1,n}$ and $T_{2,n}$ converge in distribution respectively to $\sup _{\lambda \in [0,1]} |\sigma W^0(\lambda ) |$ and $\sigma ^2 \int _0^1| W^0(\lambda )|^2d\lambda $. Using the Karhunen-Loève expansion of the Brownian bridge given e.g. in Shorack and Wellner (1986) or Pycke (2001), one has $\zeta _j={1 \over j^2 \pi ^2}$ and $e_j(\lambda )=\sqrt{2} \sin (j \pi \lambda )$, $j \ge 1$, and the convergence in distribution of $T_{2,n}$ stated in the theorem follows by elementary computations.

ii- From Theorem 4, $T_{1,n}$ and $T_{2,n}$ converge in distribution respectively to $\sup _{\lambda \in [0,1]} |(1-\lambda ) \lambda A+\sigma W^0(\lambda )|$ and $\int _0^1 |(1-\lambda ) \lambda A+\sigma W^0(\lambda )|^2 d\lambda $.

The convergence result stated for $T_{2,n}$ in the corollary results from the Karhunen-Loève expansion of the Gaussian process $(1-\lambda ) \lambda A+\sigma W^0(\lambda )$, as sketched in Sect. 4, and for which more details can be found in Ngatchou-Wandji (2009).

Remark 4

It is easy to check that anti-symmetric kernels h are such that their associated $h_{F,1}$ and $h_{F,2}(x)$ satisfy the property $h_{F,1}(x)=-h_{F,2}(x)$.

Remark 5

In the context of Corollary 1, under ${\mathcal {H}}_0$, the asymptotic distributions of $T_{1,n}$ and $T_{2,n}$ can be approximated, as done in Sect. 4. Under ${\mathcal {H}}_{1,n}$ it is not easy to do this for the asymptotic distribution of $T_{1,n}$. This is a great disadvantage over $T_{2,n}$ whose asymptotic distribution under ${\mathcal {H}}_0$ as well as under ${\mathcal {H}}_{1,n}$ can be approximated, even for more general kernel h, by using Theorem 4, and proceeding in a way fully described in Ngatchou-Wandji (2009).

4 Practical considerations

4.1 Numerical simulations

Here, we apply our results to detecting a change in the mean and/or in the variance and/or in the correlation of data from some simple models. Concretely, we check the difference between the distributions of the observations only by checking the differences between their means and/or variances and/or autocorrelations. In this purpose, we consider the kernels $h(x,y)= \mathrm {1\!l}(x<y)$ and $h(x,y)= x-y$. For $h(x,y)= \mathrm {1\!l}(x<y)$, from simple computations one shows that $\theta (F,F)=1/2$ and $h_{F,1}(x)=1/2-F(x)=-h_{F,2}(x)$. For $h(x,y)= x-y$, it is a trivial matter that $\theta (F,F)=0$ and that by Remark 4, $h_{F,1}(x)=-h_{F,2}(x)$ as h is anti-symmetric. Consequently, Corollary 1 holds for these two kernels whose corresponding $\sigma ^2$ are respectively

$$\begin{aligned} \sigma _1^2=\sigma ^2= {\mathbb {V}}\text{ ar }\left[ F(X_1) \right] +2 \sum _{j\ge 1} {\mathbb {C}}\text{ ov }(F(X_1), F(X_{1+j})) \end{aligned}$$

and

$$\begin{aligned} \sigma _2^2=\sigma ^2= {\mathbb {V}}\text{ ar }\left( X_1 \right) +2 \sum _{j\ge 1} {\mathbb {C}}\text{ ov }(X_1, X_{1+j}). \end{aligned}$$

We sampled 1000 sets of $n=200$ data $X_1, X_2, \ldots , X_n$ from the model

$$\begin{aligned} X_i= \left\{ \begin{array}{cc} \varepsilon _i &{} i=1, \dots , 100\\ \mu +\rho X_{i-1}+\omega \varepsilon _i&{} i=101, \dots , 200, \end{array} \right. \end{aligned}$$

(9)

where $\mu $ is a real number, $\omega $ is a positive number, the $\varepsilon _i$’s are iid and for all $i=1, \ldots , 200$, $\varepsilon _i \sim {\mathcal {N}}(0,1)$, or $\varepsilon _i \sim {\mathcal {T}}(3)$ (Student distribution with 3 degrees of freedom), or $\varepsilon _i = {\mathcal {E}}_i-1$ with ${\mathcal {E}}_i \sim {\mathcal {E}}(1)$ (${\mathcal {E}}(1)$ exponential distribution with parameter 1).

We first apply our Kolmogorov-Smirnov and Cramér-von Mises type tests to testing $\mu =0$ against $\mu \ne 0$ for $\omega =1$ and $\rho =0$ (testing a change in the mean of a shifted white noise). Next, we apply the two tests to testing $\omega =1$ against $\omega \ne 1$ for $\mu =0$ and $\rho =0$ (testing change in the variance of a white noise). Finally, we consider testing $\rho =0$ against $\rho \ne 0$ for $\mu =0$ and $\omega =1$ (testing a change in the autocorrelation of an AR(1) model).

For $j=1,2$ let $c_{\alpha j}$ be the critical value at level of significance $ \alpha \in [0,1]$, of the test based on $T_{j,n}$. Then the empirical power of the test can be computed as the ratio of the number of samples for which $T_{j,n} > c_{\alpha j}$ over the number of replications (taken here to be 1000). However, the critical values of our tests are difficult to compute. For this reason, as in Ngatchou-Wandji (2009), we use the p-value method as follows : instead of counting the number of samples for which $T_{j,n} > c_{\alpha j}$, we rather count the number of samples for which the p-value is less than $\alpha $. Denoting by ${{\widehat{\sigma }}}_j$ an estimator of $\sigma _j$ when this number is unknown, from Corollary 1, the p-values of our tests are given respectively by

$$\begin{aligned} p_1={\mathbb {P}}\left( \sup _{\lambda \in [0,1]}|W^0(\lambda )| > {T_{1,n} \over {{\widehat{\sigma }}}_1} \right) \end{aligned}$$

and

$$\begin{aligned} p_2={\mathbb {P}}\left( \sum _{j \ge 1} {1 \over j^2 \pi ^2} \chi _j^2 > {T_{2,n} \over {{\widehat{\sigma }}}_2^2} \right) , \end{aligned}$$

where we recall that $W^0$ is the Brownian bridge on [0, 1] and the $\chi _j^2$’s are iid chi-square random variables with one degree of freedom. Note that for iid $X_i$’s the covariance terms in the expressions of $\sigma ^2$ vanish, so that under ${\mathcal {H}}_0$, for $h(x,y)= \mathrm {1\!l}(x<y)$, $\sigma ^2={\mathbb {V}}\text{ ar }[F(X_1)]=1/12$ and needs not be estimated, while for $h(x,y)=x-y$, $\sigma ^2={\mathbb {V}}\text{ ar }(X_1)$ which can be estimated by the sample variance as done in the trials.

From Billingsley (1999) p. 103, one has

$$\begin{aligned} p_1=-2 \sum _{|j|=0}^{\infty } (-1)^j \exp \left[ -2j^2 \left( {T_{1,n} \over {{\widehat{\sigma }}}_1}\right) ^2 \right] . \end{aligned}$$

Truncating the sum to the most significant terms yields an approximation for $p_1$. Also, truncating the sum in the expression of $p_2$, many well known results can be used for approximating $p_2$. Here, we use those of Imhof (1961).

On any of the graphics below, the green color represents the level of significance of the test. On Fig. 2, the blue color represents the empirical power of the test based on $T_{1,n}$ while the red one represents that of the test based on $T_{2,n}$. The upper graphics in Fig. 2 display the empirical power functions of both tests as functions of $\mu \in \{0, 0.1, 0.2, \ldots , 1.1\}$, for observations from (9) with $\rho =0$, $\omega =1$ and standard Gaussian noises. The first graphic corresponds to the kernel $h(x,y)= \mathrm {1\!l}(x<y)$, while the second corresponds to the kernel $h(x,y)= x-y$. On both graphics, one can observe that at $\mu =0$ the test based on $T_{2,n}$ estimates the nominal level of the test more accurately than the one based on $T_{1,n}$. More over, it has a larger power for smaller values of the mean.

The lower graphics on Fig. 2 are respectively the power functions for the same kernels, for data from (9) with $\rho =0$, $\omega =1$ with Student noises with 3 degrees of freedom, and centered exponential noises (from exponential distribution with parameter 1). One can see on these graphics that the powers grow a bit more slowly than those in the upper graphics.

The upper graphics in Fig. 3 present the power functions of both tests as functions of $\omega ^2 \in \{1, 1.2, 1.4, \ldots , 2, 2.2, \ldots , 2.8 \}$, for observations from (9) with $\mu =0$ and $\rho =0$. From these graphics, one sees that for the kernel $h(x,y)= \mathrm {1\!l}(x<y)$ both tests are not sensitive to any change in the variance, while they are for $h(x,y)= x-y$. Here, in contrast to the previous situations, the test based on $T_{1,n}$ does better than the one based on $T_{2,n}$.

The lower graphics in Fig. 3 exhibit the power functions of both tests as functions of $\rho \in \{0,.05,.1,.2,.3,.4,.5,.6,.7,.8,.85,.9,.95\}$, for observations from (9) with $\mu =0$ and $\omega =1$. One can see that for the kernels $h(x,y)= \mathrm {1\!l}(x<y)$ and $h(x,y)= x-y$, both tests are able to detect change in the autocorrelation. In the vicinity of the null hypothesis the test based on $T_{2,n}$ does better than the one based on $T_{1,n}$, while far from this hypothesis, the test based on $T_{1,n}$ does better than that based on $T_{2,n}$.

We did a lot of trials. In particular, we studied the detection of a change in the mean of a shifted Student white noise with 3, 4 and 5 degrees of freedom (${\mathcal {T}}(j), j=3,4,5$) associated with $h(x,y)= x-y$. Our tests did not detect a mean change in the ${\mathcal {T}}(3)$ case. This is likely due to the fact that with this kernel, the theoretical results assume the existence of moment of order larger than 2, which is not the case for ${\mathcal {T}}(3)$. For the ${\mathcal {T}}(4)$ and ${\mathcal {T}}(5)$ cases, a mean change was detected. We do not present the results as they were very similar to those presented.

4.2 Concluding remarks

The main theoretical results in this paper are the same as those of Dehling et al. (2015). But they are established for more general kernels. In addition to the traditional Kolmogorov-Smirnov (KS) type test used for change-point detection, a Cramér-von Mises (CM) type test is studied. For the kernels and the data considered, the CM test seems to have better power properties than the KS test for detecting small changes in the mean of a shifted Gaussian white noise. For the kernel $h(x,y)=\mathrm {1\!l}(x<y)$, both test are not sensitive to change in the variance of the observations studied. This may be explained by the fact that this function involves the rank of the observations rather than the observations themselves. Furthermore, the corresponding tests are associated with uniform random variables ($F(X_j) \sim {\mathcal {U}}[0,1]$) through $\sigma ^2$. This is not the case for the tests based on $h(x,y)=x-y$ which involves directly the observation and are related to their variances through $\sigma ^2$ and for which change in the variance can be detected by the tests studied here (with a favor to KS). Our study also shows that for each of these kernels, our tests are able to detect changes in the autocorrelation of an AR(1) model. The KS test does better far from the null hypothesis, while the CM does better in the vicinity of the null hypothesis. However, the results seem to show that they are more adapted to detect changes in the mean than changes in the variance or autocorrelation. Indeed, in testing the mean, the power of the tests grow quickly to 1, as one moves away from the null hypothesis.

References

Amano T (2012) Asymptotic optimality of estimating function estimator for Charn model. Adv Decis Sci
Bardet J-M, Kengne W (2014) Monitoring procedure for parameter change in causal time series. J Multivar Anal 125:204–221
Article MathSciNet MATH Google Scholar
Bardet J-M, Wintenberger O (2009) Asymptotic normality of the quasi-maximum likelihood estimator for multidimensional causal processes. Ann Stat 37(5B):2730–2759
Article MathSciNet MATH Google Scholar
Bardet J-M, Kengne W, Wintenberger O (2012) Multiple breaks detection in general causal time series using penalized quasi-likelihood. Electr J Stat 6:435–477
MathSciNet MATH Google Scholar
Bhattacharyya GK, Johnson R (1968) Nonparametric tests for shifts at an unknown time point. J Multvar Anal 102(39):1731–1743
MathSciNet MATH Google Scholar
Bhattacharya P, Zhou H (2017) Nonparametric stopping rules for detecting small changes in location and scale families. From statistics to mathematical finance. Springer, Cham
MATH Google Scholar
Billingsley P (1999) Convergence of probability measures. Wiley, New York
Book MATH Google Scholar
Chen KM, Cohen A, Sackrowitz H (2011) Consistent multiple testing for change points. J Multvar Anal 102:1339–1343
Article MathSciNet MATH Google Scholar
Chernoff H, Zacks S (1964) Estimating the current mean of a normal distribution which is subjected to changes in time. Ann Math Stat 35:999–1018
Article MathSciNet MATH Google Scholar
Ciuperca G (2011) A general criterion to determine the number of change-points. Stat Probab Lett 81(8):1267–1275
Article MathSciNet MATH Google Scholar
Csörgő M, Horváth L (1987) Nonparametric tests for the changepoint problem. Stat Probab Lett 17:1–9
MathSciNet MATH Google Scholar
Csörgő M, Horváth L (1988) Invariance principales for changepoint problems. J Multivar Anal 17:151–168
Article MATH Google Scholar
Dehling H, Fried R, Garcia I, Wendler M (2015) Change-point detection under dependence based on two-sample ${\mathbf{U}}$-statistics. asymptotic laws and methods in stochastics. Springer, New York
MATH Google Scholar
Dehling H, Rooch A, Taqqu M (2013) Non-parametric change-point tests for long-range dependent data. Scand J Stat 40:153–173
Article MathSciNet MATH Google Scholar
Dehling H, Franke B, Woerner J (2017a) Estimating drift parameters in a fractional ornstein uhlenbeck process with periodic mean. Stat Inference Stoch Process 20:1–14
Article MathSciNet MATH Google Scholar
Dehling H, Rooch A, Taqqu M (2017b) Power of change-point tests for long-range dependent data. Elect J Stat 11:2168–2198
MathSciNet MATH Google Scholar
Döring M (2010) Multiple change-point estimation with ${\mathbf{U}}$-statistics. J Stat Plann Inference 104(7):2003–2017
Article MathSciNet MATH Google Scholar
Döring M (2011) Convergence in distribution of multiple change point estimators. J Stat Plann Inference 141(7):2238–2248
Article MathSciNet MATH Google Scholar
Enikeeva F, Munk A, Werner F (2018) Bump detection in heterogeneous gaussian regression. Bernoulli 42(2):1266–1306
MathSciNet MATH Google Scholar
Fotopoulos SB, Jandhyala VK, Tan L (2009) Asymptotic study of the change-point MLE in multivariate gaussian families under contiguous alternatives. J Statist Plann Inference 139(3):1190–1202
Article MathSciNet MATH Google Scholar
Francq C, Zakoïan JM (2012) Strict stationarity testing and estimation of explosive and stationary generalized autoregressive conditional heteroscedasticity models. Econometrica 80(2):821–861
Article MathSciNet MATH Google Scholar
Gardner LA (1969) On detecting changes in the mean of normal variates. Ann Math Stat 116–126
Gombay E (2008) Change detection in autoregressive time series. J Multvar Anal 99(3):451–464
Article MathSciNet MATH Google Scholar
Gombay E, Serban D (2009) Monitoring parameter change in $ar(p)$ time series models. J Multvar Anal 100(4):715–725
Article MathSciNet MATH Google Scholar
Haccou P, Meelis E, van de Geer S (1988) The likelihood ratio test for a change point in a sequence of independent exponentially distributed random variables. Stoch Process Appl 30:121–139
MATH Google Scholar
Härdle W, Tsybakov A (1997) Local polynomial estimators of the volatility function in nonparametric autoregression. J Econometrics 81(1):223–242
Article MathSciNet MATH Google Scholar
Härdle W, Tsybakov A, Yang L (1998) Local polynomial estimators of the volatility function in nonparametric autoregression. J Stat Plann Inference 68(2):221–245
Article MATH Google Scholar
Harel M, Puri ML (1989) Limiting behavior of ${\mathbf{U}}$-statistics, ${\mathbf{V}}$-statistics and one-sample rank order statistics for nonstationary absolutely regular processes. J Multvar Anal 30:180–204
MathSciNet MATH Google Scholar
Harel M, Puri M (1994) Law of the iterated logarithm for perturbed empirical distribution functions evaluated at a random point for nonstationary random variables. J Theor Probab 4:831–855
Article MathSciNet MATH Google Scholar
Hlávka Z, Hušková M, Meintanis S (2020) Change-point methods for multivariate time-series: paired vectorial observations. Stat Pap. 61:1351–1383
Article MathSciNet MATH Google Scholar
Horváth L, Hušková M (2005) Testing for changes using permutations of ${\mathbf{U}}$-statistics. J Stat Plann Infer 128:351–371
Article MathSciNet MATH Google Scholar
Huh J (2010) Detection of a change point based on local-likelihood. Statistics 101:1–17
MathSciNet MATH Google Scholar
Imhof JP (1961) Computing the distribution of quadratic forms in normal variables. Biometrika 48:419–426
Article MathSciNet MATH Google Scholar
Kander Z, Zacks S (1966) Test procedures for possible changes in parameters of statistical distributions occurring at unknown time points. Ann Math Stat 37:1196–1210
Article MathSciNet MATH Google Scholar
Kengne WC (2012) Testing for parameter constancy in general causal time-series models. J Time Ser Anal 33(3):503–518
Article MathSciNet MATH Google Scholar
Khakhubia TG (1987) A limit theorem for a maximum likelihood estimate of the disorder time. Theor Probab Appl. 31:141–144
Article MATH Google Scholar
Ma L, Grant J, Sofronov G (2020) Multiple change point detection and validation in autoregressive time series data. Stat Pap 61:1507–1528
Article MathSciNet MATH Google Scholar
MacNeill I (1974) Tests for change of parameter at unknown times and distributions of some related functionals on brownian motion. Ann Stat 31(2):950–962
MathSciNet MATH Google Scholar
Matthews DA, Farewell VT, Pyke R (1985) Asymptotic score-statistic processes and tests for constant hazard against a changepoint alternative. Ann Stat 31(13):583–591
MATH Google Scholar
Meintanis SG (2016) A review of testing procedures based on the empirical characteristic function. S Afr Stat J 50:1–14
Article MathSciNet MATH Google Scholar
Mohr M, Selk L (2020) Estimating change points in nonparametric time series regression models. Stat Pap 61:1437–1463
Article MathSciNet MATH Google Scholar
Ngatchou-Wandji J (2009) Testing for symmetry in multivariate distributions. Stat Methodol 6:230–250
Article MathSciNet MATH Google Scholar
Oodaira H, Yoshihara K (1972) Functional central limit theorems for strictly stationary processes satisfying the strong mixing condition. Kodai Math Semin Rep 24:259–269
Article MathSciNet MATH Google Scholar
Page ES (1954) Continuous inspection schemes. Biometrika 41:100–115
Article MathSciNet MATH Google Scholar
Page ES (1955) A test for a change in a parameter occurring at an unknown point. Biometrika 42:523–526
Article MathSciNet MATH Google Scholar
Pettitt AN (1979) A non-parametric approach to the change-point problem. Appl Stat 28:126–135
Article MATH Google Scholar
Phillips P, Durlauf S (1986) Multiple time series regression with integrated processes. Rev Econom Stud 53(4):473–495
Article MathSciNet MATH Google Scholar
Pycke J (2001) Une généralisation du développement de ${\mathbf{K}}$arhunen-${\mathbf{L}}$oève du pont brownien(french) [a generalization of the ${\mathbf{K}}$arhunen-${\mathbf{L}}$oève expansion of the brownian bridge ]. C R Acad Sci Ser I 333(7):685–688
Rackauskas A, Wendler M (2020) Convergence of $u$-processes in hölder spaces with application to robust detection of a changed segment. Stat Pap 61:1409–1435
Article MATH Google Scholar
Riesz F, Nagy B (1972) Leçons d’analyse fonctionnelle, 6th edn. Gauthier-Villars, Paris
MATH Google Scholar
Sen A, Srivastava MS (1975) On tests for detecting changes in mean. Ann Stat 3:98–108
Article MathSciNet MATH Google Scholar
Shorack G, Wellner J (1986) Empirical processes with applications to statistics. Wiley series in probability and mathematical statistics. Probability and mathematical statistics. Wiley, New York
MATH Google Scholar
Wang Q, Phillips PC (2012) A specification test for nonlinear nonstationary models. Ann Stat 40:727–758
Article MathSciNet MATH Google Scholar
Wolfe DA, Schechtman E (1984) Nonparametric statistical procedures for the changepoint problem. J Stat Plann Inference 9:389–396
Article MathSciNet MATH Google Scholar
Yang Y, Song Q (2014) Jump detection in time series nonparametric regression models: a polynomial spline approach. Ann Inst Stat Math 66:325–344
Article MathSciNet MATH Google Scholar
Yang Q, Li Y-N, Zang Y (2020) Change point detection for nonparametric regression under strongly mixing process. Stat Pap 61:1465–1506
Article MathSciNet MATH Google Scholar
Yao YC, Davis RA (1986) The asymptotic behavior of the likelihood ratio statistic for testing a shift in mean in a sequence of independent normal variates. Sankhya 48:339–353
MathSciNet MATH Google Scholar
Yoshihara Y (1976) Limiting behavior of ${\mathbf{U}}$-statistics for stationary absolutely regular processes. Z Wahrscheinlichkeitstheorie Verw Gebiete 35:237–252
Article MathSciNet MATH Google Scholar
Zhou Z (2014) Nonparametric specification for non-stationary time series regression. Bernoulli 20(1):78–108
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

EHESP Sorbonne Paris Cité & Institut Élie Cartan de Lorraine, 54506, Vandoeuvre-lès-Nancy Cedex, France
Joseph Ngatchou-Wandji
Faculté des Sciences, Université Chouaîb Doukkali, BP. 20, El Jadida, 24000, Marocco
Echarif Elharfaoui
INSPÉ Limoges, 87036, Limoges Cedex, France
Michel Harel
IMT, UMR 5219 UPS, 31062 Cedex 9, Toulouse, France
Michel Harel

Authors

Joseph Ngatchou-Wandji
View author publications
You can also search for this author in PubMed Google Scholar
Echarif Elharfaoui
View author publications
You can also search for this author in PubMed Google Scholar
Michel Harel
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Joseph Ngatchou-Wandji.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix: proofs of the results

1.1 Preliminary results

In this subsection, we prove some preliminary results necessary to the proofs of Theorems 1 and 2 .

Proposition 1

Under the conditions of Theorem 1, we have, in probability

$$\begin{aligned} n^{-3/2}\sup _{0\le \lambda \le 1}\bigg | \sum _{i=1}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n}g_F(X_{i},X_{j})\bigg | \longrightarrow _{n\rightarrow \infty }0. \end{aligned}$$

Under the conditions of Theorem 2, we have, in probability

$$\begin{aligned} n^{-3/2}\sup _{0\le \lambda \le 1}\bigg | \sum _{i=1}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n}g_{G^{(n)}}(X_{i},Y_{nj})\bigg | \longrightarrow _{n\rightarrow \infty }0. \end{aligned}$$

Proof

We only prove the first part. This needs two lemmas that we first state and prove.

Lemma 1

Under the conditions of Theorem 1, there exists a Constant $Cst>0$ such that

$$\begin{aligned} {\mathbb {E}}\left\{ \left[ \sum _{i=1}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n}g_F(X_{i},X_{j})\right] ^{2} \right\} \le Cst[n\lambda ](n-[n\lambda ]). \end{aligned}$$

Proof of Lemma 1

We can write

$$\begin{aligned} {\mathbb {E}}\left\{ \left[ \sum _{i=1}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n}g_F(X_{i},X_{j})\right] ^{2} \right\}\le & {} \sum _{i=1}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n} {\mathbb {E}}\left\{ \left[ g_F(X_{i},X_{j})\right] ^{2} \right\} \\&+2\sum _{1\le i_{1}<i_{2}\le [n\lambda ]}\sum _{[n\lambda ]+1\le j_{1}<j_{2}\le n} H\left[ (i_{1},j_{1}),(i_{2},j_{2}) \right] \\= & {} A_{n}(\lambda )+2B_{n}(\lambda ). \end{aligned}$$

where

$$\begin{aligned} H\left[ (i_{1},j_{1}),(i_{2},j_{2}) \right]= & {} {\mathbb {E}}\bigg \{ \left[ g_F(X_{i_{1},}X_{j_{1}})-h_{F,1}(X_{i_{1}})-h_{F,2}(X_{j_{1}})+\theta (F,F) \right] \\&\times \left[ g_F(X_{i_{2}}X_{j_{2}})-h_{F,1}(X_{i_{2}})-h_{F,2}(X_{j_{2}})+\theta (F,F)\right] \bigg \}. \end{aligned}$$

From the integrability condition, we have

$$\begin{aligned} \sup _{i,j\in {\mathbb {N}}} {\mathbb {E}}\left\{ \left[ g_F(X_{i},X_{j})\right] ^{2} \right\} \le Cst, \end{aligned}$$

then

$$\begin{aligned} A_{n}(\lambda )\le Cst[n\lambda ](n-[n\lambda ]). \end{aligned}$$

Since

$$\begin{aligned} \int _{{\mathbb {R}}}\left[ g_F(x,y)-h_{F,1}(x)-h_{F,2}(y)+\theta (F,F)\right] dF(x)=0 \end{aligned}$$

so from Lemma 1 of Yoshihara (1976), we have the following inequalities:

(a) If $1\le i_{1}<j_{1}\le [n\lambda ], \ [n\lambda ]+1\le i_{2}<j_{2}\le n$ and $i_{2}-i_{1}\ge j_{2}-j_{1}$, then

$$\begin{aligned} H\left[ (i_{1},j_{1}),(i_{2},j_{2}) \right] \le Cst \beta ^{\frac{\delta }{2+\delta }}(i_{2}-i_{1}). \end{aligned}$$

Then we deduce

$$\begin{aligned}&\sum _{1\le i_{1}<i_{2}\le [n\lambda ]}\sum _{[n\lambda ]+1\le j_{1}<j_{2}\le n} H\left[ (i_{1},j_{1}),(i_{2},j_{2}) \right] \le Cst [n\lambda ](n-[n\lambda ]) \sum _{k=1}^{n}k\beta ^{\frac{\delta }{2+\delta }}(k) \\&\quad Cst[n\lambda ](n-[n\lambda ])\sum _{k=1}^{n}k\beta ^{\frac{\delta }{2+\delta }}(k)\le Cst[n\lambda ](n-[n\lambda ]), \end{aligned}$$

where $k=j_{2}-i_{2}$.

Suppose k fixed, we have $[n\lambda ]$ ways to choose $i_{1},$ once $i_{1}$ is chosen we have one way to choose $i_{2}=i_{1}+k$. For $j_{1}$ we have $n-[n\lambda ]$ ways to choose $j_{1}$ and then for each $j_{1}$, $j_{2}$ need to be in the interval $[j_{1},j_{1}+k]$ and there are exactly $\ k$ integers in such interval.

(b) Similarly, if $1\le i_{1}<j_{1}\le [n\lambda ], \ [n\lambda ]+1\le i_{2}<j_{2}\le n$ and $i_{2}-i_{1}\le j_{2}-j_{1}$, then

$$\begin{aligned} H\left[ (i_{1},j_{1}),(i_{2},j_{2}) \right] \le M_{0}\beta ^{\frac{\delta }{2+\delta }}(j_{2}-j_{1}). \end{aligned}$$

Thus, we deduce that

$$\begin{aligned} B_{n}(\lambda )\le Cst[n\lambda ](n-[n\lambda ]) \end{aligned}$$

and Lemma 1 is proved. $\square $

We now define the process ${\mathcal {G}}_{n}(\lambda ),0\le \lambda \le 1$ by

$$\begin{aligned} {\mathcal {G}}_{n}(\lambda )=n^{-3/2}\sum _{i=1}^{[n\lambda ]} \sum _{j=[n\lambda ]+1}^{n}g_F(X_{i},X_{j}),0\le \lambda \le 1. \end{aligned}$$

Lemma 2

Under the conditions of Theorem 1, we have

$$\begin{aligned} {\mathbb {E}}\Big (\Big | {\mathcal {G}}_{n}(\lambda )-{\mathcal {G}}_{n}(\lambda ^{\prime })\Big |^{2}\Big )\le \frac{Cst}{n}(\lambda -\lambda ^{\prime }), \text{ for } \text{ all } \ 0\le \lambda ^{\prime }\le \lambda \le 1. \end{aligned}$$

Proof of Lemma 2

We can write

$$\begin{aligned} {\mathbb {E}}\big (\left| {\mathcal {G}}_{n}(\lambda )-{\mathcal {G}}_{n}(\lambda ^{\prime })\right| ^{2}\big )\le & {} 2n^{-3}{\mathbb {E}}\left\{ \left[ \sum _{i=1}^{[n\lambda ^{\prime }]}\sum _{j=[n\lambda ^{\prime }]+1}^{[n\lambda ]}g_F(X_{i},X_{j})\right] ^{2} \right\} \\&+2n^{-3}{\mathbb {E}}\left\{ \left[ \sum _{i=1+[n\lambda ^{\prime }]}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n}g_F(X_{i},X_{j})\right] ^{2} \right\} . \end{aligned}$$

From Lemma 1, we deduce that

$$\begin{aligned} {\mathbb {E}}\left( \left| {\mathcal {G}}_{n}(\lambda )-{\mathcal {G}}_{n}(\lambda ^{\prime })\right| ^{2}\right)\le & {} \frac{Cst}{n^{3}}\big \{[n\lambda ^{\prime }]([n\lambda ]-[n\lambda ^{\prime }]) +([n\lambda ]-[n\lambda ^{\prime }])(n-[n\lambda ^{\prime }])\big \}\\&\times \frac{Cst}{n}(\lambda -\lambda ^{\prime }) \end{aligned}$$

and Lemma 2 is proved. $\square $

From Lemma 2, we deduce that

$$\begin{aligned} {\mathbb {P}}\left( \left| {\mathcal {G}}_{n}(\lambda )-{\mathcal {G}}_{n}(\lambda ^{\prime })\right| \ge \epsilon \right) \le \frac{Cst}{\epsilon ^{2}n}(\lambda -\lambda ^{\prime }) \end{aligned}$$

for all $\epsilon >0.$ It implies for $0\le l_{1}\le l_{2}\le n$ with $l_{1},l_{2},n\in {\mathbb {N}}$,

$$\begin{aligned} {\mathbb {P}}\left( \left| {\mathcal {G}}_{n}\Big (\frac{l_{2}}{n}\Big )-{\mathcal {G}}_{n}\Big (\frac{l_{1}}{n}\Big )\right| \ge \epsilon \right) \le \frac{Cst}{\epsilon ^{2}n}(l_{2}-l_{1})\le \frac{Cst}{\epsilon ^{2}n^{\frac{5}{3}}}(l_{2}-l_{1})^{\frac{4}{3}}. \end{aligned}$$

Consider the partial sum process defined by $S_{0}=0$ and $S_{i}=\sum _{j=1}^{i}A_{j}$ where $A_{j}={\mathcal {G}}_{n}(\frac{j}{n})-{\mathcal {G}}_{n}(\frac{j-1}{n})$ if $1\le j\le n-1$ and 0 otherwise. It results that $S_{i}={\mathcal {G}}_{n}(\frac{i}{n})$.

The last inequality is equivalent to

$$\begin{aligned} {\mathbb {P}}\big (\left| S_{l_{2}}-S_{l_{1}}\right| \ge \epsilon \big )\le \frac{Cst}{\epsilon ^{2}}\left( \frac{l_{2}-l_{1}}{n^{\frac{5}{4}}}\right) ^{\frac{4}{3}}. \end{aligned}$$

From Theorem 10.2 of Billingsley (1999), we easily deduce that

$$\begin{aligned} {\mathbb {P}}\Big (\max _{1\le i\le n-1}\left| S_{i}\right| \ge \epsilon \Big )\le \frac{Cst}{\epsilon ^{2}}\left( \frac{ l_{2}-l_{1}}{n^{\frac{5}{4}}}\right) ^{\frac{4}{3}}\le \frac{Cst}{\epsilon ^{2}n^{\frac{1}{3}}} \end{aligned}$$

which implies that, in probability,

$$\begin{aligned} n^{-3/2}\sup _{0\le \lambda \le 1}\bigg | \sum _{i=1}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n}g_F(X_{i},X_{j})\bigg | \longrightarrow _{_{n\rightarrow \infty }}0. \end{aligned}$$

This completes the proof of Proposition 1. $\square $

We need the following result proved by Oodaira and Yoshihara (1972).

Let $\xi _1,\xi _2,\ldots ,\xi _n,\ldots $ be a strictly stationary sequence of zero-mean random variables, and let

$$\begin{aligned} \sigma _{*}^{2}={\mathbb {E}}(\xi _1^{2})+2\sum _{i=1}^{\infty } {\mathbb {E}} \left( \xi _1\xi _{i+1} \right) . \end{aligned}$$

Proposition 2

Assume $ {\mathbb {E}} \left( \left| \xi _{i}\right| ^{2+\delta } \right) <\infty $ for some positive $\delta $ and $\xi _1,\xi _2,\ldots ,\xi _n,\ldots $ is $\alpha $-mixing with $\alpha $-rate satisfying

$$\begin{aligned} \sum _{i=1}^{\infty }[\alpha (i)]^{\frac{\delta }{2+\delta }}<\infty . \end{aligned}$$

Then $\sigma _{*}^{2}<\infty .$

If $\sigma _{*}>0$, then the sequence of processes

$$\begin{aligned} S_{n}(\lambda )=\frac{1}{\sigma _{*}\sqrt{n}}\sum _{i=1}^{[n\lambda ]}\xi _{i}, \lambda \in [0,1] \end{aligned}$$

converges weakly to a Wiener measure on $(D,\mathcal {D)}$, where ${\mathcal {D}}$ is the $\sigma $-fields of Borel sets for the Skorohod topology.

Proof

See the proof of Theorem 2 of Oodaira and Yoshihara (1972).

Proposition 3

Under the conditions of Theorem 1, we have

$$\begin{aligned} \left\{ \frac{1}{\sqrt{n}}\sum _{i=1}^{[n\lambda ]} \begin{pmatrix} h_{F,1}(X_{i}) \\ h_{F,2}(X_{i}) \end{pmatrix} \right\} _{0\le \lambda \le 1}\longrightarrow _{n\rightarrow \infty } \left\{ \begin{pmatrix} W_{1}(\lambda ) \\ W_{2}(\lambda ) \end{pmatrix} \right\} _{0\le \lambda \le 1}. \end{aligned}$$

(10)

Under the conditions of Theorem 2, we have

$$\begin{aligned} \left\{ \frac{1}{\sqrt{n}}\sum _{i=1}^{[n\lambda ]} \begin{pmatrix} h_{F,1}(X_{i}) \\ h_{G^{(n)},2}(Y_{ni}) \end{pmatrix} \right\} _{0\le \lambda \le 1}\longrightarrow _{n\rightarrow \infty } \left\{ \begin{pmatrix} W_{1}(\lambda ) \\ W_{2}(\lambda ) \end{pmatrix} \right\} _{0\le \lambda \le 1}. \end{aligned}$$

(11)

Proof

We only prove (11). The part relating to (10) involves a series which is not a triangular array. It is more easier to handle than (11).

For establishing (11), we need to establish a finite-dimensional convergence and a tightness results.

Starting by the finite-dimensional convergence, by the Cramér-Wold device it suffices to show that for any $k\in {\mathbb {N}}^{*}$, any $a_{j,}b_{j},\lambda _{j}\in {\mathbb {R}}$, $a_{1}<\ldots <a_{k}$, $b_{1}<\ldots <b_{k}$, $0=\lambda _{0}<\lambda _{1}<\ldots <\lambda _{k}=1$

$$\begin{aligned} \sum _{j=1}^{k}\frac{1}{\sqrt{n}}\sum _{i=[n\lambda _{j-1}]+1}^{[n\lambda _{j}]}\left[ a_{j}h_{F,1}(X_{i})+b_{j}h_{G^{(n)},2}(Y_{ni}) \right] \end{aligned}$$

converges in distribution to a Gaussian random variable.

For that, we need the following lemma.

Lemma 3

(Harel and Puri 1989) Let $\{X_{ni}\}$ be a sequence of zero-mean absolutely regular random variables (rv)’s with rates satisfying

$$\begin{aligned} \sum _{n\ge }\big [\beta (n)\big ]^{\delta /(2+\delta )} <\infty \ \text{ for } \text{ some } \ \delta > 0. \end{aligned}$$

(12)

Suppose that for any $\kappa $, there exists a sequence $\{Y^\kappa _{ni}\}$ of rv’s satisfying (12) such that

$$\begin{aligned} \sup _{n\in {\mathbb {N}}}\max _{0\le i\le n}|Y^\kappa _{ni}|\le B_\kappa < \infty , \end{aligned}$$

(13)

where $B_\kappa $ is some positive constant

$$\begin{aligned}&\sup _{n\in {\mathbb {N}}}\max _{0\le i\le n} {\mathbb {E}}\big (|X^*_{ni}-Y^\kappa _{ni}|^{2+\delta }\big )\longrightarrow 0 \ \text{ as } \ \kappa \rightarrow \infty \end{aligned}$$

(14)

$$\begin{aligned}&\frac{1}{n} {\mathbb {E}}\bigg [\Big (\sum ^n_{i=1}X^*_{ni}\Big )^2\bigg ]\longrightarrow c \ \text{ as } \ n\rightarrow \infty , \end{aligned}$$

(15)

where c is some positive constant

$$\begin{aligned} \frac{1}{n} {\mathbb {E}}\bigg [\Big (\sum ^n_{i=1}Y^\kappa _{ni}-{\mathbb {E}}(Y^\kappa _{ni})\Big )^2\bigg ]\longrightarrow c_\kappa \ \text{ as } \ n\rightarrow \infty , \end{aligned}$$

(16)

where $c_\kappa $ is some constant $> 0$

$$\begin{aligned} c_\kappa \longrightarrow c \ \text{ as } \ \kappa \rightarrow \infty . \end{aligned}$$

(17)

Then

$$\begin{aligned} \frac{1}{n}\sum ^n_{i=1}X^*_{ni} \end{aligned}$$

converges in distribution to the normal distribution with mean 0 and variance c.

Without loss of generality, we take $k=2$ and $0=\lambda _{0}<\lambda _{1}<\lambda _{2}=1$, $a_{1}<a_{2}$, $b_{1}<b_{2}$.

The assumption (12) readily holds from (5).

Define, for $j=1,2$,

$$\begin{aligned} \psi _{ni}^{(j)}=a_{j}h_{F,1}(X_{i})+b_{j}h_{G^{(n)},2}(Y_{ni}). \end{aligned}$$

For establishing (15), we need proving that, as n tends to infinity,

$$\begin{aligned} {\mathbb {E}} \left[ \left( \frac{1}{\sqrt{n}}\left\{ \sum _{i=1}^{[n\lambda _{1}]} \psi _{ni}^{(1)} +\sum _{i=[n\lambda _{1}]+1}^{n} \psi _{ni}^{(2)}\right\} \right) ^{2} \right] \end{aligned}$$

tends to some positive constant c.

We have

$$\begin{aligned}&{\mathbb {E}}\left\{ \left[ \frac{1}{\sqrt{n}}\left( \sum _{i=1}^{[n\lambda _{1}]}\psi _{ni}^{(1)}+\sum _{i=[n\lambda _{1}]+1}^{n}\psi _{ni}^{(2)}\right) \right] ^{2} \right\} = \frac{1}{n}\bigg \{{\mathbb {E}}\left[ \left( \sum _{i=1}^{[n\lambda _{1}]}\psi _{ni}^{(1)}\right) ^{2} \right] \\&\ \ +{\mathbb {E}}\left[ \left( \sum _{i=1}^{[n\lambda _{1}]} \psi _{ni}^{(1)}\right) \left( \sum _{i=[n\lambda _{1}]+1}^{n}\psi _{ni}^{(2)}\right) \right] +{\mathbb {E}}\left[ \left( \sum _{i=[n\lambda _{1}]+1}^{n}\psi _{ni}^{(2)}\right) ^{2}\right) \bigg \}. \end{aligned}$$

Since the random variables $\psi _{ni}^{(1)}$ and $\psi _{ni}^{(2)}$ are centered, we obtain

$$\begin{aligned} {\mathbb {E}}\left[ \bigg (\sum _{i=1}^{[n\lambda _{1}]}\psi _{ni}^{(1)}\bigg )^{2} \right] =[n\lambda _{1}] {\mathbb {E}}\left[ \big (\psi _{n1}^{(1)}\big )^{2} \right] +2\sum _{i=1}^{[n\lambda _{1}]} \sum _{j=1}^{[n\lambda _{1}]-i}{\mathbb {E}}\left[ \big (\psi _{ni}^{(1)}\psi _{n,i+j}^{(1)}\big ) \right] . \end{aligned}$$

From the condition of Theorem 2, we deduce that ${\mathbb {E}}\left[ \left( \psi _{n1}^{(1)}\right) ^{2+\delta }\right] <\infty $, which implies that

$$\begin{aligned} \sup _{n, i,j\ge 1}\left| {\mathbb {E}}\left( \psi _{ni}^{(1)}\psi _{n,i+j}^{(1)}\right) \right| \le \beta ^{\frac{\delta }{2+\delta }}(j)\big \{{\mathbb {E}}\left( \big |\psi _{ni}^{(1)}\big |^{2+\delta }\right) \big \}^{\frac{1}{2+\delta }} \big \{{\mathbb {E}}\left( \big |\psi _{n,i+j}^{(1)}\big |^{2+\delta }\right) \big \}^{\frac{1}{2+\delta }}. \end{aligned}$$

We get

$$\begin{aligned} {\mathbb {E}}\left[ \bigg (\sum _{i=1}^{[n\lambda _{1}]}\psi _{ni}^{(1)}\bigg )^{2} \right] \le [n\lambda _{1}] {\mathbb {E}}\left[ \left( \psi _{n1}^{(1)}\right) ^{2} \right] +2[n\lambda _{1}]\sum _{j=1}^{[n\lambda _{1}]}\beta ^{\frac{\delta }{2+\delta }}(j)M^{2}, \end{aligned}$$

where $M=\sup _{n \ge 1}\left\{ {\mathbb {E}}\left[ \left( \psi _{n1}^{(1)}\right) ^{2+\delta }\right] \right\} ^{\frac{1}{2+\delta }}$.

It results that

$$\begin{aligned}&\lim _{n\rightarrow \infty }\frac{1}{n}\bigg \{[n\lambda _{1}]{\mathbb {E}}\left[ \big (\psi _{n1}^{(1)}\big )^{2} \right] +2\sum _{i=1}^{[n\lambda _{1}]}\sum _{j=1}^{[n\lambda _{1}]-i}\Big |{\mathbb {E}}\big (\psi _{ni}^{(1)}\psi _{n,i+j}^{(1)}\big )\Big |\bigg \} \nonumber \\&\quad \le \lambda _1\Big \{{\mathbb {E}}\left[ \big (\psi _{n1}^{(1)}\big )^{2}\right] +2\sum _{j=1}^{\infty }\beta ^{\frac{\delta }{2+\delta }}(j)M^{2}\Big \}. \end{aligned}$$

(18)

We also have

$$\begin{aligned} {\mathbb {E}}\left[ \left( \sum _{i=1}^{[n\lambda _{1}]}\psi _{ni}^{(1)}\right) \left( \sum _{i=[n\lambda _{1}]+1}^{n}\psi _{ni}^{(2)}\right) \right] =\sum _{i=1}^{[n\lambda _{1}]}\sum _{j=[n\lambda _{1}]+1}^{n}{\mathbb {E}}\big (\psi _{ni}^{(1)}\psi _{nj}^{(2)}\big ). \end{aligned}$$

From

$$\begin{aligned} \sup _{n \ge 1}\sup _{i,j\ge 1}\left| {\mathbb {E}}\big (\psi _{ni}^{(1)}\psi _{nj}^{(2)}\big )\right| \le \beta ^{\frac{\delta }{2+\delta }}(j-i)MM^{*}, \end{aligned}$$

where $M^{*}=\sup _{n \ge 1}\left\{ {\mathbb {E}}\left[ \left( \psi _{n1}^{(2)}\right) ^{2+\delta } \right] \right\} ^{\frac{1}{2+\delta }}$, it results that

$$\begin{aligned} \lim _{n\rightarrow \infty }\frac{1}{n}\sum _{i=1}^{[n\lambda _{1}]}\sum _{j=[n\lambda _{1}]+1}^{n}\left| {\mathbb {E}}\big (\psi _{ni}^{(1)}\psi _{nj}^{(2)}\big )\right| \le \lambda _{1}\sum _{j=1}^{\infty }\beta ^{\frac{\delta }{2+\delta }}(j)MM^{*}. \end{aligned}$$

(19)

Similarly, we get

$$\begin{aligned} \lim _{n\rightarrow \infty }\frac{1}{n}{\mathbb {E}}\left[ \bigg (\sum _{i=[n\lambda _{1}]+1}^{n}\psi _{ni}^{(2)}\bigg )^{2}\right] \le (1-\lambda _{1})\Big \{{\mathbb {E}}\left[ \big (\psi _{n1}^{(1)}\big )^2 \right] +2\sum _{j=1}^{\infty }\beta ^{\frac{\delta }{2+\delta }}(j)(M^{*})^{2}\Big \}. \end{aligned}$$

(20)

From (18)-(20), we deduce (15).

Now, we turn to proving (14). For all $i\ge 1$, and for any $\kappa >0$, define

$$\begin{aligned} \psi _{ni}^{(j),\kappa }= \left\{ \begin{array}{ll} \psi _{ni}^{(j)} &{} \text{ if } \big |\psi _{ni}^{(j)}\big | \le \kappa \\ 0 &{} \text{ if } \big |\psi _{ni}^{(j)}\big | \ge \kappa , \ j=1,2. \end{array} \right. \end{aligned}$$

It is immediate that

$$\begin{aligned} \sup _{n\ge 1}\sup _{i\ge 1}\big | \psi _{ni}^{(j)}\big | \le \kappa <\infty . \end{aligned}$$

It results from the integrability condition in Theorem 2 that the sequences $\{\psi _{ni}^{(j)}; \ i\ge 1, \ j=1,2\}$ are uniformly integrable.

Whence

$$\begin{aligned} \sup _{i\ge 1}{\mathbb {E}}\left( \Big | \psi _{ni}^{(j)}-\psi _{ni}^{(j),\kappa }\Big |^{2+\delta }\right) \longrightarrow 0 \ \text { as } \ \kappa \rightarrow \infty , \ j=1,2 \end{aligned}$$

and (14) is proved.

The proof of (16), that is

$$\begin{aligned} \lim _{n\rightarrow \infty } {\mathbb {E}}\left[ \left( \frac{1}{\sqrt{n}}\left\{ \sum _{i=1}^{[n\lambda _{1}]}\left[ \psi _{ni}^{(1),\kappa }-{\mathbb {E}}\left( \psi _{ni}^{(1),\kappa }\right) \right] +\sum _{i=[n\lambda _{1}]+1}^{n}\left[ \psi _{ni}^{(2),\kappa }-{\mathbb {E}}\left( \psi _{ni}^{(2),\kappa }\right) \right] \right\} \right) ^{2} \right] =c_\kappa , \end{aligned}$$

where $c_\kappa $ is some positive constant, is similar to that of (15).

It remains to prove (17).

For any $i,j=1,2$, denote by $\psi _{i}^{(j),\kappa }$ the counterpart of $\psi _{ni}^{(j),\kappa }$ obtained by substituting the $Y_{ni}$’s for the $X_i$’s.

We have

$$\begin{aligned} c_\kappa= & {} \lambda _{1} {\mathbb {E}}\left\{ \left[ \psi _{1}^{(1),\kappa }- {\mathbb {E}}\big (\psi _{1}^{(1),\kappa }\big )\right] ^{2} \right\} +2\lambda _{1}\sum _{i=1}^{\infty } {\mathbb {E}}\left\{ \left[ \psi _{1}^{(1),\kappa }-{\mathbb {E}}\big (\psi _{1}^{(1),\kappa }\big )\right] \left[ \psi _{i+1}^{(1),\kappa }-{\mathbb {E}}\big (\psi _{i+1}^{(1),\kappa }\big )\right] \right\} \\&+\lambda _{1}\sum _{i=1}^{\infty }{\mathbb {E}}\left\{ \left[ \psi _{1}^{(1),\kappa }-{\mathbb {E}}\big (\psi _{1}^{(1),\kappa }\big )\right] \left[ \psi _{i+1}^{(2),\kappa }-{\mathbb {E}}\big (\psi _{i+1}^{(2),\kappa }\big )\right] \right\} \\&+(1-\lambda _{1}) {\mathbb {E}}\left\{ \left[ \psi _{1}^{(2),\kappa }-{\mathbb {E}}\big (\psi _{1}^{(2),\kappa }\big )\right] ^{2} \right\} \\&+2(1-\lambda _{1})\sum _{i=1}^{\infty }{\mathbb {E}}\left\{ \left[ \psi _{1}^{(2),\kappa }-{\mathbb {E}}\big (\psi _{1}^{(2),\kappa }\big )\right] \left[ \psi _{i+1}^{(1),\kappa }-{\mathbb {E}}\big (\psi _{i+1}^{(2),\kappa }\big )\right] \right\} . \end{aligned}$$

By the Lebesgue dominated convergence theorem, one obtains

$$\begin{aligned}&{\mathbb {E}}\left\{ \left[ \psi _{1}^{(1),\kappa }-{\mathbb {E}}\left( \psi _{1}^{(1),\kappa }\right) \right] ^{2} \right\} \longrightarrow {\mathbb {E}}\left[ \left( \psi _{1}^{(1)}\right) ^{2} \right] \text{ as } \kappa \rightarrow \infty , \\&{\mathbb {E}}\left\{ \left[ \psi _{1}^{(1),\kappa }-{\mathbb {E}}\big (\psi _{1}^{(1),\kappa } \right] \left[ (\psi _{i+1}^{(1),\kappa }-{\mathbb {E}}\big (\psi _{i+1}^{(1),\kappa }\big )\right] \right\} \longrightarrow {\mathbb {E}} \left( \psi _{1}^{(1)}\psi _{i+1}^{(1)}\right) \text{ as } \kappa \rightarrow \infty ,\\&{\mathbb {E}}\left\{ \left[ \psi _{1}^{(1),\kappa }-{\mathbb {E}}\big (\psi _{1}^{(1),\kappa }\big )\right] \left[ \psi _{i+1}^{(2),\kappa }-{\mathbb {E}}\big (\psi _{i+1}^{(2),\kappa }\big )\right] \right\} \longrightarrow {\mathbb {E}}\left( \psi _{1}^{(2)}\psi _{i+1}^{(2)}\right) \text{ as } \kappa \rightarrow \infty , \\&{\mathbb {E}}\left\{ \left[ \psi _{1}^{(2),\kappa }-{\mathbb {E}}\big (\psi _{1}^{(2),\kappa }\big )\right] ^{2}\right\} \longrightarrow {\mathbb {E}}\left[ \left( \psi _{1}^{(2)}\right) ^{2} \right] \text{ as } \kappa \rightarrow \infty \end{aligned}$$

and

$$\begin{aligned} {\mathbb {E}}\left\{ \left[ \psi _{1}^{(2),\kappa }-{\mathbb {E}}\big (\psi _{1}^{(2),\kappa }\big )\right] \left[ \psi _{i+1}^{(2),\kappa } -{\mathbb {E}}\big (\psi _{i+1}^{(2),\kappa }\big )\right] \right\} \longrightarrow {\mathbb {E}}\left( \psi _{1}^{(2)} \psi _{i+1}^{(2)} \right) \text{ as } \kappa \rightarrow \infty . \end{aligned}$$

Therefore

$$\begin{aligned} \lim _{\kappa \rightarrow \infty }c_\kappa =c \end{aligned}$$

and (17) is proved. Whence, the finite dimensional convergence is established.

For proving the tightness, we need the following Lemma.

Lemma 4

(Phillips and Durlauf 1986) Probability measures on a product space are tight iff all the marginal probability measures are tight on the component spaces.

It results from this lemma that it suffices to prove the tightness of each component of the sequence of processes in (11). It is immediate from Proposition 2 that the first is tight. For the second, define

$$\begin{aligned} {\mathcal {M}}_n(\lambda )=\sigma _{22}^{-1/2}\frac{1}{\sqrt{n}}\sum _{i=1}^{[n\lambda ]} h_{G^{(n)},2}(Y_{ni}). \end{aligned}$$

If $\lambda _1\le \lambda \le \lambda _2$, from the integral conditions and condition (5), there exists a constant C such that

$$\begin{aligned} {\mathbb {E}}\left( |{\mathcal {M}}_n(\lambda )-{\mathcal {M}}_n(\lambda _1)|^2|{\mathcal {M}}_n(\lambda _2)-{\mathcal {M}}_n(\lambda )|^2 \right)\le & {} C{1 \over n^2}([n\lambda ]-[n\lambda _1])([n\lambda _2]-[n\lambda ])\\\le & {} C{1 \over n^2}([n\lambda _1]-[n\lambda ])([n\lambda ]-[n\lambda _2])\\\le & {} C{1 \over n^2}([n\lambda _2]-[n\lambda _1])^2 \\\le & {} C(\lambda _2-\lambda _1)^2. \end{aligned}$$

If $\lambda _2-\lambda _1\ge 1/n$ the last inequality follows and if $\lambda _2-\lambda _1<1/n$, then either $\lambda _1$ and $\lambda $ lie in the same subinterval $[(i-1)/n,i/n]$ or else $\lambda $ and $\lambda _2$ do. In either of these cases the left hand of last inequality vanishes. From Theorem 13.5 of Billingsley (1999), the process ${\mathcal {M}}_n$ is tight. This ends the proof of Proposition 3. $\square $

1.2 Proof of Theorem 1

Using the Hoeffding decomposition, we can write $Z_{n}(\lambda )$ as

$$\begin{aligned} Z_{n}(\lambda )= & {} n^{-3/2}\sum _{i=1}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n} \left[ h_{F,1}(X_{i})+h_{F,2}(X_{j})+g_F(X_{i},X_{j})\right] \nonumber \\= & {} n^{-3/2}\left[ (n-[n\lambda ])\sum _{i=1}^{[n\lambda ]}h_{F,1}(X_{i})+[n\lambda ] \sum _{j=[n\lambda ]+1}^{n}h_{F,2}(X_{j})\right] \nonumber \\&+n^{-3/2}\sum _{i=1}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n}g_F(X_{i},X_{j}). \end{aligned}$$

(21)

From Proposition 1, we have

$$\begin{aligned} n^{-3/2}\sup _{0\le \lambda \le 1}\bigg |\sum _{i=1}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n}g_F(X_{i},X_{j})\bigg | \longrightarrow _{_{n\rightarrow \infty }}0 \end{aligned}$$

in probability.

Thus, by Slutsky’s lemma, it suffices to show that the sum of the first two terms

$$\begin{aligned} \Big \{n^{-3/2}(n-[n\lambda ])\sum _{i=1}^{[n\lambda ]} h_{F,1}(X_{i})+ n^{-3/2} [n\lambda ] \sum _{j=[n\lambda ]+1}^{n} h_{F,2}(X_{j})\Big \}_{0\le \lambda \le 1} \end{aligned}$$

converges in distribution to the desired limit process.

It results from Proposition 2 that the process

$$\begin{aligned} \Big \{\frac{1}{\sqrt{n}}\sum _{i=1}^{[n\lambda ]}h_{F,1}(X_{i})\Big \}_{0\le \lambda \le 1} \end{aligned}$$

converges weakly to a Brownian motion $\{W(\lambda )\}_{0\le \lambda \le 1}$.

Proposition 3 yields

$$\begin{aligned} \Big \{\frac{1}{\sqrt{n}}\sum _{i=1}^{[n\lambda ]} \begin{pmatrix} h_{F,1}(X_{i}) \\ h_{F,2}(X_{i}) \end{pmatrix} \Big \}_{0\le \lambda \le 1}\longrightarrow _{n\rightarrow \infty } \Big \{\begin{pmatrix} W_{1}(\lambda ) \\ W_{2}(\lambda ) \end{pmatrix} \Big \}_{0\le \lambda \le 1} \end{aligned}$$

in distribution on the space $(D[0,1])^{2}$ to $(D[0,1])^{2}.$

Now, we consider the mapping defined by

$$\begin{aligned} \begin{pmatrix} x_{1}(\lambda ) \\ x_{2}(\lambda ) \end{pmatrix} \mapsto (1-\lambda )x_{1}(\lambda )+\lambda (x_{2}(1)-x_{2}(\lambda )), \ 0\le \lambda \le 1. \end{aligned}$$

This is a continuous mapping from $(D[0,1])^{2}$ to D[0, 1]. Whence,

$$\begin{aligned}&\left\{ n^{-3/2}(n-[n\lambda ])\displaystyle \sum _{i=1}^{[n\lambda ]}h_{F,1}(X_{i})+n^{-3/2}[n\lambda ] \displaystyle \sum _{j=[n\lambda ]+1}^{n}h_{F,2}(X_{j})\right\} _{0\le \lambda \le 1} \longrightarrow _{_{n\rightarrow \infty }} \left\{ Z(\lambda )\right\} _{0\le \lambda \le 1}, \end{aligned}$$

where for any $\lambda \in [0,1]$,

$$\begin{aligned} Z(\lambda )= (1-\lambda )W_{1}(\lambda )+\lambda [W_{2}(1)-W_{2}(\lambda )]. \end{aligned}$$

Whence, Theorem 1 is proved. $\square $

1.3 Proof of Theorem 2

Now we prove Theorem 2. Under the conditions of Theorem 2, we have the following equality

$$\begin{aligned} Z_{n}(\lambda )= & {} Z_{n}^{*}(\lambda )-n^{-3/2}[n\lambda ](n-[n\lambda ])\theta (F,F) \\= & {} n^{-3/2}[n\lambda ](n-[n\lambda ])\theta (F,G^{(n)})-n^{-3/2}[n\lambda ](n-[n\lambda ])\theta (F,F) \\&+\frac{[n\lambda ](n-[n\lambda ])}{n^{3/2}}\frac{1}{[n\lambda ]}\sum _{i=1}^{[n\lambda ]}h_{F,1}(X_{i}) \\&+\frac{[n\lambda ](n-[n\lambda ])}{n^{3/2}}\frac{1}{(n-[n\lambda ])}\sum _{j=[n\lambda ]+1}^{n}h_{G^{(n)},2}(Y_{nj}) \\&+\frac{1}{n^{3/2}}\sum _{i=1}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n}g_F(X_{i},Y_{nj}). \end{aligned}$$

From Proposition 1, we deduce that

$$\begin{aligned} n^{-3/2}\sup _{0\le \lambda \le 1}\bigg | \sum _{i=1}^{[n\lambda ]}\sum _{j=[n\lambda ]+1}^{n}g_F(X_{i},Y_{nj})\bigg | \longrightarrow _{n\rightarrow \infty }0 \end{aligned}$$

in probability.

From Proposition 2, we deduce that

$$\begin{aligned} n^{-1/2}\sum _{i=1}^{[n\lambda ]}h_{F,1}(X_{i}) \end{aligned}$$

converges weakly to the Brownian process $\{W_1(\lambda )\}_{_{0\le \lambda \le 1}}$ and

$$\begin{aligned} n^{-1/2}\sum _{j=[n\lambda ]+1}^{n}h_{G^{(n)},2}(Y_{nj}) \end{aligned}$$

converges weakly to the Brownian process $\{W_2(1)-W_2(\lambda )\}_{_{0\le \lambda \le 1}}$.

We have also from the ${\mathcal {H}}_{1,n}$

$$\begin{aligned} \lim _{n\rightarrow \infty } n^{-3/2}[n\lambda ](n-[n\lambda ])\theta (F,G^{(n)})-n^{-3/2}[n\lambda ](n-[n\lambda ])\theta (F,F)=\lambda (1-\lambda )A. \end{aligned}$$

From Proposition 3, we obtain that

$$\begin{aligned}&\left\{ n^{-3/2}(n-[n\lambda ])\displaystyle \sum _{i=1}^{[n\lambda ]}h_{F,1}(X_{i})+n^{-3/2}[n\lambda ] \displaystyle \sum _{j=[n\lambda ]+1}^{n}h_{G^{(n)},2}(Y_{nj})\right\} _{0\le \lambda \le 1}\\&\quad \longrightarrow _{_{n\rightarrow \infty }} \left\{ {{\widetilde{Z}}}(\lambda )\right\} _{0\le \lambda \le 1}, \end{aligned}$$

where for any $\lambda \in [0,1]$,

$$\begin{aligned} {{\widetilde{Z}}}(\lambda )=(1-\lambda )W_{1}(\lambda )+\lambda (W_{2}(1)-W_{2}(\lambda )). \end{aligned}$$

This establishes Theorem 2.

1.4 Proof of Theorem 3

Let $1\le [(n+1)t]\le [n\lambda _0]$, then

$$\begin{aligned} Z_n^*(t)= & {} n^{-3/2}\sum _{1\le i<j\le [n\lambda _0]}h(X_i,X_j) + n^{-3/2}\sum _{i=1}^{[n\lambda _0]}\sum _{j=[n\lambda _0]+1}^nh(X_i,X_j)\\&-n^{-3/2}\bigg [\sum _{1\le i<j\le [(n+1)t]}h(X_i,X_j) +\sum _{[(n+1)t] +1\le i<j\le [n\lambda _0]}h(X_i,X_j)\\&+\sum _{[(n+1)t]+1\le i\le [n\lambda _0]}\sum _{[n\lambda _0]+1\le j\le n}h(X_i,X_j)\bigg ] \\= & {} R_n^{(1)} + R_n^{(2)} - \left( R_n^{(3)} + R_n^{(4)} + R_n^{(5)}\right) . \end{aligned}$$

First we prove that

$$\begin{aligned} n^{-1/2}R_n^{(1)}{\mathop {\longrightarrow }\limits ^{ a.s. }}_{n\rightarrow \infty } \lambda ^2_0\theta (F,F)/2. \end{aligned}$$

From the Hoeffding decomposition (4), we have

$$\begin{aligned} n^{-1/2}R_n^{(1)}= & {} \frac{1}{2n^{2}}U_{[n\lambda _0]}\nonumber \\= & {} \frac{[n\lambda _0]([n\lambda _0]-1)}{2n^2}\theta (F,F)+ \frac{[n\lambda _0]([n\lambda _0]-1)}{n^2}\frac{2}{[n\lambda _0]}\sum _{i=1}^{[n\lambda _0]}h_{F,1}(X_i)\nonumber \\&+ \frac{[n\lambda _0]([n\lambda _0]-1)}{2n^2} U_{[n\lambda _0]}^{(2)}. \end{aligned}$$

(22)

As $(h_{1}^{(1)}(X_{i}))_{1\le i\le n}$ is stationary and ergodic, we have

$$\begin{aligned} \frac{1}{[n\lambda _0]}\sum _{i=1}^{[n\lambda _0]}h_{F,1}(X_i){\mathop {\longrightarrow }\limits ^{ a.s. }}_{n\rightarrow \infty } 0. \end{aligned}$$

For any $\varepsilon >0$, put

$$\begin{aligned} A_{[n\lambda _0]}={\mathbb {P}}\Big (\Big |U_{[n\lambda _0]}^{(2)}\Big |>\varepsilon \Big ). \end{aligned}$$

One has from Markov inequality and Lemma 2 of Yoshihara (1976)

$$\begin{aligned} {\mathbb {P}}\Big (\Big |U_{[n\lambda _0]}^{(2)}\Big |>\varepsilon \Big )\le & {} \frac{1}{\varepsilon ^2} {\mathbb {E}}\left[ \left( U_{[n\lambda _0]}^{(2)}\right) ^2 \right] \\= & {} {\mathcal {O}}([n\lambda _0]^{-1-\gamma }), \ \gamma >0, \end{aligned}$$

which implies

$$\begin{aligned} \sum _{i=1}^{[n\lambda _0]}A_{[i\lambda _0]}<\infty . \end{aligned}$$

Then from Borel–Cantelli lemma

$$\begin{aligned} \frac{[n\lambda _0]([n\lambda _0]-1)}{2n^2} U_{[n\lambda _0]}^{(2)}{\mathop {\longrightarrow }\limits ^{ a.s. }}_{n\rightarrow \infty } 0. \end{aligned}$$

Then from (22), we have

$$\begin{aligned} n^{-1/2}R_n^{(1)}{\mathop {\longrightarrow }\limits ^{ a.s. }}_{n\rightarrow \infty } \lambda ^2_0\theta (F,F)/2. \end{aligned}$$

Similarly, we prove

$$\begin{aligned} n^{-1/2}R_n^{(3)}{\mathop {\longrightarrow }\limits ^{ a.s. }}_{n\rightarrow \infty } t^2\theta (F,F)/2 \end{aligned}$$

and

$$\begin{aligned} n^{-1/2}R_n^{(4)}{\mathop {=}\limits ^{ {\mathcal {D}} }}\sum _{1\le i < j \le [n\lambda _0] -[(n+1)t]}h(X_i,X_j){\mathop {\longrightarrow }\limits ^{ a.s. }}_{n\rightarrow \infty }(t-\lambda _0)^2\theta (F,F)/2. \end{aligned}$$

Now, we establish that

$$\begin{aligned} n^{-1/2}R_n^{(2)}{\mathop {\longrightarrow }\limits ^{ a.s. }}_{n\rightarrow \infty } \lambda _0(1-\lambda _0)\theta (F,G). \end{aligned}$$

From (21), we have

$$\begin{aligned} n^{-1/2}R_n^{(2)}= & {} \frac{1}{n^2}\sum _{i=1}^{[n\lambda _0]}\sum _{j=[n\lambda _0]+1}^n\{\theta (F,G)+ h_{G,1}(X_i)+ h_{G,2}(Y_j)+ g_G(X_i,Y_j)\}\nonumber \\= & {} \frac{[n\lambda _0](n-[n\lambda _0])}{n^{2}}\theta (F,G) +\frac{[n\lambda _0](n-[n\lambda _0])}{n^{2}}\frac{1}{[n\lambda _0]} \sum _{i=1}^{[n\lambda _0]}h_{F,1}(X_{i})\nonumber \\&+\frac{[n\lambda _0](n-[n\lambda _0])}{n^{2}}\frac{1}{(n-[n\lambda _0])} \sum _{j=[n\lambda _0]+1}^{n}h_{G,2}(Y_{j})\nonumber \\&+\frac{1}{n^{2}}\sum _{i=1}^{[n\lambda _0]}\sum _{j=[n\lambda _0]+1}^{n}g_F(X_{i},Y_{j}), \end{aligned}$$

(23)

where we recall that the $Y_{j}$’s are random variables with cumulative distribution function G and satisfy (5).

From the ergodic theorem, we have that

$$\begin{aligned} \frac{1}{[n\lambda _0]}\sum _{i=1}^{[n\lambda _0]}h_{F,1}(X_i){\mathop {\longrightarrow }\limits ^{ a.s. }}_{n\rightarrow \infty } 0. \end{aligned}$$

and

$$\begin{aligned} \frac{1}{(n-[n\lambda _0])}\sum _{j=[n\lambda _0]+1}^{n}h_{G,2}(Y_{j}){\mathop {\longrightarrow }\limits ^{ a.s. }}_{n\rightarrow \infty }0. \end{aligned}$$

From Lemma 2, we deduce that

$$\begin{aligned} {\mathbb {E}}\left\{ \left[ \frac{1}{n^{2}}\sum _{i=1}^{[n\lambda _0]}\sum _{j=[n\lambda _0]+1}^{n}g_F(X_{i},Y_{j})\right] ^{2} \right\} \le Cst[n\lambda _0](n-[n\lambda _0])n^{-4}. \end{aligned}$$

From Markov inequality, we deduce for any $\epsilon >0$ that

$$\begin{aligned} {\mathbb {P}}\left( \left| \frac{1}{n^{2}}\sum _{i=1}^{[n\lambda _0]}\sum _{j=[n\lambda _0]+1}^{n}g_F(X_{i},Y_{j})\right| >\epsilon \right) ={\mathcal {O}}(n^{-2}). \end{aligned}$$

Also, by Borel–Cantelli Lemma one has

$$\begin{aligned} \frac{1}{n^{2}}\sum _{i=1}^{[n\lambda _0]}\sum _{j=[n\lambda _0]+1}^{n}g_F(X_{i},Y_{j}){\mathop {\longrightarrow }\limits ^{ a.s. }}_{n\rightarrow \infty }0. \end{aligned}$$

Similarly, we prove that

$$\begin{aligned} n^{-1/2}R_n^{(5)}{\mathop {\longrightarrow }\limits ^{{\mathbb {P}} }}(\lambda _0-t)(1-\lambda _0)\theta (F,G). \end{aligned}$$

These observations clearly imply the first part of (7). The proof of its second part is similar. $\square $

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ngatchou-Wandji, J., Elharfaoui, E. & Harel, M. On change-points tests based on two-samples U-Statistics for weakly dependent observations . Stat Papers 63, 287–316 (2022). https://doi.org/10.1007/s00362-021-01242-3

Download citation

Received: 12 August 2020
Revised: 17 April 2021
Accepted: 18 May 2021
Published: 28 May 2021
Issue Date: February 2022
DOI: https://doi.org/10.1007/s00362-021-01242-3

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

On change-points tests based on two-samples U-Statistics for weakly dependent observations

Abstract

Similar content being viewed by others

Change-Point Detection Under Dependence Based on Two-Sample U-Statistics

On nonparametric change point estimator based on empirical characteristic functions

Z-Process Method for Change Point Problems in Time Series

1 Introduction

2 General definitions and assumptions

Remark 1

Remark 2

3 Asymptotics

Theorem 1

Proof

Remark 3

Theorem 2

Proof

Theorem 3

Proof

Theorem 4

Proof

Corollary 1

Proof

Remark 4

Remark 5

4 Practical considerations

4.1 Numerical simulations

4.2 Concluding remarks

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Appendix: proofs of the results

Appendix: proofs of the results

1.1 Preliminary results

Proposition 1

Proof

Lemma 1

Proof of Lemma 1

Lemma 2

Proof of Lemma 2

Proposition 2

Proof

Proposition 3

Proof

Lemma 3

Lemma 4

1.2 Proof of Theorem 1

1.3 Proof of Theorem 2

1.4 Proof of Theorem 3

Rights and permissions

About this article

Cite this article

Share this article

Mathematics Subject Classification

Search

Navigation