Higher-degree eigenvalue complementarity problems for tensors

Ling, Chen; He, Hongjin; Qi, Liqun

doi:10.1007/s10589-015-9805-x

Higher-degree eigenvalue complementarity problems for tensors

Published: 03 November 2015

Volume 64, pages 149–176, (2016)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Computational Optimization and Applications Aims and scope Submit manuscript

Higher-degree eigenvalue complementarity problems for tensors

Download PDF

Chen Ling¹,
Hongjin He¹ &
Liqun Qi²

491 Accesses
24 Citations
Explore all metrics

Abstract

In this paper, we introduce a unified framework of Tensor Higher-Degree Eigenvalue Complementarity Problem (THDEiCP), which goes beyond the framework of the typical Quadratic Eigenvalue Complementarity Problem for matrices. First, we study some topological properties of higher-degree cone eigenvalues of tensors. Based upon the symmetry assumptions on the underlying tensors, we then reformulate THDEiCP as a weakly coupled homogeneous polynomial optimization problem, which might be greatly helpful for designing implementable algorithms to solve the problem under consideration numerically. As more general theoretical results, we present the results concerning existence of solutions of THDEiCP without symmetry conditions. Finally, we propose an easily implementable algorithm to solve THDEiCP, and report some computational results.

On the cone eigenvalue complementarity problem for higher-order tensors

Article 06 June 2015

Tensor Z-eigenvalue complementarity problems

Article 03 January 2021

A Class of Second-Order Cone Eigenvalue Complementarity Problems for Higher-Order Tensors

Article 26 October 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Let A, B, $C\in {\mathbb {R}}^{n\times n}$ be given matrices, the generic Quadratic Eigenvalue Complementarity Problem (QEiCP) widely studied in recent literature, e.g., see [6, 14, 38], is that of finding $(\lambda ,x)\in {\mathbb {R}}\times {\mathbb {R}}^n$ such that

$$\begin{aligned} \left\{ \begin{array}{l} K\ni x \bot (\lambda ^2A+\lambda B+C)x\in K^*,\\ e^\top x=1, \end{array} \right. \end{aligned}$$

(1.1)

where $e=(1,1,\ldots , 1)^\top \in {\mathbb {R}}^n$, K is a closed convex cone in ${\mathbb {R}}^n$, and $K^*$ refers to the dual cone of K, which is defined by

$$\begin{aligned} K^*:=\left\{ y\in {\mathbb {R}}^n~|~\langle y,x\rangle \ge 0,~\forall ~x\in K\right\} . \end{aligned}$$

The linear constraint $e^\top x=1$ in (1.1) plays an important role in preventing the x component of a solution to vanish. Notice that the leading matrix A could be singular, and in particular, the QEiCP immediately reduces to the classical Eigenvalue Complementarity Problem (EiCP) when $A=0$. Clearly, QEiCP is an interesting generalization of the classical EiCP, embracing an extra quadratic term on $\lambda $. When K is taken as the whole space ${\mathbb {R}}^n$, then (1.1) becomes the well studied unconstrained quadratic eigenvalue problem, which frequently arises in areas such as the electric power systems [28], the dynamic analysis of acoustic systems [3] and linear stability of flows in fluid mechanics [27], to name a few. We refer the reader to [41] for a survey on the unconstrained version. If the matrices A, B, C are all symmetric, QEiCP and EiCP are called symmetric, respectively.

Since the seminal work on EiCP [10] devoted to the study of static equilibrium states of mechanical systems with unilateral friction, both EiCP and QEiCP have been well discussed from theoretical and numerical perspective in the literature, e.g., see [1, 11, 14–16, 22–24, 38], where these papers only focus on matrix cases. For instance, in order to study the sufficient conditions for the existence of solutions of QEiCP, the so-called co-regularity and co-hiperbolicity properties were introduced in Ref. [38]. Usually, the co-regularity on matrix $A\in {\mathbb {R}}^{n\times n}$, i.e., $x^\top Ax\ne 0$ for any $x\in K$, means that A or $-A$ is strictly K-positive. We say that $(A,B,C)\in {\mathcal M}_n:={\mathbb {R}}^{n\times n}\times {\mathbb {R}}^{n\times n}\times {\mathbb {R}}^{n\times n}$ satisfies co-hiperbolicity property, if

$$\begin{aligned} (x^\top Bx)^2\ge 4(x^\top Ax)(x^\top Cx),\quad \forall ~x\in K. \end{aligned}$$

However, checking whether a given matrices triplet $(A,B,C)\in {\mathcal M}_n$ satisfies co-hyperbolicity or not is co-NP-complete, which is essentially the verification problem of copositiveness (see Definition 2.1) of a related fourth order n-dimensional tensor [32, 39].

In recent decades, tensor, which is a natural extension of the concept of matrix, is on the timely topic of high-dimensional data representation in terms of theoretical analysis and algorithmic design because of its widespread applications in engineering. A tensor, namely, is a multidimensional array, whose order is the number of dimensions. Let m and n be positive integers. We call ${\mathcal {A}}= (a_{i_1\ldots i_m} )$, where $a_{i_1\ldots i_m} \in {\mathbb {R}}$ for $1\le i_1, \ldots , i_m\le n$, a real mth order n-dimensional real square tensor. For the sake of convenience, we denote by ${\mathcal {T}}_{m,n}$ the space of mth order n-dimensional real square tensors. Furthermore, a tensor ${\mathcal {A}}\in {\mathcal {T}}_{m,n}$ is called symmetric if its entries are invariant under any permutation of its indices. For a vector $x = (x_1, \ldots ,x_n)^\top \in {\mathbb {C}}^n$ and a tensor ${\mathcal {A}}= (a_{i_1\ldots i_m})\in {\mathcal {T}}_{m,n}$, ${\mathcal {A}}x^{m-1}$ is an n-dimensional vector with its ith component defined by

$$\begin{aligned} ({\mathcal {A}}x^{m-1})_i=\sum _{i_2,\ldots ,i_m=1}^na_{ii_2\ldots i_m}x_{i_2}\cdots x_{i_m},\quad {\mathrm{for}}\, i=1,2,\ldots ,n, \end{aligned}$$

and ${\mathcal {A}}x^m$ is the value at x of a homogeneous polynomial, defined by

$$\begin{aligned} {\mathcal {A}}x^{m}=\sum _{i_1,i_2,\ldots ,i_m=1}^na_{i_1i_2\ldots i_m}x_{i_1}x_{i_2}\cdots x_{i_m}. \end{aligned}$$

Although tensor-related problems have received considerable attention many years ago, the history of research on eigenvalues (eigenvectors) of a square tensor can be traced back to the pioneer works independently introduced by Qi [31] and Lim [25]. Comparatively speaking, the developments of eigenvalue-related problems for tensors are still in their infancy. For given tensors ${\mathcal {A}},{\mathcal {B}}\in {\mathcal {T}}_{m,n}$, we say that $({\mathcal {A}},{\mathcal {B}})$ is an identical singular pair, if

$$\begin{aligned} \left\{ x\in {\mathbb {C}}^n\backslash \{0\}~|~{\mathcal {A}}x^{m-1}=0,\;{\mathcal {B}}x^{m-1}=0\right\} \ne \emptyset . \end{aligned}$$

Assuming that $({\mathcal {A}},{\mathcal {B}})$ is not an identical singular pair, we say $(\lambda , x)\in {\mathbb {C}}\times ({\mathbb {C}}^n\backslash \{0\})$ is an eigenvalue-eigenvector pair of $({\mathcal {A}}, {\mathcal {B}})$, if the following n-system of equations

$$\begin{aligned} ({\mathcal {A}}-\lambda {\mathcal {B}})x^{m-1}=0 \end{aligned}$$

(1.2)

possesses a nonzero solution x. This unified definition of eigenvalue-eigenvector pair for tensors was introduced by Chang et al. [8]. In recent years, it is well documented in the literature that tensors and eigenvalues/eigenvectors of tensors have fruitful applications in various fields such as magnetic resonance imaging [4, 34], higher-order Markov chains [29] and best-rank one approximation in data analysis [33], whereby many nice properties such as the Perron–Frobenius theorem for eigenvalues/eigenvectors of nonnegative square tensors have been established, see, e.g., [7, 43]. All these encourage us to consider tensor eigenvalue complementarity problems. However, to the best of our knowledge, the most recent paper [26] is the first work devoted to the Tensor Generalized Eigenvalue Complementarity Problem (TGEiCP), thus leaving higher-degree cases a big gap. Therefore, the main objective of this paper is to fill out this gap.

In this paper, we consider the Tensor Higher-Degree Eigenvalue Complementarity Problem (THDEiCP), which goes beyond the framework of QEiCP and further generalizes TGEiCP. Mathematically, the THDEiCP can be characterized as finding a scalar $\lambda \in {\mathbb {R}}$ and a vector $x\in {\mathbb {R}}^n\backslash \{0\}$ such that

$$\begin{aligned} K\ni x\perp ( \lambda ^m{\mathcal {A}}+\lambda {\mathcal {B}}+{\mathcal {C}}) x^{m-1}\in K^*, \end{aligned}$$

(1.3)

where ${\mathcal {A}}=(a_{i_1i_2\ldots i_m})$, ${\mathcal {B}}=(b_{i_1i_2\ldots i_m})$, and ${\mathcal {C}}=(c_{i_1i_2\ldots i_m})\in {\mathcal {T}}_{m,n}$. Correspondingly, the scalar $\lambda $ and the nonzero vector x satisfying system (1.3) are respectively called an m-degree K-eigenvalue of the tensors triplet ${\mathcal {Q}}:=({\mathcal {A, B, C}})\in {\mathcal {F}}_{m,n}:={\mathcal {T}}_{m,n}\times {\mathcal {T}}_{m,n}\times {\mathcal {T}}_{m,n}$ and an associated K-eigenvector. Alternatively, $(\lambda , x)$ is also called an m-degree K-eigenpair of ${\mathcal {Q}}$. Throughout, the set of all m-degree K-eigenvalues of ${\mathcal {Q}}$ is called the m-degree K-spectrum of ${\mathcal {Q}}$, i.e.,

$$\begin{aligned} \sigma ({\mathcal {Q}},K):=\left\{ \lambda \in {\mathbb {R}}~|~\exists ~ x\in {\mathbb {R}}^n\backslash \{0\}, ~K\ni x\perp (\lambda ^m{\mathcal {A}} + \lambda {\mathcal {B}}+{\mathcal {C}} )x^{m-1}\in K^*\right\} . \end{aligned}$$

(1.4)

If $K:={\mathbb {R}}_+^n$, the m-degree K-eigenvalue/eigenvector of the tensors triplet ${\mathcal {Q}}$ is called the m-degree Pareto-eigenvalue/eigenvector of ${\mathcal {Q}}$, and the m-degree K-spectrum of ${\mathcal {Q}}$ is called the m-degree Pareto-spectrum of ${\mathcal {Q}}$. If $x\in {\mathrm{int}}(K)$ (resp. $x\in {\mathbb {R}}_{++}^n)$, then $\lambda $ is called a strict m-degree K-eigenvalue (resp. Pareto-eigenvalue) of ${\mathcal {Q}}$.

It is clear from (1.3) that THDEiCP covers TGEiCP and QEiCP as the special cases. More concretely, by taking ${\mathcal {A}}=0$, model (1.3) immediately reduces to TGEiCP studied in [26]. When we set $m=2$, THDEiCP clearly corresponds to the QEiCP (1.1). Like [37], on the other hand, we can further establish the connection between THDEiCP and a class of differential inclusions with nonconvex processes $\varGamma $ defined by

$$\begin{aligned} Gr(\varGamma ):=\left\{ (x,y)\in {\mathbb {R}}^n\times {\mathbb {R}}^n~|~K\ni x\bot ({\mathcal {A}}y^{m-1}-{\mathcal {B}}x^{m-1})\in K^* \right\} . \end{aligned}$$

Accordingly, for the differential inclusions defined by $\dot{{\varvec{u}}}(t)\in \varGamma ({\varvec{u}}(t))$, as noticed already by Rockafellar [35], the change of variables ${\varvec{u}}(t)={\mathrm{exp}}(\lambda t)x$ with $\lambda >0$ leads to the equivalent system $\lambda x\in \varGamma (x)$. Therefore, if the pair $(\lambda , x)$ satisfies $\lambda x\in \varGamma (x)$, then the trajectory $t\mapsto {\mathrm{exp}}(\lambda t)x$ is a solution to the considered differential inclusions. Moreover, if the trajectory constructed above is nonconstant, then x must be a nonzero vector; this requires $(\lambda , x)$ to be a solution of THDEiCP with ${\mathcal {C}}=0$ because of $\lambda >0$.

The paper is divided into six sections. As far as we know, it might be the first work on THDEiCP, we thus do not know whether the topological properties of QEiCP still hold for the newly introduced model. In Sect. 2, we first study, as briefly as possible, some topological properties such as closedness, boundedness, and upper-semicontinuity of the m-degree K-spectrum given by (1.4), in addition to estimating upper bounds on the number of eigenvalues of THDEiCP. With the preparations on these topological properties, one may be further concerned with that the issue of solving the model under consideration. In Sect. 3, we reformulate the special case of THDEiCP with symmetric ${\mathcal {A}}$ and ${\mathcal {B}}$ as a weakly coupled polynomial optimization problem for the case where $K:={\mathbb {R}}_+^n$, which would potentially facilitate the algorithmic design. From a theoretical perspective, in Sect. 4, we establish the results concerning existence of solutions of THDEiCP without symmetry assumptions on ${\mathcal {A}}$ and ${\mathcal {B}}$. Based upon the augmented Lagrangian method, in Sect. 5, we propose an implementable splitting algorithm to solve the resulting polynomial optimization reformulation of the symmetric THDEiCP and report some preliminary results. Finally, we give some concluding remarks in Sect. 6.

Notation

Let ${\mathbb {R}}^n$ denote the real Euclidean space of column vectors of length n, which is equipped with the standard inner product $\langle y,x\rangle =y^\top x$ and the associated norm. The superscript ‘$\top $’ indicates transposition and the symbol ‘$\bot $’ represents orthogonality. Denote ${\mathbb {R}}_+^n=\{x\in {\mathbb {R}}^n~|~x\ge 0\}$ and ${\mathbb {R}}_{++}^n=\{x\in {\mathbb {R}}^n~|~x> 0\}$. For a vector $x\in {\mathbb {R}}^n$ and a subset J of the index set $[n]:=\{1,2,\ldots ,n\}$, we use the notation $x_J$ for the |J| dimensional sub-vector of x, which is obtained by deleting all components $i\not \in J$, where the symbol |J| denotes the cardinality of J. For a vector $x\in {\mathbb {R}}^n$ and an integer $r\ge 0$, denote $x^{[r]}=(x_1^r,x_2^r,\ldots , x_n^r)^\top $, and denote by $\mathrm{diag}(x)$ the $n\times n$ diagonal matrix containing $x_i$ in its diagonal. Moreover, for ${\mathcal {A}}\in {\mathcal {T}}_{m,n}$, we denote the principal sub-tensor of ${\mathcal {A}}$ by ${\mathcal {A}}_J$, which is obtained by homogeneous polynomial ${\mathcal {A}}x^m$ for all $x=(x_1,x_2,\ldots ,x_n)^\top $ with $x_i=0$ for $[n]\backslash J$. So, ${\mathcal {A}}_J\in {\mathcal {T}}_{m,|J|}$. Denote by $e\in {\mathbb {R}}^n$ with all entries being 1, i.e., $e=(1,1,\ldots ,1)^\top $. Denote by ${\mathcal {I}}=(\delta _{i_1\ldots i_m})$ the unit tensor in ${\mathcal {T}}_{m,n}$, where $\delta _{i_1\ldots i_m}$ is the Kronecker symbol

$$\begin{aligned} \delta _{i_1\ldots i_m}=\left\{ \begin{array}{ll} 1,&{}\;\;{\mathrm{if}~~}i_1=\ldots =i_m,\\ 0,&{}\;\;{\mathrm{otherwise}}. \end{array} \right. \end{aligned}$$

2 Some basic properties of K-spectrum

In this section, we summarize some basic definitions and study some basic topological properties of the m-degree cone spectrum, which will be used in subsequent sections.

We first introduce the concept of cone positive square tensors, which is a generalization of the concept of copositive square tensor introduced in Ref. [32] and studied in Ref. [39].

Definition 2.1

Let K be a closed convex cone in ${\mathbb {R}}^n$ and ${\mathcal {G}}\in {\mathcal {T}}_{m,n}$. We say that ${\mathcal {G}}$ is a (resp. strictly) K-positive tensor, if ${\mathcal {G}}x^m\ge 0$ (resp. $>0$) for any $x\in K\backslash \{0\}$. If $K={\mathbb {R}}_+^n$, the (strictly) K-positive tensor ${\mathcal {G}}$ is said the (strictly) copositive tensor.

It is obvious that ${\mathcal {F}}_{m,n}$ is a linear space. The distance between two elements ${\mathcal {Q}}_i=({\mathcal {A}}_i,{\mathcal {B}}_i,{\mathcal {C}}_i)\in {\mathcal {F}}_{m,n}~(i=1,2)$ is measured by means of the expression

$$\begin{aligned} \Vert {\mathcal {Q}}_1-{\mathcal {Q}}_2\Vert _F=\left\{ \Vert {\mathcal {A}}_1-{\mathcal {A}}_2\Vert _F^2+\Vert {\mathcal {B}}_1-{\mathcal {B}}_2\Vert _F^2+\Vert {\mathcal {C}}_1-{\mathcal {C}}_2\Vert _F^2\right\} ^{\frac{1}{2}}, \end{aligned}$$

where

$$\begin{aligned} \Vert {\mathcal {A}}\Vert _F=\sqrt{\sum _{1\le i_1,\ldots ,i_m\le n}a^2_{i_1\ldots i_m}}, \quad \forall ~{\mathcal {A}}=(a_{i_1\ldots i_m})\in {\mathcal {T}}_{m,n}. \end{aligned}$$

Denote by ${\mathscr {C}}({\mathbb {R}}^n)$ the set of nonzero closed convex cones in ${\mathbb {R}}^n$, which is associated with the natural metric defined by

$$\begin{aligned} \delta (K_1,K_2) := \sup _{\Vert z\Vert \le 1}|{\mathrm{dist}}(z,K_1)- \mathrm{dist}(z,K_2)|, \end{aligned}$$

where ${\mathrm{dist}}(z,K) := {\mathrm{inf}}_{u\in K}\Vert z-u\Vert $ stands for the distance from z to K. An equivalent way of defining $\delta $ is

$$\begin{aligned} \delta (K_1,K_2) = {\mathrm{haus}}(K_1\cap {\mathscr {B}}_n,K_2 \cap {\mathscr {B}}_n), \end{aligned}$$

where ${\mathscr {B}}_n$ is the closed unit ball in ${\mathbb {R}}^n$, and

$$\begin{aligned} {{\mathrm{haus}}}({\mathscr {C}}_1,{\mathscr {C}}_2) := \max \left\{ \sup _{z\in {\mathscr {C}}_1}{\mathrm{dist}}(z,{\mathscr {C}}_2), \sup _{z\in {\mathscr {C}}_2}{\mathrm{dist}}(z,{\mathscr {C}}_1)\right\} \end{aligned}$$

stands for the Hausdorff distance between the compact sets ${\mathscr {C}}_1,{\mathscr {C}}_2\subset {\mathbb {R}}^n$ (see [2, pp. 85–86]). General information on the metric $\delta $ can be consulted in the book by Rockafellar and Wets [36]. According to [42], the operation $K\mapsto K^*$ is an isometry on the space $({\mathscr {C}}({\mathbb {R}}^n), \delta )$, that is to say,

$$\begin{aligned} \delta (K_1^*,K_2^*)=\delta (K_1,K_2), \quad {{\mathrm{for~ all}~~}} K_1,K_2\in {\mathscr {C}}({\mathbb {R}}^n). \end{aligned}$$

The basic topological properties of the mapping $\sigma : {\mathcal {F}}_{m,n} \times {\mathscr {C}}({\mathbb {R}}^n)\rightarrow 2^{{\mathbb {R}}}$, defined in (1.4), are listed in the next proposition. This proposition is a tensor version of generalizing the results presented in Ref. [38]. As far as semicontinuity concepts are concerned, we use the following terminology (cf. Section 6.2 in [2]):

Definition 2.2

Let W and Y be two topological spaces. The mapping $\varPsi :W\rightarrow 2^Y$ is said to be upper-semicontinuous (resp. lower-semicontinuous) if the set

$$\begin{aligned} \{w\in W~|~\varPsi (w)\subset U\}~~~\left( {\mathrm{resp.~~}}\{w\in W~|~\varPsi (w)\cap U\ne \emptyset \}\right) \end{aligned}$$

is open, whenever $U\subset Y$ is open.

Definition 2.3

Let ${\mathcal {Q}}=({\mathcal {A,B,C}})\in {\mathcal {F}}_{m,n}$ and $K\in {\mathscr {C}}({\mathbb {R}}^n)$. We say that ${\mathcal {Q}}$ is K-regular if the leading tensor ${\mathcal {A}}$ satisfies

$$\begin{aligned} {\mathcal {A}}x^m\ne 0,~~ \forall ~ x\in K\backslash \{0\}. \end{aligned}$$

It is obvious that, if ${\mathcal {Q}}$ is K-regular, then either the leading tensor ${\mathcal {A}}$ in ${\mathcal {Q}}$ or $-{\mathcal {A}}$ is K-positive.

Proposition 2.1

The following three statements are true:

(i)
The set $\varSigma :=\{({\mathcal {Q}},K,\lambda ) \in {\mathcal {F}}_{m,n} \times {\mathscr {C}}({\mathbb {R}}^n)\times {\mathbb {R}}~|~\lambda \in \sigma ({\mathcal {Q}},K)\}$ is closed in the product space ${\mathcal {F}}_{m,n} \times {\mathscr {C}}({\mathbb {R}}^n)\times {\mathbb {R}}$. In particular, for any $(\bar{{\mathcal {Q}}},\bar{K})\in {\mathcal {F}}_{m,n} \times {\mathscr {C}}({\mathbb {R}}^n)$, $\sigma (\bar{{\mathcal {Q}}},\bar{K})$ is a closed subset of ${\mathbb {R}}$;
(ii)
Let $(\bar{{\mathcal {Q}}},\bar{K})\in {\mathcal {F}}_{m,n}\times {\mathscr {C}}({\mathbb {R}}^n)$. If $\bar{{\mathcal {Q}}}$ is $\bar{K}$-regular, then the mapping $\sigma :{\mathcal {F}}_{m,n} \times {\mathscr {C}}({\mathbb {R}}^n)\rightarrow 2^{\mathbb {R}}$ is locally bounded at $(\bar{{\mathcal {Q}}},\bar{K})$, i.e., $\bigcup _{({\mathcal {Q}},K)\in {\mathcal {N}}}\sigma ({\mathcal {Q}},K)$ is bounded for some neighborhood ${\mathcal {N}}$ of $(\bar{{\mathcal {Q}}},\bar{K})$.
(iii)
Let $(\bar{{\mathcal {Q}}},\bar{K})\in {\mathcal {F}}_{m,n}\times {\mathscr {C}}({\mathbb {R}}^n)$. If $\bar{{\mathcal {Q}}}$ is $\bar{K}$-regular, then $\sigma $ is upper-semicontinuous at $(\bar{{\mathcal {Q}}},\bar{K})$.

Proof

(i). The closedness of $\varSigma $ amounts to saying that

$$\begin{aligned} \left. \begin{array}{r} ({\mathcal {Q}}_\nu , K_\nu )\rightarrow (\bar{{\mathcal {Q}}},\bar{K}),~\lambda _\nu \rightarrow \bar{\lambda }\\ \lambda _\nu \in \sigma ({\mathcal {Q}}_\nu ,K_\nu ) \end{array} \right\} \Rightarrow \bar{\lambda }\in \sigma (\bar{{\mathcal {Q}}},\bar{K}). \end{aligned}$$

Since $\lambda _\nu \in \sigma ({\mathcal {Q}}_\nu ,K_\nu ) $, there exists a vector $x_\nu \in {\mathbb {R}}^n\backslash \{0\}$ such that

$$\begin{aligned} K_\nu \ni x_\nu \perp (\lambda ^m_\nu {\mathcal {A}}_\nu +\lambda _\nu {\mathcal {B}}_\nu +{\mathcal {C}}_\nu )x_\nu ^{m-1}\in K_\nu ^*. \end{aligned}$$

(2.1)

Let $\bar{x}_\nu =x_\nu /\Vert x_\nu \Vert $. From the homogeneity of (2.1) on x, we know that

$$\begin{aligned} K_\nu \ni \bar{x}_\nu \perp (\lambda ^m_\nu {\mathcal {A}}_\nu +\lambda _\nu {\mathcal {B}}_\nu +{\mathcal {C}}_\nu )\bar{x}_\nu ^{m-1}\in K_\nu ^*. \end{aligned}$$

(2.2)

It is clear that $\Vert \bar{x}_\nu \Vert =1$ for every $\nu =1,2,\ldots $. Without loss of generality, we assume that $\bar{x}_\nu \rightarrow \bar{x}$. It is obvious that $\Vert \bar{x}\Vert =1$, which means $\bar{x}\in {\mathbb {R}}^n\backslash \{0\}$. Since $\delta (K_1^*,K_2^*)=\delta (K_1,K_2)$ for any $K_1,K_2\in {\mathscr {C}}({\mathbb {R}}^n)$, by passing to the limit in (2.2), one gets

$$\begin{aligned} \bar{K} \ni \bar{x}\perp \left( \bar{\lambda }^m \bar{{\mathcal {A}}} +\bar{\lambda } \bar{{\mathcal {B}}}+\bar{{\mathcal {C}}}\right) \bar{x}^{m-1}\in \bar{K}^*, \end{aligned}$$

which implies $\bar{\lambda }\in \sigma (\bar{{\mathcal {Q}}},\bar{K})$ due to $\bar{x}\in {\mathbb {R}}^n\backslash \{0\}$. We proved the first part (i) of this proposition.

(ii). Suppose that the map $\sigma $ is not locally bounded at $(\bar{{\mathcal {Q}}},\bar{K})$. Then there exists a sequence $\{{\mathcal {Q}}_\nu ,K_\nu ,\lambda _\nu \}$ satisfying

$$\begin{aligned} \Vert {\mathcal {Q}}_\nu -\bar{{\mathcal {Q}}}\Vert _F\rightarrow 0,~~~\delta (K_\nu ,\bar{K})\rightarrow 0,~~~\mathrm{and}~~~|\lambda _\nu |\rightarrow +\infty , \end{aligned}$$

such that $\lambda _\nu \in \sigma ({\mathcal {Q}}_\nu ,K_\nu )$ for any $\nu =1,2,\ldots $. Consequently, there exist vectors $x_\nu \in K_{\nu }$ with $\Vert x_\nu \Vert =1$, such that

$$\begin{aligned} K_\nu \ni x_\nu \perp \left( \lambda ^m_\nu {\mathcal {A}}_\nu +\lambda _\nu {\mathcal {B}}_\nu +{\mathcal {C}}_\nu \right) x_\nu ^{m-1}\in K_\nu ^*. \end{aligned}$$

(2.3)

By (2.3), we have

$$\begin{aligned} \left( \lambda ^m_\nu {\mathcal {A}}_\nu +\lambda _\nu {\mathcal {B}}_\nu +{\mathcal {C}}_\nu \right) x_\nu ^{m}=0, \end{aligned}$$

which implies

$$\begin{aligned} \left( {\mathcal {A}}_\nu +\frac{{\mathcal {B}}_\nu }{\lambda _\nu ^{m-1}} +\frac{{\mathcal {C}}_\nu }{\lambda _\nu ^{m}}\right) x_\nu ^{m}=0. \end{aligned}$$

We assume, without loss of generality, that $x_\nu \rightarrow \bar{x}$. It is obvious that $\Vert \bar{x}\Vert =1$, which means $\bar{x}\in {\mathbb {R}}_+^n\backslash \{0\}$. By passing to the limit in the above expression, it holds that $\bar{{\mathcal {A}}} \bar{x}^{m}=0$. It contradicts the $\bar{K}$-regularity of $\bar{{\mathcal {Q}}}$ because $\bar{x}\in K\backslash \{0\}$.

(iii). Suppose that $\sigma $ is not upper-semicontinuous at $(\bar{{\mathcal {Q}}},\bar{K})$. Then we could find an open set $\bar{U}\subset {\mathbb {R}}$ and a sequence $\{({\mathcal {Q}}_\nu , K_\nu )\}$ satisfying $({\mathcal {Q}}_\nu , K_\nu )\rightarrow (\bar{{\mathcal {Q}}}, \bar{K})$, such that

$$\begin{aligned} \sigma (\bar{{\mathcal {Q}}},\bar{K})\subset \bar{U}~~~{\mathrm{but}}~~\sigma ({\mathcal {Q}}_\nu , K_\nu )\cap ({\mathbb {R}}\backslash \bar{U})\ne \emptyset , ~~~~{\mathrm{for~any}}~\nu =1,2,\ldots . \end{aligned}$$

Now, for each $\nu $, pick up $\lambda _\nu \in \sigma ({\mathcal {Q}}_\nu , K_\nu )\cap ({\mathbb {R}}\backslash \bar{U})$. It follows from (ii) that the sequence $\{\lambda _\nu \}$ admits a converging subsequence. By (i), the corresponding limit must be in $\sigma (\bar{{\mathcal {Q}}}, \bar{K})\cap ({\mathbb {R}}\backslash \bar{U})$, which together with $\sigma (\bar{{\mathcal {Q}}},\bar{K})\subset \bar{U}$ leads to a contradiction. $\square $

From the first two parts (i) and (ii) of Proposition 2.1, we have the following corollary.

Corollary 2.1

Let $({\mathcal {Q}},K)\in {\mathcal {F}}_{m,n}\times {\mathscr {C}}({\mathbb {R}}^n)$. If ${\mathcal {Q}}$ is K-regular, then $\sigma ({\mathcal {Q}},K)$ is compact.

Below, we present a preliminary estimation on the numbers of m-degree Pareto-eigenvalues. We first present the following proposition which fully characterizes the m-degree Pareto-spectrum of THDEiCP.

Proposition 2.2

Let ${\mathcal {Q}}=({\mathcal {A}}, {\mathcal {B}}, {\mathcal {C}})\in {\mathcal {F}}_{m,n}$. A real number $\lambda $ is an m-degree Pareto-eigenvalue of ${\mathcal {Q}}$, if and only if there exists a nonempty subset $J\subseteq [n]$ and a vector $w\in {\mathbb {R}}_{++}^{|J|}$ such that

$$\begin{aligned} (\lambda ^m{\mathcal {A}}_J+\lambda {\mathcal {B}}_J+{\mathcal {C}}_J)w^{m-1}=0 \end{aligned}$$

(2.4)

and

$$\begin{aligned} \sum _{i_2,\ldots ,i_m\in J}\left( \lambda ^m a_{ii_2\ldots i_m}+\lambda b_{ii_2\ldots i_m}+c_{ii_2\ldots i_m}\right) w_{i_2}\cdots w_{i_m}\ge 0, \quad \forall ~i\in [n]\backslash J. \end{aligned}$$

In such a case, the vector $x\in {\mathbb {R}}^n_+$ defined by

$$\begin{aligned} x_i=\left\{ \begin{array}{ll} w_i,&{}i\in J,\\ 0,&{}i\in [n]\backslash J \end{array} \right. \end{aligned}$$

is a Pareto-eigenvector of ${\mathcal {Q}}$, associated to the m-degree Pareto-eigenvalue $\lambda $.

Proof

It can be proved in a way similar to that used in [40]. $\square $

It is well known that, on the left-hand side of (2.4), $(\lambda ^m{\mathcal {A}}_J+\lambda {\mathcal {B}}_J+{\mathcal {C}}_J)w^{m-1}$ is indeed a set of |J| homogeneous polynomials in |J| variables, denoted by $\{P^{\lambda }_i(w) ~|~ 1\le i\le |J|\}$, of degree $(m-1)$. In the complex field, in order to study the solution set of a system of |J| homogeneous polynomials $(P_1,\ldots ,P_{|J|})$, in |J| variables, the concept of the resultant ${\mathrm{Res}}(P_1,\ldots , P_{|J|})$ is well defined and introduced; we refer to [9] for details. Applying to our current problem, ${\mathrm{Res}}(P_1,\ldots , P_{|J|})$ has the following properties.

Proposition 2.3

The following statements hold:

(i)
${\mathrm{Res}}(P_1,\ldots , P_{|J|})=0$, if and only if there exists $(\lambda ,x)\in {\mathbb {C}}\times ({\mathbb {C}}^{|J|}\backslash \{0\})$ satisfying the relation (2.4).
(ii)
The degree of $\lambda $ in ${\mathrm{Res}}(P_1,\ldots , P_{|J|})$ is at most $m|J|(m-1)^{|J|-1}$.

By Proposition 2.2, if $\lambda $ is an m-degree Pareto-eigenvalue of ${\mathcal {Q}}=({\mathcal {A}},{\mathcal {B}}, {\mathcal {C}})\in {\mathcal {F}}_{m,n}$, then there exists a nonempty subset $J\subseteq [n]$ such that $\lambda $ is a strict m-degree Pareto-eigenvalue of ${\mathcal {Q}}_J=({\mathcal {A}}_J, {\mathcal {B}}_J, {\mathcal {C}}_J)\in {\mathcal {F}}_{m,|J|}$. We now state and prove one of main results in this section.

Theorem 2.1

Let ${\mathcal {Q}}=({\mathcal {A}},{\mathcal {B}}, {\mathcal {C}})\in {\mathcal {F}}_{m,n}$. Assume that ${\mathcal {Q}}$ is ${\mathbb {R}}_+^n$-regular. Then, there are at most ${\varvec{\tau }}_{m,n}:=nm^{n}$ m-degree Pareto-eigenvalues of ${\mathcal {Q}}$.

Proof

It is obvious that, for every $k=0,1,\ldots ,n-1$, there are $\left( {\begin{array}{c}n\\ n-k\end{array}}\right) $ corresponding principal sub-tensors triplet of order m dimension $(n-k)$. Moreover, by Proposition 2.3, we know that every principal sub-tensors triplet of order m dimension $(n-k)$ can have at most $m(n-k)(m-1)^{n-k-1}$ strict m-degree Pareto-eigenvalues. By Proposition 2.2, in this way one obtains the upper bound

$$\begin{aligned} {\varvec{\tau }}_{m,n}=\sum _{k=0}^{n-1}\left( {\begin{array}{c}n\\ n-k\end{array}}\right) m(n-k)(m-1)^{n-k-1}=nm^{n}, \end{aligned}$$

which concludes the proof. $\square $

Now we extend the above result to the more general case where K is a polyhedral convex cone. A closed convex cone K in ${\mathbb {R}}^n$ is said to be finitely generated if there is a linearly independent collection $\{\eta _1,\eta _2,\ldots ,\eta _p\}$ of vectors in ${\mathbb {R}}^n$ such that

$$\begin{aligned} K=\mathrm{cone}\{\eta _1,\eta _2,\ldots ,\eta _p\}=\left\{ \sum _{i=1}^pa_j\eta _j~|~\alpha =(a_1,a_2,\ldots ,a_p)^\top \in {\mathbb {R}}_+^p\right\} . \end{aligned}$$

(2.5)

It is clear that $K=\{H^\top \alpha ~|~\alpha \in {\mathbb {R}}_+^p\}$, where $H=(h_{ij}):=[\eta _1,\eta _2,\ldots ,\eta _p]^\top $. Moreover, it is easy to see that the dual cone $K^*$ of K is equivalent to $\{z\in {\mathbb {R}}^n~|~Hz\ge 0\}$.

Theorem 2.2

Let ${\mathcal {Q}}=({\mathcal {A}},{\mathcal {B}}, {\mathcal {C}})\in {\mathcal {F}}_{m,n}$. Let K be represented by (2.5). Assume that ${\mathcal {Q}}$ is K-regular. Then, there are at most ${\varvec{\tau }}_{m,p}:=pm^{p}$ m-degree K-eigenvalues of ${\mathcal {Q}}$.

Proof

We first prove that problem (1.3) with K defined by (2.5) is equivalent to finding a vector $\bar{\alpha }\in {\mathbb {R}}^p\backslash \{0\}$ and $\bar{\lambda }\in {\mathbb {R}}$ such that

$$\begin{aligned} \bar{\alpha } \ge 0, \quad \left( \bar{\lambda }^m {\mathcal {D}}+\bar{\lambda }{\mathcal {G}}+{\mathcal {S}}\right) \bar{\alpha }^{m-1}\ge 0, \quad \left\langle \bar{\alpha },\left( \bar{\lambda }^m {\mathcal {D}}+\bar{\lambda }{\mathcal {G}}+{\mathcal {S}}\right) \bar{\alpha }^{m-1}\right\rangle =0, \end{aligned}$$

(2.6)

where ${\mathcal {D}}$, ${\mathcal {G}}$ and ${\mathcal {S}}$ are three mth order p-dimensional tensors, whose elements are denoted by

$$\begin{aligned} d_{i_1i_2\ldots i_m}= & {} \sum _{j_1,j_2,\ldots ,j_m=1}^na_{j_1j_2\ldots j_m}h_{i_1j_1}h_{i_2j_2}\ldots h_{i_mj_m},\\ g_{i_1i_2\ldots i_m}= & {} \sum _{j_1,j_2,\ldots ,j_m=1}^nb_{j_1j_2\ldots j_m}h_{i_1j_1}h_{i_2j_2}\ldots h_{i_mj_m}, \end{aligned}$$

and

$$\begin{aligned} s_{i_1i_2\ldots i_m}=\sum _{j_1,j_2,\ldots ,j_m=1}^nc_{j_1j_2\ldots j_m}h_{i_1j_1}h_{i_2j_2}\ldots h_{i_mj_m} \end{aligned}$$

for $1\le i_1,i_2,\ldots , i_m\le p$, respectively.

Let $(\bar{\lambda },\bar{x})\in {\mathbb {R}}\times ({\mathbb {R}}^n\backslash \{0\})$ be an m-degree K-eigenpair of ${\mathcal {Q}}$. Since $\bar{x}\in K\backslash \{0\}$, by the definition of K, there exists a nonzero vector $\bar{\alpha }\in {\mathbb {R}}^p_+$ such that $\bar{x}=H^\top \bar{\alpha }$. Consequently, from the fact that $(\bar{\lambda }^m {\mathcal {A}}+\bar{\lambda }{\mathcal {B}}+{\mathcal {C}})\bar{x}^{m-1}\in K^*$ and the expression of $K^*$, it holds that $H(\bar{\lambda }^m {\mathcal {A}}+\bar{\lambda }{\mathcal {B}}+{\mathcal {C}})\bar{x}^{m-1}\ge 0$, which implies

$$\begin{aligned} H(\bar{\lambda }^m {\mathcal {A}}+\bar{\lambda }{\mathcal {B}}+{\mathcal {C}})(H^\top \bar{\alpha })^{m-1}\ge 0. \end{aligned}$$

(2.7)

By the definitions of ${\mathcal {D}}$, ${\mathcal {G}}$ and ${\mathcal {S}}$, we know that (2.7) can be equivalently written as

$$\begin{aligned} \left( \bar{\lambda }^m {\mathcal {D}} +\bar{\lambda }{\mathcal {G}}+{\mathcal {S}}\right) \bar{\alpha }^{m-1}\ge 0. \end{aligned}$$

Moreover, it is easy to verify that $\langle \bar{\alpha },(\bar{\lambda }^m {\mathcal {D}} +\bar{\lambda }{\mathcal {G}}+{\mathcal {S}})\bar{\alpha }^{m-1}\rangle =0$. Conversely, if $(\bar{\lambda },\bar{\alpha })\in {\mathbb {R}}\times ({\mathbb {R}}^p\backslash \{0\})$ satisfies (2.6), then we can prove that $(\bar{\lambda },\bar{x})$ with $\bar{x}=H^\top \bar{\alpha }$ satisfies (1.3) in a similar way.

Since ${\mathcal {Q}}$ is K-regular, it is easy to verify that $({\mathcal {D,G,S}})$ is ${\mathbb {R}}^p_+$-regular. Consequently, by applying Theorem 2.1 to problem (2.6), we get that ${\mathcal {Q}}$ has at most ${\varvec{\tau }}_{m,p}=pm^{p}$ m-degree K-eigenvalues. $\square $

The above theorem shows that $\sigma ({\mathcal {Q}},K)$ has finitely many elements in case where K is a polyhedral convex cone. However, the situation can be worse in the nonpolyhedral case. For instance, Iusem and Seeger [21] successfully constructed a symmetric matrix C (i.e., ${\mathcal {Q}}=(O,I,-C)\in {\mathcal {F}}_{2,n}$) and a nonpolyhedral convex cone K such that $\sigma ({\mathcal {Q}}, K)$ behaves like the Cantor ternary set, i.e., it is uncountable and totally disconnected.

3 Optimization formulation of THDEiCP

In this section, for the purpose of finding solutions of THDEiCP, we introduce an optimization reformulation, which paves the way of designing algorithms. Here, we only consider the case where ${\mathcal {A}}$ and ${\mathcal {B}}$ are two symmetric tensors, ${\mathcal {C}}:=-{\mathcal {I}}$, and $K:={\mathbb {R}}^n_+$.

We consider the following homogeneous polynomial optimization problem.

$$\begin{aligned} \begin{array}{cl} {\mathrm{max}}&{}\varphi _0(u,v):=m(m-1)^{\frac{1}{m}-1}v^\top u^{[m-1]}-{\mathcal {B}}u^m\\ {\mathrm{s.t.}}&{}{\mathcal {A}}u^m+v^\top v^{[m-1]}=1,\\ &{}u\ge 0,v\ge 0. \end{array} \end{aligned}$$

(3.1)

Let $\phi _0(u,v):={\mathcal {A}}u^m+v^\top v^{[m-1]}$, and let $\varphi _0$ be defined by (3.1). We derive that

$$\begin{aligned} \nabla _u \varphi _0(u,v)= & {} m(m-1)^{\frac{1}{m}}{\mathrm{diag}}(v )u^{[m-2]}-m{\mathcal {B}}u^{m-1}, \end{aligned}$$

(3.2a)

$$\begin{aligned} \nabla _v \varphi _0(u,v)= & {} m(m-1)^{\frac{1}{m}-1}u^{[m-1]}, \end{aligned}$$

(3.2b)

$$\begin{aligned} \nabla _u\phi _0(u,v)= & {} m{\mathcal {A}}u^{m-1}, \end{aligned}$$

(3.2c)

$$\begin{aligned} \nabla _v\phi _0(u,v)= & {} mv^{[m-1]}. \end{aligned}$$

(3.2d)

Now, we state the relationship between (1.3) and (3.1) as follows.

Theorem 3.1

Let ${\mathcal {Q}}=({\mathcal {A}},{\mathcal {B}}, -{\mathcal {I}})\in {\mathcal {F}}_{m,n}$. Assume that ${\mathcal {A}}$ and ${\mathcal {B}}$ are both symmetric. Let $(\bar{u},\bar{v})$ with $\bar{u}\ne 0$ be a stationary point of problem (3.1). Then $(\bar{\lambda }, \bar{u})$ is an m-degree Pareto-eigenpair of ${\mathcal {Q}}$, where $\bar{\lambda }=(\varphi _0(\bar{u},\bar{v}))^{\frac{1}{m-1}}$.

Proof

Since $(\bar{u},\bar{v})$ is a stationary point of problem (3.1), it follows from (3.2) that there exist three multipliers $\bar{\alpha }, \bar{\beta } \in {\mathbb {R}}^n$ and $\bar{\gamma }\in {\mathbb {R}}$, such that

$$\begin{aligned}&m{\mathcal {B}}\bar{u}^{m-1}-m(m-1)^{\frac{1}{m}}{\mathrm{diag}}(\bar{v} )\bar{u}^{[m-2]}=\bar{\alpha }+\bar{\gamma } m{\mathcal {A}}\bar{u}^{m-1}, \end{aligned}$$

(3.3a)

$$\begin{aligned}&-m(m-1)^{\frac{1}{m}-1}\bar{u}^{[m-1]}=\bar{\beta }+\bar{\gamma } m\bar{v}^{[m-1]}, \end{aligned}$$

(3.3b)

$$\begin{aligned}&\bar{\alpha }\ge 0,\;\bar{u}\ge 0,\;\bar{\alpha }^\top \bar{u}=0, \end{aligned}$$

(3.3c)

$$\begin{aligned}&\bar{\beta }\ge 0,\;\bar{v}\ge 0,\;\bar{\beta }^\top \bar{v}=0, \end{aligned}$$

(3.3d)

$$\begin{aligned}&{\mathcal {A}}\bar{u}^m+\bar{v}^\top \bar{v}^{[m-1]}=1. \end{aligned}$$

(3.3e)

Rearranging terms of (3.3b) yields

$$\begin{aligned} -(m-1)^{\frac{1}{m}-1}\bar{u}^{[m-1]}-\bar{\gamma } \bar{v}^{[m-1]}=\bar{\beta }/m. \end{aligned}$$

(3.4)

We claim that $\bar{\beta }=0$. Otherwise, if $\bar{\beta }\ne 0$, then there exists an index $i_0\in [n]$ such that $\bar{\beta }_{i_0}>0$, which implies $\bar{v}_{i_0}=0$ from (3.3d), and hence, it holds that $-(m-1)^{\frac{1}{m}-1}\bar{u}_{i_0}^{m-1}=\bar{\beta }_{i_0}/m>0$, which is a contradiction. Therefore, it follows from (3.4) that

$$\begin{aligned} -(m-1)^{\frac{1}{m}-1}\bar{u}^{[m-1]}=\bar{\gamma } \bar{v}^{[m-1]}. \end{aligned}$$

(3.5)

Moreover, it is clear from the facts that $\bar{u}\ne 0$ and (3.5) that $\bar{\gamma }<0$ and

$$\begin{aligned} \bar{v}=(-\bar{\gamma })^{-\frac{1}{m-1}}(m-1)^{-\frac{1}{m}} \bar{u}. \end{aligned}$$

(3.6)

By invoking (3.3a) and (3.6), we have

$$\begin{aligned} {\mathcal {B}}\bar{u}^{m-1}-(-\bar{\gamma })^{-\frac{1}{m-1}}\bar{u}^{[m-1]}-\bar{\gamma } {\mathcal {A}}\bar{u}^{m-1}=\frac{\bar{\alpha }}{m}\ge 0,\\ \end{aligned}$$

which implies

$$\begin{aligned} (-\bar{\gamma } )^{\frac{m}{m-1}}{\mathcal {A}}\bar{u}^{m-1}+(-\bar{\gamma })^{\frac{1}{m-1}}{\mathcal {B}}\bar{u}^{m-1}-\bar{u}^{[m-1]}\ge 0. \end{aligned}$$

(3.7)

Moreover, using (3.6), (3.3a), and (3.3c), it is not difficult to verify that

$$\begin{aligned} \left\langle \bar{u}, (-\bar{\gamma } )^{\frac{m}{m-1}}{\mathcal {A}}\bar{u}^{m-1}+(-\bar{\gamma })^{\frac{1}{m-1}}{\mathcal {B}}\bar{u}^{m-1}-\bar{u}^{[m-1]}\right\rangle =0. \end{aligned}$$

(3.8)

On the other hand, it follows from (3.3a) and (3.3c) that

$$\begin{aligned} m{\mathcal {B}}\bar{u}^m-m\bar{\gamma } {\mathcal {A}}\bar{u}^m-m(m-1)^{\frac{1}{m}}\bar{v}^\top \bar{u}^{[m-1]}=0, \end{aligned}$$

which implies

$$\begin{aligned} -m\varphi _0(\bar{u},\bar{v})&=m\bar{\gamma } {\mathcal {A}}\bar{u}^m-m(m-1)^{\frac{1}{m}-1}\bar{v}^\top \bar{u}^{[m-1]} \nonumber \\&=m\bar{\gamma } {\mathcal {A}}\bar{u}^m+m\bar{\gamma } \bar{v}^\top \bar{v}^{[m-1]} \nonumber \\&=m\bar{\gamma }, \end{aligned}$$

(3.9)

where the second equality comes from (3.5), and the last equality is due to (3.3e). Hence, we conclude from (3.9) that $\varphi _0(\bar{u},\bar{v})=-\bar{\gamma }>0$, and both (3.7) and (3.8) mean that $\bar{u}$ is an eigenvector of (3.1) associated to the eigenvalue $\bar{\lambda }$. $\square $

Theorem 3.2

Let ${\mathcal {Q}}=({\mathcal {A}},{\mathcal {B}}, -{\mathcal {I}})\in {\mathcal {F}}_{m,n}$. Assume that ${\mathcal {A}}$ and ${\mathcal {B}}$ are both symmetric and ${\mathcal {A}}$ is copositive. Let $(\bar{\lambda }, \bar{x})$ be an m-degree Pareto-eigenpair of ${\mathcal {Q}}$. If $\bar{\lambda }>0$, then $(\bar{u}, \bar{v})$ is a stationary point of (3.1), where

$$\begin{aligned} (\bar{u}, \bar{v})=\frac{1}{\left( {\mathcal {A}}\bar{x}^m+\bar{y}^\top \bar{y}^{[m-1]}\right) ^\frac{1}{m}}(\bar{x},\bar{y}) \end{aligned}$$

(3.10)

with $\bar{y}={(m-1)^{-\frac{1}{m}}} ({\bar{\lambda }})^{-1}{\bar{x}}$.

Proof

Since $\bar{y}\in {\mathbb {R}}_+^n\backslash \{0\}$ and ${\mathcal {A}}$ is copositive, we have ${\mathcal {A}}\bar{x}^m+\bar{y}^\top \bar{y}^{[m-1]}>0$. Moreover, it is easy to check that $(\bar{u}, \bar{v})$ given in (3.10) is a feasible solution of (3.1). Take

$$\begin{aligned} \bar{\alpha }=\frac{m}{\bar{\lambda }\left( {\mathcal {A}}\bar{x}^m+\bar{y}^\top \bar{y}^{[m-1]}\right) ^{\frac{m-1}{m}}}\left( \bar{\lambda }^m{\mathcal {A}}\bar{x}^{m-1}+\bar{\lambda } {\mathcal {B}}\bar{x}^{m-1}-\bar{x}^{[m-1]}\right) , \end{aligned}$$

$\bar{\beta }=0$ and $\bar{\gamma }=-\bar{\lambda }^{m-1}.$ It is obvious that $\bar{\alpha }\ge 0$, since $(\bar{\lambda }, \bar{x})$ satisfies (1.3) and $\bar{\lambda }>0$. Moreover, it is not difficult to see that $\bar{\alpha }^\top \bar{u}=0$ and $\bar{\beta }^\top \bar{v}=0$. Finally, with the definition of $(\bar{u},\bar{v})$ given in (3.10), we can verify that (3.3a) and (3.3c) hold, which means that $(\bar{u},\bar{v})$ is a stationary point of (3.1). $\square $

Denote $w:=(u^\top ,v^\top )^\top $ and $\phi _i(w)=w_i$ for $i=1,2,\ldots ,2n$. When ${\mathcal {A}}$ is strictly copositive, the feasible set of (3.1) is compact. Hence, the globally optimal value of (3.1), denoted by $\varphi _0^{\mathrm{max}}$, exists. Denote by $\bar{w}$ the corresponding globally optimal solution, and denote

$$\begin{aligned} \bar{d}=\left( (e_{I(\bar{w})}\right) ^\top ,-\bar{t}(\bar{w}_{I^c(\bar{w})})^\top )^\top ~~~{\mathrm{with}}~~\bar{t}=\sum _{i\in I(\bar{w})}\left( {\mathcal {A}}\bar{u}^{m-1}\right) _i, \end{aligned}$$

where $I(\bar{w})=\{i\in [2n]~|~\bar{w}_i=0\}$ and $I^c(\bar{w})=[n]\backslash I(\bar{w})$. From the homogeneity of $\phi _0$ and the fact that $\phi _0(\bar{w})=1$, it is easy to verify that $\bar{w}^\top \nabla \phi _0(\bar{w})=m \phi _0(\bar{w})=m\ne 0$, which implies that $\nabla \phi _0(\bar{w})\ne 0$ and hence $\{\nabla \phi _0(\bar{w})\}$ is linearly independent. Moreover, it is not difficult to show that

$$\begin{aligned} \bar{d}^\top \nabla \phi _0(\bar{w})=m\left( \sum _{i\in I(\bar{w})}({\mathcal {A}}\bar{u}^{m-1})_i-\bar{t}\right) =0 \end{aligned}$$

and $\bar{d}^\top \nabla \phi _i(\bar{w})=1>0$ for every $i\in I(\bar{w})$. This means that the Mangasarian-Fromovitz constraint qualification (MFCQ) holds at $\bar{w}$. Therefore, we know that $\bar{w}$ is a stationary point of (3.1). Moreover, we claim that $\bar{u}\ne 0$. In fact, by taking $u_t=te$ and $v_t=\left( \frac{1-\bar{a}t^m}{n}\right) ^{1/m}e$ with $\bar{a}=\sum _{i_1,\ldots ,i_m=1}^na_{i_1\ldots i_m}$, we see that $w_t=(u_t,v_t)$ is a feasible solution of (3.1), and the corresponding objective value is

$$\begin{aligned} \varphi _0(u_t,v_t)=t^m\left( nm(m-1)^{\frac{1}{m}-1}\left( \frac{1-\bar{a}t^m}{n}\right) ^{\frac{1}{m}}- \bar{b} t\right) , \end{aligned}$$

where $\bar{b}=\sum _{i_1,\ldots ,i_m=1}^nb_{i_1\ldots i_m}$. Hence, we have that $\varphi _0(u_t,v_t)>0$ for $t>0$ small enough, which implies that $\varphi _0(\bar{u},\bar{v})>0$ due to the fact that $(\bar{u},\bar{v})$ is an optimal solution of problem (3.1). Consequently, it holds that $\bar{u}\ne 0$. Moreover, by Theorem 3.1, we know that $(\bar{\lambda }, \bar{u})$ with $\bar{\lambda }=(\varphi _0(\bar{u},\bar{v}))^{\frac{1}{m-1}}$ is a solution of (1.3), which implies that (1.3) has at least a positive m-degree Pareto-eigenvalue. Therefore, one has $\varphi _0^{\max }\le \lambda _{\max }^{m-1}$, where

$$\begin{aligned} \lambda _{\max } ={\max }\left\{ \lambda \in {\mathbb {R}}~|~\exists ~ x \in {\mathbb {R}}^n,~(\lambda ,x)~\mathrm{is~ an~}m\hbox {-degree Pareto-}\hbox {eigenpair of }{\mathcal {Q}}\right\} . \end{aligned}$$

Theorem 3.3

Let ${\mathcal {Q}}=({\mathcal {A}},{\mathcal {B}},-{\mathcal {I}})\in {\mathcal {F}}_{m,n}$. Assume that ${\mathcal {A}}$ and ${\mathcal {B}}$ are both symmetric and ${\mathcal {A}}$ is strictly copositive. Then, we have

$$\begin{aligned} \lambda _{\max }^{m-1}=\varphi _0^{\max }. \end{aligned}$$

Proof

Let $(\bar{\lambda }, \bar{x})$ with $\bar{\lambda }>0$ be an m-degree Pareto-eigenpair of ${\mathcal {Q}}$. By the homogeneity of the complementarity system (1.3) with respect to x, without loss of generality, we assume that $\bar{x}$ satisfies $e^\top \bar{x}=1$. As a consequence, it immediately follows from the strict copositiveness of ${\mathcal {A}}$ that ${\mathcal {A}}\bar{x}^m>0$. Denote

$$\begin{aligned} \bar{y}=\frac{(m-1)^{-\frac{1}{m}}}{\bar{\lambda }}\bar{x}~~~~\mathrm{and}\quad (\bar{u},\bar{v})=\frac{(\bar{x},\bar{y})}{\left( {\mathcal {A}}\bar{x}^m+\bar{y}^\top \bar{y}^{[m-1]}\right) ^{\frac{1}{m}}}. \end{aligned}$$

(3.11)

It is trivial that $(\bar{u},\bar{v})\in {\mathbb {R}}_+^n\times {\mathbb {R}}_+^n$,

$$\begin{aligned} {\mathcal {A}}\bar{u}^m=\frac{1}{{\mathcal {A}}\bar{x}^m+\bar{y}^\top \bar{y}^{[m-1]}}{\mathcal {A}}\bar{x}^m,~~~{\mathrm{and}}~~~\bar{v}^\top \bar{v}^{[m-1]}=\frac{1}{{\mathcal {A}}\bar{x}^m+\bar{y}^\top \bar{y}^{[m-1]}}\bar{y}^\top \bar{y}^{[m-1]}, \end{aligned}$$

which implies that ${\mathcal {A}}\bar{u}^m+\bar{v}^\top \bar{v}^{[m-1]}=1$ holds. Hence, $(\bar{u},\bar{v})$ is a feasible solution of (3.1) and $\varphi _0(\bar{u},\bar{v})\le \varphi _0^\mathrm{max}$.

On the other hand, since $(\bar{\lambda }, \bar{x})$ is an m-degree Pareto-eigenpair of ${\mathcal {Q}}$, we know that $\bar{\lambda } ^m{\mathcal {A}}\bar{x}^{m}+\bar{\lambda }{\mathcal {B}}\bar{x}^{m}-\bar{x}^\top \bar{x}^{[m-1]}=0$. Substituting $(\bar{u},\bar{v})$ into $\varphi _0(u,v)$ yields

$$\begin{aligned} \varphi _0(\bar{u},\bar{v})&=m(m-1)^{\frac{1}{m}-1}\bar{v}^\top \bar{u}^{[m-1]}-{\mathcal {B}}\bar{u}^{m}\\&=\frac{m\bar{x}^\top \bar{x}^{[m-1]}-(m-1)\bar{\lambda }{\mathcal {B}}\bar{x}^m}{(m-1)\bar{\lambda }^m{\mathcal {A}}\bar{x}^m+\bar{x}^\top \bar{x}^{[m-1]}}\bar{\lambda }^{m-1}\\&=\bar{\lambda }^{m-1}, \end{aligned}$$

where the second equality comes from (3.11). Therefore, it holds that $\bar{\lambda }^{m-1}\le \varphi _0^{\mathrm{max}}$, which implies that $\lambda _{\mathrm{max}}^{m-1}\le \varphi _0^{\mathrm{max}}$, which concludes the proof. $\square $

4 Existence of solutions for THDEiCP

In this section, we study more general results on the existence of solutions of THDEiCP with ${\mathcal {C}}:=\pm {\mathcal {I}}$ and $K={\mathbb {R}}_+^n$, but without symmetry assumptions on ${\mathcal {A}}$ and ${\mathcal {B}}$. We first present the existence result for symmetric tensors.

Theorem 4.1

Let ${\mathcal {Q}}=({\mathcal {A}},{\mathcal {B}},-{\mathcal {I}})\in {\mathcal {F}}_{m,n}$. Assume that ${\mathcal {A}}$ and ${\mathcal {B}}$ are both symmetric and ${\mathcal {A}}$ is strictly copositive. Then ${\mathcal {Q}}$ has at least an m-degree Pareto-eigenpair.

Proof

Consider the homogeneous polynomial optimization problem (3.1). Since ${\mathcal {A}}$ is strictly copositive, it is easy to see that the feasible set of (3.1) is compact. Consequently, from the continuity of the objective function $\varphi _0$ in (3.1), its globally optimal solution, denoted by $(\bar{u}, \bar{v})$, exists. As established above, the constraint qualification MFCQ holds at $(\bar{u},\bar{v})$. Hence, $(\bar{u},\bar{v})$ is a stationary point of (3.1). Moreover, we also know that $\varphi _0(\bar{u},\bar{v})>0$, which implies that $\bar{u}\ne 0$. Therefore, the assertion of Theorem 3.1 shows that $(\bar{\lambda }, \bar{u})$ is an m-degree Pareto-eigenpair of ${\mathcal {Q}}$, where $\bar{\lambda }=\left( \varphi _0(\bar{u},\bar{v})\right) ^{\frac{1}{m-1}}$. $\square $

The above theorem is a fundamental result for THDEiCP. However, many real-world problems often violate the symmetry condition. In other words, the symmetry assumptions on ${\mathcal {A}}$ and ${\mathcal {B}}$ are relatively strong. Indeed, a general existence theorem of solutions of QEiCP have been well established in [38], which states that, if (A, B, C) satisfies co-hyperbolicity properties and the leading matrix A is co-regular, then the considered QEiCP has at least one solution. As a generalization of QEiCP, we are naturally concerned with whether such a similar result of QEiCP also holds for tensors. Hereafter, we study a more general result without assuming the symmetry of ${\mathcal {A}}$ and ${\mathcal {B}}$, in addition to presenting some checkable conditions on ${\mathcal {Q}}=({\mathcal {A,B,C}})$, instead of the co-hyperbolicity.

Theorem 4.2

Let ${\mathcal {Q}}=({\mathcal {A,B,-I}})\in {\mathcal {F}}_{m,n}$. Assume that ${\mathcal {A}}$ is strictly copositive. If ${\mathcal {A}}$ and ${\mathcal {B}}$ satisfy the following condition

$$\begin{aligned} (a_{ii\ldots i}+1-m)(m-1)^{\frac{1}{m}-1}-b_{ii\ldots i}>0, ~~~\forall ~i\in [n], \end{aligned}$$

(4.1)

then ${\mathcal {Q}}$ has at least an m-degree Pareto-eigenpair.

Proof

We first denote two sets by

$$\begin{aligned} S=\{(x,y)\in {\mathbb {R}}^n_+\times {\mathbb {R}}^n_+~|~x\ge 0,e^\top x=1,y\ge 0\}\quad {\mathrm{and}}\quad S_0=\{(x,y)\in S~|~\Vert y\Vert \le 1\}. \end{aligned}$$

It is clear that $S_0$ is a compact convex subset of S. Define the function $F:S\times S\rightarrow {\mathbb {R}}$ by

$$\begin{aligned} F(x,y;z,w)=&\left\langle - {\mathcal {B}}x^{m-1}-f(x,y){\mathcal {A}}x^{m-1}+(m-1)^{\frac{1}{m}}{\mathrm{diag}}(y)x^{[m-2]},z\right\rangle \nonumber \\&+\left\langle (m-1)^{\frac{1}{m}-1}x^{[m-1]}-f(x,y)y^{[m-1]},w\right\rangle , \end{aligned}$$

(4.2)

where

$$\begin{aligned} f(x,y)=\frac{m(m-1)^{\frac{1}{m}-1}y^\top x^{[m-1]}-{\mathcal {B}}x^m}{{\mathcal {A}}x^m+y^\top y^{[m-1]}}. \end{aligned}$$

Evidently, $F(x,y;x,y)=0$ holds for any $(x,y)\in S$. Moreover, it can be seen that $F(\cdot ,\cdot ;z,w)$ is lower-semicontinuous on S for any fixed $(z,w)\in S$, and $F(x,y;\cdot ,\cdot )$ is concave on S for any fixed $(x,y)\in S$. With the given condition (4.1), we claim that

$$\begin{aligned} \varOmega :=\{(z,w)\in S~|~F(x,y;z,w)\le 0,~\forall ~(x,y)\in S_0\} \end{aligned}$$

is compact. Otherwise, there exists a sequence $\{(z^{(k)},w^{(k)})\}$ of $\varOmega $ such that

$$\begin{aligned} \Vert (z^{(k)},w^{(k)})\Vert \rightarrow +\infty \quad \text {as}\quad k\rightarrow +\infty . \end{aligned}$$

Since $\{z^{(k)}\}$ is bounded, without loss of generality, we claim that $\Vert w^{(k)}\Vert \rightarrow +\infty $. As a consequence, there exists $i_0\in [n]$ such that $w_{i_0}^{(k)}\rightarrow +\infty $. By taking $x^{(k)}=y^{(k)}=e_{i_0}\in S_0$ with $e_{i_0}$ being the $i_0$th unit vector in ${\mathbb {R}}^n$, we have

$$\begin{aligned} F\left( x^{(k)},y^{(k)};z^{(k)},w^{(k)}\right) =\theta _k+\frac{(a_{i_0\ldots i_0}+1-m)(m-1)^{\frac{1}{m}-1}-b_{i_0\ldots i_0}}{a_{i_0\ldots i_0}+1}w_{i_0}^{(k)}, \end{aligned}$$

where

$$\begin{aligned} \theta _k= & {} \left\langle - {\mathcal {B}}\left( x^{(k)}\right) ^{m-1}-f\left( x^{(k)},y^{(k)}\right) {\mathcal {A}}\left( x^{(k)}\right) ^{m-1}\right. \\&\left. +\,(m-1)^{\frac{1}{m}}{\mathrm{diag}}\left( y^{(k)}\right) \left( x^{(k)}\right) ^{[m-2]},z^{(k)}\right\rangle . \end{aligned}$$

Clearly, the sequence $\{\theta _k\}$ is bounded. It follows from the condition (4.1) that

$$\begin{aligned} F\left( x^{(k)},y^{(k)};z^{(k)},w^{(k)}\right) >0 \end{aligned}$$

for k large enough, which contradicts the fact that $(z^{(k)},w^{(k)})\in \varOmega $. By Theorem 6 in [13], there exists $(\bar{x},\bar{y})\in S$ such that

$$\begin{aligned} F(\bar{x},\bar{y};z,w)\le 0, \quad \forall ~(z,w)\in S. \end{aligned}$$

(4.3)

Take $w=0$ in (4.3), we know that, for any $z\in D:=\{z\in {\mathbb {R}}^n~|~z\ge 0,e^\top z=1\}$,

$$\begin{aligned} F(\bar{x},\bar{y};z,0)=\left\langle - {\mathcal {B}}\bar{x}^{m-1}-\bar{f}{\mathcal {A}}\bar{x}^{m-1}+(m-1)^{\frac{1}{m}}\mathrm{diag}(\bar{y})\bar{x}^{[m-2]},z\right\rangle \le 0, \end{aligned}$$

where $\bar{f}=f(\bar{x},\bar{y})$, which implies

$$\begin{aligned} {\mathcal {B}}\bar{x}^{m-1}+\bar{f}{\mathcal {A}}\bar{x}^{m-1}-(m-1)^{\frac{1}{m}}{\mathrm{diag}}(\bar{y})\bar{x}^{[m-2]}\ge 0, \end{aligned}$$

(4.4)

since D is a basis of ${\mathbb {R}}_+^n$. Take again any $w\in {\mathbb {R}}_+^n$; it is clear that $(\bar{x},\bar{y}+w)\in S$. Consequently, it holds that

$$\begin{aligned} F(\bar{x},\bar{y};\bar{x}, \bar{y}+w)&=F(\bar{x},\bar{y};\bar{x}, \bar{y})+\left\langle (m-1)^{\frac{1}{m}-1}\bar{x}^{[m-1]}-\bar{f}\bar{y}^{[m-1]}, w\right\rangle \\&=\left\langle (m-1)^{\frac{1}{m}-1}\bar{x}^{[m-1]}-\bar{f}\bar{y}^{[m-1]}, w\right\rangle \\&\le 0, \end{aligned}$$

where the second equality is due to the fact that $F(\bar{x},\bar{y};\bar{x}, \bar{y})=0$. Hence,

$$\begin{aligned} \bar{f}\bar{y}^{[m-1]}-(m-1)^{\frac{1}{m}-1}\bar{x}^{[m-1]}\ge 0. \end{aligned}$$

(4.5)

Since $\bar{x}\ge 0$ and $\bar{x}\ne 0$, there exists $i_0\in [n]$ such that $\bar{x}_{i_0}> 0$. An immediate consequence is

$$\begin{aligned} \bar{f}\bar{y}_{i_0}^{m-1}\ge (m-1)^{\frac{1}{m}-1}\bar{x}_{i_0}^{m-1}>0, \end{aligned}$$

which implies $\bar{f}>0$. Denote $I(\bar{y}):=\{i\in [n]~|~\bar{y}_i=0\}$. It is clear that $I(\bar{y})$ is a proper subset of [n]. By (4.5), it is obvious that $\bar{x}_i=0$ for any $i\in I(\bar{y})$. So

$$\begin{aligned} \bar{f}\bar{y}_i^{m-1}=(m-1)^{\frac{1}{m}-1}\bar{x}_i^{m-1},~~~~\forall ~i\in I(\bar{y}). \end{aligned}$$

(4.6)

For any $i\in [n]\backslash I(\bar{y})$, taking $w=te_i$ with $t\in {\mathbb {R}}$, it follows from $\bar{y}_i>0$ that $(\bar{x},\bar{y}+w)\in S$ for any real number t with enough small |t|. Recalling (4.3), we have

$$\begin{aligned} F(\bar{x},\bar{y};\bar{x}, \bar{y}+w)\le 0, \end{aligned}$$

which implies that

$$\begin{aligned} \left( (m-1)^{\frac{1}{m}-1}\bar{x}_i^{m-1}-\bar{f}\bar{y}_i^{m-1}\right) t\le 0 \end{aligned}$$

for $i\in [n]\backslash I(\bar{y})$ and any real number t with enough small |t|. We immediately obtain

$$\begin{aligned} (m-1)^{\frac{1}{m}-1}\bar{x}_i^{m-1}=\bar{f}\bar{y}_i^{m-1},~~~\forall ~i\in [n]\backslash I(\bar{y}). \end{aligned}$$

(4.7)

By (4.6) and (4.7), it holds that

$$\begin{aligned} \bar{f}\bar{y}^{[m-1]}=(m-1)^{\frac{1}{m}-1}\bar{x}^{[m-1]}, \end{aligned}$$

(4.8)

or equivalently,

$$\begin{aligned} \bar{f}^{\frac{1}{m-1}}\bar{y}=(m-1)^{-\frac{1}{m}}\bar{x}. \end{aligned}$$

(4.9)

Combining (4.4) and (4.8) leads to

$$\begin{aligned} 0&\le {\mathcal {B}}\bar{x}^{m-1}+\bar{f}{\mathcal {A}}\bar{x}^{m-1}-\bar{f}^{-\frac{1}{m-1}}\bar{x}^{[m-1]}\\&=\bar{f}^{-\frac{1}{m-1}}\left\{ \bar{f}^{\frac{1}{m-1}}{\mathcal {B}}\bar{x}^{m-1}+\bar{f}^{\frac{m}{m-1}}{\mathcal {A}}\bar{x}^{m-1}-\bar{x}^{[m-1]}\right\} , \end{aligned}$$

which implies

$$\begin{aligned} \bar{\lambda } ^m{\mathcal {A}}\bar{x}^{m-1}+\bar{\lambda }{\mathcal {B}}\bar{x}^{m-1}-\bar{x}^{[m-1]}\ge 0, \end{aligned}$$

(4.10)

where $\bar{\lambda }=\bar{f}^{\frac{1}{m-1}}$. Now we verify that

$$\begin{aligned} \left\langle \bar{x},\; \bar{\lambda } ^m{\mathcal {A}}\bar{x}^{m-1}+\bar{\lambda }{\mathcal {B}}\bar{x}^{m-1}-\bar{x}^{[m-1]}\right\rangle =0. \end{aligned}$$

We only need to check

$$\begin{aligned} \left\langle \bar{x}, \bar{f}{\mathcal {A}}\bar{x}^{m-1}+{\mathcal {B}}\bar{x}^{m-1}-\bar{f}^{-\frac{1}{m-1}}\bar{x}^{[m-1]}\right\rangle =0, \end{aligned}$$

that is,

$$\begin{aligned} \bar{f}{\mathcal {A}}\bar{x}^{m}+{\mathcal {B}}\bar{x}^{m}-\bar{f}^{-\frac{1}{m-1}}\sum _{i=1}^n\bar{x}_i^m=0. \end{aligned}$$

Since $F(\bar{x},\bar{y};\bar{x},\bar{y})=0$, that is,

$$\begin{aligned} \bar{f}{\mathcal {A}}\bar{x}^{m}+{\mathcal {B}}\bar{x}^{m}=m(m-1)^{\frac{1}{m}-1}\bar{y}^\top \bar{x}^{[m-1]}-\bar{f} \sum _{i=1}^n\bar{y}_i^m, \end{aligned}$$

we only need to further verify

$$\begin{aligned} m(m-1)^{\frac{1}{m}-1}\bar{y}^\top \bar{x}^{[m-1]}-\bar{f} \sum _{i=1}^n\bar{y}_i^m-\bar{f}^{-\frac{1}{m-1}}\sum _{i=1}^n\bar{x}_i^m=0. \end{aligned}$$

(4.11)

Actually, the left hand of (4.11) amounts to

$$\begin{aligned}&\;m(m-1)^{\frac{1}{m}-1}\bar{y}^\top \bar{x}^{[m-1]}-\bar{f} \bar{y}^\top \bar{y}^{[m-1]}-\bar{f}^{-\frac{1}{m-1}}\sum _{i=1}^n\bar{x}_i^m\\&\; =m(m-1)^{\frac{1}{m}-1}\bar{y}^\top \bar{x}^{[m-1]}-(m-1)^{\frac{1}{m}-1}\bar{y}^\top \bar{x}^{[m-1]}-\bar{f}^{-\frac{1}{m-1}}\sum _{i=1}^n\bar{x}_i^m\\&\; =(m-1)^{\frac{1}{m}}\bar{y}^\top \bar{x}^{[m-1]}-\bar{f}^{-\frac{1}{m-1}}\bar{x}^\top \bar{x}^{[m-1]}\\&\; =(m-1)^{\frac{1}{m}}\bar{y}^\top \bar{x}^{[m-1]}-\bar{f}^{-\frac{1}{m-1}}(m-1)^{\frac{1}{m}}\bar{f}^{\frac{1}{m-1}}\bar{y}^\top \bar{x}^{[m-1]}\\&\;=0, \end{aligned}$$

where the first equality is due to (4.8), and the second equality comes from (4.9). Therefore, $(\bar{\lambda }, \bar{x})$ is an m-degree Pareto-eigenpair of ${\mathcal {Q}}$. $\square $

Similarly, when we deal with the case of ${\mathcal {C}}:={\mathcal {I}}$, we can also establish the following result.

Theorem 4.3

Let ${\mathcal {Q}}=({\mathcal {A,B,I}})\in {\mathcal {F}}_{m,n}$. Assume that ${\mathcal {A}}$ is strictly copositive. If ${\mathcal {A}}$ and ${\mathcal {B}}$ satisfy the following condition

$$\begin{aligned} (m+a_{ii\ldots i}-1)(m-1)^{\frac{1}{m}-1}+b_{ii\ldots i}>0, \quad \forall ~i=1,2,\ldots ,n, \end{aligned}$$

then ${\mathcal {Q}}$ has at least an m-degree Pareto-eigenpair.

Proof

Define the function $h:{\mathbb {R}}_+^n\times {\mathbb {R}}_+^n\rightarrow {\mathbb {R}}$ by

$$\begin{aligned} h(x,y)=\frac{(2-m)(m-1)^{\frac{1}{m}-1}y^\top x^{[m-1]}-{\mathcal {B}}x^m}{{\mathcal {A}}x^m+y^\top y^{[m-1]}}. \end{aligned}$$

and the function $G:S\times S\rightarrow {\mathbb {R}}$ by

$$\begin{aligned} \begin{array}{lll} G(x,y;z,w)&{}=&{}\left\langle - {\mathcal {B}}x^{m-1}-h(x,y){\mathcal {A}}x^{m-1}-(m-1)^{\frac{1}{m}}{\mathrm{diag}}(y)x^{[m-2]},z\right\rangle \\ &{}&{}+\,\left\langle (m-1)^{\frac{1}{m}-1}x^{[m-1]}-h(x,y)y^{[m-1]},w\right\rangle , \end{array} \end{aligned}$$

where S is defined in the proof of Theorem 4.2. We can prove the assertion in a way similar to that used in Theorem 4.2, and skip its details here. $\square $

As a byproduct of Theorem 4.3, we immediately obtain the following existence result of the solution for QEiCP, which differs from the one presented in Ref. [38].

Corollary 4.1

Consider QEiCP corresponding to the special case of THDEiCP with $m=2$. Let ${\mathcal Q}:=(A,B,I)\in {\mathcal {M}}_n$. Assume that A is strictly copositive matrix. If A and B satisfy that $a_{ii}+b_{ii}+1>0$ for every $i\in [n]$, then ${\mathcal Q}$ has at least one quadratic Pareto-eigenpair.

5 Numerical algorithm and experiments

In this section, we first introduce an implementable splitting algorithm based upon the augmented Lagrangian method, which efficiently exploits the weakly coupled structure of the resulting optimization formulation of THDEiCP. Then, we conduct some computational experiments to show the reliability and convergence behavior of the proposed algorithm.

5.1 The algorithm

Note that model (3.1) can be recast as the standard minimization problem:

$$\begin{aligned} \min&\;\; {\mathcal {B}}u^m + {{\varvec{\vartheta }}}v^\top u^{[m-1]}\nonumber \\ {\mathrm{s.t.}}&\;\; {\mathcal {A}}u^m + {\mathcal {I}}v^m = 1, \nonumber \\&\;\; u\ge 0,\;\;v\ge 0, \end{aligned}$$

(5.1)

where ${\varvec{\vartheta }}$ is a constant given by ${\varvec{\vartheta }}:=-m(m-1)^{\frac{1}{m}-1}$. Here, we should notice that it is possible to employ the powerful semismooth and smoothing Newton methods to solve the model under consideration. However, we show below that a first-order structure-exploiting algorithm can be developed, which is much easier to implement than the second-order type methods.

Revisiting on (5.1), we observe that (5.1) is an equality constrained optimization problem, and we know that the Augmented Lagrangian Method (ALM) [20, 30] is a benchmark solver for this type of model. Let ${\varvec{\zeta }}\in {\mathbb {R}}$ be the Lagrangian multiplier associated to the equality constraint. The augmented Lagrangian function is given by

$$\begin{aligned} {\mathscr {L}}(u,v,{\varvec{\zeta }}):= {\mathcal {B}}u^m + {\varvec{\vartheta }}v^\top u^{[m-1]} -{\varvec{\zeta }}\left( {\mathcal {A}}u^m + {\mathcal {I}}v^m -1\right) + \frac{\beta }{2}\left( {\mathcal {A}}u^m + {\mathcal {I}}v^m -1\right) ^2,\nonumber \\ \end{aligned}$$

(5.2)

where $\beta >0$ is the penalty parameter. Consequently, for a given ${\varvec{\zeta }}^{(k)}\in {\mathbb {R}}$, the iterative scheme of ALM reads as follows:

$$\begin{aligned} {}&\left( u^{(k+1)},v^{(k+1)}\right) =\arg \min _{u,v} \left\{ {\mathscr {L}}(u,v,{\varvec{\zeta }}^{(k)})\;|\;u\ge 0,\;v\ge 0\right\} ; \end{aligned}$$

(5.3a)

$$\begin{aligned}&{\varvec{\zeta }}^{(k+1)}={\varvec{\zeta }}^{(k)} - \beta \left( {\mathcal {A}}\left( u^{(k+1)}\right) ^m +{\mathcal {I}}\left( v^{(k+1)}\right) ^m -1\right) . \end{aligned}$$

(5.3b)

However, it seems not easy enough to implement such an algorithm due to the coupled structure and high nonlinearity emerging in the objective function and equality constraint. To improve its implementability and numerical performance, the so-called Alternating Direction Method of Multipliers (ADMM) [17, 19] was judiciously developed for separable convex minimizations by updating the variables in an alternating (Gauss-Seidel) order. In recent years, it is well documented that ADMM has a surge of popularity in the areas such as signal/image processing, statistical learning, data mining, and so on. Here we just refer to [5, 12, 18] for some surveys on ADMM.

Following the spirit of ADMM, we split the first subproblem (5.3a) into two parts. For given $(v^{(k)},{\varvec{\zeta }}^{(k)})$, we immediately have the following ADMM scheme:

$$\begin{aligned} {} u^{(k+1)}= & {} \arg \min _{u} \left\{ {\mathscr {L}}(u,v^{(k)},{\varvec{\zeta }}^{(k)})\;|\;u\ge 0\right\} ; \end{aligned}$$

(5.4a)

$$\begin{aligned} v^{(k+1)}= & {} \arg \min _{v} \left\{ {\mathscr {L}}(u^{(k+1)},v,{\varvec{\zeta }}^{(k)})\;|\;v\ge 0\right\} ; \end{aligned}$$

(5.4b)

$$\begin{aligned} {\varvec{\zeta }}^{(k+1)}= & {} {\varvec{\zeta }}^{(k)} - \beta \left( {\mathcal {A}}\left( u^{(k+1)}\right) ^m +{\mathcal {I}}\left( v^{(k+1)}\right) ^m -1\right) . \end{aligned}$$

(5.4c)

It seems that such an algorithm exploits the weakly separable structure of model (5.1). However, it also fails to be easily implemented, because the first two subproblems are not easy enough to have closed-form solutions. Indeed, we can clearly observe that both subproblems (5.4a) and (5.4b) have very simple convex sets as their constraints, thereby making the projections onto these sets very easy. Hence, it would greatly simplify (5.4a) if both subproblems could reduce to the computation of projections. Below, we consider the linearized version of (5.4a) so that each subproblem has closed-form representation. Since ${\mathscr {L}}(u,v,{\varvec{\zeta }})$ is nonconvex with respect to u and v in general cases, for the purpose of making both subproblems well-posed, we attach two proximal terms $\frac{\gamma _1}{2}\Vert u-u^{(k)}\Vert ^2$ and $\frac{\gamma _2}{2}\Vert v-v^{(k)}\Vert ^2$ to (5.4a) and (5.4b), respectively. Here $\gamma _1$ and $\gamma _2$ are two positive constants. More specifically, linearizing the nonlinear parts of ${\mathscr {L}}(u,v,{\varvec{\zeta }})$ (see gradients in (3.2)), we derive a linearized ADMM as follows:

$$\begin{aligned} {} u^{(k+1)}= & {} \varPi _{{\mathbb {R}}^n_{+}}\left[ u^{(k)} -\frac{{\varvec{\varPhi }}^{(k)} }{\gamma _1} \right] , \end{aligned}$$

(5.5a)

$$\begin{aligned} v^{(k+1)}= & {} \varPi _{{\mathbb {R}}^n_{+}}\left[ v^{(k)}- \frac{{\varvec{\vartheta }}\left( u^{(k+1)}\right) ^{[m-1]}+{\varvec{\varUpsilon }}^{(k)}}{\gamma _2} \right] , \end{aligned}$$

(5.5b)

$$\begin{aligned} {\varvec{\zeta }}^{(k+1)}= & {} {\varvec{\zeta }}^{(k)} - \beta \left( {\mathcal {A}}\left( u^{(k+1)}\right) ^m + {\mathcal {I}}\left( v^{(k+1)}\right) ^m -1\right) , \end{aligned}$$

(5.5c)

where $\varPi _{{\mathbb {R}}^n_+}[\cdot ]$ represents the projection onto ${\mathbb {R}}^n_+$,

$$\begin{aligned} {\varvec{\varPhi }}^{(k)}:=m{\mathcal {B}}\left( u^{(k)}\right) ^{m-1} +{\varvec{\vartheta }}(m-1){\mathrm{diag}}\left( v^{(k)}\right) \left( u^{(k)}\right) ^{[m-2]} +\beta m {\varvec{q}}^{(k)}{\mathcal {A}}\left( u^{(k)}\right) ^{m-1}, \end{aligned}$$

with ${\varvec{q}}^{(k)} := {\mathcal {A}}(u^{(k)})^m+{\mathcal {I}}(v^{(k)})^m-1-\frac{{\varvec{\zeta }}^{(k)}}{\beta }$, and

$$\begin{aligned} {\varvec{\varUpsilon }}^{(k)}:=\beta m\left( {\mathcal {A}}\left( u^{(k+1)}\right) ^m +{\mathcal {I}}\left( v^{(k)}\right) ^m-1-\frac{{\varvec{\zeta }}^{(k)}}{\beta } \right) {\mathcal {I}}\left( v^{(k)}\right) ^{m-1}. \end{aligned}$$

Obviously, the linearized version (5.5a) is more implementable than (5.4a) due to the pretty simple iterative scheme. To the best of our knowledge, there is no convergence result of such a linearized ADMM for solving the underlying nonconvex model. Therefore, it seems that our method (5.5a) goes beyond the theoretical guarantees of the traditional ADMM. However, we will illustrate that our method (5.5a) indeed is numerically convergent for model (5.1) in many cases.

5.2 Numerical experiments

We have shown that THDEiCP (1.3) is solvable when $K:={\mathbb {R}}^n_+$ in Sect. 4 and introduce an implementable splitting method in Sect. 5.1. Now, we turn our attention to verifying our theoretical results and convergence behavior of the proposed algorithm (5.5a) through preliminary computational results. We implement our algorithm in Matlab R2012b and conduct the numerical experiments on a Lenovo notebook with Intel(R) Core(TM) i5-5200U CPU@2.20 GHz and 4 GB RAM running on Windows 7 Home Premium operating system.

Notice that model (5.1) is also available for matrix cases, and it is a new formulation in the QEiCP literature. Thus, we also test here a matrix scenario for the purpose of showing the efficiency of our proposed algorithm (5.5a) in solving QEiCP. In the following experiments, we test three synthetic examples and only list the details of ${\mathcal {A}}$ (or A) and ${\mathcal {B}}$ (or B) in the coming examples. For the random data, we generate them by the Matlab script .

Example 5.1

This example considers a special case of THDEiCP, that is, QEiCP, whose matrices A and B are uniformly distributed in (0, 1) and given by

$$\begin{aligned} A=\left( \begin{array}{cccc} 0.2296 &{} 0.6870 &{} 0.7421 &{} 0.8943 \\ 0.6870 &{} 0.9403 &{} 0.1194 &{} 0.5919 \\ 0.7421 &{} 0.1194 &{} 0.9325 &{} 0.7779 \\ 0.8943 &{} 0.5919 &{} 0.7779 &{} 0.3290 \end{array}\right) ,\;\;B=\left( \begin{array}{cccc} 0.2235 &{} 0.3014 &{} 0.7879 &{} 0.5394\\ 0.3014 &{} 0.4026 &{} 0.5329 &{} 0.5453\\ 0.7879 &{} 0.5329 &{} 0.8272 &{} 0.5375\\ 0.5394 &{} 0.5453 &{} 0.5375 &{} 0.5994 \end{array}\right) . \end{aligned}$$

Example 5.2

We consider the case where ${\mathcal {A}}$ and ${\mathcal {B}}$ are two 3-rd order 4-dimensional tensors; ${\mathcal {A}}$ is strictly copositive, but not nonnegative, and ${\mathcal {B}}$ is a randomly generated tensor, whose entries are uniformly distributed in (1, 2), that is,

$$\begin{aligned} {\mathcal {A}}(:,:,1)= & {} \left( \begin{array}{cccc} 2&{}2&{}4/3&{}4/3\\ 2&{}4/3&{}2/3&{}4/3\\ 4/3&{}2/3&{}8/3&{}0\\ 4/3&{}4/3&{}0&{}2 \end{array}\right) ,~~ {\mathcal {B}}(:,:,1)= \left( \begin{array}{cccc} 1.6557 &{} 1.3572 &{} 1.7523 &{} 1.6055\\ 1.3572 &{} 1.7577 &{} 1.4572 &{} 1.2192\\ 1.7523 &{} 1.4572 &{} 1.7060 &{} 1.0645\\ 1.6055 &{} 1.2192 &{} 1.0645 &{} 1.8235 \end{array}\right) ,\\ {\mathcal {A}}(:,:,2)= & {} \left( \begin{array}{cccc} 2&{}4/3&{}2/3&{}4/3\\ 4/3&{}12&{}-2/3&{}10/3\\ 2/3&{}-2/3&{}16/3&{}-2\\ 4/3&{}10/3&{}-2&{}14/3 \end{array}\right) ,~~ {\mathcal {B}}(:,:,2)= \left( \begin{array}{cccc} 1.6551 &{} 1.5612 &{} 1.4351 &{} 1.6946\\ 1.5612 &{} 1.3404 &{} 1.4202 &{} 1.5916\\ 1.4351 &{} 1.4202 &{} 1.5060 &{} 1.6231\\ 1.6946 &{} 1.5916 &{} 1.6231 &{} 1.1386 \end{array}\right) ,\\ {\mathcal {A}}(:,:,3)= & {} \left( \begin{array}{cccc} 4/3&{}2/3&{}8/3&{}0\\ 2/3&{}-2/3&{}16/3&{}-2\\ 8/3&{}16/3&{}6&{}2/3\\ 0&{}-2&{}2/3&{}0 \end{array}\right) ,~~ {\mathcal {B}}(:,:,3)= \left( \begin{array}{cccc} 1.9172 &{} 1.3331 &{} 1.6440 &{} 1.6613\\ 1.3331 &{} 1.5678 &{} 1.4275 &{} 1.2617\\ 1.6440 &{} 1.4275 &{} 1.9340 &{} 1.0709\\ 1.6613 &{} 1.2617 &{} 1.0709 &{} 1.3371 \end{array}\right) ,\\ {\mathcal {A}}(:,:,4)= & {} \left( \begin{array}{cccc} 4/3&{}4/3&{}0&{}2\\ 4/3&{}10/3&{}-2&{}14/3\\ 0&{}-2&{}2/3&{}0\\ 2&{}14/3&{}0&{}4 \end{array} \right) ,~~ {\mathcal {B}}(:,:,4)= \left( \begin{array}{cccc} 1.5383 &{} 1.5514 &{} 1.4477 &{} 1.3513\\ 1.5514 &{} 1.9619 &{} 1.4367 &{} 1.7875\\ 1.4477 &{} 1.4367 &{} 1.0844 &{} 1.4156\\ 1.3513 &{} 1.7875 &{} 1.4156 &{} 1.9106 \end{array}\right) . \end{aligned}$$

Example 5.3

We consider two 4th order 3-dimensional symmetric tensors ${\mathcal {A}}$ and ${\mathcal {B}}$, where ${\mathcal {A}}$ and ${\mathcal {B}}$ are randomly generated and uniformly distributed in (0, 1). Specifically, ${\mathcal {A}}$ and ${\mathcal {B}}$ are taken as follows:

$$\begin{aligned} {\mathcal {A}}(:,:,1,1)=\left( \begin{array}{ccccc} 0.6229 &{}&{} 0.2644 &{}&{} 0.3567 \\ 0.2644 &{}&{} 0.0475 &{}&{} 0.7367\\ 0.3567 &{}&{} 0.7367 &{}&{} 0.1259 \end{array}\right) ,\; {\mathcal {B}}(:,:,1,1)=\left( \begin{array}{ccccc} 0.6954 &{}&{} 0.4018 &{}&{} 0.1406\\ 0.4018 &{}&{} 0.9957 &{}&{} 0.0483\\ 0.1406 &{}&{} 0.0483 &{}&{} 0.0988 \end{array}\right) , \\ {\mathcal {A}}(:,:,1,2)=\left( \begin{array}{ccccc} 0.7563 &{}&{} 0.5878 &{}&{} 0.5406\\ 0.5878 &{}&{} 0.1379 &{}&{} 0.0715\\ 0.5406 &{}&{} 0.0715 &{}&{} 0.3725 \end{array}\right) , \;{\mathcal {B}}(:,:,1,2)=\left( \begin{array}{ccccc} 0.6730 &{}&{} 0.5351 &{}&{} 0.4473\\ 0.5351 &{}&{} 0.2853 &{}&{} 0.3071\\ 0.4473 &{}&{} 0.3071 &{}&{} 0.9665 \end{array}\right) ,\\ {\mathcal {A}}(:,:,1,3)=\left( \begin{array}{ccccc} 0.0657 &{}&{} 0.4918 &{}&{} 0.9312\\ 0.4918 &{}&{} 0.7788 &{}&{} 0.9045\\ 0.9312 &{}&{} 0.9045 &{}&{} 0.8711 \end{array}\right) ,\; {\mathcal {B}}(:,:,1,3)=\left( \begin{array}{ccccc} 0.7585 &{}&{} 0.6433 &{}&{} 0.2306\\ 0.6433 &{}&{} 0.8986 &{}&{} 0.3427\\ 0.2306 &{}&{} 0.3427 &{}&{} 0.5390 \end{array}\right) , \\ {\mathcal {A}}(:,:,2,1)=\left( \begin{array}{ccccc} 0.7563 &{}&{} 0.5878 &{}&{} 0.5406\\ 0.5878 &{}&{} 0.1379 &{}&{} 0.0715\\ 0.5406 &{}&{} 0.0715 &{}&{} 0.3725 \end{array}\right) ,{\mathcal {B}}(:,:,2,1)=\left( \begin{array}{ccccc} 0.6730 &{}&{} 0.5351 &{}&{} 0.4473\\ 0.5351 &{}&{} 0.2853 &{}&{} 0.3071\\ 0.4473 &{}&{} 0.3071 &{}&{} 0.9665 \end{array}\right) , \\ {\mathcal {A}}(:,:,2,2)=\left( \begin{array}{ccccc} 0.7689 &{}&{} 0.3941 &{}&{} 0.6034\\ 0.3941 &{}&{} 0.3577 &{}&{} 0.3465\\ 0.6034 &{}&{} 0.3465 &{}&{} 0.4516 \end{array}\right) ,\;{\mathcal {B}}(:,:,2,2)=\left( \begin{array}{ccccc} 0.3608 &{}&{} 0.3914 &{}&{} 0.5230\\ 0.3914 &{}&{} 0.6822 &{}&{} 0.5516\\ 0.5230 &{}&{} 0.5516 &{}&{} 0.7091 \end{array}\right) ,\\ {\mathcal {A}}(:,:,2,3)=\left( \begin{array}{ccccc} 0.8077 &{}&{} 0.4910 &{}&{} 0.2953\\ 0.4910 &{}&{} 0.5054 &{}&{} 0.5556\\ 0.2953 &{}&{} 0.5556 &{}&{} 0.9608 \end{array}\right) ,\; {\mathcal {B}}(:,:,2,3)=\left( \begin{array}{ccccc} 0.4632 &{}&{} 0.2043 &{}&{} 0.2823\\ 0.2043 &{}&{} 0.7282 &{}&{} 0.7400\\ 0.2823 &{}&{} 0.7400 &{}&{} 0.9369 \end{array}\right) ,\\ {\mathcal {A}}(:,:,3,1)=\left( \begin{array}{ccccc} 0.0657 &{}&{} 0.4918 &{}&{} 0.9312\\ 0.4918 &{}&{} 0.7788 &{}&{} 0.9045\\ 0.9312 &{}&{} 0.9045 &{}&{} 0.8711 \end{array}\right) ,\; {\mathcal {B}}(:,:,3,1)=\left( \begin{array}{ccccc} 0.7585 &{}&{} 0.6433 &{}&{} 0.2306\\ 0.6433 &{}&{} 0.8986 &{}&{} 0.3427\\ 0.2306 &{}&{} 0.3427 &{}&{} 0.5390 \end{array}\right) , \\ {\mathcal {A}}(:,:,3,2)=\left( \begin{array}{ccccc} 0.8077 &{}&{} 0.4910 &{}&{} 0.2953\\ 0.4910 &{}&{} 0.5054 &{}&{} 0.5556\\ 0.2953 &{}&{} 0.5556 &{}&{} 0.9608 \end{array}\right) , \;{\mathcal {B}}(:,:,3,2)=\left( \begin{array}{ccccc} 0.4632 &{}&{} 0.2043 &{}&{} 0.2823\\ 0.2043 &{}&{} 0.7282 &{}&{} 0.7400\\ 0.2823 &{}&{} 0.7400 &{}&{} 0.9369 \end{array}\right) ,\\ {\mathcal {A}}(:,:,3,3)=\left( \begin{array}{ccccc} 0.7581 &{}&{} 0.7205 &{}&{} 0.9044\\ 0.7205 &{}&{} 0.0782 &{}&{} 0.7240\\ 0.9044 &{}&{} 0.7240 &{}&{} 0.3492 \end{array}\right) ,\;{\mathcal {B}}(:,:,3,3)=\left( \begin{array}{ccccc} 0.8200 &{}&{} 0.5914 &{}&{} 0.4983\\ 0.5914 &{}&{} 0.0762 &{}&{} 0.2854\\ 0.4983 &{}&{} 0.2854 &{}&{} 0.1266 \end{array}\right) .\end{aligned}$$

Before our experiments, we first introduce a reasonable stopping rule for the proposed method (5.5a). Without loss of generality, we can use

$$\begin{aligned} {\mathrm{RelErr}}:=\max \left\{ \Vert u^{(k+1)}-u^{(k)}\Vert ,\;\Vert v^{(k+1)}-v^{(k)}\Vert ,\;|{\varvec{V}}^{(k)}|\right\} \le {\mathrm{Tol}} \end{aligned}$$

(5.6)

as a termination criterion to pursue an approximate solution with a preset tolerance ‘Tol’, where $|{\varvec{V}}^{(k)}|:=|{\mathcal {A}}(u^{(k+1)})^m+{\mathcal {I}}(v^{(k+1)})^m-1|$ measures the violation of the underlying equality constraint. For the parameters involved in our algorithm, we throughout take $\beta =1$, in addition to setting the starting points $u^{(0)}$ and $v^{(0)}$ as randomly generated vectors and ${\varvec{\zeta }}^{(0)}=0$. For the other two parameters, we choose $\gamma _1=200$ and $\gamma _2=10$ for Example 5.1, $\gamma _1=1000$ and $\gamma _2=50$ for Examples 5.2–5.3. The tolerance ‘Tol’ in (5.6) is taken as ${\mathrm{Tol}}=10^{-6}$ for all tests.

As we have mentioned in the introduction, tensor-related polynomial optimization problems suffers from high nonlinearity. From a theoretical point of view, linearization may destroy structural properties of the underlying functions, thereby resulting in inadequate approximations so that the algorithm is not necessarily convergent for some cases. To investigate the performance of such a linearization, in Fig. 1, we plot evolutions of the relative error (‘RelErr’ defined by (5.6)) and the objective value (‘Obj.’) of (5.1) [i.e., $-\varphi _0(u^{(k)},v^{(k)})$] with respect to the number of iterations, respectively, and the ability of finding ideal solutions of THDEiCP.

From the left plots of Fig. 1, we can see that our linearized ADMM is convergent very fast with a random starting point. More importantly, it clearly shows that the high nonlinearity leads to severely oscillating property in terms of the relative error. Actually, our computational experiences tell us that an inappropriate initial point far away from a local solution may lead to divergence (see the right plot in Fig. 1), which also implies that designing an implementable and stable algorithm for THDEiCP is a challengeable task. To further verify the ability and reliability of our algorithm, we randomly generate 100 different starting points such that $u^{(0)}=v^{(0)}$ and their entries are uniformly distributed in (0, 1). As we have proved in Theorem 3.1, a solution $u^{(k)}$ of THDEiCP must be a nonzero vector. In our experiments, we observe that the algorithm terminates at a zero point in some cases. Accordingly, we record all results of the 100 tests and divide them into three groups: the first group corresponds to the divergent cases, which means the number of iterations exceeds the preset maximum iterations 20000; the second group refers to convergent cases but failed to find a nonzero solution of THDEiCP; the last group contains the cases of successfully finding a nonzero solution of THDEiCP. The corresponding rate of each group is graphically shown by the right plot in Fig. 1, which empirically exhibits the ability and reliability of our proposed algorithm. Indeed, we did a lot of experiments on QEiCP, and interestingly, all numerical results shows that the proposed linearized ADMM is always convergent for QEiCP. Thus, such an algorithm further enriches the solvers tailored for QEiCP. This also leaves us an open problem of whether we could prove global or local convergence of the linearized ADMM on solving the nonconvex model (5.1).

In Table 1, we list several groups of numerical results including starting points $u^{(0)}$, eigenvalue (EigVal $\lambda $), eigenvector (x), dual variable ${\varvec{\varrho }}$, number of iterations (‘Iter.’) and computing time in seconds (Time), where the dual variable is defined by

$$\begin{aligned} {\varvec{\varrho }}:=\lambda ^m{\mathcal {A}}x^{m-1}+\lambda {\mathcal {B}}x^{m-1}-{\mathcal {I}}x^{m-1}, \end{aligned}$$

which together with the eigenvector x satisfies $x^\top {\varvec{\varrho }}=0$.

Table 1 Computational results by the proposed algorithm

Full size table

It can be easily seen from the data in Table 1 that our proposed algorithm is fast and reliable for solving the model under consideration, even it is not necessarily convergent for some cases. We conducted many simulations on random data and observed that an appropriate initial point and the parameters, especially the two $\gamma _1$ and $\gamma _2$ are very important for convergence. Therefore, we will pay our attention on the study of convergence results of the iterative scheme (5.5a) in the future. Additionally, we will provide some practical suggestions on choices of $\gamma _1$ and $\gamma _2$.

6 Conclusions

This paper considers a unified model of THDEiCP including TGEiCP and QEiCP as its special cases. As the first work on finding higher-degree cone eigenvalues of tensors, we analyze some corresponding topological properties including closedness, boundedness, and upper-semicontinuity of the m-degree K-spectrum of ${\mathcal Q}$ [(i.e., $\sigma ({\mathcal Q},K)$ defined by (1.4)], and the number of m-degree Pareto- and K-eigenvalues of THDEiCP. For the special case where the underlying tensors ${\mathcal {A}}$ and ${\mathcal {B}}$ are symmetric, ${\mathcal {C}}=-{\mathcal {I}}$, and $K:={\mathbb {R}}^n_+$, we present a weakly coupled optimization formulation for THDEiCP, which is also a new formulation for QEiCP in the literature. Moreover, such a formulation could bring some numerical benefits for algorithmic design, for instance, an implementable linearized ADMM is developed in this paper. Theoretically, we establish results concerning existence of solutions of THDEiCP with general (symmetric and nonsymmetric) ${\mathcal {A}}$ and ${\mathcal {B}}$ when ${\mathcal {C}}:=\pm {\mathcal {I}}$ and $K:={\mathbb {R}}^n_+$. In the future, we will continue our study in this direction, but with general tensor ${\mathcal {C}}$ not being a unit tensor (i.e., ${\mathcal {C}}\ne \pm {\mathcal {I}}$). Of course, the convergence analysis of the proposed algorithm and designing algorithms for THDEiCP, especially for nonsymmetric cases, are also our future concerns.

References

Adly, S., Seeger, A.: A new method for solving Pareto eigenvalue complementarity problems. Comput. Optim. Appl. 55, 703–731 (2013)
Article MathSciNet MATH Google Scholar
Beer, G.: Topologies on Closed and Closed Convex Sets. Kluwer Academic Publishers, Dordrecht (1993)
Book MATH Google Scholar
Bermúdez, A., Durán, R., Rodríguez, R., Solomin, J.: Finite element analysis of a quadratic eigenvalue problem arising in dissipative acoustics. SIAM J. Numer. Anal. 38(1), 267–291 (2000)
Article MathSciNet MATH Google Scholar
Bloy, L., Verma, R.: On computing the underlying fiber directions from the diffusion orientation distribution function. In: Metaxas, D., Axel, L., Fichtinger, G., Székely, G. (eds.) Medical Image Computing and Computer-Assisted Intervention-MICCAI 2008, pp. 1–8. Springer, Berlin (2008)
Chapter Google Scholar
Boyd, S., Parikh, N., Chu, E., Peleato, B., Eckstein, J.: Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends Mach. Learn. 3, 1–122 (2010)
Article MATH Google Scholar
Brás, C.P., Iusem, A.N., Júdice, J.J.: On the quadratic eigenvalue complementarity problem. J. Global Optim. (2014). doi:10.1007/s10898-014-0260-5
MATH Google Scholar
Chang, K.C., Pearson, K., Zhang, T.: Perron-Frobenius theorem for nonnegative tensors. Commun. Math. Sci 6, 507–520 (2008)
Article MathSciNet MATH Google Scholar
Chang, K.C., Pearson, K., Zhang, T.: On eigenvalue problems of real symmetric tensors. J. Math. Anal. Appl. 350, 416–422 (2009)
Article MathSciNet MATH Google Scholar
Cox, D.A., Little, J., O’shea, D.: Using Algebraic Geometry, vol. 185. Springer, New York (2006)
MATH Google Scholar
da Costa, A., Figueiredo, I., Júdice, J., Martins, J.: A complementarity eigenproblem in the stability analysis of finite dimensional elastic systems with frictional contact. In: Ferris, M., Pang, J.S., Mangasarian, O. (eds.) Complementarity: Applications, Algorithms and Extensions, pp. 67–83. Kluwer, New York (2001)
Chapter Google Scholar
da Costa, A.P., Seeger, A.: Cone-constrained eigenvalue problems: theory and algorithms. Comput. Optim. Appl. 45, 25–57 (2010)
Article MathSciNet MATH Google Scholar
Eckstein, J., Yao, W.: Understanding the convergence of the alternating direction method of multipliers: theoretical and compuational perspective. Pac. J. Optim. To appear
Fan, K.: Some properties of convex sets related to fixed point theorems. Mathematische Annalen 266, 519–537 (1984)
Article MathSciNet MATH Google Scholar
Fernandes, L.M., Júdice, J.J., Fukushima, M., Iusem, A.: On the symmetric quadratic eigenvalue complementarity problem. Optim. Method Softw. 59, 307–326 (2014)
MathSciNet MATH Google Scholar
Fernandes, L.M., Júdice, J.J., Sherali, H.D., Forjaz, M.A.: On an enumerative algorithm for solving eigenvalue complementarity problems. Comput. Optim. Appl. 59, 113–134 (2014)
Article MathSciNet MATH Google Scholar
Fernandes, L.M., Júdice, J.J., Sherali, H.D., Fukushima, M.: On the computation of all eigenvalues for the eigenvalue complementarity problem. J. Global Optim. 59, 307–326 (2014)
Article MathSciNet MATH Google Scholar
Gabay, D., Mercier, B.: A dual algorithm for the solution of nonlinear variational problems via finite element approximations. Comput. Math. Appl. 2, 16–40 (1976)
Article MATH Google Scholar
Glowinski, R.: On alternating direction methods of multipliers: a historical perspective. In: Fitzgibbon, W., Kuznetsov, Y.A., Neittaanmäki, P., Pironneau, O. (eds.) Modeling, Simulation and Optimization for Science and Technology, Computational Methods in Applied Sciences, vol. 34, chap. 4, pp. 59–82. Springer, New York (2014)
Google Scholar
Glowinski, R., Marrocco, A.: Approximation par éléments finis d’ordre un et résolution par pénalisation-dualité d’une classe de problèmes non linéaires. R.A.I.R.O. R2, 41–76 (1975)
MathSciNet MATH Google Scholar
Hestenes, M.: Multiplier and gradient methods. J. Optim. Theory Appl. 4, 303–320 (1969)
Article MathSciNet MATH Google Scholar
Iusem, A., Seeger, A.: On convex cones with infinitely many critical angles. Optimization 56, 115–128 (2007)
Article MathSciNet MATH Google Scholar
Júdice, J.J., Raydan, M., Rosa, S.S., Santos, S.A.: On the solution of the symmetric eigenvalue complementarity problem by the spectral projected gradient algorithm. Numer. Algorithm 47(4), 391–407 (2008)
Article MathSciNet MATH Google Scholar
Júdice, J.J., Sherali, H.D., Ribeiro, I.M.: The eigenvalue complementarity problem. Comput. Optim. Appl. 37, 139–156 (2007)
Article MathSciNet MATH Google Scholar
Judice, J.J., Sherali, H.D., Ribeiro, I.M., Rosa, S.S.: On the asymmetric eigenvalue complementarity problem. Optim. Method Softw. 24(4–5), 549–568 (2009)
Article MathSciNet MATH Google Scholar
Lim, L.H.: Singular values and eigenvalues of tensors: a variational approach. In: Proceedings of the IEEE International Workshop on Computational Advances in Multi-Sensor Addaptive Processing. CAMSAP05, pp. 129–132. IEEE Computer Society Press, Piscataway (2005)
Ling, C., He, H.J., Qi, L.: On the cone eigenvalue complementarity problem for higher-order tensors. Comput. Optim. Appl. (2015). doi:10.1007/s10589-015-9767-z
MathSciNet MATH Google Scholar
Meirovitch, L., Trefethen, L.: A spectral Petrov-Galerkin formulation for pipe flow I: Linear stability and transient growth. Technical Report 00/18, Numerical Analysis Group, Oxford University Computing Laboratory, Oxford (2000)
Misrikhanov, M.S., Ryabchenko, V.N.: The quadratic eigenvalue problem in electric power systems. Autom Remote Control 67(5), 698–720 (2006)
Article MathSciNet MATH Google Scholar
Ng, M., Qi, L., Zhou, G.: Finding the largest eigenvalue of a non-negative tensor. SIAM J. Matrix Anal. Appl. 31, 1090–1099 (2009)
Article MathSciNet MATH Google Scholar
Powell, M.: A method for nonlinear constraints in minimization problems. In: Fletcher, R. (ed.) Optimization, pp. 283–298. Academic Press, London (1969)
Google Scholar
Qi, L.: Eigenvalues of a real supersymmetric tensor. J. Symbolic Comput. 40(6), 1302–1324 (2005)
Article MathSciNet MATH Google Scholar
Qi, L.: Symmetric nonnegative tensors and copositive tensors. Linear Algebra Appl. 439, 228–238 (2013)
Article MathSciNet MATH Google Scholar
Qi, L., Wang, F., Wang, Y.: Z-eigenvalue methods for a global polynomial optimization problem. Math. Progr. 118(2), 301–316 (2009)
Article MathSciNet MATH Google Scholar
Qi, L., Yu, G., Wu, E.X.: Higher order positive semidefinite diffusion tensor imaging. SIAM J. Imaging Sci. 3, 416–433 (2010)
Article MathSciNet MATH Google Scholar
Rockafellar, R.: Convex processes and hamiltonian dynamical systems. In: Convex Analysis and Mathematical Economics. Lecture Notes in Econmics and Mathematical Systems, vol. 168, pp. 122–136. Springer, Berlin (1979)
Rockafellar, R.T., Wets, R.J.B.: Variational Analysis. Springer, Berlin (1998)
Book MATH Google Scholar
Seeger, A.: Eigenvalue analysis of equilibrium processes defined by linear complementarity conditions. Linear Algebra Appl. 292, 1–14 (1999)
Article MathSciNet MATH Google Scholar
Seeger, A.: Quadratic eigenvalue problems under conic constraints. SIAM J. Matrix Anal. Appl. 32, 700–721 (2011)
Article MathSciNet MATH Google Scholar
Song, Y., Qi, L.: Necessary and sufficient conditions for copositive tensors. Linear Multilinear A 63, 120–131 (2015)
Song, Y., Qi, L.: Eigenvalue analysis of constrained minimization problem for homogeneous polynomial. arXiv preprint arXiv:1302.6085 (2013)
Tisseur, F., Meerbergen, K.: The quadratic eigenvalue problem. SIAM Rev. 43, 235–286 (2001)
Article MathSciNet MATH Google Scholar
Walkup, D.W., Wets, R.J.B.: Continuity of some convex-cone-valued mappings. Proc. Am. Math. Soc. 18, 229–235 (1967)
Article MathSciNet MATH Google Scholar
Yang, Y., Yang, Q.: Further results for Perron-Frobenius theorem for nonnegative tensors. SIAM J. Matrix Anal. Appl. 31, 2517–2530 (2010)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgments

The authors would like to thank the two anonymous referees for their careful reading and valuable comments, which significantly improved the presentation of this paper. The first two authors would like to thank Professor Deren Han for his comments on the numerical algorithm. This research of the first two authors was supported in part by National Natural Science Foundation of China (11171083, 11301123, 11571087) and the Zhejiang Provincial NSF (LZ14A010003). The third author was supported by the Hong Kong Research Grant Council (Grant Nos. PolyU 502111, 501212, 501913, and 15302114).

Author information

Authors and Affiliations

Department of Mathematics, School of Science, Hangzhou Dianzi University, Hangzhou, 310018, China
Chen Ling & Hongjin He
Department of Applied Mathematics, The Hong Kong Polytechnic University, Hung Hom, Kowloon, Hong Kong
Liqun Qi

Authors

Chen Ling
View author publications
You can also search for this author in PubMed Google Scholar
Hongjin He
View author publications
You can also search for this author in PubMed Google Scholar
Liqun Qi
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hongjin He.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ling, C., He, H. & Qi, L. Higher-degree eigenvalue complementarity problems for tensors. Comput Optim Appl 64, 149–176 (2016). https://doi.org/10.1007/s10589-015-9805-x

Download citation

Received: 14 July 2015
Published: 03 November 2015
Issue Date: May 2016
DOI: https://doi.org/10.1007/s10589-015-9805-x

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Higher-degree eigenvalue complementarity problems for tensors

Abstract

Similar content being viewed by others

On the cone eigenvalue complementarity problem for higher-order tensors

Tensor Z-eigenvalue complementarity problems

A Class of Second-Order Cone Eigenvalue Complementarity Problems for Higher-Order Tensors

1 Introduction

Notation

2 Some basic properties of K-spectrum

Definition 2.1

Definition 2.2

Definition 2.3

Proposition 2.1

Proof

Corollary 2.1

Proposition 2.2

Proof

Proposition 2.3

Theorem 2.1

Proof

Theorem 2.2

Proof

3 Optimization formulation of THDEiCP

Theorem 3.1

Proof

Theorem 3.2

Proof

Theorem 3.3

Proof

4 Existence of solutions for THDEiCP

Theorem 4.1

Proof

Theorem 4.2

Proof

Theorem 4.3

Proof

Corollary 4.1

5 Numerical algorithm and experiments

5.1 The algorithm

5.2 Numerical experiments

Example 5.1

Example 5.2

Example 5.3

6 Conclusions

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation