Abstract
For a \(\mathcal {C}^2\)-smooth function on a finite-dimensional space, a necessary condition for its quasiconvexity is the positive semidefiniteness of its Hessian matrix on the subspace orthogonal to its gradient, whereas a sufficient condition for its strict pseudoconvexity is the positive definiteness of its Hessian matrix on the subspace orthogonal to its gradient. Our aim in this paper is to extend those conditions for \(\mathcal {C}^{1,1}\)-smooth functions by using the Fréchet and Mordukhovich second-order subdifferentials.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Since the notion of convexity does not satisfy a variety of mathematical models used in sciences, economics, and engineering, various generalizations of convex functions have been introduced in literature [7, 11, 18] such as (strictly) quasiconvex and (strictly) pseudoconvex functions. Those functions share many nice properties of convex functions and cover some models which are effective and adaptable to real-world situations. To be more specific, the quasiconvexity of a function ensures the convexity of its sub-level sets, and the pseudoconvexity implies that its critical points are minimizers.
First-order characterizations for quasiconvexity and pseudoconvexity can be found in [4, 7, 10, 11] for smooth functions and [1, 2, 5, 14,15,16, 22, 23, 25] for nonsmooth ones. The well-known second-order necessary condition for the quasiconvexity of \(\mathcal {C}^2\)-smooth functions (see for instance [3, 7, 9, 11]) states that the Hessian matrix of a quasiconvex function is positive-semidefinite on the subspace orthogonal to its gradient. Furthermore, if the Hessian matrix of a \(\mathcal {C}^2\)-smooth function is positive definite on the subspace orthogonal to its gradient then the given function is strictly pseudoconvex [11]. Using some kinds of generalized second-order derivatives, many authors established some second-order criteria for the quasiconvexity and pseudoconvexity of functions without the \(\mathcal {C}^2\)-smooth property. By employing Taylor’s formula and an estimation formula of generalized Hessian, Luc [17] set up the necessary and sufficient conditions for the quasiconvexity of \(\mathcal {C}^{1,1}\)-smooth functions. In [12] Ginchev and Ivanov introduced the concept of second-order upper Dini-directional derivatives and utilized it to characterize the pseudoconvexity of radially upper semicontinuous functions. By using the theory of viscosity solutions of partial differential equations, Barron, Goebel, and Jensen [6] obtained some necessary conditions and sufficient ones for the quasiconvexity of upper semicontinuous functions.
It can be seen that the Fréchet and the Mordukhovich second-order subdifferentials play a crucial role in variational analysis [19, 20, 24]. Recently, Nadi and Zafarani [21] established some characterizations of the quasimonotone and pseudomonotone of set-valued mappings in terms of their Fréchet coderivatives. Using the relationship between generalized monotone mappings and generalized convex functions, they presented some second-order characterizations of quasiconvex [21, Corollary 3.16] and pseudoconvex functions [21, Corollary 3.20] via their Fréchet second-order subdifferentials. In this paper, utilizing Fréchet and Mordukhovich second-order subdifferentials, we establish directly some necessary and sufficient conditions for quasiconvexity and pseudoconvexity of \(\mathcal {C}^{1,1}\)-smooth functions without using characterizations of generalized monotone mappings. For the necessity, we prove that the Fréchet second-order subdifferential of a pseudoconvex function is positive semidefinite on the subspace orthogonal to its gradient while the Mordukhovich second-order subdifferential of a quasiconvex function is only positive semidefinite along its some selection. It is noted that although the latter can be implied from [21, Corollary 3.16], we give another simpler proof of this result via the mean value inequality and some facts of quasiconvex functions. For the sufficiency, we propose two conditions guaranteeing the strict pseudoconvexity. The first one is the positive definiteness of the Mordukhovich second-order subdifferential of a given function on the subspace orthogonal to its gradient. The second one claims that the Fréchet second-order subdifferential of a given function has some selection which is positive on the subspace orthogonal to its gradient. Moreover, a second-order sufficient condition for the strict quasiconvexity is also established by using Fréchet second-order subdifferentials. Throughout the paper, we proposed a variety of examples to illustrate and analyze the obtained results.
The paper is organized as follows. Some background material from variational analysis and generalized convexity are recalled in Sect. 2. Section 3 presents some second-order conditions for quasiconvexity and pseudoconvexity of \(\mathcal {C}^{1,1}\)-smooth functions. Sufficient conditions are given in Sect. 4. Conclusions and further investigations are discussed in the last section.
2 Preliminaries
To begin with, some necessary notions from [20] will be recalled. Let F be a set-valued mapping between Euclidean spaces \(\mathbb {R}^n\) and \(\mathbb {R}^m\). As usual, the effective domain and the graph of F are given, respectively, by
The sequential Painlevé–Kuratowski outer limit of F as \(x\rightarrow \bar{x}\) is defined as
Let us consider an extended-real-valued function \(\varphi :\mathbb {R}^n\rightarrow \overline{\mathbb {R}}:=(-\infty ,\infty ]\). We always assume that \(\varphi \) is proper and lower semicontinuous. The Fréchet subdifferential of \(\varphi \) at \(\bar{x}\in \text {dom}\varphi :=\{x\in \mathbb {R}^n:\varphi (x)<\infty \}\) (known as the presubdifferential and as the regular subdifferential) is
Then the limiting subdifferential of \(\varphi \) at \(\bar{x}\) (known also the general or basic subdifferential) is defined via the outer limit (1)
where \(x \overset{\varphi }{\rightarrow } \bar{x}\) signifies that \(x\rightarrow \bar{x}\) with \(\varphi (x)\rightarrow \varphi (\bar{x})\). Observe that both Fréchet and limiting subdifferentials reduce to the classical Fréchet derivative for continuously differentiable functions.
Given a set \(\Omega \subset \mathbb {R}^n\) with its indicator function \(\delta _\Omega (x)\) equal to 0 for \(x\in \Omega \) and to \(\infty \) otherwise, the Fréchet and the Mordukhovich normal cones to \(\Omega \) at \(\bar{x}\in \Omega \) are defined, respectively, via the corresponding subdifferentials (2) and (3) by
The Fréchet and Mordukhovich coderivatives of F at \((\bar{x},\bar{y})\in \text {gph}F\) are defined, respectively, via corresponding normal cones (4) by
We omit \(\bar{y} = f(\bar{x})\) in the above coderivative notions if \(F:= f:\mathbb {R}^n \rightarrow \mathbb {R}^m\) is single-valued.
Definition 2.1
Let \(\varphi : \mathbb {R}^n \rightarrow \overline{\mathbb {R}}\) be a function with a finite value at \(\bar{x}\).
-
(i)
For any \(\bar{y} \in \partial \varphi (\bar{x})\), the map \(\partial ^2 \varphi (\bar{x},\bar{y}): \mathbb {R}^n \rightrightarrows \mathbb {R}^n\) with the values
$$\begin{aligned} \partial ^2 \varphi (\bar{x},\bar{y})(u)= (D^*\partial \varphi )(\bar{x},\bar{y})(u) \quad (u \in \mathbb {R}^n) \end{aligned}$$is said to be the Mordukhovich second-order subdifferential of \(\varphi \) at \(\bar{x}\) relative to \(\bar{y}\).
-
(ii)
For any \(\bar{y} \in \widehat{\partial } \varphi (\bar{x})\), the map \(\widehat{\partial }^2 \varphi (\bar{x},\bar{y}): \mathbb {R}^n \rightrightarrows \mathbb {R}^n\) with the values
$$\begin{aligned} \widehat{\partial }^2 \varphi (\bar{x},\bar{y})(u)= (\widehat{D}^*\widehat{\partial }\varphi )(\bar{x},\bar{y})(u) \quad (u \in \mathbb {R}^n) \end{aligned}$$is said to be the Fréchet second-order subdifferential of \(\varphi \) at \(\bar{x}\) relative to \(\bar{y}\). We omit \(\bar{y} = \nabla \varphi (\bar{x})\) in the above second-order subdifferentials if \(\varphi \in \mathcal {C}^1\) around \(\bar{x}\), i.e., continuously Fréchet differentiable in a neighborhood of \(\bar{x}\).
In general, the Fréchet second-order subdifferential and the Mordukhovich one are incomparable. However, if \(\varphi \in \mathcal {C}^1\) around \(\bar{x}\), then
If \(\varphi \in \mathcal {C}^{1,1}\) around \(\bar{x}\), i.e., Fréchet differentiable around \(\bar{x}\) with the gradient \(\nabla \varphi \) being locally Lipschitzian around \(\bar{x}\) then the calculation of second-order subdifferentials can be essentially simplified due to the following scalarization formulas (see [19, Proposition 3.5] and [20, Proposition 1.120])
In this case, Mordukhovich second-order subdifferentials are nonempty [20, Corollary 2.25] while Fréchet ones may be empty. If \(\varphi \in \mathcal {C}^2\) around \(\bar{x}\), i.e., \(\varphi \) is twice continuously Fréchet differentiable in a neighborhood of \(\bar{x}\), then
Let us recall some well-known notions of generalized convexity.
Definition 2.2
-
(a)
A function \(\varphi : \mathbb {R}^n \rightarrow \mathbb {R}\) is said to be quasiconvex if
$$\begin{aligned} \varphi ((1-\lambda )x + \lambda y) \le \max \{\varphi (x),\varphi (y)\} \end{aligned}$$for every \(x, y \in \mathbb {R}^n\) and for every \(\lambda \in [0,1]\).
-
(b)
A function \(\varphi : \mathbb {R}^n \rightarrow \mathbb {R}\) is said to be strictly quasiconvex if
$$\begin{aligned} \varphi ((1-\lambda )x + \lambda y) < \max \{\varphi (x),\varphi (y)\} \end{aligned}$$for every \(x, y \in \mathbb {R}^n, x\ne y\) and for every \(\lambda \in (0,1)\).
-
(c)
A differentiable function \(\varphi : \mathbb {R}^n \rightarrow \mathbb {R}\) is called pseudoconvex if
$$\begin{aligned} x,y \in \mathbb {R}^n, \varphi (x)>\varphi (y) \Longrightarrow \langle \nabla \varphi (x), y - x \rangle <0. \end{aligned}$$ -
(d)
A differentiable function \(\varphi : \mathbb {R}^n \rightarrow \mathbb {R}\) is called strictly pseudoconvex if
$$\begin{aligned} x,y \in \mathbb {R}^n,\; x\ne y,\; \varphi (x)\ge \varphi (y) \Longrightarrow \langle \nabla \varphi (x), y - x \rangle <0. \end{aligned}$$
It follows, immediately, from the given definitions, that a strictly quasiconvex (pseudoconvex) function is quasiconvex (pseudoconvex). For differentiable functions, (strict) pseudoconvexity implies (strict) quasiconvexity. The next theorem shall point out that within the class of (strictly) quasiconvex functions, (strict) pseudoconvexity may be specified by means of its behaviour at critical points.
Theorem 2.1
[7, Theorem 3.2.9] Let \(\varphi :\mathbb {R}^n\rightarrow \mathbb {R}\) be a continuously differentiable function. Then, \(\varphi \) is (strictly) pseudoconvex if and only if the following conditions hold:
-
(i)
\(\varphi \) is quasiconvex;
-
(ii)
If \(\nabla \varphi (x)=0\) then x is a (strict) local minimizer for \(\varphi \).
Finally, we consider a lemma which will be used in the sequel.
Lemma 2.1
Let \(\varphi :\mathbb {R}^n\rightarrow \mathbb {R}\) be a differentiable function. If \(\varphi \) is not strictly quasiconvex, then there exist \(x_1, x_2\in \mathbb {R}^n, x_1\ne x_2\) and \(t_0\in (0,1)\) such that \(\langle \nabla \varphi (x_1+t_0(x_2-x_1)),x_2-x_1\rangle =0\) and
Proof
Since \(\varphi \) is not strictly quasiconvex, there exist \(x_1,x_2\in \mathbb {R}^n, \alpha \in (0,1)\) such that \(x_1\ne x_2\) and
Consider the function \(f:\mathbb {R}\rightarrow \mathbb {R}\) given by
Then, thanks to the Weierstrass theorem and (9), we can find a number \(t_0\in (0,1)\) for which the function f admits a maximum on the interval [0, 1] at \(t_{0}\). Hence, (8) is satisfied and by the Fermat rule we have
\(\square \)
3 Necessary conditions
Let us recall the well-known second-order necessary condition for quasiconvexity of \(\mathcal {C}^2\)-smooth functions.
Theorem 3.1
(see [3, Lemma 6.2] or [7, Theorem 3.4.2]) Let \(\varphi :\mathbb {R}^n \rightarrow \mathbb {R}\) be a \(\mathcal {C}^2\)-smooth function. If \(\varphi \) is quasiconvex, then
By using the mean value inequality in terms of limiting subdifferential for Lipschitzian functions [20, Corollary 3.51 ] we extend the above result to \(\mathcal {C}^{1,1}\)-smooth functions.
Proposition 3.1
Let \(\varphi :\mathbb {R}^n\rightarrow \mathbb {R}\) be a Lipschitz continuous function on an open set containing [a, b]. Then one has
Theorem 3.2
Let \(\varphi :\mathbb {R}^n\rightarrow \mathbb {R}\) be a \(\mathcal {C}^{1,1}\)-smooth function. If \(\varphi \) is quasiconvex then
Proof
Let \(x,u\in \mathbb {R}^n\) be such that \(\langle \nabla \varphi (x),u\rangle =0\). If \(u=0\) then \(\langle z,u\rangle = 0\) for all \(z\in {\partial }^2 \varphi (x)(u)\). Otherwise, consider the function \(f:\mathbb {R}^n\rightarrow \mathbb {R}\) given by
Then, \(f(x)=0\) and f is locally Lipschitz continuous on \(\mathbb {R}^n\) by the \(\mathcal {C}^{1,1}\)-smoothness of \(\varphi \). Moreover, \(\partial f\) is locally bounded (see [20, Corollary 1.81] or [24, Theorem 9.13]), robust [24, Proposition 8.7] on \(\mathbb {R}^n\) and for every \(y\in \mathbb {R}^n\)
For the sequences \(x_k:=x+(1/k)u,\; x^\prime _k:=x-(1/k)u\; (k\in \mathbb {N})\), one has \(x_k\rightarrow x, x^\prime _k\rightarrow x\) and, in view of Proposition 3.1, there exist \(\theta _k \in [0,1/k)\), \(\theta ^\prime _k\in (0,1/k]\) and \(z_k\in \partial f(x+\theta _ku), z^\prime _k\in \partial f(x-\theta _k^\prime u)\) such that
By the quasiconvexity of \(\varphi \), it follows from [11, Proposition 1] that
Therefore, \(\max \{\langle z_k,u\rangle ,\langle z^\prime _k,u\rangle \}\ge 0\) for all \(k\in \mathbb {N}\). Since \(\partial f\) is locally bounded around x, the sequences \((z_k), (z^\prime _k)\) are bounded. Without loss of generality, we can assume that \(z_k\rightarrow z\) and \(z^\prime _k\rightarrow z^\prime \). It follows that \(\max \{\langle z,u\rangle ,\langle z^\prime ,u\rangle \}\ge 0\) and by the robustness of \(\partial f\) we have \(z,z^\prime \in \partial f(x)=\partial ^2\varphi (x)(u)\). The proof is complete.\(\square \)
Remark 3.1
Theorem 3.2 can be deduced directly from [21, Corollary 3.16] when \(\varphi \) is a \(\mathcal {C}^{1,1}\)-smooth function. The mean value inequality allows us to give a simpler proof for this result.
The following example shows that the inequality in (11) may not be true for all points belonging to the second-order Mordukhovich subdifferentials even for pseudoconvex functions.
Example 3.1
[13, Remark 3.1] Let \(\varphi : \mathbb {R} \rightarrow \mathbb {R}\) be defined by
where
Observe that \(\varphi \) is a pseudoconvex \(\mathcal {C}^{1,1}\)-smooth function. Indeed, for every \(x\in \mathbb {R}\), we have
Hence, \(\nabla \varphi \) is locally Lipschitz and so it is \(\mathcal {C}^{1,1}\)-smooth. Moreover, \(\nabla \varphi (x) = 0\) if and only if \(x = 0\) and 0 is a local minimum of \(\varphi \). It follows from [7, Theorem 3.2.7] that \(\varphi \) is a pseudoconvex function. Clearly, one has \(\partial ^2 \varphi (0)(u) = [-|u|,|u|]\) for each \(u \in \mathbb {R}\). Thus, for all \(u\ne 0\), there exists \(z^* \in \partial ^2 \varphi (0)(u) \) such that \(\langle z^*, u \rangle < 0\).
Although the pseudoconvexity does not imply the positive semidefiniteness of the second-order Mordukhovich subdifferential, it guarantees the positive semidefiniteness of the second-order Fréchet subdifferential.
Theorem 3.3
Let \(\varphi :\mathbb {R}^n\rightarrow \mathbb {R}\) be a \(\mathcal {C}^{1,1}\)-smooth function. If \(\varphi \) is pseudoconvex then
Proof
Suppose to the contrary that there exist \(x,u\in \mathbb {R}^n\) and \(z\in \widehat{\partial }^2 \varphi (x)(u)\) such that \(\langle \nabla \varphi (x), u \rangle = 0\) and \(\langle z,u\rangle <0\). By (6), we have \(z\in \widehat{\partial }\langle u,\nabla \varphi \rangle (x)\) and so
For the sequence \(x_k:=x-(1/k)u\; (k\in \mathbb {N})\), one has \(x_k\rightarrow x\) and
The pseudoconvexity of \(\varphi \) implies that \(\varphi (x_k)\ge \varphi (x)\) and by the classical mean value theorem there exists \(\theta _k\in (0,1/k)\) such that
For the sequence \(y_k:=x-\theta _ku\; (k\in \mathbb {N})\), one has \(y_k\rightarrow x\) and \(\langle \nabla \varphi (y_k),u\rangle \le 0\). Therefore, by (13) we have
which is a contradiction to \(\langle z,u\rangle <0\).\(\square \)
The next example shows that (12) is violated if the pseudoconvexity is relaxed to quasiconvexity.
Example 3.2
Let \(\varphi : \mathbb {R} \rightarrow \mathbb {R}\) be given by
Observe that \(\varphi \) is a quasiconvex \(\mathcal {C}^{1,1}\)-smooth function and \(\nabla \varphi (x) = |x|\) for every \(x \in \mathbb {R}\). It is clear that
Note that the Fréchet coderivative of \(\nabla \varphi \) in this case is given by
Observe that for \(z=-1, u=1\), we have \(z \in \widehat{\partial }^2 \varphi (0)(u)\) and \(\langle z,u \rangle < 0\).
4 Sufficient conditions
A second-order sufficient condition for the strict pseudoconvexity in the \(\mathcal {C}^2\)-smooth case is recalled in the following theorem.
Theorem 4.1
[11, Proposition 4] Let \(\varphi : \mathbb {R}^n \rightarrow \mathbb {R}\) be a \(\mathcal {C}^2\)-smooth function satisfying
Then, \(\varphi \) is a strictly pseudoconvex function.
Our aim in this section is to establish some similar versions of Theorem 4.1 in the \(\mathcal {C}^{1,1}\)-smooth case by using the Fréchet and Mordukhovich second-order subdifferentials. The first version is the replacement of the Hessian matrices in (14) by the Mordukhovich second-order subdifferentials. Our proof is based on Theorem 2.1 and the following sufficient optimality condition for \(\mathcal {C}^{1,1}\)-smooth functions.
Proposition 4.1
[8, Corollary 4.8] Suppose that \(\varphi :\mathbb {R}^n\rightarrow \mathbb {R}\) is a \(\mathcal {C}^{1,1}\)-smooth function and \(x\in \mathbb {R}^n\). If \(\nabla \varphi (x)=0\) and
then x is a strict local minimizer of \(\varphi \).
Theorem 4.2
Let \(\varphi :\mathbb {R}^n\rightarrow \mathbb {R}\) be a \(\mathcal {C}^{1,1}\)-smooth function satisfying
Then \(\varphi \) is a strictly pseudoconvex function.
Proof
Observe that if \(\nabla \varphi (x)=0\), then (15) implies the positive semidefiniteness of \(\partial ^2\varphi (x)\) and so, by Proposition 4.1, x is a strict local minimizer of \(\varphi \). Hence, it follows from Theorem 2.1 that \(\varphi \) is strictly pseudoconvex if and only if \(\varphi \) is quasiconvex.
Assume that \(\varphi \) is not quasiconvex. Then, by Lemma 2.1, there exist \(x_1,x_2\in \mathbb {R}^n, x_1\ne x_2\) and \(t_0\in (0,1)\) such that \(\langle \nabla \varphi (x_1+t_0(x_2-x_1)),x_2-x_1\rangle =0\) and (8) is satisfied. Let \(\bar{x}:=x_1+t_0(x_2-x_1)\) and \(u:=x_2-x_1\). It follows that \(u\ne 0\) and \(\langle \nabla \varphi (\bar{x}),u\rangle =0\) and so, by (15),
For the sequence \(x_k:=\bar{x}+(1/k)u\) (\(k\in \mathbb {N}\)) we have \(x_k\rightarrow \bar{x}\). For sufficiently large k, we have \(t_0+1/k\in (0,1)\) and so \(\varphi (x_k)\le \varphi (\bar{x})\) by (8). Applying the classical mean value theorem, for sufficiently large k, there exists \(\theta _k\in (0, 1/k)\) such that
Consider the function \(\phi :\mathbb {R}^n\rightarrow \mathbb {R}\) given by
Applying Proposition 3.1, for every k, there exist \(\gamma _k\in (0,\theta _k]\) and \(z_k\in \partial \phi (\bar{x}+\gamma _ku)\) such that
Combining the above inequality with (17) we have \(\langle z_k,u\rangle \le 0\) for sufficiently large k. Since \(\partial \phi \) is locally bounded at \(\bar{x}\), the sequence \((z_k)\) is bounded. Without loss of generality, we can assume that \(z_k\rightarrow z\). It follows that \(\langle z,u\rangle \le 0\) and by the robustness of \(\partial \phi \) we have \(z\in \partial \phi (\bar{x})=\partial ^2\varphi (\bar{x})(u)\). This is a contradiction to (16). The proof is complete.\(\square \)
We consider two examples to analyze (15). The first one shows that (15) cannot be relaxed to the following condition
Moreover, (18) is not sufficient for the quasiconvexity of \(\varphi \).
Example 4.1
Let \(\varphi : \mathbb {R} \rightarrow \mathbb {R}\) be the function given by
where
Observe that \(\varphi \) is a \(\mathcal {C}^{1,1}\)-smooth function and \(\nabla \varphi (x) = \phi (x)\) for every \(x \in \mathbb {R}\). Moreover, we have \(\partial ^2 \varphi (0)(u) = \left[ -|u|,|u|\right] \) for all \(u \in \mathbb {R}\). Let \(x \in \mathbb {R}\), \(u \in \mathbb {R} \setminus \{0\}\) be such that \(\langle \nabla \varphi (x), u \rangle = 0\). It follows that \(\nabla \varphi (x) = 0\), or equivalently \(x = 0\). For \(z^* = u \in \partial ^2 \varphi (0)(u) \), we have \(\langle z^*, u \rangle = |u|^2 > 0\). The condition (18) holds for \(\varphi \). However, \(\varphi \) is not quasiconvex. Indeed, for \(x =\displaystyle \frac{1}{\pi }, y =-\frac{1}{\pi } \), we have
By [11, Proposition 1], \(\varphi \) is not quasiconvex.
The second example points out that we cannot replace the Mordukhovich second-order subdifferential in (15) by the Fréchet second-order one since it may be empty.
Example 4.2
Let \(\varphi : \mathbb {R} \rightarrow \mathbb {R}\) be the function given by
where
is a locally Lipschitz function. Hence, \(\varphi \) is \(\mathcal {C}^{1,1}\)-smooth and \(\nabla \varphi (x)=\phi (x)\) for every \(x\in \mathbb {R}\). Let \(x, u \in \mathbb {R}\), \(u \ne 0\) such that \(\langle \nabla \varphi (x),u\rangle =0\). Then, \(\nabla \varphi (x) = 0\) and so \(x = 0\). We have \(\widehat{\partial }^2 \varphi (0)(u) = \emptyset \). Thus, the below condition holds
However, \(\varphi \) is not a pseudoconvex function. Indeed, for \(x = 0, y = 1\), we have
By [11, Proposition 2], \(\varphi \) is not pseudoconvex.
When the Fréchet second-order subdifferential is nonempty, we can use it to characterize the strict quasiconvexity and strict pseudoconvexity of \(\mathcal {C}^{1,1}\)-smooth functions.
Theorem 4.3
Let \(\varphi :\mathbb {R}^n\rightarrow \mathbb {R}\) be a \(\mathcal {C}^{1,1}\)-smooth function satisfying
Then \(\varphi \) is a strictly quasiconvex function.
Proof
Assume that \(\varphi \) is not strictly quasiconvex. Then, by Lemma 2.1, there exist \(x_1,x_2\in \mathbb {R}^n\) with \(x_1\ne x_2\) and \(t_0\in (0,1)\) such that \(\langle \nabla \varphi (x_1+t_0(x_2-x_1)),x_2-x_1\rangle =0\) and (8) is satisfied. Let \(x:=x_1+t_0(x_2-x_1)\) and \(u:=x_2-x_1\). It follows that \(u\ne 0\) and \(\langle \nabla \varphi (x),u\rangle =0\) and so, by (19), there exists \(z\in \widehat{\partial }^2\varphi (x)(u) \cup -\widehat{\partial }^2\varphi (x)(-u)\) such that \(\langle z,u\rangle >0\). Since
it must happen one of the following cases.
Case 1: \(z\in \widehat{\partial }\langle u,\nabla \varphi \rangle (x)\). Since \(\langle \nabla \varphi (x),u\rangle =0\), we have
For the sequence \(x_k:=x+(1/k)u\) (\(k\in \mathbb {N}\)) we have \(x_k\rightarrow x\). For sufficiently large k, we have \(t_0+1/k\in (0,1)\) and so \(\varphi (x_k)\le \varphi (x)\) by (8). Applying the classical mean value theorem, for sufficiently large k, there exists \(\theta _k\in (0, 1/k)\) such that
For the sequence \(y_k:=x+\theta _ku\; (k\in \mathbb {N})\) we have \(y_k\rightarrow x\) and \(\langle \nabla \varphi (y_k),u\rangle \le 0\) by (21) for every \(k\in \mathbb {N}\). It follows from (20) that
which is a contradiction to \(\langle z,u\rangle > 0\).
Case 2. \(z\in -\widehat{\partial }\langle -u,\nabla \varphi \rangle (x)\). Repeating the proof of Case 1. with u, z being replaced by \(-u,-z\) we also get a contradiction.\(\square \)
Remark 4.1
Observe that the strict quasiconvexity in Theorem 4.3 cannot be improved to strict pseudoconvexity. Indeed, let \(\varphi \) be the function given in Example 3.2. We have
for all \(u \in \mathbb {R}\). Observe that if \(x \in \mathbb {R}\), \(u\in \mathbb {R}\setminus \{0\}\) are such that \(\langle \nabla \varphi (x), u \rangle = 0\) then \(x = 0\). Hence, with \(z:= u \in \widehat{\partial }^2\varphi (0)(u)\cup -\widehat{\partial }^2\varphi (0)(-u)\) we have \(\langle z, u \rangle = |u|^2 > 0\) and so (19) holds while \(\varphi \) is not strictly pseudoconvex.
We now improve (19) to get another characterization for the strict pseudoconvexity.
Theorem 4.4
Let \(\varphi : \mathbb {R}^n \rightarrow \mathbb {R}\) be a \(\mathcal {C}^{1,1}\)-smooth function satisfying
Then \(\varphi \) is a strictly pseudoconvex function.
Proof
By Theorem 4.3, \(\varphi \) is strictly quasiconvex. We will use Theorem 2.1 to prove the strict pseudoconvexity of \(\varphi \). Let \(x \in \mathbb {R}^n\) such that \(\nabla \varphi (x)=0\). It follows from (6) and (22) that
for every \(u\in \mathbb {R}^n\setminus \{0\}\). By [20, Proposition 1.87], the scalar function \(\langle u, \nabla \varphi \rangle \) is differentiable at x for every \(u\in \mathbb {R}^n\setminus \{0\}\). Hence, \(\varphi \) is twice differentiable at x and
for every \(u\in \mathbb {R}^n\setminus \{0\}\). Again, by (22), its Hessian \(\nabla ^2\varphi (x)\) is positive definite. Moreover, by [24, Theorem 13.2], the Hessian matrix \(\nabla ^2\varphi (x)\) also furnishes a quadratic expansion for \(\varphi \) at x. Therefore, since \(\nabla ^2\varphi (x)\) is positive definite and \(\nabla \varphi (x)=0\), it yields that x is a strict local minimizer of \(\varphi \). By Theorem 2.1, \(\varphi \) is strictly pseudoconvex. \(\square \)
Remark 4.2
According to the proof of Theorem 4.4, the condition (22) also implies that \(\varphi \) is twice differentiable at its critical points.
In the two next examples, we will show that (22) and (15) are incomparable.
Example 4.3
Let \(\varphi : \mathbb {R} \rightarrow \mathbb {\mathbb {R}}\) be the function defined by
Then, \(\varphi \) is \(\mathcal {C}^{1,1}\)-smooth and
Let \(x, u \in \mathbb {R}\), \(u \ne 0\) such that \(\langle \nabla \varphi (x),u \rangle =0\). Then, \(\nabla \varphi (x) =0\) and so \(x = 0\). Clearly,
Hence, (15) holds while (22) is not satisfied.
Example 4.4
Let \(\varphi : \mathbb {R} \rightarrow \mathbb {R}\) be the function defined by
where \(\phi : \mathbb {R} \rightarrow \mathbb {R}\) is given by
Since \(\phi \) is locally Lipschitz, \(\varphi \) is \(\mathcal {C}^{1,1}\)-smooth and \(\nabla \varphi (x)=\phi (x)\) for every \(x\in \mathbb {R}\). Moreover, \(\varphi \) is twice differentiable everywhere except the points \(\displaystyle \frac{1}{\pi }\) and \(\displaystyle -\frac{1}{\pi }\). Let \(x, u \in \mathbb {R}^n\), \(u \ne 0\) such that \(\langle \nabla \varphi (x),u \rangle =0\). Then \(\nabla \varphi (x)=0\). We have
when \(-\frac{1}{\pi }<x < \frac{1}{\pi }\). Hence,
Therefore, \(\nabla \varphi (x) = 0\) if and only if \(x = 0\). Clearly,
5 Conclusions and further investigations
Several second-order necessary and sufficient conditions for the (strict) quasiconvexity and the (strict) pseudoconvexity of \(\mathcal {C}^{1,1}\)-smooth functions have been established on finite-dimensional Euclidean spaces. We also propose many examples to analyze and illustrate our results. Further investigations are needed to solve the following questions:
-
1.
How to extend our results to wider classes of smooth and non-smooth functions on infinite-dimensional Hilbert or even Banach spaces?
-
2.
How to apply our results to construct second-order necessary and sufficient conditions for nonlinear programming problems with non-convex and \(\mathcal {C}^{1,1}\)-smooth data?
References
Aussel, D., Corvellec, J.N., Lassonde, M.: Subdifferential characterization of quasiconvexity and convexity. J. Convex Anal. 1, 195–201 (1994)
Aussel, D.: Subdifferential properties of quasiconvex and pseudoconvex functions: unified approach. J. Optim. Theory Appl. 97, 29–45 (1998)
Avriel, M.: r-Convex functions. Math. Program. 2, 309–323 (1972)
Avriel, M., Diewert, W.E., Schaible, S., Zang, I.: Generalized Concavity, Mathematical Concepts and Methods in Science and Engineering, vol. 36. Plenum Press, New York (1988)
Barron, E.N., Goebel, R., Jensen, R.R.: The quasiconvex envelope through first-order partial differential equations which characterize quasiconvexity of nonsmooth functions. Discrete Contin. Dyn. Syst. Ser. B 17, 1693–1706 (2012)
Barron, E.N., Goebel, R., Jensen, R.R.: Quasiconvex functions and nonlinear PDEs. Trans. Am. Math. Soc. 365, 4229–4255 (2013)
Cambina, A., Martein, L.: Generalized Convexity and Optimization. Theory and Applications. Springer, Berlin (2009)
Chieu, N.H., Lee, G.M., Yen, N.D.: Second-order subdifferentials and optimality conditions for \(C^1\)-smooth optimization problems. Appl. Anal. Optim. 1, 461–476 (2017)
Crouzeix, J.-P.: On second order conditions for quasiconvexity. Math. Program. 18, 349–352 (1980)
Crouzeix, J.P., Ferland, J.A.: Criteria for quasiconvexity and pseudoconvexity: relationships and comparisons. Math. Program. 23, 193–205 (1982)
Crouzeix J.-P.: Characterizations of generalized convexity and monotonicity, a survey. In: Generalized Convexity, Generalized Monotonicity, pp. 237 – 256 (1998)
Ginchev, I., Ivanov, V.I.: Second-order characterizations of convex and pseudoconvex functions. J. Appl. Anal. 9, 261–273 (2003)
Hiriart-Urruty, J.B., Strodiot, J.J., Hien, N.V.: Generalized Hessian matrix and second order optimality conditions for problems with \(\cal{C}^{1,1}\) data. Appl. Math. Optim. 11, 43–56 (1984)
Hoa, B.T., Khanh, P.D., Trinh, T.T.T.: Characterizations of nonsmooth robustly quasiconvex functions. J. Optim. Theory Appl. 180, 775–786 (2019)
Ivanov, V.I.: Characterizations of pseudoconvex functions and semistrictly quasiconvex ones. J. Global Optim. 57, 677–693 (2013)
Luc, D.T.: Characterizations of quasiconvex function. Bull. Aust. Math. Soc. 48, 393–405 (1993)
Luc, D.T.: Taylor’s formula for \(C^{k,1}\) functions. SIAM J. Optim. 5, 659–669 (1995)
Mangasarian, O.L.: Nonlinear Programming. Classics in Applied Mathematics. SIAM, Philadelphia (1994)
Mordukhovich, B.S., Nam, N.M., Yen, N.D.: Fréchet subdifferential calculus and optimality conditions in nondifferentiable programming. Optimization 55, 685–708 (2006)
Mordukhovich, B.S.: Variational analysis and generalized differentiation. In: Basic Theory, vol. I. Applications, vol. II. Springer, Berlin (2006)
Nadi, M.T., Zafarani, J.: Characterizations of quasiconvex and pseudoconvex functions by their second-order regular subdifferentials. J. Aust. Math. Soc. (2019). https://doi.org/10.1017/S1446788719000090
Soleimani-damaneh, M.: Characterization of nonsmooth quasiconvex and pseudoconvex functions. J. Math. Anal. Appl. 330, 1387–1392 (2007)
Penot, J.P., Quang, P.H.: On generalized convexity of functions and generalized monotonicity of set-valued maps. J. Optim. Theory Appl. 92, 343–356 (1997)
Rockafellar, R.T., Wets, R.J.-B.: Variational Analysis. Springer, Berlin (1998)
Yang, X.Q.: Continuous generalized convex functions and their characterizations. Optimization 54, 495–506 (2005)
Acknowledgements
The authors are grateful to the editors and two anonymous referees for constructive comments and suggestions, which greatly improved the paper. Pham Duy Khanh was supported, in part, by the Fondecyt Postdoc Project 3180080, the Basal Program CMM–AFB 170001 from CONICYT–Chile, and the National Foundation for Science and Technology Development (NAFOSTED) under Grant Number 101.01-2017.325.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Khanh, P.D., Phat, V.T. Second-order characterizations of quasiconvexity and pseudoconvexity for differentiable functions with Lipschitzian derivatives. Optim Lett 14, 2413–2427 (2020). https://doi.org/10.1007/s11590-020-01563-6
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11590-020-01563-6