1 Introduction

A multiobjective semi-infinite programming (MOSIP in brief) is an optimization problem where two or more objectives are to be minimized on a set of feasible solutions described by infinitely many inequality constraint functions. Optimality and duality conditions of MOSIP have been studied by many authors: (Guerra-Vazquez and Todorov 2016; Kanzi et al. 2018) in linear case, (Goberna and Kanzi 2017; Goberna et al. 2013) in convex case, (Caristi et al. 2010) in smooth case, and (Kanzi 2015; Kanzi and Nobakhtian 2013) in locally Lipschitz case. Also, Gao presented some sufficient and duality results for MOSIPs under the various generalized convexity assumptions in (Gao 2012, 2013). In almost all of the articles in MOSIP theory, the Fritz–John-type (Karush–Kuhn–Tucker-type) necessary optimality conditions are justified for continuous problems (under Slater constraint qualification); continuous MOSIPs and Slater constraint qualification will be defined in Sect. 3. The first aim of this paper is to replace these conditions by two weaker conditions, named PLV property and weak Slater constraint qualification. Also, in all the articles mentioned above (with the exception of Caristi et al. (2010), sufficient optimality conditions and duality results are presented under convexity, quasiconvexity\(\backslash \)pseudoconvexity, and invexity assumptions for objective and restriction functions. Caristi et al. (2010) took the \((\Phi ,\rho )\)-invexity to state of optimality and duality theorems for MOSIPs with differentiable data. Another aim of this paper is to extension of \((\Phi ,\rho )\)-invexity for nondifferentiable functions and also is to apply this concept for nonsmooth MOSIPs. We should mention that the \((\Phi ,\rho )\)-invexity was generalized for nonsmooth functions in some references (e.g., Antczak 2015, 2012), but our definition and results are different from theirs. Some of the issues that soared in the optimization theory, in the past four decades and suddenly declined, are the topics related to the invex functions, their extensions, and applications. The concept of invexity was originated by Hanson and Mond (1981) but so named by Craven (1981) for differentiable functions. They used this concept to extend the concept of convexity in presenting sufficient optimality conditions and duality results. After Hanson, this concept was considered by many researchers. Invexity was generalized to nonsmooth functions by Phoung et al. (1995), by Reiland (1990), and by Craven (1986). Also, the invexity was used in optimality conditions and for alternative theorems by (Hanson 1999; Brandao et al. 2000), respectively. Some researchers such as Antczak have selected the extension of the concepts of invexity and convexity as their main research areas (see, e.g., Antczak 2002, 2009a, b). These studies extended the previous concepts in a way that sufficient and duality results could remain valid. These concepts are researched in two separate forms, smooth and nonsmooth. The procedure to extend convexity was not limited to invexity. But other concepts such as abstract convex functions by Rubinov (2000), DC functions by Horst and Thoai (1999), and star-shaped functions by Cambini and Martein (2009) were also introduced. Of course, the quasiconvex analysis also plays a crucial role in weakening the convexity condition (Penot 1998, 2000). Since these concepts are not related to this study, they are not delved into. But why was invexity taken out of focus? The authors present some reasons for that: First, it seems that the extensions of invex functions become so abstract in some levels that one can hardly find a model for them in the real world and they can only be used as some artificial samples. Of course, we do not consider them as very serious because the “theory of optimization" is one of the crossing points of pure and applied mathematics and it is natural for it to inherit the abstractness from pure mathematics (specifically from mathematical analysis), while in some issues, it fully functions within the framework of the applied mathematics (e.g., modeling). Another reason seems to be the similarity (and sometimes sameness) of proving the theorems. In fact, in many theorems which are proved under various extensions of invexity, only the underlying assumptions will change and the foundations will remain unchanged and these repetitions will drastically reduce the attraction and novelty of the subject. Repetition of techniques in some articles cannot be ignored, and we think nobody would approve them. Unfortunately, this is a plague which many scientific papers suffer from (including papers related to optimization) and invexity is not the only subject affected by it. But the main reason for lack of interest in researchers on invexity is the publication of article by Zalinescu (2014). In fact, it may be claimed that the heaviest strike on invexity has been caused by this beautiful, exact, and clever paper. Zalinescu (2014) in his article, which is a criticism of several other papers, mentions some basic points.

  • One is that, in the definition of invex functions (according to some references), a logical instrument exists and its consequence is that all functions should be inevitably invex. It is clear that this problem can be solved by a replacement in the assumption of existence of \(\eta (x,y)\). Fortunately, this replacement for \(\eta (x,y)\) in the definition of invex function does not invalidate the proofs of previous theorems and the verdicts are still valid (as long as the authors have studied).

  • Another problem that Zalinescu highlights is the falsity of some of the theorems in the invexity-related articles to which many have referred. It is clear that it is not the fault by invexity and negligence in inference and argument we have caused these problems.

Considering what has been mentioned in above, we still believe that invexity and its extensions could be studied more, and for this purpose, we investigate the concept of \((\Phi ,\rho )\)-invexity which has been introduced by Caristi et al. (2010) and try to present its nonsmooth version. Of course, it should be mentioned that, in this study, if we replace “\((\Phi ,\rho )\)-invex" by “invex," the results will still be original which are the extensions of the existing theorems in other articles and we have added the concept of \((\Phi ,\rho )\)-invexity to these extensions so that our results could be more general. We organize the paper as follows. In the next section, we provide the preliminary results to be used in the rest of the paper. In Sect. 3 (resp. 4), we present some necessary and sufficient optimality conditions and duality results for weak (resp. proper) efficient solutions of nonsmooth MOSIPs.

2 Preliminaries

In this section, we briefly overview some notions of nonsmooth analysis widely used in formulations and proofs of main results of the paper. Nonsmooth analysis refers to differential analysis in the absence of differentiability. It can be regarded as a subfield of that vast subject known as nonlinear analysis. While nonsmooth analysis has classical roots, it is only in the last decades that the subject has grown rapidly. To the point, in fact, that further development has sometimes appeared in danger of being stymied, due to the plethora of definitions and unclearly related theories. For more details, discussion, and applications, see (Clarke 1983; Hiriart-Urruty and Lemarechal 1991). Our notation and terminology are basically standard. As usual, ||x|| stands for the Euclidean norm of \(x\in {\mathbb {R}}^{n}\), and \({\mathbb {B}}_n\) denotes the closed unit ball in \({\mathbb {R}}^n\). Given \(x,y\in {\mathbb {R}}^n\), we write \(x\leqq y\ (\text{ resp. }\ x<y)\) when \(x_i \le y_i\ (\text{ resp. }\ x_i <y_i)\) for all \(i\in \{1,\ldots ,n\}\). Moreover, we write \(x\le y\) when \(x\leqq y\) and \(x\ne y\). The zero vector of \({\mathbb {R}}^n\) is denoted by \(0_n\). Given a nonempty set \(A \subseteq {\mathbb {R}}^{n}\), we denote by \({\overline{A}}\), conv(A), and cone(A), the closure of A, the convex hull and convex cone (containing the origin) generated by A, respectively. Also, we denote the Clarke tangent cone of A at \(\hat{x} \in {\overline{A}}\) by \(\Gamma (A,\hat{x})\), i.e.,

$$\begin{aligned}{} & {} \Gamma (A,\hat{x}):=\big \{v\in {{\mathbb {R}}^{n}} \mid \forall \{x_r\}\subseteq A,\ x_r\rightarrow \hat{x},\ \forall t_r \downarrow 0,\\{} & {} \quad \exists v_r \rightarrow v\ \text{ such } \text{ that }\ x_r+t_r v_r \in A \ \ \forall r\in {\mathbb {N}} \big \}. \end{aligned}$$

Let \(\hat{x} \in {\mathbb {R}}^n\) and let \(\varphi :{\mathbb {R}}^n \rightarrow {\mathbb {R}}\) be a locally Lipschitz function. The Clarke directional derivative of \(\varphi \) at \(\hat{x}\) in the direction \(v \in {\mathbb {R}}^n\) and the Clarke subdifferential of \(\varphi \) at \(\hat{x}\) are, respectively, given by

$$\begin{aligned} \varphi ^0({\hat{x}};v):=\limsup _{y\rightarrow {\hat{x}},\ t\downarrow 0} \frac{\varphi (y+tv)- \varphi (y)}{t}\end{aligned}$$

and

$$\begin{aligned} \partial _c \varphi (\hat{x}):=\big \{\xi \in {\mathbb {R}}^n \mid \left<\xi ,v\right> \le \varphi ^0 (\hat{x};v) \ \ \ \ \ \text{ for } \text{ all }\ v\in {\mathbb {R}}^n \big \}.\end{aligned}$$

The Clarke subdifferential is a natural generalization of the classical derivative since it is known that when function \(\varphi \) is continuously differentiable at \(\hat{x}\), \(\partial _c \varphi (\hat{x})=\{\nabla \varphi (\hat{x})\}\). Moreover, when a function \(\varphi \) is convex, the Clarke subdifferential coincides with \(\partial \varphi (\hat{x})\), the subdifferential in the sense of convex analysis, i.e.,

$$\begin{aligned} \partial \varphi (\hat{x}):=\big \{\xi \in {\mathbb {R}}^{n} \mid \varphi (x)\ge \varphi (\hat{x})+\big <\xi ,x-\hat{x} \big >\ \ \ \ \ \ \forall \; x\in {\mathbb {R}}^{n} \big \}.\end{aligned}$$

It is worth to observe that if \(\hat{x}\) is a minimizer of locally Lipschitz function \(\phi \) on a set C, then

$$\begin{aligned} 0\in \partial _c \phi (\hat{x})+N(C,\hat{x}),\end{aligned}$$

where \(N(C,\hat{x})\) denotes the Clarke normal cone of C at \(\hat{x} \), i.e.,

$$\begin{aligned} N(C,\hat{x}):=\big \{x\in {\mathbb {R}}^{n}\mid \langle x,a \rangle \le 0,\ \ \ \forall a\in \Gamma (C,\hat{x})\big \}.\end{aligned}$$

In the following theorem, we summarize some important properties of the Clarke subdifferential from Clarke (1983) which are widely used in what follows.

Theorem 1

Let \(\varphi \) and \(\phi \) be functions from \({\mathbb {R}}^n \) to \({\mathbb {R}}\) which are Lipschitz near \(\hat{x}\). Then,

  1. (i)

    \(\partial _c (\varphi +\phi ) (\hat{x})\subseteq \partial _c \varphi (\hat{x})+\partial _c \phi (\hat{x}).\)

  2. (ii)

    \(\partial _c \big (\lambda \varphi \big )(\hat{x})=\lambda \partial _c \varphi (\hat{x}),\ \ \ \ \ \forall \lambda \in {\mathbb {R}}.\)

  3. (iii)

    \(\partial _c \big (\max \{\varphi ,\phi \}\big )(\hat{x}) \subseteq conv \big (\partial _c \varphi (\hat{x}) \cup \partial _c \phi (\hat{x}) \big ).\)

  4. (iv)

    \( \partial _c \varphi (\hat{x})\) is a nonempty, convex, and compact subset of \({\mathbb {R}}^n\).

3 On the weak efficiency

In this paper, we consider the following multiobjective semi-infinite programming problem:

$$\begin{aligned} \text{(P) }\quad{} & {} \inf \;\big (f_1(x), f_2(x), \ldots , f_p(x)\big ) \\ \text{ s.t. } \quad \quad{} & {} g_t(x)\le 0\quad \ t\in T,\\ {}{} & {} x \in {\mathbb {R}}^n, \end{aligned}$$

where \(f_i,\ i\in I:=\{1,2,\ldots ,p\}\) and \(g_t\), \(t\in T\) are locally Lipschitz functions from \({\mathbb {R}}^n\) to \({\mathbb {R}}\), and the index set T is arbitrary, not necessarily finite (but nonempty). An important feature of problem (P) is that the index set T is arbitrary, i.e., may be infinite and also noncompact. When T is finite, (P) is a multiobjective optimization problem, and when \(p=1\) and T is infinite, (P) is a semi-infinite optimization problem. The feasible set of (P) is denoted by M, i.e.,

$$\begin{aligned} M:=\{x\in {\mathbb {R}}^n \mid g_t(x)\le 0,\ \ \ \forall t\in T \}.\end{aligned}$$

For each \(\hat{x} \in M\), set

$$\begin{aligned} F_{\hat{x}}:=\bigcup _{i\in I} \partial _c f_i(\hat{x}) \ \ \ \text{ and }\ \ \ \ G_{\hat{x}}:=\bigcup _{t\in T(\hat{x})}\partial _c g_t(\hat{x}),\end{aligned}$$

where \(T(\hat{x}):=\{ t\in T \mid g_t(\hat{x})=0 \}.\) A feasible point \(\hat{x}\) is said to be efficient solution [resp. weakly efficient solution] for (P) if and only if there is no \(x\in M\) satisfying \(f(x)\le f(\hat{x})\) [resp. \(f(x) < f(\hat{x})\)]. Recall that the problem (P) is said to be continuous when T is a compact metric space, \(g_t(x)\) is a continuous function of (tx) in \(T\times {\mathbb {R}}^n\), and \(t\mapsto \partial _c g_t (x)\) is an upper semicontinuous (set-valued) mapping for each \(x\in {\mathbb {R}}^n\). At almost all articles in (multiobjective) semi-infinite programming, the continuity of problem is assumed, even in differentiable case Caristi et al. (2010). The continuity of (P) implies the compactness of \(F_{\hat{x}} \cup G_{\hat{x}}\) by Kanzi (2015); then, the strict separation theorem implies that \(0\in conv( F_{\hat{x}} \cup G_{\hat{x}})\) when \(\hat{x}\) is a weakly efficient solution of the considered problem, and hence, the Fritz–John (FJ)-type necessary condition is satisfied for continuous (P) at a weakly efficient solution. For extension of this well-known result to noncontinuous (P), we recall the following definition from (Kanzi 2011, 2014).

Definition 1

We say that (P) has the Pshenichnyi–Levin–Valadier (PLV in short) property at \(\hat{x} \in M\), if \(\Psi (\cdot )\) is finite-valued Lipschitz around \(\hat{x}\), and

$$\begin{aligned} \partial _c \Psi (\hat{x})\subseteq conv \Big ( \bigcup _{t\in T(\hat{x})} \partial _c g_t(\hat{x})\Big )=conv \big ( G_{\hat{x}} \big ),\end{aligned}$$

where \(\Psi (\cdot )\) is defined as

$$\begin{aligned} \Psi (x):=\sup _{t\in T} g_t(x),\ \ \ \ \ \ \forall x\in M.\end{aligned}$$

It should be observed from (Goberna and Kanzi 2017; Kanzi 2014) that the PLV property is strictly weaker than continuity for (P). Thus, the following simple theorem is better than its continuous versions Caristi et al. (2010).

Theorem 2

(FJ necessary condition) Let \(\hat{x}\) be a weakly efficient solution of (P). If the PLV property holds at \(\hat{x}\), then there exist \(\alpha _{i}\ge 0\) (for \(i\in I\)), and \(\beta _{t}\ge 0,\) (for \(t\in T(\hat{x}))\), with \(\beta _{t} \ne 0\) for finitely many indexes, such that

$$\begin{aligned} 0_n \in \sum _{i= 1} ^ p \alpha _i \partial _c f_i(\hat{x})+\sum _{t\in T(\hat{x})}\beta _{t}\partial _c g_{t}(\hat{x}), \ \ \ \ \ \text{ and } \ \ \ \ \ \ \ \sum _{i=1}^p \alpha _i+\sum _{t\in T(\hat{x})} \beta _t=1.\end{aligned}$$

Proof

It is easy to see that \(\hat{x}\) is a global minimizer for the function

$$\begin{aligned} \vartheta (x):=\max \{\Theta (x), \Psi (x)\},\end{aligned}$$

where \(\Theta (x):=\max _{i\in I} \{ f_i(x)-f_i(\hat{x}) \}\) and \(\Psi (x)\) is defined as Definition 1. Thus, by PLV property we deduce that

$$\begin{aligned} 0_n \in \partial _c \vartheta (\hat{x}) \subseteq conv\big (\partial _c \Theta (\hat{x}) \cup \partial _c \Psi (\hat{x})\big )\subseteq conv \Big ( \big [conv (F_{\hat{x}} )\big ] \cup \big [conv ( G_{\hat{x}}) \big ] \Big ).\end{aligned}$$

This means \( 0_n \in conv( F_{\hat{x}} \cup G_{\hat{x}})\), as required. \(\square \)

As well as in the classical case, the optimality implies the Karush–Kuhn–Tucker (KKT) condition, provided some constraint qualifications are satisfied (see Caristi et al. 2010; Gao 2012, 2013; Goberna and Kanzi 2017; Guerra-Vazquez and Todorov 2016; Goberna et al. 2013; Kanzi 2015; López and Vercher 1983). Slater’s condition (SC) is said to be satisfied for problem (P) if there exists a Slater point, \((x^*)<0\) for all \(t\in T\). It is easy to see that the SC cannot play the role of such constraint qualification without additional convexity assumption on the restriction functions. Also, the SC works only for continuous (multiobjective) semi-infinite problems (SIP). In fact, the Slater constraint qualification (SCQ) for (multiobjective) SIP consists of the following three conditions (see López and Vercher 1983):

$$\begin{aligned} \left\{ \begin{array}{ll} \text{(I): } \text{ Continuity } \text{ of } (P). \\ \text{(II): } \text{ SC. } \\ \text{(III): } \text{ Convexity } \text{ of } g_t \text{ functions } \text{ for } t\in T. \end{array}\right. \end{aligned}$$

In the line of extension of KKT necessary condition for (P), we are going to change: (I) the continuity of (P) by PLV property; (II) the SC by weak Slater’s condition, introduced in Definition 2; and (III) the convexity of \(g_t\)s by \((\Phi ,\rho )\)-invexity of them, defined in Definition 3. We will use the following weaker form of SC for SIP appeared in Hettich and Kortanek (1993).

Definition 2

We say that the weak Slater’s condition (WSC in brief) is satisfied for (P) at \(x_0 \in M\) if for each finite index set \(T^* \subseteq T( x_0)\), there exists a point \(x_{ _{T^*}}\in {\mathbb {R}}^n\) such that \(g_t(x_{ _{T^*}})<0\) for all \(t\in T^*\). Also, (P) is satisfied in global WSC (GWSC, briefly) if for each finite index set \(T^* \subseteq T\), there exists a point \(x_{ _{T^*}}\in {\mathbb {R}}^n\) such that \(g_t(x_{ _{T^*}})<0\) for all \(t\in T^*\).

Clearly, WSC at a point is strictly weaker than GWSC. The following example shows that the GWSC is strictly weaker than SC.

Example 1

Let \(T:={\mathbb {N}}\cup \{0\}\), and

$$\begin{aligned} g_t(x):= \left\{ \begin{array}{ll} x-\displaystyle \frac{1}{t}\ \ \ \ \ \ \text{ if }\ \ \ \ \ t\in {\mathbb {N}}, \\ -x \ \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ t=0. \end{array}\right. \end{aligned}$$

It is easy to check that \(M=\{0\}\), and SC does not hold. Now, assume that \(T^*\) is a finite subset of T. Take \(\max (T^*):=q\) and \( x_{ _{T^*}}:=\frac{1}{q+1}\). Then,

$$\begin{aligned} g_t(x_{ _{T^*}})= \left\{ \begin{array}{ll} x_{ _{T^*}}-\displaystyle \frac{1}{t}=\displaystyle \frac{1}{q+1}-\displaystyle \frac{1}{t}<0\ \ \ \ \ \ \text{ if }\ \ \ \ \ t\in {\mathbb {N}}\cap T^*, \\ \ \\ -x_{ _{T^*}}=- \displaystyle \frac{1}{q+1}<0\ \ \ \ \ \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ t=0. \end{array}\right. \end{aligned}$$

Thus, the GWSC holds.

Definition 3

Suppose that the functions \(\Phi :{\mathbb {R}}^n \times {\mathbb {R}}^n \times {\mathbb {R}}^n \times {\mathbb {R}} \rightarrow {\mathbb {R}}\) and \(\rho :{\mathbb {R}}^n \times {\mathbb {R}}^n \rightarrow {\mathbb {R}}\), and the nonempty set \(X \subseteq {\mathbb {R}}^n\) are given. A locally Lipschitz function \(\hbar :{\mathbb {R}}^n \rightarrow {\mathbb {R}}\) is said to be \((\Phi ,\rho )\)-invex at \(x^* \in X \) with respect to X, if for each \(x\in X\) one has:

$$\begin{aligned}{} & {} \Phi \big (x,x^*, 0_n,r\big ) \ge 0 \ \ \ \ \ \text{ for } \text{ all } r \ge 0, \end{aligned}$$
(1)
$$\begin{aligned}{} & {} \Phi (x,x^*,.,.) \text{ is } \text{ convex } \text{ on } {\mathbb {R}}^{n}\times {\mathbb {R}}, \end{aligned}$$
(2)
$$\begin{aligned}{} & {} \Phi \Big (x,x^*,\xi ,\rho (x,x^*) \Big ) \le \hbar (x)-\hbar (x^*),\ \ \ \ \forall \xi \in \partial _c \hbar (x^*). \end{aligned}$$
(3)

Remark 1

Since in Antczak (2015), Antczak and Stasiak (2011), Antczak (2012) the considered \(\rho \) is real number, Definition 2 is more general than them.

As mentioned by Antczak (2015), the definition of \((\Phi ,\rho )\)-invexity generalizes the almost all concepts of invexity and convexity. Also, some \((\Phi ,\rho )\)-invex functions are presented by Antczak (2015) which are not convex. The following example shows that the \((\Phi ,\rho )\)-invexity is strictly weaker than invexity.

Example 2

Consider a function \(\Phi : {\mathbb {R}} \times {\mathbb {R}} \times {\mathbb {R}} \times {\mathbb {R}} \rightarrow {\mathbb {R}}\) defined by

$$\begin{aligned} \Phi (x,y,u,w):= \left\{ \begin{array}{ll} w-\displaystyle \frac{u}{3y^2}|x^3 -y^3|\ \ \ \ \ \ \text{ if }\ \ \ \ \ y\ne 0, \\ \\ w|x^3 | \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ y=0. \end{array}\right. \end{aligned}$$

Let x and \(\hat{x}\) be arbitrary elements of \({\mathbb {R}}\). Since \(\Phi (x,\hat{x},.,.)\) is a linear function and

$$\begin{aligned} \Phi (x,y,0,r)= \left\{ \begin{array}{ll} r\ \ \ \ \ \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ y\ne 0, \\ r|x^3 | \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ y=0, \end{array}\right. \end{aligned}$$

conditions (1) and (2) hold. Take \(\rho (x,y):=-1\) for all \(x,y\in {\mathbb {R}}\), and \(\hbar (x):=x^3\). Since \(\hbar (.)\) is continuously differentiable on \({\mathbb {R}}\), \(\partial _c \hbar (\hat{x})=\{3 \hat{x}^2\}\). Thus, the following relations show that \(\hbar (.)\) is a \((\Phi ,\rho )\)-invex function at each \(\hat{x} \in {\mathbb {R}}\) with respect to \({\mathbb {R}}\),

$$\begin{aligned} \Phi (x,\hat{x},3\hat{x}^2,-1)= & {} \left\{ \begin{array}{ll} -1-|x^3-\hat{x}^3|\ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ \hat{x}\ne 0, \\ -|x^3 | \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ \hat{x}=0, \end{array}\right. \\ \ {}&\ {}&\ \\ \ {}\le & {} x^3-\hat{x}^3=\hbar (x)-\hbar (\hat{x}). \end{aligned}$$

Furthermore, as it follows by Ben-Israel and Mond (1986), \(\hbar (.)\) is not an invex function on \({\mathbb {R}}\) with respect to any \(\eta : {\mathbb {R}} \times {\mathbb {R}} \rightarrow {\mathbb {R}}\).

It is noteworthy that the strength of the Example 2 is that \(\hbar (.)\) is \((\Phi ,\rho )\)-invex for any \(\hat{x} \in {\mathbb {R}} ^n\) but its weakness is that \(\hbar (.)\) is continuously differentiable and \(\rho \) is a fixed number. The following example presents a function \(\hbar : {\mathbb {R}} \rightarrow {\mathbb {R}}\) that is nonsmooth at \(x^*=0\) and \((\Phi ,\rho )\)-invex at \(\hat{x}\) for a \(\rho \) that depends on x and \(x^*\).

Example 3

Consider the functions \(\Phi : {\mathbb {R}} \times {\mathbb {R}} \times {\mathbb {R}} \times {\mathbb {R}} \rightarrow {\mathbb {R}}\) and \(\rho : {\mathbb {R}} \times {\mathbb {R}} \rightarrow {\mathbb {R}}\), respectively, defined as

$$\begin{aligned} \Phi (x,y,u,w):= \left\{ \begin{array}{ll} w-\displaystyle \frac{u}{3y^2}|x^3 -y^3|\ \ \ \ \ \ \text{ if }\ \ \ \ \ y\ne 0, \\ \\ (w+u^2)|x^3 | \ \ \ \ \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ y=0. \end{array}\right. \\ \rho (x,y):= \left\{ \begin{array}{ll} (1-\frac{1}{x^2})+xy\ \ \ \ \ \ \text{ if }\ \ \ \ \ x>0, \\ \\ -(1+\frac{1}{x^2})+y \ \ \ \ \ \text{ if }\ \ \ \ \ x<0, \\ \\ 1+y\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ x=0. \end{array}\right. \end{aligned}$$

Let \(x^*=0\) and \(\hbar : {\mathbb {R}} \rightarrow {\mathbb {R}}\) as \(\hbar (x):=x^3-|x|\), obviously the conditions (1) and (2) hold. For checking condition (3), we observe that for each \(\xi \in \partial _c \hbar (x^*)=[-1,1]\) and \(x \in {\mathbb {R}}\) we have:

\(\Phi (x,x^*,\xi ,\rho (x,x^*))=(\rho (x,0)+\xi ^2) |x^3| \le \rho (x,0) |x^3|\)

$$\begin{aligned} = \left\{ \begin{array}{ll} (1-\frac{1}{x^2})|x^3|\ \ \ \ \ \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ x>0, \\ \\ - (1+\frac{1}{x^2}) |x^3| \ \ \ \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ x<0, \\ \\ |x^3|\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ x=0. \end{array}\right. \\ = \left\{ \begin{array}{ll} (x^3-x)\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ x>0, \\ \\ (x^3+x) \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ x<0, \\ \\ 0\ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \ \text{ if }\ \ \ \ \ x=0. \end{array}\right. \end{aligned}$$

\(=(x^3-|x|)= \hbar (x) = \hbar (x)- \hbar (x^*)\).

Everywhere in the following, we will assume X equals to feasible solution of (P), i.e., \(X=M\), but for the sake of simplicity we will omit to mention X. The following definition is motivated by above comments.

Definition 4

Let \(\Phi :{\mathbb {R}}^n \times {\mathbb {R}}^n \times {\mathbb {R}}^n \times {\mathbb {R}} \rightarrow {\mathbb {R}}\) be a given function, and \(\hat{x} \in M\). We say that (P) satisfies the \(\Phi \)-weak SCQ (\(\Phi \)-WSCQ, briefly) at \({\hat{x}},\) if

  • the PLV property holds at \(\hat{x}\),

  • WSC is satisfied at \(\hat{x}\),

  • for each \(t\in T(\hat{x})\), the \(g_t \) function is \((\Phi ,\rho _t)\)-invex at \(\hat{x}\) for some given function \(\rho _t:{\mathbb {R}}^n \times {\mathbb {R}}^n \rightarrow {\mathbb {R}}\).

Normally, we are interested to show Karush–Kuhn–Tucker necessary condition for (P) under \(\Phi \)-WSCQ assumption. In fact, the following theorem guaranties that \(\Phi \)-WSCQ is a constraint qualification.

Theorem 3

(KKT necessary condition) Let \(\hat{x}\) be a weak efficient solution of (P). Suppose that the \(\Phi \)-WSCQ is satisfied at \(\hat{x}\) with \(\rho _t(x,\hat{x})\ge 0\) for every \((x,t)\in {\mathbb {R}}^n \times T\). Then, there exist \(\alpha _{i}\ge 0\) (for \(i\in I\)) with \(\sum _{i=1}^p \alpha _i=1\), and \(\beta _{t}\ge 0,\) (for \(t\in T(\hat{x}))\), with \(\beta _{t} \ne 0\) for finitely many indexes, such that

$$\begin{aligned} 0_n \in \sum _{i= 1} ^ p \alpha _i \partial _c f_i(\hat{x})+\sum _{t\in T(\hat{x})}\beta _{t}\partial _c g_{t}(\hat{x}). \end{aligned}$$
(4)

Proof

Applying Theorem 2, we find some \(\alpha _i \ge 0\) and \(\xi _i \in \partial _c f_i(\hat{x})\) for \(i\in I\), \(\beta _t \ge 0\) and \(\zeta _t \in \partial _c g_t(\hat{x})\) for \(t\in T^* \subseteq T(\hat{x})\) with \(|T^*|<\infty \), such that

$$\begin{aligned} \sum _{i\in I} \alpha _i \xi _i +\sum _{t\in T^*}\beta _t \zeta _t =0_n,\ \ \ \ \ \ \ \ \ \sum _{i\in I} \alpha _i+\sum _{t\in T^*}\beta _t =1.\end{aligned}$$

All we need to prove is that at least one \(\alpha _i\) should be positive. If it is not this case, then

$$\begin{aligned} \sum _{t\in T^*}\beta _t \zeta _t =0_n,\ \ \ \ \ \ \ \ \ \sum _{t\in T^*}\beta _t =1.\end{aligned}$$

Thus, owing to \(\displaystyle \sum _{t\in T^*}\beta _t \rho _t (x_{ _{T^*}},\hat{x}) \ge 0\) and Definition 2, we get

$$\begin{aligned} 0 \le \Phi \Big (x_{ _{T^*}},\hat{x},\sum _{t\in T^*}\beta _t \zeta _t,\sum _{t\in T^*}\beta _t \rho _t (x_{ _{T^*}},\hat{x}) \Big ) \\ \le \sum _{t\in T^*}\beta _t \Phi \big (x_{ _{T^*}},\hat{x}, \zeta _t,\rho (x_{ _{T^*}},\hat{x})\big )\\ \le \sum _{t\in T^*}\beta _t \big (g_t(x_{ _{T^*}})- g_t(\hat{x})\big )= \sum _{t\in T^*}\beta _t g_t(x_{ _{T^*}})<0. \\ \end{aligned}$$

This contradiction justifies the result. \(\square \)

In order to recall the role of \((\Phi ,\rho )\)-invexity in KKT sufficient condition, we recall the following result from Kanzi (2017). It is noteworthy that the following theorem is proved in Caristi et al. (2010) for MPVCs with differentiable data, and for the case \(p=1,\ |T|<\infty \) in Antczak and Stasiak (2011).

Theorem 4

(KKT sufficient condition) Suppose that there exist a feasible solution \(\hat{x} \in M\), and scalars \(\alpha _i \ge 0\) (for \(i\in I\)) with \(\sum _{i=1}^p \alpha _i =1\), and a finite set \(T^*:=\{t_1,t_2,\ldots ,t_m\} \subseteq T(\hat{x})\), and scalars \(\beta _{j_s} \ge 0\) (for \(s\in \{1,2,\ldots ,m\}\)) such that

$$\begin{aligned} 0\in \sum _{i=1}^p \alpha _i \partial _c f_i(\hat{x})+\sum _{s=1}^m \beta _{t_s}\partial _c g_{t_s}(\hat{x}). \end{aligned}$$
(5)

Moreover, if the \(f_i\) functions are generalized \((\Phi ,\rho _i)\)-invex at \(\hat{x}\) and the \(g_t\) functions are generalized \((\Phi ,\rho _t)\)-invex at \(\hat{x}\) (for \((i,t)\in I \times T(\hat{x})\)), and \(\sum _{i=1}^p \alpha _i \rho _i(x,\hat{x})+\sum _{s=1}^m \beta _{t_s}\rho _{t_s}(x,\hat{x})\ge 0\) for all \(x\in M\), then \(\hat{x}\) is a weak efficient solution for (P).

Now, in accordance with Caristi et al. (2006) and similar to Antczak (2009a, 2009b), we introduce a dual of Mond and Weir (1981) type for (P) that is in connection with weak efficiency. For \(y\in {\mathbb {R}}^n\), \(T^* \subseteq T\) with \(|T^*|<\infty \), and \(\beta :=(\beta _t)_{t\in T^*} \ge 0_{|T^*|}\), put

$$\begin{aligned} f^\sharp (y,\beta ,T^*):=\Big ( f_1(y)+\sum _{t\in T^*}\beta _t g_t(y),\ldots , f_p(y)+\sum _{t\in T^*}\beta _t g_t(y) \Big ).\end{aligned}$$

Consider the following dual problem:

$$\begin{aligned} (D_1^\sharp ): \ \ \ \ \ \ \ \ \ \max \big \{f^\sharp (y,\beta ,T^*) \mid \exists \alpha :=(\alpha _i)_{i\in I},\ \ \ (\alpha ,y,\beta ,T^*)\in M_1^\sharp \big \},\end{aligned}$$

where the feasible set \(M_1^\sharp \) is defined by

$$\begin{aligned} M_1^\sharp :=\Big \{(\alpha ,y,\beta ,T^*) \mid y\in {\mathbb {R}}^n,\ T^* \subseteq T,\ |T^*|<\infty ,\ (\beta _t)_{t\in T^*}\ge 0_{|T^*|},\ \\ (\alpha _i)_{i\in I}\ge 0_p, \sum _{i=1}^p \alpha _i=1,\ \ 0_n\in \sum _{i=1}^p \alpha _i \partial _c f_i(y) +\sum _{t\in T^*}\beta _t \partial _c g_t(y)\Big \}. \end{aligned}$$

Since the proofs of following two theorems are similar to Antczak (2015) and Caristi et al. (2010), we omit them. Notice that we can rewrite the proofs of Theorems 9 and 10 (which will be expressed in the next section) with a little change for the following theorems.

Theorem 5

(weak duality for \((D_1^\sharp )\)) Let \(x\in M\) and \((\alpha ,y,\beta ,T^*)\in M_1^\sharp \). If the \(f_i\) functions for \(i\in I\) are \((\Phi ,{\rho _{i}})\)-invex at y, the \(g_{t}\) functions for \(t\in T^*\) are \((\Phi ,{\rho _{t}})\)-invex at y, and \(\sum _{i=1}^p \alpha _i \rho _i(x,y)+\sum _{t\in T^*} \beta _{t}\rho _{t}(x,y)\ge 0\), then \(f(x) \nless f^\sharp (y,\beta ,T^*)\).

Theorem 6

(strong duality for \((D_1^\sharp )\)) Suppose that \(\hat{x}\) is a weak efficient of (P) and that \(\Phi \)-WSCQ is satisfied at \(\hat{x}\). If each \(f_i\) functions (for \(i\in I \)) is \((\Phi ,{\rho _i})\)-invex at \(\hat{x}\) with \(\rho _i(x,\hat{x})\ge 0\) for every \(x\in {\mathbb {R}}^n\), then there exist \(\hat{\alpha }_{i}\ge 0\) (for \(i\in I\)) with \(\sum _{i=1}^p \hat{\alpha }_i=1\), and \(T_1^* \subseteq T(\hat{x})\) with \(|T_1^*|<\infty \), and \( \hat{\beta }_t \ge 0\) (for \(t\in T_1^*\)), such that \((\hat{\alpha },\hat{x},\hat{\beta },T_1^*)\in M_1^\sharp \) and \(f(\hat{x})=f^\sharp (\hat{x},\hat{\beta },T_1^*)\). Furthermore, \((\hat{\alpha },\hat{x},\hat{\beta },T_1^*)\) is a weak efficient solution for dual problem \((D_1^\sharp )\).

4 On the proper efficiency

Proper efficiency is a very important notion used in studying multiobjective optimization problems. There are many definitions of proper efficiency in literature, as those introduced by Geoffrion, Benson, Borwein, and Henig; see Guerraggio et al. (1994) for a comparison among the main definitions of this notion. We recall the following definition from Gopfert et al. (2003).

Definition 5

A point \(\hat{x} \in M\) is called a properly efficient solution of (P) when there exists a \(\lambda >0_p\) such that

$$\begin{aligned} \langle \lambda ,f(\hat{x})\rangle \le \langle \lambda ,f( x)\rangle ,\ \ \ \ \ \ \ \ \ \ \ \forall x\in M.\end{aligned}$$

As proved in Ehrgott (2005), the above definition of proper efficiency is weaker than its other definitions (under some assumed conditions). Thus, the following theorem can be extended to other sense of proper efficiency under further assumptions.

Theorem 7

(KKT strong necessary condition) Let \(\hat{x}\) be a proper efficient solution of (P). Suppose that the \(\Phi \)-WSCQ is satisfied at \(\hat{x}\) with \(\rho _t(x,\hat{x})\ge 0\) for every \(x\in {\mathbb {R}}^n\). Then, there exist \(\alpha _{i}>0\) (for \(i\in I\)) with \(\sum _{i=1}^p \alpha _i=1\), and \(\beta _{t}\ge 0,\) (for \(t\in T(\hat{x}))\), with \(\beta _{t} \ne 0\) for finitely many indexes, such that (4) fulfilled.

Proof

By the definition of proper efficiency, there exist some scalars \(\lambda _i>0\) (for \(i\in I\)) such that \(\hat{x}\) is a minimizer of the following scalar semi-infinite problem:

$$\begin{aligned} \min _{x\in M}\sum _{i=1}^p \lambda _i f_i(x).\end{aligned}$$

Applying Theorem 3, we get

$$\begin{aligned} 0_n\in \partial \Big ( \sum _{i=1}^p \lambda _i f_i(\cdot ) \Big )(\hat{x})+\sum _{t\in T(\hat{x})}\mu _{t}\partial _c g_{t}(\hat{x})\subseteq \sum _{i=1}^p \lambda _i \partial _c f_i(\hat{x})+\sum _{t\in T(\hat{x})}\mu _{t}\partial _c g_{t}(\hat{x}),\end{aligned}$$

for some \(\mu _{t}\ge 0,\) (\(t\in T(\hat{x}))\), with \(\mu _{t} \ne 0\) for finitely many indexes. For each \(i\in I\) take \(\alpha _i:=\frac{\lambda _i}{\sum _{i=1}^p \lambda _i}\), and for each \(t\in T(\hat{x})\) put \(\beta _t:=\frac{\mu _t}{\sum _{i=1}^p \lambda _i}\). \(\square \)

Now, we state the sufficient condition for proper efficiency of (P) as follows.

Theorem 8

(KKT strong sufficient condition) Let \(\hat{x} \in M\). Suppose that there exist some scalars \(\alpha _i > 0\) (for \(i\in I\)) with \(\sum _{i=1}^p \alpha _i =1\), a finite set \(T^*:=\{t_1,t_2,\ldots ,t_m\} \subseteq T(\hat{x})\), and some scalars \(\beta _{t_s} \ge 0\) (for \(s\in \{1,2,\ldots ,m\}\)) such that (4) holds. Moreover, if the \(f_i\) functions for \(i\in I\) are \((\Phi ,{\rho })\)-invex at \(\hat{x}\) (this means that all \(\rho _i\)s are equal to \(\rho \) for \(i\in I\)), and the \(g_{t_s}\) functions for \(s\in \{1,\ldots ,m\}\) are \((\Phi ,{\rho _{t_s}})\)-invex at \(\hat{x}\), and \(\sum _{i=1}^p \alpha _i \rho _i(x,\hat{x})+\sum _{s=1}^m \beta _{t_s}\rho _{t_s}(x,\hat{x})\ge 0\) for all \(x\in M\), then \(\hat{x}\) is a proper efficient solution for (P).

Proof

Owing to (4), we can find some \(\xi _i \in \partial _c f_i(\hat{x})\) and \(\zeta _{t_s} \in \partial _c g_{t_s}(\hat{x})\) for \(i\in I\) and \(s\in \{1,\ldots ,m \}\)) such that

$$\begin{aligned} \sum _{i=1}^p \alpha _i \xi _i+\sum _{s=1}^m \beta _{t_s}\zeta _{t_s}=0_n, \ \ \ \ \ \ \ \sum _{i=1}^p \alpha _i +\sum _{s=1}^m \beta _{t_s}>0.\end{aligned}$$

Put \(\widehat{\alpha }_i:=\frac{\alpha _i}{\sum _{i=1}^p \alpha _i +\sum _{s=1}^m \beta _{t_s}}\) and \(\widehat{\beta }_{t_s}:=\frac{\beta _{t_s}}{\sum _{i=1}^p \alpha _i +\sum _{s=1}^m \beta _{t_s}}\). Due to assumptions of theorem, the following inequalities hold:

$$\begin{aligned} 0\le & {} \Phi \Big (x,\hat{x}, \sum _{i=1}^p \widehat{\alpha }_i \xi _i+\sum _{s=1}^m \widehat{\beta }_{t_s}\zeta _{t_s} , \sum _{i=1}^p \widehat{\alpha }_i \rho _i(x,\hat{x})+\sum _{s=1}^m \widehat{\beta }_{t_s}\rho _{t_s}(x,\hat{x}) \Big ) \nonumber \\\le & {} \sum _{i=1}^p \widehat{\alpha }_i \Phi \big ( x,\hat{x}, \xi _i, \rho _i (x,\hat{x}) \big ) + \sum _{s=1}^m \widehat{\beta }_{t_s} \Phi \big (x, \hat{x}, \zeta _{t_s}, \rho _{t_s}(x,\hat{x}) \big ). \end{aligned}$$
(6)

On the other hand, the \((\Phi ,{\rho _{t_s}})\)-invexity of \(g_{t_s}\) functions implies that for each \(s\in \{1,\ldots ,m\}\) we get

$$\begin{aligned} \Phi \big (x, \hat{x}, \zeta _{t_s}, \rho _{t_s}(x,\hat{x}) \big )\le g_{t_s}(x)-g_{t_s}(\hat{x})=g_{t_s}(x)\le 0. \end{aligned}$$
(7)

From (6) and (7), we conclude that

$$\begin{aligned} \sum _{i=1}^p \widehat{\alpha }_i \Phi \big ( x,\hat{x}, \xi _i, \rho _i (x,\hat{x}) \big )\ge 0\ \Rightarrow \ \sum _{i=1}^p \alpha _i \Phi \big ( x,\hat{x}, \xi _i, \rho _i (x,\hat{x}) \big )\ge 0. \end{aligned}$$
(8)

Employing [Antczak and Stasiak (2011), Proposition 11] and \(\sum _{i=1}^p \alpha _i=1\), we deduce that \(\sum _{i=1}^p \alpha _i f_i\) is a \((\Phi ,{\rho })\)-invex function. Therefore, (8) yields that

$$\begin{aligned} \sum _{i=1}^p \alpha _i f_i(\hat{x}) \le \sum _{i=1}^p \alpha _i f_i(x),\ \ \ \ \ \forall x\in M.\end{aligned}$$

Since \( \alpha _i >0\) for all \(i\in I\), we can take \(\lambda _i= \alpha _i\) in Definition 5, and the proof is complete. \(\square \)

For given \(y\in {\mathbb {R}}^n\), \(T^* \subseteq T\) with \(|T^*|<\infty \), and \(\beta :=(\beta _t)_{t\in T^*} \ge 0_{|T^*|}\), set the \(f^\sharp (y,\beta ,T^*)\) function as previous section. In connection with the problem (P), we consider the following dual of Mond–Weir-type problem (which is different with \((D_1^\sharp )\)):

$$\begin{aligned} (D_2^\sharp ): \ \ \ \ \ \ \ \ \ \max \big \{f^\sharp (y,\beta ,T^*) \mid \exists \alpha :=(\alpha _i)_{i\in I},\ \ \ (\alpha ,y,\beta ,T^*)\in M_2^\sharp \big \},\end{aligned}$$

in which \(M_2^\sharp \) is defined as

$$\begin{aligned} M_2^\sharp :=\Big \{(\alpha ,y,\beta ,T^*) \mid y\in {\mathbb {R}}^n,\ T^* \subseteq T,\ |T^*|<\infty ,\ (\beta _t)_{t\in T^*}\ge 0_{|T^*|},\\ \ (\alpha _i)_{i\in I}>0_p, \sum _{i=1}^p \alpha _i=1,\ \ 0_n\in \sum _{i=1}^p \alpha _i \partial _c f_i(y) +\sum _{t\in T^*}\beta _t \partial _c g_t(y)\Big \}, \end{aligned}$$

Remark 2

It should be noted that the dual problems presented in some articles are constructed with infinite index sets (unlike the problems \((D_1^\sharp )\) and \((D_2^\sharp )\)); to study such problems, we can refer to surveys (Hettich and Kortanek 1993; López and Still 2007) and their references. Clearly, dual problems defined by a finite index set (including the problems \((D_1^\sharp )\) and \((D_2^\sharp )\)) are more efficient and usable in terms of application.

The next two theorems describe (weak and strong) duality relations between the prime problem (P) and the dual problem \((D_2^\sharp )\).

Theorem 9

(weak duality for \((D_2^\sharp )\)) Let \(x\in M\) and \((\alpha ,y,\beta ,T^*)\in M_2^\sharp \). If the \(f_i\) functions for \(i\in I\) are \((\Phi ,{\rho _{i}})\)-invex at y, the \(g_{t}\) functions for \(t\in T^*\) are \((\Phi ,{\rho _{t}})\)-invex at y, and \(\sum _{i=1}^p \alpha _i \rho _i(x,y)+\sum _{t\in T^*} \beta _{t}\rho _{t}(x,y)\ge 0\), then \(f(x) \nleq f^\sharp (y,\beta ,T^*)\).

Proof

Assume on the contrary that \(f(x) \le f^\sharp (y,\beta ,T^*)\). Thus, there exists an index \(k\in I\) such that

$$\begin{aligned} \left\{ \begin{array}{ll} f_{i}(x) \le f_i(y)+\sum _{t\in T^*}\beta _t g_t(y),\ \ \ \ \ \forall i\in I\setminus \{k\}, \\ f_{k}(x) < f_k(y)+\sum _{t\in T^*}\beta _t g_t(y). \end{array}\right. \end{aligned}$$
(9)

For each \(i\in I\) and \(t\in T^*\), set

$$\begin{aligned} \widetilde{\alpha }_i:=\frac{\alpha _i}{1+\sum _{t\in T^*}\beta _t},\ \ \ \ \text{ and }\ \ \ \ \widetilde{\beta }_t:=\frac{\beta _t}{1+\sum _{t\in T^*}\beta _t}.\end{aligned}$$

Since \((\alpha ,y,\beta ,T^*)\in M_2^\sharp \), we have \(\sum _{i=1}^p \widetilde{\alpha }_i +\sum _{t\in T^*}\widetilde{\beta }_t =1\), and

$$\begin{aligned} \sum _{i=1}^p \widetilde{\alpha }_i \xi _i +\sum _{t\in T^*}\widetilde{\beta }_t \zeta _t=0_n, \end{aligned}$$
(10)

for some \(\xi _i \in \partial _c f_i(y),\ i\in I\), and \(\zeta _t \in \partial _c g_t(y),\ t\in T^*\). It follows from (9) and \((\widetilde{\alpha }_1,\ldots ,\widetilde{\alpha }_p)> 0_p\) that

$$\begin{aligned} \sum _{i=1}^p \widetilde{\alpha }_i \big (f_i(x)-f_i(y)\big )< & {} \sum _{i=1}^p \Big [\widetilde{\alpha }_i \sum _{t\in T^*}\beta _t g_t(y)\Big ] \nonumber \\= & {} \Big (\sum _{t\in T^*}\beta _t g_t(y)\Big )\Big (\sum _{i=1}^p \widetilde{\alpha }_i\Big )\le \sum _{t\in T^*}\beta _t g_t(y). \end{aligned}$$
(11)

At the same time, Eq. (10) implies that

$$\begin{aligned} 0\le & {} \Phi \Big (x,y,\sum _{i=1}^p \widetilde{\alpha }_i \xi _i +\sum _{t\in T^*}\widetilde{\beta }_t \zeta _t,\sum _{i=1}^p \widetilde{\alpha }_i \rho _i(x,y)+\sum _{t\in T^*} \widetilde{\beta }_{t}\rho _{t}(x,y)\Big ) \\\le & {} \sum _{i=1}^p \widetilde{\alpha }_i \Phi \big (x,y,\xi _i,\rho _i(x,y)\big )+\sum _{t\in T^*} \widetilde{\beta }_{t}\Phi \big (x,y,\zeta _t,\rho _t(x,y)\big )\\\le & {} \sum _{i=1}^p \widetilde{\alpha }_i \big (f_i(x)-f_i(y)\big )+\sum _{t\in T^*}\beta _t \big (g_t(x)-g_t(y)\big )\\\le & {} \sum _{i=1}^p \widetilde{\alpha }_i \big (f_i(x)-f_i(y)\big )-\sum _{t\in T^*}\beta _t g_t(y), \end{aligned}$$

where the last inequality holds by \(x\in M\). The above inequality contradicts (11), and the proof is complete. \(\square \)

Theorem 10

(strong duality for \((D_2^\sharp )\)) Suppose that \(\hat{x}\) is a properly efficient solution of (P) and that \(\Phi \)-WSCQ is satisfied at \(\hat{x}\). If each \(f_i\) functions (for \(i\in I \)) is \((\Phi ,{\rho _i})\)-invex at \(\hat{x}\) with \(\rho _i( x,\hat{x})\ge 0\) for every \(x\in {\mathbb {R}}^n\), then there exist \(\hat{\alpha }_{i}> 0\) (for \(i\in I\)) with \(\sum _{i=1}^p \hat{\alpha }_i=1\), and \(T_1^* \subseteq T(\hat{x})\) with \(|T_1^*|<\infty \), and \( \hat{\beta }_t \ge 0\) (for \(t\in T_1^*\)), such that \((\hat{\alpha },\hat{x},\hat{\beta },T_1^*)\in M_2^\sharp \) and \(f(\hat{x})=f^\sharp (\hat{x},\hat{\beta },T_1^*)\). Furthermore, \((\hat{\alpha },\hat{x},\hat{\beta },T_1^*)\) is an efficient solution for dual problem \((D_2^\sharp )\).

Proof

According to Theorem 7, there exist \(\hat{\alpha }:=(\hat{\alpha }_{i})_{i\in I}> 0_p,\) with \(\sum _{i=1}^p \hat{\alpha }_i=1\), and \(T_1^* \subseteq T(\hat{x})\) with \(|T_1^*|<\infty \), and \(\hat{\beta }_t \ge 0,\ t\in T_1^*\) such that

$$\begin{aligned} 0_n\in \sum _{i=1}^p \hat{\alpha }_i \partial _c f_i(\hat{x})+\sum _{t\in T_1^*}\hat{\beta }_t \partial _c g_t(\hat{x}).\end{aligned}$$

This means \((\hat{\alpha }, \hat{x},\hat{\beta },T_1^*)\in M_2^\sharp \). Obviously, \(t\in T_1^* \subseteq T(\hat{x})\) implies

$$\begin{aligned} \ \ \ \ f^\sharp (\hat{x}, \hat{\beta },T_1^*):=\Big ( f_1(\hat{x})+\sum _{t\in T_1^*}\beta _t g_t(\hat{x}),\ldots , f_p(\hat{x})+\sum _{t\in T_1^*}\hat{\beta }_t g_t(\hat{x}) \Big )=f(\hat{x}).\nonumber \\ \end{aligned}$$
(12)

Now, invoking the weak duality result in Theorem 9, and take (12) into account, we get \(f^\sharp (\hat{x},\hat{\beta },T_1^*)\nleq f^\sharp (y,\beta ,T^*)\) for any \( (\alpha ,y,\beta ,T^*)\in M_2^\sharp \). This gives us that \((\hat{x}, \hat{\beta },T_1^*)\) is an efficient solution for the dual problem \((D_2^\sharp )\). \(\square \)

Remark 3

Note that our strong duality result appeared in Theorem 10 is not in an ordinary way; that is, the solution of the dual problem is not guaranteed properly efficient, only efficient, although the solution to the prime one is properly efficient. It is noteworthy that in [Antczak (2012), Theorem 19] a relationship has been established between the properly efficient solutions of primary and dual problems; and this is made possible by choosing the concept of properly efficient in Geoffrion type in Antczak (2012), which is weaker than properly efficient in type of Definition 5 [see Ehrgott (2005), Theorem 3.11].

Remark 4

Similar to Caristi et al. (2010), we can define quasi \((\Phi ,{\rho })\)-quasiinvex functions and \((\Phi ,{\rho })\)-pseudoinvex functions, and then, we can prove some weaker optimality and duality results for (P). Since the proof of these extensions are similar to previous theorems, we omit them.