Abstract
Kantorovich-type results for generalized equations are extended with no additional conditions using Newton procedures. Iterates are shown to belong in a smaller domain resulting to tighter Lipschitz constants and a finer convergence analysis than in earlier works.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Let \(B_1, B_2\) are Banach spaces \(D\subset B_1\) be an open set \(F:D\longrightarrow B_2\) be a continuously differentiable operator in the Fréchet-sense. Consider the problem of finding a solution \(x^*\in D\) of of the equation
Numerous applications from computational sciences reduce to finding the point \(x^*.\) The points \(x^*\) is needed in closed form. But this can be achieved only in special cases. That is why most solution procedures for (1) involve iterative procedures. The convergence region for these procedures is small in general, limiting their applicability. The error bounds on the distances involved are also pessimistic (in general).
Motivated by optimization concerns we develop a technique that addresses all these problems. In particular, we determine a subset of D where the iterates also belong. But in this subset the Lipschitz-type constants involved are at least as tight as the ones in D. This modification leads to a finer convergence analysis with advantages (A):
-
(1)
Extended convergence domain.
-
(2)
Tighter error bounds on the distances involved.
-
(3)
More precise information for the location of the solution.
The advantages A are obtained with no additional conditions. We apply our technique on Newton-like procedures [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18]. But it is so general that it can be used to extend the applicability of other iterative procedures along the same lines.
In particular, we extend the results by Cibulka et al in [10] which in turn generalized earlier ones [1,2,3, 9,10,11,12,13,14,15,16,17,18].
2 Convergence
We introduce certain Lipschitz-type conditions to be used in order to compare them. Let \( \beta > 0\) be a given parameter. We use the notation U(x, a), U[x, a] to denote respectively the open and closed balls in \(B_1\) with center x and of radius \(a>0\).
Definition 2.1
Operator \(F'\) is center Lipschitz continuous on D if there exists \(K_0 > 0\) such that
for all \(u\in D.\)
Set
Definition 2.2
Operator \(F'\) is \(1-\)Restricted Lipschitz continuous on \(D_0\) if there exists \(K > 0\) such that
for all \(u\in D_0,\, v=u- F'(u)^{-1}F(u)\in D_0.\)
Definition 2.3
Operator \(F'\) is \(2-\)Restricted Lipschitz continuous on \(D_0\) if there exists \(M > 0\) such that
for all \(u,v\in D_0.\)
Definition 2.4
Operator \(F'\) is Lipschitz continuous on D if there exists \(K_1 > 0\) such that
for all \(u,v\in D.\)
Remark 2.5
(i) By the definition of \(D_0\) in (3), we have
We also assume that
If not, then the results that follow hold with \(K_0\) replacing K. Examples where (7)-(9) are strict can be found in the numerical section and in [4,5,6,7]. Hence, the new technique improves our earlier results too [4,5,6,7].
We suppose that there exists \(x_0\in D\) such that \(F'(x_0)^{-1}\in L(B_2,B_1)\) and
Notice that using (6) and (10) we obtain by the Banach lemma on invertible operators [4, 5, 13] that
for all \(x\in U(x_0, \frac{1}{\beta K_1}).\) But using the weaker and more precise (2) we have
for all \(x\in U(x_0, \frac{1}{\beta K_0}).\) This modification in the proofs of earlier works [1, 2, 8,9,10,11,12,13,14,15,16,17,18] leads to advantages A. It is worth noticing that \(K_0=K_0(F',D),\, K_1=K_1(F',D), \) but \(M=M(F', D_0),\) and \(K=K(F',D_0).\) Notice also that we require (4) to only hold for the Newton iterates and not for all elements in D or \(D_0\) (see also the numerical section).
In order to further emphasize the importance of introducing the center Lipschitz condition and using it instead of the Lipschitz condition to provide tighter upper bounds on the norm of \(\Vert F'(x)^{-1}\Vert ,\) we present a motivational example.
Example 2.6
Let \(B_1=B_2={\mathbb {R}}.\) Moreover, define the function
where \(\alpha _i,\, i=0,1,2,3\) are given real parameters. Then, it follows that for \(\alpha _3\) sufficiently large and \(\alpha _2\) sufficiently small, \(\frac{K_0}{K_1}\) can be arbitrarily small, i.e.\(\frac{K_0}{K_1}\longrightarrow 0.\)
Next, we get the following results for \(D_0=U(x_0, r), r > 0.\)
Theorem 2.7
(Extended semi-local Kantorovich Theorem [10, 13]) Suppose:
- (i)
-
(ii)
\(\Vert F'(x_0)^{-1}F(x_0)\Vert \le \gamma \)
and
-
(iii)
\(\delta = \beta K \gamma r < \frac{1}{2}\) for \(r\ge r_0=\frac{1-\sqrt{1-2\delta }}{\beta K}.\)
Then, there exists a unique sequence \(\{x_n\}\) satisfying Newton procedure
initiated at \(x_0\in D.\) Moreover, this sequence converges to a unique solution \(x^*\in U(x_0,r_0)\) of equation \(F(x)=0.\) Furthermore, the convergence rate is quadratic so that
Proof
Simply replace \(K_1\) by K, (6) by (2) and (3) in the proof of the Kantorovich theorem in [10]. Notice that M can also replace \(K_1\) in this Theorem. But we have \(K\le M\). \(\square \)
Remark 2.8
If \(D_0=D,\) then our Theorem 2.7 reduces to the corresponding one in [10]. But if \(K < K_1,\) then
where
Implications (14) justify the advantages A as stated in the introduction. Hence, the new results are always at least as good as the ones in [10]. It is worth noticing that in practice the computation of the constant \(K_1\) used before requires that of the rest of the Lipschitz constants as special cases. Hence, no additional computational effort or conditions are required to obtain advantages (A). Moreover, in view of (14) our results can hold consistently in cases the earlier ones cannot (see also the numerical Section).
Set \(B_1=B_2={\mathbb {R}}^i\) in the next result. Consider Newton-like procedure
where \(T_n\) are matrices from \(\bar{\partial }F(x_n)\) the Clarke generalized Jacobian of F [11].
Theorem 2.9
(Extended local [10]) Suppose (2) and (3) hold for \(x_0=x^*.\) Moreover, for every \(\epsilon > 0\) there exists \(\alpha > 0\) such that for all \(x\in U(x^*,\alpha )\) and \(T\in \bar{\partial }F(x)\)
Then, There exists a neighborhood U of \(x^*\) such that for each \(x_0\in U\) there exists a sequence \(\{x_n\}\) satisfying (15) so that \(\lim _{n\longrightarrow \infty }x_n=x^*.\) The convergence is supperlinear.
Proof
Simply replace \(K_1\) by K in the proof of Theorem 1.3 in [10]. \(\square \)
Next, we solve the inclusion problem
where F is as before and \(G:B_1\rightrightarrows B_2\) is a set-valued operator whose graph is closed [10]. The Newton-like procedure
shall be used.
Recall that a set-valued operator \(\psi :B_1\rightrightarrows B_2\) is said to be metrically regular at \(x_0\) for \(y_0\) if \(y_0\in \psi (x_0)\) and there exist neighborhoods \(N_1\) of \(x_0\) and \(N_2\) of \(y_0\) and a positive parameter \(\mu \) such that the graph of \(\psi \) denoted by \(gph \psi \) is such that \(gph \psi \cap (N_1\times N_2)\) is closed [11] and
The next result using the concept of metric regularity extends Theorem 3.1 in [10] which generalized Theorem 6C.1 and 6D.2 from [11].
Theorem 2.10
(Semi-local) Suppose:
-
(i)
Let \(r> 0, b> 0, \beta > 0, p\ge 0\) and \(x_0\in B_1,\, y_0\in F(x_0)+G(x_0)\) be such that \(\beta p <1\) and \(\Vert y_0\Vert < (1-\beta p)\min \{\frac{r}{\beta }, b\}.\)
- (ii)
-
(iii)
For each \(w\in U(x_0, r)\)
$$\begin{aligned} x\longrightarrow Q_w(x):=F(x_0)+F'(w)(x-x_0)+G(x) \end{aligned}$$is metrically regular at \(x_0\) for \(y_0\) with constant \(\beta \) and neighborhood \(U(x_0, r)\) and \(U(y_0,b).\)
-
(iv)
\(\Vert F(x)-F(y)-F'(x)(x-y)\Vert \le p\Vert x-y\Vert \) for each \(x\in U(x_0, r), y\in F(x)+G(x).\)
Then, there exists a sequence \(\{x_n\}\) satisfying (18) and convergent \(q-\)superlinearly to a solution \(x^*\) if the inclusion problem (17) (if (2) and (4) are not assumed). Moreover, if (2) and (3) are assumed, then \(\{x_n\}\) converges \(Q-\) quadratically to \(x^*.\)
The rest of the results in [10] can be extended along the same lines.The details are left to the motivated reader.
3 Three Examples
The older convergence criteria are compared with the new ones.
Example 3.1
Let us consider a scalar function h defined on the set \(D=[x_0-(1-\xi ), x_0+1-\xi ] \)for \(\xi \in (0,\frac{1}{2})\) by
Choose \(x_0=r=r_1=1.\) Then, we obtain the estimates \(\eta =\frac{1-\xi }{3},\, \beta =\frac{1}{3}\)
for all \(x\in D, \) so \(K_0=3(3-\xi ),\) \(D_0=U(x_0,\frac{1}{\beta K_0})\cap D=U(x_0, \frac{1}{\beta K_0}),\)
for all \(x,y\in D\) and so \(K_1=6(2-\xi ).\) Notice that for all \(\xi \in (0, \frac{1}{2})\)
Next, set \(y=x-F'(x)^{-1}F(x), x\in D.\) Then, we have
Define function \(\bar{h}\) on the interval \(D=[\xi , 2-\xi ]\) bt
Then, we get by this definition that
where \(q=\root 3 \of {\frac{2\xi }{5}}\) is the critical point of function \(\bar{h}.\) Notice that \(\xi< q < 2-\xi .\) It follows that this function is decreasing on the interval \((\xi ,q)\) and increasing on the interval \((q, 2-\xi ),\) since \(x^2+xq+q^2 > 0\) and \(x^3 > 0.\) So, we can set
Thus, we have
But if \(x\in D_0=[1-\frac{1}{\beta K_0}, 1+\frac{1}{\beta K_0}],\) then
where \(\lambda =\frac{4-\xi }{3-\xi }.\) Then, we also get \(K < M_1.\) Hence, K can replace \(K_1\) or M in all previous results [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18]. Next, we verify existing convergence criteria [1, 2, 9,10,11,12,13,14,15].
for \(\xi \in (0.5,1)\)
But this criterion does not hold for all \(\xi \in (0, \frac{1}{2}).\) Hence, there is no guarantee that Newton’s method converges. In particular, the results in [10] cannot apply.
and those of Theorem 2.6 for \(\xi \in (0,1).\)
Clearly, the new results extend the range of values \(\xi \) for which Newton’s procedure (13) converges.
Next, we present an example to show that our conditions can be used to solve equations in cases where the ones in [1, 2, 9,10,11,12,13,14,15,16,17] cannot.
Example 3.2
Consider \(B_1=B_2=C[0,1]\) with the norm-max. Set \(D=U(x_0,3).\) Define operator G on D by
\(w\in [0,1], \, z\in C[0,1],\) where \(y\in C[0,1]\) is fixed and Q is a Green’s Kernel defined by
Then, the derivative \(G'\) according to Fréchet is defined by
\(w\in [0,1],\, z\in C[0,1].\) Let \(y(w)=x_0(w)=1.\) Then, using (2)-(21), we obtain \(G'(x_0)^{-1}\in L(B_2,B_1),\) \(\Vert I-G'(x_0)\Vert < \frac{3}{8},\, \Vert G'(x_0)^{-1}\Vert \le \frac{8}{5}:=\beta ,\, \gamma =\frac{1}{5},\) \(K_0=\frac{12}{5},\, K_1=\frac{18}{5},\) and \(D_0=U(1,3)\cap U(1,\frac{5}{12})=U(1,\frac{5}{12}),\) so \(M=\frac{3}{2},\) and \(K_0< K_1,\, M < K_1.\) Set \(M=K.\) Then, the old sufficient convergence criterion is not satisfied, since \(\gamma \beta K_1 =\frac{1}{5}\frac{8}{5}\frac{18}{5}=\frac{144}{125}> \frac{1}{2}\) holds. Therefore, there is no guarantee that Newton’s method (14) converges to \(x^*\) under the conditions of the aforementioned references. But our condition hold, since \(\gamma \beta K=\frac{1}{5}\frac{8}{5}\frac{3}{2}=\frac{24}{50} <\frac{1}{2}.\) Hence, the conclusions of Theorem 2.7 follow.
Example 3.3
Consider \(B_1=B_2=C[0,1]\) and \(D=U[0,1].\) Then the boundary value problem (BVP) [4]
can be also given as
where \(\sigma \) is a constant and \(P(t_2,t_1)\) is the Green’s function
Consider \(F:D\longrightarrow B_2\) as
Let us set \(\tau _0(t_2)=t_2\) and \(D=U(\tau _0, \rho _0).\) Then, clearly \(U(\tau _0,\rho _0)\subset U(0, \rho _0+1),\) since \(\Vert \tau _0\Vert =1.\) If \(2\sigma < 5.\) Then,
Hence, \(K_0 < K_1.\)
4 Conclusions
A technique is introduced by which the convergence analysis of Newton methods (13 and (18) is consistently extended under weaker conditions. Researchers and practitioners will always prefer to use these results over the earlier ones, since they are at least as applicable. The same idea can be used on other methods in order to extend their applicability along the same lines. This will be the focus of our future research.
References
Adly, S., H.V. Ngai, and V.V. Nguyen. 2016. Newton’s procedure for solving generalized equations: Kantorovich’s and Smale’s approaches. Journal of Mathematical Analysis and Applications 439: 396–418.
Adly, S., R. Cibulka, and H.V. Ngai. 2015. Newton’s procedure for solving inclusions using set-valued approximations. SIAM Journal on Optimization 25 (1): 159–184.
Aubin, J.-P., and H. Frankowska. 1990. Set Valued Analysis. Boston: Birkháuser.
Argyros, I.K., and S. Hilout. 2010. Inexact Newton-type procedures. Journal of Complex 26 (6): 577–590.
Argyros, I.K. 2008. Convergence and Applications of Newton-Type Iterations. New York: Springer.
Argyros, I.K., and A.A. Magréñan. 2018. A Contemporary Study of Iterative Procedures. New York: Elsevier (Academic Press).
Argyros, I.K., and S. George. 2021. Mathematical Modeling for the Solution of Equations and Systems of Equations with Applications, vol. IV. New York: Nova Publisher.
Behl, R., P. Maroju, E. Martinez, and S. Singh. 2020. A study of the local convergence of a fifth order iterative procedure. Indian Journal of Pure and Applied Mathematics 51 (2): 439–455.
Ciarlet, P.G., and C. Madare. 2012. On the Newton-Kantorovich theorem. Analysis and Applications (Springer) 10: 249–269.
Cibulka, R., A.L. Dontchev, J. Preininger, V. Veliov, and T. Roubai. 2018. Kantorovich-type theorems for generalized equations. Journal of Convex Analysis 25 (2): 459–486.
Dontchev, A.L., and R.T. Rockafellar. 2012. Parameter stability of solutions in models of economic equilibrium. Journal of Convex Analysis 19: 975–997.
Ezquerro, J.A., and M.A. Hernandez. 2018. Newton’s procedure: an updated approach of Kantorovich’s theory, Cham, Switzerland.
Kantorovich, L.V., and G.P. Akilov. 1964. Functional Analysis in Normed Spaces. New York: The Macmillan Co.
Magréñan, A.A., and J.M. Gutiérrez. 2015. Real dynamics for damped Newton’s procedure applied to cubic polynomials. Journal of Computational and Applied Mathematics 275: 527–538.
Magréñan, A.A., I.K. Argyros, J.J. Rainer, and J.A. Sicilia. 2018. Ball convergence of a sixth-order Newton-like procedure based on means under weak conditions. Journal of Mathematical Chemistry 56: 2117–2131. https://doi.org/10.1007/s10910-018-0856-y.
Potra, F.A., and V. Pták. 1984. Nondiscrete induction and iterative processes, Research Notes in Mathematics 103, Boston: Pitman.
Robinson, S.M. 1972. Extension of Newton’s procedure to nonlinear functions with values in a cone. Numerical Mathematics 19: 341–347.
Silva, G.N. 2016. Kantorovich’s theorem on Newton’s procedure for solving generalized equations under the majorant condition. Applied Mathematics and Computation 286: 178–188.
Acknowledgements
We would like to express our gratitude to the reviewers for their constructive criticism of this paper.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of Interest
The authors declare no conflict of interest.
Additional information
Communicated by S Ponnusamy.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Regmi, S., Argyros, I.K., George, S. et al. Kantorovich-type results for generalized equations with applications. J Anal 31, 1191–1200 (2023). https://doi.org/10.1007/s41478-022-00510-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41478-022-00510-1