A family of higher order derivative free methods for nonlinear systems with local convergence analysis

Bhalla, S.; Kumar, S.; Argyros, I. K.; Behl, Ramandeep

doi:10.1007/s40314-018-0663-x

A family of higher order derivative free methods for nonlinear systems with local convergence analysis

Published: 16 June 2018

Volume 37, pages 5807–5828, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Computational and Applied Mathematics Aims and scope Submit manuscript

A family of higher order derivative free methods for nonlinear systems with local convergence analysis

Download PDF

S. Bhalla¹,
S. Kumar¹,
I. K. Argyros² &
…
Ramandeep Behl³

228 Accesses
5 Citations
Explore all metrics

Abstract

A wide general class of Ostrowski’s families without memory proposed by Behl et al. (Int J Comput Math 90(2):408–422, 2013) is being extended to solve systems of nonlinear equations. This extension uses multidimensional divided differences of first order. Many more new derivative free iterative families with higher order local convergence are presented. In addition, the proposed iterative family for $\alpha _1=\mathbb {R}-\{0\}$ and $\alpha _2=0$ are special cases of Grau et al. (J Comput Appl Math 237:363–372, 2013) for iterative schemes of fourth and sixth orders. The computational efficiency is compared with some known methods. It is proved that the proposed methods are equally competent with their existing counter parts. Moreover, we present the local convergence analysis of the proposed family of methods based on Lipschitz constants and hypotheses on the divided difference of order one in the more general settings of a Banach space. We expand this way the applicability of these methods, since we used higher derivatives to show convergence of the method in Sect. 3 although such derivatives do not appear in these methods. Numerical experiments are performed which support the theoretical results.

A family of iterative methods that uses divided differences of first and second orders

Article 27 January 2015

Local Convergence for an Efficient Eighth Order Iterative Method with a Parameter for Solving Equations Under Weak Conditions

Article 22 July 2015

A study on the local convergence and the dynamics of Chebyshev–Halley–type methods free from second derivative

Article 03 March 2015

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

One of the most basic and earliest problems of numerical analysis concerns with finding efficiently and accurately the approximate solution of a nonlinear system

$$\begin{aligned} G(X)=0, \end{aligned}$$

where $ G(X)=(\widetilde{g}_{1}(X),\;\widetilde{g}_{2}(X), \; \ldots , \;\widetilde{g}_{t}(X))^{T}, \;\;\; X=(x_{1},\; x_{2},\; \ldots ,\; x_{t})^{T}$ and $G:{\mathbb {R}^{t} \rightarrow \mathbb {R}^{t}}$ is a sufficiently differentiable vector function. Analytical methods for solving such problems are non-existent, and therefore, it is only possible to obtain approximate solutions, by relying on numerical techniques based on iteration procedures. The most simple and common iterative method for this purpose is the Newton’s method (Kelley 2003; Traub 1964), which converges quadratically and is defined by

$$\begin{aligned} X^{k+1}=X^{k}-{\big \{G'({X^{k}})\big \}}^{-1}{G(X^{k})}, \quad k=0,1,2,\ldots , \end{aligned}$$

where ${\big \{G'({X^{k}})\big \}}^{-1}$ is the inverse of first Fréchet derivative ${G'({X^{k}})}$ of the function of G(X). The practice of Numerical Functional Analysis for approximating solutions iteratively is essentially connected to Newton-like methods (Kelley 2003; Traub 1964; Amat et al. 2005, 2008, 2010; Behl et al. 2013; Grau-Sánchez et al. 2014; Ostrowski 1960; Ortega and Rheinboldt 1970; Petković 2011; Sharma and Arora 2014). However, the main practical difficulty associated with this method is to calculate first-order derivative at each step of computation, sometimes which is very difficult and time consuming.

In 2013, Behl et al. (2013) have proposed new optimal families of Ostrowski-like methods for solving scalar nonlinear equations having cubic scaling factor of functions in the correction factor and is given by

$$\begin{aligned} \begin{aligned} y_{m}&=x_{m}-\frac{f(x_{m})}{f'(x_{m})},\\ x_{m+1}&=x_{m}-\frac{f(x_{m})}{f'(x_{m})}\left[ \frac{(\alpha _{1}^{2}+\alpha _{1}\alpha _{2}-\alpha _{2}^{2})f(x_{m})f(y_{m})-\alpha _{1}(\alpha _{1}-\alpha _{2})\{f(x_{m})\}^{2}}{(\alpha _{1}f(x_{m})-\alpha _{2}f(y_{m}))((2\alpha _{1}-\alpha _{2})f(y_{m})-(\alpha _{1}-\alpha _{2})f(x_{m}))}\right] , \end{aligned} \end{aligned}$$

(1)

where $\alpha _{1},~\alpha _{2}\in \mathbb {R}$ but choose $\alpha _{1}$ and $\alpha _{2}$ such that neither $\alpha _{1}=0$ nor $\alpha _{1}=\alpha _{2}$.

In 1964, Traub (1964) introduced the quadratically convergent scheme defined as

$$\begin{aligned} X^{k+1}=X^{k}-[Y^{k},X^{k};G]^{-1}{G(X^{k})}, \end{aligned}$$

(2)

where $Y^{k}=X^{k}+\beta G(X^{k}),$ $\beta \in \mathbb {R}-\{0\}.$ $[Y^{k},X^{k};G]$ is defined as first-order divided difference of G in t dimensional space as an $t\times t$ matrix with elements

$$\begin{aligned}&[Y^{k},X^{k};G]_{ij}\nonumber \\&\quad =\frac{\widetilde{g}_i(y_1^{k},y_2^{k},\ldots ,y_{j-1}^{k},y_{j}^{k},x_{j+1}^{k},\ldots ,x_t^{k})-\widetilde{g}_i(y_1^{k},y_2^{k},\ldots ,y_{j-1}^{k},x_j^{k},x_{j+1}^{k},\ldots ,x_t^{k})}{y_j^{k}-x_j^{k}},\nonumber \\ \end{aligned}$$

(3)

where $X^{k}=(x_1^{k},\ldots ,x_{j-1}^{k},x_{j}^{k},x_{j+1}^{k},\ldots ,x_t^{k}),Y^{k}=(y_1^{k},\ldots ,y_{j-1}^{k},y_{j}^{k},y_{j+1}^{k},\ldots ,y_t^{k})$ and $1\le i,j\le t$ (see Grau-Sánchez et al. 2011; Potrá and Pták 1984). For $\beta =1$, the Traub’s Scheme reduces to Steffensen’s method (Steffensen 1933). Inspired from this work, recently many researchers have approximated the derivatives using first-order divided difference operators preserving the local convergence order of iterative methods (Argyros et al. 2015; Ezquerro et al. 2015; Ezquerro and Hernández 2009; Grau-Sánchez and Noguera 2011; Grau-Sánchez et al. 2013; Sharma and Arora 2013, 2014). In this study, we will construct higher order generalization for several variables of given families of Ostrowski’s methods (1) using first-order divided difference operator. For the computational purpose, we used another tool to compute $X^{k+1}$ which is defined as

$$\begin{aligned} X^{k+1}=X^{k}-[X^{k}+G,X^{k}-G;\;G]^{-1}{G(X^{k})},\quad k=0,1,2,\ldots \end{aligned}$$

(4)

where ${[X^{k}+G,X^{k}-G;G]}=\big (G(X^{k}+H^{k}e^1)-G(X^{k}-H^{k}e^1),\ldots ,G(X^{k}+H^{k}e^t)-G(X^{k}-H^{k}e^t)\big )\{H^{k}\}^{-1}$ with $H^{k}=\mathrm{diag}(\widetilde{g}_1(X^{k}),\widetilde{g}_2(X^{k}),\ldots ,\widetilde{g}_t(X^{k})).$

The meaning of $X\pm G$ is $X\pm G(X)$. We shall use either notation in this paper.

2 Construction of iterative family

We propose the following modification over iterative scheme (1) as follows:

$$\begin{aligned} \left. \begin{aligned} \psi _{1}^{k}&=X^{k+1}=X^{k}-[X^{k}+G,X^{k}-G;G]^{-1}{G(X^{k})},\\ \psi _{2}^{k}&=\psi _1^{k}-\eta ~G(\psi _1^{k}),\\ \psi _{3}^{k}&=\psi _2^{k}-\eta ~G(\psi _2^{k}),\\ \psi _{4}^{k}&=\psi _3^{k}-\eta ~G(\psi _3^{k}),\\ \vdots \\ \psi _{i-1}^{k}&=\psi _{i-2}^{k}-\eta ~G(\psi _{i-2}^{k}),\\ \psi _{i}^{k}&=\psi _{i-1}^{k}-\eta ~G(\psi _{i-1}^{k}).\\&\text {This relation is true for}~i=2,3,4\ldots n. \end{aligned} \right\} , \end{aligned}$$

(5)

Here,

$$\begin{aligned} \left. \begin{aligned} \eta&=\tau ^{-1}\Big (-(\alpha _{2}^{2}-2\alpha _{1}\alpha _{2})[\psi _1^{k},X^{k};G]+(\alpha _{1}^{2}+\alpha _{2}^{2}-3\alpha _{1}\alpha _{2})[X^{k}+G,X^{k}-G;G]\Big ),\\ \tau&=(2\alpha _{1}\alpha _{2}-\alpha _{2}^{2})[\psi _1^{k},X^{k};G][\psi _1^{k},X^{k};G]-\alpha _{1}\alpha _{2}[\psi _1^{k},X^{k};G][X^{k}+G,X^{k}-G;G]~~\\&\quad +(2\alpha _{1}^{2}+\alpha _{2}^{2}-3\alpha _{1}\alpha _{2})[X^{k}+G,X^{k}-G;G][\psi _1^{k},X^{k};G]~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~\\&\quad -(\alpha _{1}^{2}-\alpha _{1}\alpha _{2})[X^{k}+G,X^{k}-G;G][X^{k}+G,X^{k}-G;G]. \end{aligned} \right\} . \end{aligned}$$

(6)

Here $\alpha _1$ and $\alpha _2$ are the real parameters. From Eq. (5), the various multi-step methods can be proposed by taking different values of $\alpha _1$ and $\alpha _2$ as follows:

(i)
For $\alpha _1=\mathbb {R}-\{0\}~and~\alpha _2=0$, first two steps $(i=2)$ of family (5) reduces as follows:
$$\begin{aligned} \left. \begin{aligned} \psi _1^{k}&=X^{k+1}=X^{k}-[X^{k}+G,X^{k}-G;G]^{-1}G(X^{k}),\\ \psi _2^{k}&=\psi _1^{k}-{\big \{2[\psi _1^{k},X^{k};G]-[X^{k}+G,X^{k}-G;G]}\big \}^{-1}{G(\psi _1^{k})}. \end{aligned} \right\} . \end{aligned}$$
This is a fourth-order iterative scheme derived by Grau-Sánchez et al. (2013).
(ii)
For $\alpha _1=\mathbb {R}-\{0\}~and~\alpha _2=0$, first three steps $(i=3)$ of family (5) reads as follows:
$$\begin{aligned} \left. \begin{aligned} \psi _1^{k}&=X^{k+1}=X^{k}-[X^{k}+G,X^{k}-G;G]^{-1}G(X^{k}),\\ \psi _2^{k}&=\psi _1^{k}-{\big \{2[\psi _1^{k},X^{k};G]-[X^{k}+G,X^{k}-G;G]}\big \}^{-1}{G(\psi _1^{k})},\\ \psi _3^{k}&=\psi _2^{k}-{\big \{2[\psi _1^{k},X^{k};G]-[X^{k}+G,X^{k}-G;G]}\big \}^{-1}{G(\psi _2^{k})}. \end{aligned} \right\} . \end{aligned}$$
This is a sixth-order iterative scheme derived by Grau-Sánchez et al. (2013).
(iii)
For $\alpha _{1}=\mathbb R-\{0\}~and~\alpha _2=0$, first four steps $(i=4)$ of family (5) reduces as follows:
$$\begin{aligned} \left. \begin{aligned} \psi _1^{k}&=X^{k+1}=X^{k}-[X^{k}+G,X^{k}-G;G]^{-1}G(X^{k}),\\ \psi _2^{k}&=\psi _1^{k}-{\big \{2[\psi _1^{k},X^{k};G]-[X^{k}+G,X^{k}-G;G]}\big \}^{-1}{G(\psi _1^{k})},\\ \psi _3^{k}&=\psi _2^{k}-{\big \{2[\psi _1^{k},X^{k};G]-[X^{k}+G,X^{k}-G;G]}\big \}^{-1}{G(\psi _2^{k})},\\ \psi _4^{k}&=\psi _3^{k}-{\big \{2[\psi _1^{k},X^{k};G]-[X^{k}+G,X^{k}-G;G]}\big \}^{-1}{G(\psi _3^{k})}. \end{aligned} \right\} . \end{aligned}$$

This is a new eighth-order iterative scheme.

3 Convergence analysis

We consider the first-order divided difference operator of G on $\mathbb {R}^{t}$ as a mapping $[\cdot ,\;\cdot :\;G]: D\times D \subset \mathbb {R}^{t} \times \mathbb {R}^{t} \rightarrow L(\mathbb {R}^{t})$, which is defined by Grau-Sánchez and Noguera (2011), Grau-Sánchez et al. (2013)

$$\begin{aligned} \big [X^{k}+h,X^{k};G\big ]={\int _{0}}^{1} G'(X^{k}+uh)\mathrm{d}u,~~~~~\forall (X^{k},h)~\in ~\mathbb {R}^{t}\times \mathbb {R}^{t}. \end{aligned}$$

(7)

Developing $G'(X^{k}+uh)$ in Taylor’s series at $X^{k}$ and after integrating, one can obtain

$$\begin{aligned} {\int _{0}}^{1} G'(X^{k}+uh)\mathrm{d}u=G'({X^{k}})+\frac{1}{2}G^{''}({X^{k}})h+\frac{1}{6}G^{'''}({X^{k}})h^{2}+O(h^{3}). \end{aligned}$$

(8)

Taking into account $e^{k}~=~X^{k}~-~X^*$, we develop $G(X^{k})$ and its derivatives in a neighborhood of $X^*$, where $X^* \in \mathbb {R}^{t}$ is the solution of system $G(X)=0$. Assuming that $\Gamma =\big \{G'(X^*)\big \}^{-1}$ exists, one can have

$$\begin{aligned} G(X^{k})=G'(X^*)\Big [e^{k}+A_{2}\big (e^{k}\big )^{2}+A_{3}\big (e^{k}\big )^{3}+A_{4}\big (e^{k}\big )^{4}+A_{5}\big (e^{k}\big )^{5}+O\left( (e^{k})^{6}\right) \Big ], \end{aligned}$$

(9)

where $A_i=\frac{1}{i!}\Gamma G^{(i)}(X^*) \in L_i( \mathbb {R}^{t}, \mathbb {R}^{t}), \quad i=2,3,\ldots $

From Eq. (9), the derivative of $G(X^{k})$ can be written as

$$\begin{aligned} G'(X^{k})= & {} G'(X^*)\Big [I+2A_{2}\big (e^{k}\big )+3A_{3}\big (e^{k}\big )^{2}+4A_{4}\big (e^{k}\big )^{3}+5A_{5}\big (e^{k}\big )^{4}+ O\left( (e^{k})^{5}\right) \Big ], \end{aligned}$$

(10)

$$\begin{aligned} G''(X^{k})= & {} G'(X^*)\Big [2A_{2}+6A_{3}\big (e^{k}\big )+12A_{4}\big (e^{k}\big )^{2}+20A_{5}\big (e^{k}\big )^{3}+ O\left( (e^{k})^{4}\right) \Big ], \end{aligned}$$

(11)

$$\begin{aligned} \text {and}~~~~G'''(X^{k})= & {} G'(X^*)\Big [6A_{3}+24A_{4}\big (e^{k}\big )+O\left( (e^{k})^{2}\right) \Big ], \end{aligned}$$

(12)

where I is an identity matrix of order t.

Setting $ \psi _1^{k}=X^{k}+h~ \& ~\epsilon _1^{k}=\psi _1^{k}-X^*$, one can have $h=\psi _1^{k}-X^{k}=\epsilon _1^{k}-e^{k}$.

By substituting Eqs. (10)–(12) into Eq. (8), one gets

$$\begin{aligned}{}[\psi _1^{k},X^{k};G]=G'(X^*)\Big [I+A_{2}\big (\epsilon _1^{k}+e^{k}\big )+A_{3}\big (\big (\epsilon _1^{k}\big )^{2}+\big (e^{k}\big )^{2}+\epsilon _1^{k}e^{k}\big )+O\left( (e^{k})^{3}\right) \Big ]. \end{aligned}$$

(13)

In our analysis, we have considered the center difference operator

$$\begin{aligned}{}[X^{k}+G,X^{k}-G;G]=G'(X^*)\Big [I+2A_{2}\big (e^{k}\big )+A_{3}\big (3+\big \{G'(r)\big \}^{2}\big )(e^{k})^{2}+O\left( (e^{k})^{3}\right) \Big ], \end{aligned}$$

(14)

which we get after replacing $\epsilon _1^{k}$ by $e^{k}+G(X)$ and $e^{k}$ by $e^{k}-G(X)$ in Eq. (13). The convergence of iterative schemes (5) can be proved through the following theorem:

Theorem 1

Let $X^*\in \mathbb {R}^{t}$ be solution of the system $G(X)=0$ and $G:D \subset \mathbb {R}^{t} \rightarrow \mathbb {R}^{t} $ be sufficiently differentiable in an open neighborhood D of $X^*$ at which $G'(X^*)$ is nonsingular. Then for an initial approximation sufficiently close to $X^*$, iterative scheme (5) will have $2\times i$ local order of convergence with error equation

$$\begin{aligned} \epsilon _i^{k}=(-1)^{i+1}\frac{\big (\lambda A_3-P A_2^{2}\big )^{i-2}\big (\lambda A_2A_3-Q A_2^{3}\big )}{\alpha _1^{i-1}(\alpha _1-\alpha _2)^{i-1}}(e^{k})^{2i}+O\left( (e^{k})^{2i+1}\right) , \end{aligned}$$

provided that $\alpha _1 \in \mathbb R-\{0\},~~\alpha _2 \in \mathbb R~~and~~\alpha _1 \ne \alpha _2$, where

$\lambda =\alpha _1(\alpha _1-\alpha _2)(1+\gamma ^2),\;~Q=(\alpha _1^{2}-3\alpha _1\alpha _2+\alpha _2^{2}),\;~P=(2\alpha _1^{2}-4\alpha _1\alpha _2+\alpha _2^{2})$ & $\gamma =G'(X^*).$

Proof

We shall prove the theorem by induction method.

For $i=2$, the relation (5) reduces as follows:

$$\begin{aligned} \left. \begin{aligned} \psi _{1}^{k}&=X^{k+1}=X^{k}-[X^{k}+G,X^{k}-G;G]^{-1}{G(X^{k})},\\ \psi _{2}^{k}&=\psi _1^{k}-\eta ~G(\psi _1^{k}). \end{aligned} \right\} . \end{aligned}$$

(15)

The inverse operator of Eq. (14) is

$$\begin{aligned}{}[X^{k}+G,X^{k}-G;G]^{-1}= & {} \gamma ^{-1}\Big [I-2A_{2}e^{k}+\big (4A_{2}^2-A_{3}(3+\gamma ^{2})\big )(e^{k})^{2}\nonumber \\&+2\big ((6+\gamma ^{2})A_{2}A_{3}-2(1+\gamma ^{2})A_{4}-4A_4^{3}\big )(e^{k})^{3}\nonumber \\&+O\left( (e^{k})^{4}\right) \Big ]. \end{aligned}$$

(16)

Using (9) and (16) in the first step of Eq. (5), one can get the following error equation:

$$\begin{aligned} \begin{aligned} \epsilon _1^{k}&=\psi _1^{k}-X^*=A_2(e^{k})^2+(-2A_2^2+(2+\gamma ^2)A_3)(e^{k})^3\\&\quad +\big (-(7+\gamma ^{2})A_{2}A_{3}+(3+4\gamma ^{2})A_{4}+4A_{2}^{3}\big )(e^{k})^{4}+O\left( (e^{k})^{5}\right) . \end{aligned} \end{aligned}$$

(17)

Expanding $G(\psi _1^{k})$ by Taylor’s series expansion around the solution $X^*$ using (17), one gets

$$\begin{aligned} G(\psi _1^{k})=\gamma \Big [\epsilon _1^{k}+A_{2}\big (\epsilon _1^{k}\big )^{2}+A_{3}\big (\epsilon _1^{k}\big )^{3}+O\left( (\epsilon _1^{k})^{4}\right) \Big ]. \end{aligned}$$

(18)

By substituting Eqs. (13) and (14) in Eq. (6), one can obtain

$$\begin{aligned} \left. \begin{aligned} \tau&=\gamma ^{2}\alpha _1(\alpha _1-\alpha _2)+PA_2\gamma ^{2}\big (e^{k}\big )\\&\quad +\gamma ^{2}\big [2\alpha _1^{2}(A_2^{2}+A_3)-2\alpha _1\alpha _2(A_2^{2}+(3+\gamma ^{2})A_3)+(2+\gamma ^{2})\alpha _2^{2}A_3\big ](e^{k})^{2}~~~~~~\\&\quad +O((e^{k})^{3}),\\ \eta&=\frac{1}{\gamma }+\frac{(1+\gamma ^{2})A_3\alpha _1(\alpha _1-\alpha _2)-A_2^{2}P}{\gamma \alpha _1(\alpha _1-\alpha _2)} (e^k)^2 \\&\quad +\frac{1}{\gamma \alpha _1^{2}(\alpha _1-\alpha _2)^{2}}\big [2(1+2\gamma ^{2})A_4\alpha _1^{2}(\alpha _1-\alpha _2)^{2}+A_2^{3}P^{2}\\ {}&\quad +2A_2A_3\alpha _1\big (-3\alpha _1^{3}+2(\gamma ^{2}+5)\alpha _1^{2}\alpha _2-3(\gamma ^{2}+3)\alpha _{1}\alpha _{2}^{2}+(2+\gamma {2})\alpha _{2}^{3}\big ) \big ](e^{k})^{3}\\&\quad +O\left( (e^{k})^{4}\right) .\\ \end{aligned} \right\} . \end{aligned}$$

(19)

The second step of Eq. (15) can be rewritten as $\psi _{2}^{k}-X^*=\psi _1^{k}-X^*-\eta ~G(\psi _1^{k})$,

$$\begin{aligned} \Rightarrow \epsilon _2^{k}=\epsilon _1^{k}-\eta ~G(\psi _1^{k}). \end{aligned}$$

(20)

Putting the values of $\epsilon _1^{k},~~G(\psi _1^{k})~and ~\eta $ from Eqs. (17)–(19), respectively, Eq. (20) yields

$$\begin{aligned} \epsilon _2^{k}= & {} \psi _2^{k}-X^*=-\frac{\lambda A_2A_{3}-QA_2^{3}}{\alpha _1(\alpha _1-\alpha _2)}(e^{k})^{4}\nonumber \\&-\frac{1}{(\lambda A_{3}-PA_2^{2})\alpha _1(\alpha _1-\alpha _2)} \Big [-P(\alpha _2^4-10\alpha _2^3\alpha _1+26\alpha _1^2\alpha _2^2-20\alpha _1^3\alpha _2+4\alpha _1^4)A_2^6\nonumber \\&+2\alpha _1^3(\alpha _1-\alpha _2)^3(2\gamma ^2+1)(\gamma ^2+1)A_2A_3A_4 +\alpha _1^3(\alpha _1-\alpha _2)^3(\gamma ^2+2)(\gamma ^2+1)^2A_3^3\nonumber \\&-2\alpha _1^2(\alpha _1-\alpha _2)^2(2\gamma ^2+1)(2\alpha _1^2-4\alpha _1\alpha _2+\alpha _2^2)A_2^3A_4\nonumber \\&-\alpha _1^2(\alpha _1-\alpha _2)^2(1+\gamma ^2)(4\alpha _1^2\gamma ^2-12\alpha _1\alpha _2\gamma ^2+4\alpha _2^2\gamma ^2+12\alpha _1^2-28\alpha _1\alpha _2+8\alpha _2^2)A_2^2A_3^2\nonumber \\&+\alpha _1(\alpha _1-\alpha _2)\big (20\alpha _1^4-92\alpha _1^3\alpha _2-44\alpha _1^3\alpha _2\gamma ^2+126\alpha _1^2\alpha _2^2+74\alpha _1^2\alpha _2^2\gamma ^2-30\alpha _1\alpha _2^3\gamma ^2\nonumber \\&-54\alpha _1\alpha _2^3+7\alpha _2^4+4\alpha _2^4\gamma ^2+8\alpha _1^4\gamma ^2\big )A_2^4A_3\Big ] (e^{k})^{5}+O\left( (e^{k})^{6}\right) . \end{aligned}$$

(21)

Thus, the iterative family (5) has fourth order of convergence for first two steps.

For $i=n,$ the iterative scheme (5) is written as

$$\begin{aligned} \left. \begin{aligned} \psi _{1}^{k}&=X^{k+1}=X^{k}-[X^{k}+G,X^{k}-G;G]^{-1}{G(X^{k})},\\ \psi _{2}^{k}&=\psi _1^{k}-\eta ~G(\psi _1^{k}),\\ \psi _{3}^{k}&=\psi _2^{k}-\eta ~G(\psi _2^{k}),\\ \psi _{4}^{k}&=\psi _3^{k}-\eta ~G(\psi _3^{k}),\\ \vdots \\ \psi _{n-1}^{k}&=\psi _{n-2}^{k}-\eta ~G(\psi _{n-2}^{k}),\\ \psi _{n}^{k}&=\psi _{n-1}^{k}-\eta ~G(\psi _{n-1}^{k}).\\ \end{aligned} \right\} . \end{aligned}$$

(22)

Let us assume the scheme (22) has order of convergence $2*n$ for first n steps, with error equation

$$\begin{aligned} \epsilon _n^{k}= & {} \psi _{n}-X^*=(-1)^{n+1}\frac{\big (\lambda A_3-P A_2^{2}\big )^{n-2}\big (\lambda A_2A_3-Q A_2^{3}\big )}{\alpha _1^{n-1}(\alpha _1-\alpha _2)^{n-1}}(e^{k})^{2n}\nonumber \\&+(-1)^{n+1}\frac{(\lambda A_{3}-PA_2^{2})^{n-3}}{\alpha _1^n(\alpha _1-\alpha _2)^n} \Big [-P\big ((n-1)\alpha _2^4+(4-7n)\alpha _2^3\alpha _1+(15n-4)\alpha _1^2\alpha _2^2\nonumber \\&-10n\alpha _1^3\alpha _2+2n\alpha _1^4\big )A_2^6 +(2n-2)\alpha _1^3(\alpha _1-\alpha _2)^3(2\gamma ^2+1)(\gamma ^2+1)A_2A_3A_4\nonumber \\&+\alpha _1^3(\alpha _1-\alpha _2)^3(\gamma ^2+2)(\gamma ^2+1)^2A_3^3 -2\alpha _1^2(\alpha _1-\alpha _2)^2(2\gamma ^2+1)(n\alpha _1^2+(2-3n)\alpha _1\alpha _2\nonumber \\&+(n-1)\alpha _2^2)A_2^3A_4 -\alpha _1^2(\alpha _1-\alpha _2)^2(1+\gamma ^2)(4\alpha _1^2\gamma ^2-(4n+4)\alpha _1\alpha _2\gamma ^2+2n\alpha _2^2\gamma ^2\nonumber \\&+6n\alpha _1^2-14n\alpha _1\alpha _2+4n\alpha _2^2)A_2^2A_3^2 +\alpha _1(\alpha _1-\alpha _2)\big (10n\alpha _1^4+(4-48n)\alpha _1^3\alpha _2\nonumber \\&-(20n+4)\alpha _1^3\alpha _2\gamma ^2+(72n-18)\alpha _1^2\alpha _2^2+(34n+2)\alpha _1^2\alpha _2^2\gamma ^2+(6-18n)\alpha _1\alpha _2^3\gamma ^2\nonumber \\&+(14-34n)\alpha _1\alpha _2^3+(5n-3)\alpha _2^4+(3n-2)\alpha _2^4\gamma ^2+4n\alpha _1^4\gamma ^2\big )A_2^4A_3\Big ] (e^{k})^{2n+1}\nonumber \\&+O\left( (e^{k})^{2n+2}\right) . \end{aligned}$$

(23)

Further, for $i=n+1,$ the iterative family (5) is represented as

$$\begin{aligned} \left. \begin{aligned} \psi _{1}^{k}&=X^{k+1}=X^{k}-[X^{k}+G,X^{k}-G;G]^{-1}{G(X^{k})},\\ \psi _{2}^{k}&=\psi _1^{k}-\eta ~G(\psi _1^{k}),\\ \psi _{3}^{k}&=\psi _2^{k}-\eta ~G(\psi _2^{k}),\\ \psi _{4}^{k}&=\psi _3^{k}-\eta ~G(\psi _3^{k}),\\ \vdots \\ \psi _{n-1}^{k}&=\psi _{n-2}^{k}-\eta ~G(\psi _{n-2}^{k}),\\ \psi _{n}^{k}&=\psi _{n-1}^{k}-\eta ~G(\psi _{n-1}^{k}),\\ \psi _{n+1}^{k}&=\psi _{n}^{k}-\eta ~G(\psi _{n}^{k}).\\ \end{aligned} \right\} . \end{aligned}$$

(24)

Now we shall show that the result is true for $i=n+1$, i.e., we have to prove that iterative method (24) has $2(n+1)$ order of convergence for first $n+1$ steps.

Since we have assumed that the result is true for first n steps, therefore expanding $G(\psi _n^{k})$ by Taylor’s series around the solution $X^*$, one gets

$$\begin{aligned} G(\psi _n^{k})&=\gamma \Bigg [(-1)^{n+1}\frac{\big (\lambda A_3-P A_2^{2}\big )^{n-2}\big (\lambda A_2A_3-Q A_2^{3}\big )}{\alpha _1^{n-1}(\alpha _1-\alpha _2)^{n-1}}(e^{k})^{2n}\nonumber \\&\quad +(-1)^{n+1}\frac{(\lambda A_{3}-PA_2^{2})^{n-3}}{\alpha _1^n(\alpha _1-\alpha _2)^n} \Big [-P\big ((n-1)\alpha _2^4+(4-7n)\alpha _2^3\alpha _1+(15n-4)\alpha _1^2\alpha _2^2\nonumber \\&\quad -10n\alpha _1^3\alpha _2+2n\alpha _1^4\big )A_2^6\nonumber \\&\quad +(2n-2)\alpha _1^3(\alpha _1-\alpha _2)^3(2\gamma ^2+1)(\gamma ^2+1)A_2A_3A_4 \nonumber \\&\quad +\alpha _1^3(\alpha _1-\alpha _2)^3(\gamma ^2+2)(\gamma ^2+1)^2A_3^3\nonumber \\&\quad -2\alpha _1^2(\alpha _1-\alpha _2)^2(2\gamma ^2+1)(n\alpha _1^2+(2-3n)\alpha _1\alpha _2+(n-1)\alpha _2^2)A_2^3A_4\nonumber \\&\quad -\alpha _1^2(\alpha _1-\alpha _2)^2(1+\gamma ^2)(4\alpha _1^2\gamma ^2-(4n+4)\alpha _1\alpha _2\gamma ^2+2n\alpha _2^2\gamma ^2\nonumber \\&\quad +6n\alpha _1^2-14n\alpha _1\alpha _2+4n\alpha _2^2)A_2^2A_3^2\nonumber \\&\quad +\alpha _1(\alpha _1-\alpha _2)\big (10n\alpha _1^4+(4-48n)\alpha _1^3\alpha _2-(20n+4)\alpha _1^3\alpha _2\gamma ^2\nonumber \\&\quad +(72n-18)\alpha _1^2\alpha _2^2+(34n+2)\alpha _1^2\alpha _2^2\gamma ^2\nonumber \\&\quad +(6-18n)\alpha _1\alpha _2^3\gamma ^2+(14-34n)\alpha _1\alpha _2^3+(5n-3)\alpha _2^4\nonumber \\&\quad +(3n-2)\alpha _2^4\gamma ^2+4n\alpha _1^4\gamma ^2\big )A_2^4A_3\Big ] (e^{k})^{2n+1}\nonumber \\&\quad +O((e^{k})^{2n+2})\Bigg ]. \end{aligned}$$

(25)

Now last step of Eq. (24) rewritten as

$$\begin{aligned} \psi _{n+1}^{k}-X^*= & {} \psi _{n}^{k}-X^*-\eta ~G(\psi _{n}^{k}),\nonumber \\ \Rightarrow \epsilon _{n+1}^{k}= & {} \epsilon _n^k-\eta ~G(\psi _{n}^{k}). \end{aligned}$$

(26)

Substituting the values of $\eta $, $\epsilon _n^k$ and $G(\psi _{n}^{k})$ from Eqs. (19), (23) and (25), respectively, in Eq. (26) and after some simplifications, one can get the error equation

$$\begin{aligned} \begin{aligned} \epsilon _{n+1}^{k}=(-1)^{n+2}\frac{\big (\lambda A_3-P A_2^{2}\big )^{n-1}\big (\lambda A_2A_3-Q A_2^{3}\big )}{\alpha _1^{n}(\alpha _1-\alpha _2)^{n}}(e^{k})^{2n+2}+O\left( (e^{k})^{2n+3}\right) , \end{aligned} \end{aligned}$$

(27)

which shows that the iterative scheme (24) has $2(n+1)$ order of convergence for first $n+1$ steps. That is, the result is true for $i=n+1$. Hence, by induction method one deduces that the result is true $\forall $ $i=2,3,4\ldots n$. $\square $

4 Computational efficiency

For the estimation of efficiency of proposed families, the efficiency index has been used. The efficiency of an iterative method is given by $E=\rho ^{1/C}$Ostrowski (1960) where $\rho $ is the order of convergence and C is the computational cost per iteration. For a system of t nonlinear equations with t variables, the computational cost per iteration is given by

$$\begin{aligned} C(\nu ,t,\ell )=A(t)\nu +P(t,\ell ), \end{aligned}$$

(28)

where A(t) denotes the number of evaluations of scalar functions used in the evaluation of G and [X, Y; G] and $P(t,\ell )$ denotes the number of products needed per iteration. To express the value of $C(\nu ,t,\ell )$ in terms of products, a ratio $\nu >0$ between products and evaluations of scalar functions, and a ratio $\ell \ge 1$ between products and quotients is required.

To compute G in any iterative function, we evaluate t scalar functions $(\widetilde{g}_1,\widetilde{g}_2,\ldots ,\widetilde{g}_t)$ and if we compute a divided difference [X, Y; G] then we evaluate $t(t-1)$ scalar functions, where G(X) and G(Y) are computed separately. In addition, for central divided difference operator $[X+G,X-G;G]$, $t(t+1)$ scalar functions are evaluated. We must add $t^2$ quotient for any divided difference and $5t^2$ products for multiplication of a vector by a scalar. To calculate an inverse linear operator, we solve a linear system where we have $\frac{t(t-1)(2t-1)}{6}$ products and $\frac{t(t-1)}{2}$ quotients in the LU decomposition, $t(t-1)$ products and t quotients in the resolution of two triangular linear system.

For comparison of computational efficiencies of proposed schemes $\psi _{1}^{k},\psi _{2}^{k},\psi _{3}^{k}$ and $\psi _{4}^{k}$ order of convergence in two, four, six and eight, respectively, the efficiency indices are denoted by $CEI_{i}$ and computational cost (calculated according to (28)) by $C_{i}$. Taking into account the above considerations, one can have

$$\begin{aligned} C_1= & {} \frac{t}{6} (2t^2+6t\nu +3t+9lt+12\nu +3\ell -5)~~~~~~~~~~~\mathrm{and} ~~~~~~~~~CEI_1=2^{1/C_1}.\qquad \end{aligned}$$

(29)

$$\begin{aligned} C_2= & {} \frac{t}{3} (2t^2+6t\nu +18t+9lt+6\nu +3\ell -5)~~~~~~~~~~~\mathrm{and} ~~~~~~~~~CEI_2=4^{1/C_2}.\qquad \end{aligned}$$

(30)

$$\begin{aligned} C_3= & {} \frac{t}{3} (2t^2+6t\nu +21t+9lt+9\nu +6\ell -8)~~~~~~~~~~~\mathrm{and} ~~~~~~~~~CEI_3=6^{1/C_3}.\qquad \end{aligned}$$

(31)

$$\begin{aligned} C_4= & {} \frac{t}{3} (2t^2+6t\nu +24t+9lt+12\nu +9\ell -11)~~~~~~~\mathrm{and} ~~~~~~~~~CEI_4=8^{1/C_4}.\qquad \end{aligned}$$

(32)

4.1 Comparison between efficiencies

To compare the iterative families $\psi _{i},1\le i\le 4,$ the following ratio can be defined as

$$\begin{aligned} R_{i,j}=\frac{\mathrm{log}~CEI_{i}}{\mathrm{log}~CEI_{j}}=\frac{\mathrm{log} (\rho _{i})C_{j}}{\mathrm{log} (\rho _{j}) C_{i}}. \end{aligned}$$

It is clear that if $R_{i,j}>1$, the iterative method $\psi _{i}$ is more efficient than $\psi _{j}$. Taking into account that the border between two computational efficiencies is given by $R_{i,j}=1$, this boundary is given by the equation of $\nu $ written as a function of $\ell $ and t, that is $\nu =M_{i,j}(\ell ,t)$. Here $\nu >0,~\ell \ge 1$ and t is a positive integer $t\ge 2.$

Case 1: Iterative method $\psi _1$ verses iterative family $\psi _3$

The boundary $R_{3,1}=1$ expressed by $\nu $ written as a function of $\ell $ and t is

$$\begin{aligned} M_{3,1}=\frac{(4t^2+42t+18t\ell +12\ell -16)\mathrm{log}2-(2t^2+3t+9t\ell +3\ell -5)\mathrm{log}6}{\mathrm{log}6(6t+12)-\mathrm{log}2(12t-48)}. \end{aligned}$$

(33)

This function has the vertical asymptote for $t=-3.70951.$ Note that the numerator of Eq. (33) is negative for $t\ge 25$ and the denominator of Eq. (33) is positive for $t\ge 2$. Consequently, it shows that $\nu $ is always positive for $2\le t < 25$ and for all $\ell \ge 1$.

So, one can have $ CEI_{3}>CEI_{1},~~~~~~\forall ~ \nu >0,~\ell \ge 1~ \& ~2\le t < 25.$

Case 2: Iterative method $\psi _1$ verses iterative family $\psi _4$

The boundary $R_{4,1}=1$ expressed by $\nu $ written as a function of $\ell $ and t is

$$\begin{aligned} M_{4,1}=\frac{-2t^2+39t-9t\ell +9\ell -7}{6t+12}. \end{aligned}$$

(34)

This function has the vertical asymptote for $t=-2.$ Note that the numerator of Eq. (34) is negative for $t>20$ and the denominator of Eq. (34) is positive for $t\ge 2$. Consequently, it shows that $\nu $ is positive for $2\le t<20$ and for all $\ell \ge 1$.

So, one gets $ ~CEI_{4}>CEI_{1},~~~~~~\forall ~ \nu >0,~\ell \ge 1~ \& ~~2\le t<20.$

Case 3: Iterative family $\psi _2$ verses iterative family $\psi _3$

The boundary $R_{3,2}=1$ expressed by $\nu $ written as a function of $\ell $ and t is

$$\begin{aligned} M_{3,2}=\frac{(2t^2+21t+9t\ell +6\ell -8)\mathrm{log}4-(2t^2+18t+9t\ell +3\ell -5)\mathrm{log}6}{\mathrm{log}6(6t+6)-\mathrm{log}4(6t+9)}. \end{aligned}$$

(35)

This function has the vertical asymptote for $t=0.7095.$ Note that the numerator of Eq. (35) is negative for $t\ge 0$ and the denominator of Eq. (35) is positive for $t>0$. Consequently, it shows that $\nu $ is always negative for $t\ge 2$ and for all $\ell \ge 1$.

So, one can get $ ~CEI_{3}<CEI_{2},~~~~~~\forall ~ \nu >0,~\ell \ge 1~ \& ~~t\ge 2.$

Case 4: Iterative family $\psi _2$ verses iterative family $\psi _4$

The boundary $R_{4,2}=1$ expressed by $\nu $ written as a function of $\ell $ and t is

$$\begin{aligned} M_{4,2}=\frac{(2t^{2}+24t+9t\ell +9\ell -11)\mathrm{log}4-(2t^{2}+18t+9t\ell +3\ell -5)\mathrm{log}8}{\mathrm{log}8(6t+6)-\mathrm{log}4(6t+12)}. \end{aligned}$$

(36)

This function has the vertical asymptote for $t=1.$ Note that the numerator of Eq. (36) is negative for $t\ge 0$ and the denominator of Eq. (36) is positive for $t>1$. Consequently, it shows that $\nu $ is always negative for $t\ge 2$ and for all $\ell \ge 1$.

So, one can obtain $ ~CEI_{4}<CEI_{2},~~~~~~\forall ~ \nu >0,~\ell \ge 1~ \& ~~t\ge 2.$

Case 5: Iterative family $\psi _3$ verses iterative family $\psi _4$

The boundary $R_{4,3}=1$ expressed by $\nu $ written as a function of $\ell $ and t is

$$\begin{aligned} M_{4,3}=\frac{(2t^{2}+24t+9t\ell +9\ell -11)\mathrm{log}6-(2t^{2}+21t+9t\ell +6\ell -8)\mathrm{log}8}{\mathrm{log}8(6t+9)-\mathrm{log}6(6t+12)}. \end{aligned}$$

(37)

This function has the vertical asymptote for $t=1.61413.$ Note that the numerator of Eq. (37) is positive for $t\ge 1$ and the denominator of Eq. (37) is negative for $t>1$. Consequently, it shows that $\nu $ is always negative for $t\ge 2$ and for all $\ell \ge 1$.

So, one can have $ ~CEI_{4}<CEI_{3},~~~~~~\forall ~ \nu >0,~\ell \ge 1~ \& ~~t\ge 2.$

Theorem 4.1

For all $\nu >0$ and $\ell \ge 1$, we have

(i)
$CEI_{3}>CEI_{1},~~for ~~2\le t <25$ (see Fig. 1).
(ii)
$CEI_{4}>CEI_{1},~~for ~~2\le t <20$ (see Fig. 2).
(iii)
$CEI_{3}<CEI_{2},~~for ~~t\ge 2$ (see Fig. 3).
(iv)
$CEI_{4}<CEI_{2},~~for ~~t\ge 2$ (see Fig. 4).
(v)
$CEI_{4}<CEI_{3},~~for ~~t\ge 2$ (see Fig. 5).

5 Local convergence

In this section, we proposed the local convergence analysis of the proposed family of methods which is based on Lipschitz constants and hypotheses on the divided difference of order one. In this way, we further expand the applicability of the proposed methods, since we used higher derivatives to show convergence of the proposed family in Sect. 3 although such derivatives do not appear in method (5). The local convergence analysis of method (5) is based on some scalar functions and parameters. This analysis is also given for $G:D \subset B \rightarrow B,$ in a more general setting than in the previous sections, since B is a Banach space. Let $K_0>0,\; K>0, \; c_0>0,\; c>0,\; c_1>0$ and $p=1,\;2,\; \ldots ,$ be parameters. Define function $g_1$ on the interval $[0,\;r_0^{-})$ by

$$\begin{aligned} g_1(s)= \frac{(1+2c_0)Ks}{1-r_0 s}, \end{aligned}$$

where

$$\begin{aligned} r_0=2(1+c_0)K_0 \end{aligned}$$

and parameter $r_1$ by

$$\begin{aligned} r_1=\frac{1}{(1+2c_0)K+2(1+c_0)K_0}. \end{aligned}$$

Then, we have that $g_1(r_1)=1,\; 0<r_1<r_0^{-}$ and for each $s \in [0,\;r_1)$, $0 \le g_1(s)<1$. Let $\alpha _1$ and $\alpha _2$ be real or complex parameters. Define b and $b_i,\; i=1,\;2,\;3,\;4,\;5$ by $b_1=-\alpha _1 ^2-\alpha _1 \alpha _2,\; b_2=2\alpha _1^2+\alpha _2^2-3\alpha _1\alpha _2, b_3=-\alpha _1\alpha _2,\; b_4=2\alpha _1\alpha _2-\alpha _2^2,\; b_5=\alpha _1^2+\alpha _2^2-3\alpha _1\alpha _2$ and $b=b_1+b_2$. Define functions q and $h_q $ in the following way:

$$\begin{aligned} q(s)=|b|^{-1}\left( |b_1|r_0 s+|b_2|K_0(1+g_1(s))s+\frac{(|b_3|+|b_4|)c_0c_1 c}{1-r_0s}\right) ,\; b_0 \ne 0 \end{aligned}$$

and

$$\begin{aligned} h_q(s)=q(s)-1. \end{aligned}$$

Suppose that

$$\begin{aligned} (|b_3|+|b_4|)c_0c_1c<|b|, \end{aligned}$$

(38)

we get by (38) that $h_q(0)=-1<0$ and $h_q(s) \rightarrow + \infty $ as $s \rightarrow \frac{1^{-}}{r_0}$. It then follows from the intermediate value theorem that function $h_q$ has zeros in the interval $(0,\; r_0^{-})$. Let us consider that $r_q$ be the smallest zero among such zero. Moreover, define some functions $g_i$ and $h_i$ on the interval $[0,\;r_q)$ for $i=2,\;3,\; \ldots , \; p$ in the following way:

$$\begin{aligned} g_i(s)= & {} \left( 1+c |b_2|+ \frac{|b_4|c^2}{|b|(1-q(s))(1-r_0s)} \right) g_{i-1}(s)\\= & {} \left( 1+c |b_2|+ \frac{|b_4|c^2}{|b|(1-q(s))(1-r_0s)} \right) ^{{i-1}}g_1(s),\\ h_i(s)= & {} g_i(s)-1. \end{aligned}$$

Then, we have that $h_i(0)=-1<0$ and $h_i(s) \rightarrow +\infty $ as $s \rightarrow r_q^{-}$. Denote by $r_i,\; i=2,\;3,\; \ldots , \; p$ the smallest zeros of functions $g_i$ on the interval zeros of functions $g_i$ on the interval $(0,\;r_q)$. Notice that $h_i(r_{i-1})= c\left( |b_2|+ \frac{|b_4|c}{|b|(1-r_0r_{i-1})(1-q(r_{i-1}))}\right) >0$, which imply that

$$\begin{aligned} r_n<r_{n-1}< \cdots <r_2. \end{aligned}$$

(39)

Define

$$\begin{aligned} r^*= \min \{r_p,\; r_1\}. \end{aligned}$$

(40)

Then, we have that for each $s \in [0,\;r^*)$

$$\begin{aligned} 0 \le g_i(s)<1 \; \text {and}\; 0 \le q(t)<1,\; i=1,\;2,\; \ldots , \;p. \end{aligned}$$

Let $U(\gamma ,\; \rho ),\; \bar{U}(\gamma , \; \rho )$ stand, respectively, for the open and closed balls in X with the center $\gamma \in X$ and of radius $\rho >0$. Next, we present the local convergence analysis of method (5) using the preceding notation.

Theorem 2

Let $G: D \subset B \rightarrow B$ be a continuous operator. Suppose that there exists divided difference of order one for operator G, $[\cdot ,\; \cdot ; \;G]:D^2 \rightarrow L(B)$, $X^* \in D$, for which $G'(X^*)^{-1}$ exists, $\alpha _1,\; \alpha _2 \in \mathbb {R} \;( \text {or}\; \mathbb {C})$, $K_0>0,\; K>0, c_0>0,\;c>0,\; c_1>0$ and $p=1,\;2,\;3,\; \ldots $ such that (38) holds and $b \ne 0$ for each $X,\;Y,\;Z \in D$ and $G(X^*)=0$, $G'(X^*)^{-1} \in L(X)$, $\left\| G'(X^*)^{-1} \right\| \le c_1,\; b \ne 0$

$$\begin{aligned}&\displaystyle \left\| G'(X^*)^{-1}\left( [X,\;Y;\;G]-G'(X^*)\right) \right\| \le K_0 (\Vert X-X^*\Vert +\Vert Y-X^*\Vert ) \end{aligned}$$

(41)

$$\begin{aligned}&\displaystyle \left\| G'(X^*)^{-1}\left( [X,\;Y;\;G]-[Z,\;X^*;\;G]\right) \right\| \le K(\Vert X-Z\Vert +\Vert Y-X^*\Vert ) \end{aligned}$$

(42)

$$\begin{aligned}&\displaystyle \Vert [X,\;Y;\;G]\Vert \le c_0 \end{aligned}$$

(43)

$$\begin{aligned}&\displaystyle \left\| G'(X^*)^{-1}[X,\;Y;\;G] \right\| \le c \end{aligned}$$

(44)

and

$$\begin{aligned} \bar{U}(X^*,\; (1+c_0)K_0) \subset D. \end{aligned}$$

(45)

Then, the sequence generated by method (5) for $X^0 \in U(X^*,\; r^*)-\{X^*\}$ is well defined, remains in $U(X^*,\;r^*)$ and converges to $X^*$. Moreover, the following estimates hold:

$$\begin{aligned} \Vert \psi _i^k-X^*\Vert \le g_i(\Vert X^k-X^*\Vert )\Vert X^k-X^*\Vert<\Vert X^k-X^*\Vert <r^*, \end{aligned}$$

(46)

for each $i=1,\;2,\; \ldots , p$, where the $``g''$ functions are defined previously. Furthermore, for $T\in \left[ r^*,\;\frac{1}{K_0}\right) $, the limit point $X^*$ is the only solution of Eq. $G(X)=0$ in $\bar{U}(X^*,\;T)\cap D.$

Proof

We shall show estimate (46) holds with the help of mathematical induction. By hypotheses $X^0\in U(X^*,\; r^*)-\{X^*\}$, (39), (40), and (41), we get that

$$\begin{aligned}&\left\| G'(X^*)^{-1}\left( [X^0+G,\; X^0-G; G]-G'(X^*)\right) \right\| \nonumber \\&\quad \le K_0 \left( \Vert X^0-X^*+G(X^0)\Vert +\Vert X^0-X^*-G(X^0)\Vert \right) \nonumber \\&\quad \le K_0 \left( \Vert X^0-X^*\Vert +\Vert G(X^0)-G(X^*)\Vert +\Vert X^0-X^*\Vert +\Vert G(X^0)-G(X^*)\Vert \right) \nonumber \\&\quad =2K_0 \left( \Vert X^0-X^*\Vert +c_0\Vert X^0-X^*\Vert \right) \nonumber \\&\quad =2K_0 (1+c_0)\Vert X^0-X^*\Vert \nonumber \\&\quad = r_0 \Vert X^0-X^*\Vert<r_0 r^*<1. \end{aligned}$$

(47)

Notice that $\Vert X^0+G-X^*\Vert \le \Vert X^0-X^*\Vert + \Vert G(X^0)-G(X^*)\Vert \le (1+c_0) \Vert X^0-X^*\Vert < (1+c_0)r^*$, so $X^0+G \in \bar{U}(X^*,\; (1+c_0 r^*)) \subset D$. Similarly, we get that $X^0-G (X^0)\in D$. Then, it follows from (47) and the Banach Lemma on invertible operators (Argyros 2008; Argyros and Hilout 2013) that $\psi _1^0$ is well defined by the first sub step of method (5) and

$$\begin{aligned} \left\| [X^0+G(X^0),\; X^0-G(X^0);G]^{-1}G'(X^*)\right\| \le \frac{1}{1-r_0\Vert X^0-X^*\Vert }. \end{aligned}$$

(48)

We can write by (40) and the first sub step of method (5) that

$$\begin{aligned}&\psi _1^0-X^* \nonumber \\&\quad =X^0-X^*-[X^0+G(X^0),\; X^0-G(X^0);G]^{-1} G(X^0)\nonumber \\&\quad =[X^0+G(X^0),\; X^0-G(X^0);\;G]^{-1} \left( [X^0+G(X^0),\; X^0-G(X^0);G] (X^0-X^*)-G(X^0)\right) \nonumber \\&\quad =\Big ([X^0+G(X^0),\; X^0-G(X^0);\;G]^{-1} G'(X^*) \Big ) \Big (G'(X^*)^{-1}[X^0+G(X^0),\nonumber \\&\qquad X^0-G(X^0);G]-[X^0,\; X^0;G]\Big ) (X^0-X^*). \end{aligned}$$

(49)

Using (39), (40) (for $i=1$), (42) and (48), we obtain in turn that

$$\begin{aligned}&\Vert \psi _1^0-X^*\Vert \le \left\| [X^0+G(X^0),\; X^0-G(X^0);G]^{-1} G(X^*)\right\| \nonumber \\&\quad = \left\| G'(X^*)^{-1} \left( [X^0+G(X^0),\; X^0-G(X^0);G] -[X^0,\; X^*;G]\right) \right\| \Vert X^0-X^*\Vert \nonumber \\&\quad \le \frac{(1+2c_0)K}{1-2K_0(1+c_0)}\Vert X^0-X^*\Vert \nonumber \\&\quad \le g_1(\Vert X^0-X^*\Vert )\Vert X^0-X^*\Vert<\Vert X^0-X^*\Vert <r^*,\nonumber \\ \end{aligned}$$

(50)

which shows (46) for $k=0,\; i=1$ and $\psi _1 ^0 \in U(X^*,\; r^*)$. Let us define

$$\begin{aligned} A_0= b_4[X^0+G(X^0),\; X^0-G(X^0);G]^{-1}[\psi _1 ^0,\; X^0;\;G]+b_2 I \end{aligned}$$

(51)

and

$$\begin{aligned} B_0&= b_1[X^0+G(X^0),\; X^0-G(X^0);\;G]+ b_2 [\psi _1 ^0,\; X^0;\;G]\nonumber \\&\quad +b_3[X^0+G(X^0),\; X^0-G(X^0);\;G]^{-1}[\psi _1 ^0,\; X^0;\;G]\nonumber \\&\quad [X^0+G(X^0),\; X^0-G(X^0);\;G]\nonumber \\&\quad +b_4[X^0+G(X^0),\; X^0-G(X^0);G]^{-1}[\psi _1 ^0,\; X^0;\;G]^2, \end{aligned}$$

(52)

where the “b” parameters are defined previously. Next, we shall show that $B_0^{-1} \in L(X)$. Using (39), (40), (41), (44), (50) and (52), we get in turn that since $b \ne 0$

$$\begin{aligned}&\left\| \left( b G'(X^*)\right) ^{-1}[B_0-(b_1+b_2+b_3)G'(x^*)]\right\| \nonumber \\&\quad \le |b|^{-1} \left[ |b_1| \Vert G'(X^*)^{-1}\left( [X^0+G(X^0),\; X^0-G(X^0);\;G]-G'(X^*)\right) \Vert \right] \nonumber \\&\qquad +|b_2| \left\| G'(X^*)^{-1} \left( [\psi _1 ^0,\; X^0;\;G]-G'(X^*)\right) \right\| \nonumber \\&\qquad + |b_3| \left\| G'(X^*)^{-1} \right\| \left\| [X^0+G(X^0),\; X^0-G(X^0);G]^{-1} G'(X^*) \right\| \nonumber \\&\qquad \times \left\| G'(X^*)^{-1} [\psi _1^0,\; X^0;\;G] \right\| \left\| [X^0+G(X^0),\; X^0-G(X^0);\;G] \right\| \nonumber \\&\qquad + |b_4|\left\| G'(X^*)^{-1} \right\| \left\| [X^0+G(X^0),\; X^0-G(X^0);G]^{-1} G'(X^*) \right\| \nonumber \\&\qquad \times \left\| G'(X^*)^{-1} [\psi _1 ^0,\; X^0;\;G]\right\| \left\| [\psi _1 ^0,\; X^0;\;G]\right\| \nonumber \\&\quad \le |b|^{-1} \left[ |b_1| r_0 \Vert X^0-X^*\Vert +|b_2|K_0 \left( \Vert \psi _1^0-X^*\Vert + \Vert X^0-X^*\Vert \right) +\frac{(|b_3|+|b_4|)c_0 c_1 c}{1-r_0 \Vert X^0-X^*\Vert } \right] \nonumber \\&\quad \le |b|^{-1} \left[ |b_1|r_0 \Vert X^0-X^*\Vert +|b_2|K_0 \left( 1+g_1( \Vert X^0-X^*\Vert ) \right) \Vert X^0-X^*\Vert \right. \nonumber \\&\qquad \left. +\frac{(|b_3|+|b_4|)c_0 c_1 c}{1-r_0 \Vert X^0-X^*\Vert } \right] \nonumber \\&\quad \le q(\Vert X^0-X^*\Vert )<q(r^*)<1. \end{aligned}$$

(53)

Then, it follows from (53) that

$$\begin{aligned} \left\| B_0^{-1}G'(X^*)\right| \le \frac{1}{|b|\left( 1-q(\Vert X^0-X^*\Vert )\right) }. \end{aligned}$$

(54)

By (44), (48) and (51), we get that

$$\begin{aligned} \begin{aligned} \Vert A_0\Vert&\le |b_4| \left\| [X^0+G(X^0),\; X^0-G(X^0);\;G]^{-1}G'(X^*)\right\| \Vert G'(X^*)^{-1}[\psi _1 ^0,\; X^0;\;G]\Vert +|b_2| \\&\le \frac{|b_4|c}{1-r_0 \Vert X^0-X^*\Vert }+|b_2| \end{aligned} \end{aligned}$$

(55)

Then, using (39), (40), (50), (54), (55) and the definition of the $``g''$ functions, we obtain from the second sub step of method (5) that

$$\begin{aligned} \begin{aligned} \Vert \psi _2^0-X^*\Vert&\le \Vert \psi _1^0-X^*\Vert +\Vert A_0\Vert \Vert B_0^{-1}G'(X^*)\Vert \Vert G'(X^*)^{-1}G(\psi _1^0)\Vert \\&\le \Vert \psi _1^0-X^*\Vert +\Vert A_0\Vert \Vert B_0^{-1}G'(X^*)\Vert \Vert G'(X^*)^{-1}\left( G(\psi _1^0) -G(X^*)\right) \Vert \\&\le \Vert \psi _1^0-X^*\Vert +\Vert A_0\Vert \Vert B_0^{-1}G'(X^*)\Vert \Vert G'(X^*)^{-1} [\psi _1 ^0,\; X^0;\;G] \Vert \Vert \psi _1^0-X^*\Vert \\&\le \left( 1+c |b_2|+ \frac{|b_4|c^2}{|b|\big (1-q(\Vert X^0-X^*\Vert )\big )\big (1-r_0(\Vert X^0-X^*\Vert )\big )} \right) \\ {}&\qquad g_1(\Vert X^0-X^*\Vert )\Vert X^0-X^*\Vert \\&= g_2( \Vert X^0-X^*\Vert ) \Vert X^0-X^*\Vert< \Vert X^0-X^*\Vert <r^*, \end{aligned} \end{aligned}$$

(56)

which shows (46) for $i=2, \; k=0$ and $\psi _2^0 \in U(X^*,\; r^*)$. Similarly, we show

$$\begin{aligned} \begin{aligned} \Vert \psi _3^0-X^*\Vert&\le \left( 1+c |b_2|+ \frac{|b_4|c^2}{|b|\big (1-q(\Vert X^0-X^*\Vert )\big )\big (1-r_0(\Vert X^0-X^*\Vert )\big )} \right) \\ {}&\qquad g_2(\Vert X^0-X^*\Vert )\Vert X^0-X^*\Vert \\&= g_3(\Vert X^0-X^*\Vert )\Vert X^0-X^*\Vert<\Vert X^0-X^*\Vert < r^* \end{aligned} \end{aligned}$$

until

$$\begin{aligned} \begin{aligned} \Vert X^1-X^*\Vert&= \Vert \psi _n^k-X^*\Vert \le g_n(\Vert X^0-X^*\Vert )\Vert X^0-X^*\Vert \\&\le \mu \Vert X^0-X^*\Vert<\Vert X^0-X^*\Vert < r^*, \end{aligned} \end{aligned}$$

(57)

where $\mu =g_p(r^*) \in (0,\; 1)$. By simply replacing $\psi _1^0$, $\psi _2^0, \; \ldots , \psi _p^0, \; X^0$ by $\psi _1^m$, $\psi _2^m, \; \ldots , \psi _p^m, \; X^m$ in the preceding estimates we complete the induction for (46). Then, in view of the estimates $ \Vert X^{m+1}-X^*\Vert \le \mu \Vert X^m-X^*\Vert $(see (57)), we deduce that $\{X^m\}$ converges to $X^*$ and $X^{m}\in U(X^*,\; r^*)$ for each $m=0,\;1,\;2,\; \ldots $ Finally, to show the uniqueness part, let $H=[X^*, Y^*; G]$ where $G(Y^*)=0$ and $Y^*\in \bar{U}(X^*, T)$. Then, using (41), we get that

$$\begin{aligned} \begin{aligned} \left| G'(X^*)^{-1}([X^*, Y^*; G]-G'(X^*))\right|&\le K_0\left( \Vert X^*-Y^*\Vert \right) <1.\\ \end{aligned} \end{aligned}$$

(58)

Hence, $H^{-1} \in L(B)$. Then, from the identity $0=G(X^*)-G(Y^*)=H(X^*-Y^*)$, we conclude that $X^*=Y^*$. $\square $

Remark 5.2

(a)
If $X=R^t$ then Theorem 2 specializes in the case studied in the earlier sections.
(b)
The convergence of method (5) in the previous sections was shown using hypothesis limit the applicability of method (5). In Argyros et al. (2015), we have presented some examples where the third or higher derivatives do not exist. Therefore, in Example 1, we present another such a case for such equations where method (5) is not applicable. However, in Theorem 2, we only use hypothesis on the divided difference of order one and on $G'(X^*)$, which actually appear in method (5). We expand this way the applicability of method (5). Moreover, we present computable radius of convergence and error radius of convergence and error bounds on the distances involved (see (46)) using only Lipschitz constants.

6 Numerical results

In this section, some numerical problems are considered to illustrate the convergence behavior and computational efficiency of the proposed methods. The computational work and CPU time of all the numerical experiments have been done in the programming package Mathematica 7.1 Wolfram (2003) with multiple-precision arithmetic with 2048 digits. The CPU time has been calculated by TimeUsed[] command in Mathematica 7.1. For comparison of the computational efficiencies of the proposed schemes (5) $\psi _{2,1},\psi _{3,1}$ which are special cases of Grau-Sánchez et al. (2013) and $\psi _{4,1}$ for $ \alpha _{1}=\mathbb {R}-\{0\}~ \& ~\alpha _2=0$ are considered. In the same manner, the proposed schemes (5) $\psi _{2,2},\psi _{3,2},\psi _{4,2}$ for $ \alpha _1={\pm }\,10^{20}~ \& ~\alpha _2={\pm }\,10^{-1000}$ and $\psi _{2,3},\psi _{3,3},\psi _{4,3}$ for $ \alpha _1={\pm }\,\sqrt{3}~ \& ~\alpha _2={\pm }\,10^{-2000}$ are denoted and compared with existing schemes of fourth order, namely $M_{4,1},M_{4,2}$ for Sharma and Arora (2013) and seventh order $S_{7}$ (Sharma and Arora 2014). To verify the theoretical order of convergence, authors have used the computational order of convergence (COC) (Ezquerro and Hernández 2009).

$$\begin{aligned} \rho =\frac{ln\frac{\Vert X^{k+1}-X^*\Vert }{\Vert X^k-X^*\Vert }}{ln\frac{\Vert X^{k}-X^*\Vert }{\Vert X^{k-1}-X^*\Vert }},\quad \text { for each } k=1,2,\ldots \end{aligned}$$

(59)

or the approximate computational order of convergence (ACOC) (Ezquerro and Hernández 2009)

$$\begin{aligned} \rho ^*=\frac{ln\frac{\Vert X^{k+1}-X^{k}\Vert }{\Vert X^{k}-X^{k-1}\Vert }}{ln\frac{\Vert X^{k}-X^{k-1}\Vert }{\Vert X^{k-1}-X^{k-2}\Vert }},\quad \text { for each } k=2,\; 3, \ldots \end{aligned}$$

(60)

Notice that the computational of $\rho $ or $\rho ^*$ do not require higher order derivatives to compute the error bounds. According to the definition of the computational cost (28), an estimation of the factors $\nu $ is claimed. To do this, one can express the cost of the evaluation of the elementary functions in terms of products which depends on the machine, the software and the arithmetics used (Fousse and Hanrot 2007). In the following table, an estimation of the cost of the elementary functions in number of equivalent products is shown, where running time of one product is measured in milliseconds. For the detail of hardware and software used in the numerical work, the computational cost of quotient with respect to product is $\ell =3$ is given as follows:

Estimation of computational cost of elementary functions computed with Mathematica 7.1 in a processor Intel(R) Core (TM) i5-2430M CPU @ 2.40 GHz (32-bit machine) Microsoft Windows 7 Ultimate 2009, where $x=\surd 3 - 1$ and $y=\surd 5$

Digits	$x*y$	x / y	$\sqrt{x}$	exn(x)	ln(x)	sin(x)	cos(x)	arccos(x)	arctan(x)
2048	$0.0301\,\mathrm{ms}$	3	1.5	77	78	78	77	119	118

Example 1

As a motivational example, define function F on $\mathbb {X}=\mathbb {Y}=\mathbb {R}$, $D=[-\frac{1}{\pi },\;\frac{2}{\pi }]$ by

$$\begin{aligned} F(x)= \left\{ \begin{array}{lcl} x^3 \log ( \pi ^2 x^2)+x^5 \sin \left( \dfrac{1}{x}\right) ,&{} &{}x\not =0\\ 0, &{} &{} x=0\\ \end{array}. \right. \end{aligned}$$

Then, we have that

$$\begin{aligned} F'(x)= & {} 2x^2 - x^3 \cos \left( \frac{1}{x}\right) + 3x^2 \log (\pi ^2 x^2)+5x^4 \sin \left( \frac{1}{x}\right) ,\\ F''(x)= & {} -8x^2 \cos \left( \frac{1}{x}\right) +2x(5+3 \log (\pi ^2 x^2))+x(20x^2-1)\sin \left( \frac{1}{x}\right) \end{aligned}$$

and

$$\begin{aligned} F'''(x)=\frac{1}{x}\left[ (1-36x^2) \cos \left( \frac{1}{x}\right) + x\left( 22+6 \log (\pi ^2 x^2)+(60x^2-9)\sin \left( \frac{1}{x}\right) \right) \right] . \end{aligned}$$

One can easily find that the function $F'''(x)$ is unbounded on $\mathbb {D}$ at the point $x=0$. Therefore, the results before Sect. 5 cannot apply to show the convergence of method (5). In particular, hypotheses on the third derivative of function F or even higher are assumed to prove convergence of method (5) in Sect. 3. However, according to this section, we just need the hypotheses on first order. Moreover, we have

$$\begin{aligned} K= & {} K_0= \frac{80+16 \pi +( \pi +12 \log 2) \pi ^2}{2 \pi +1},\; \;\; c_1=\frac{ \pi ^3}{2 \pi +1},\\ c= & {} \frac{8}{ \pi (2 \pi +1)(10+ \pi +(1+3 \log 2) \pi ^2)}, \;\;\; c_0=\frac{8 }{[10+ \pi +(1+3 \log 2 ) \pi ^2] \pi ^4} \end{aligned}$$

and our required zero is $X^*=\frac{1}{\pi }$. We obtain different radii of convergence, COC ($\rho $) and n in Table 1.

Table 1 Different radii of convergence

Full size table

Other such examples can be found in Argyros et al. (2015).

Example 2

Considering mixed Hammerstein integral equation (see [Ortega and Rheinboldt (1970), pp. 19–20]).

$x(s)=1+\frac{1}{5}\int _{0}^{1} G(s,t)x(t)^3\mathrm{d}t$ where $x\in C[0,1];s,t\in [0,1]$ and the kernel G is

$$\begin{aligned} G(s,t)= \left\{ \begin{array}{l} (1-s)t,t\le s,\\ s(1-t),s\le t. \end{array} \right. \end{aligned}$$

To transform the above equation into a finite-dimensional problem using Gauss Legendre quadrature formula given as $\int _{0}^{1} f(t)\mathrm{d}t\simeq \sum _{j=1}^{8} w_jf(t_j),$ where the abscissas $t_j$ and the weights $w_j$ are determined for $t=8$ by Gauss–Legendre quadrature formula. Denoting the approximations of $x(t_i)$ by $x_i~ (i=1,2,\ldots ,8)$, one gets the system of nonlinear equations $5x_i-5-\sum _{j=1}^{8} a_{ij}x_j^3=0,$ where $i=1,2,\ldots ,8 $

$$\begin{aligned} a_{ij}= \left\{ \begin{array}{l} w_jt_j(1-t_i),j\le i,\\ w_jt_i(1-t_j),i<j, \end{array} \right. \end{aligned}$$

where the abscissas $t_j$ and the weights $w_j$ are known and given in the following table for $t=8$.

Abscissas and weights of Gauss–Legendre quadrature formula for $t=8$

j	$t_j$	$w_j$
1	$0.01985507175123188415821957\ldots $	$0.05061426814518812957626567\ldots $
2	$0.10166676129318663020422303\ldots $	$0.11119051722668723527217800\ldots $
3	$0.23723379504183550709113047\ldots $	$0.15685332293894364366898110\ldots $
4	$0.40828267875217509753026193\ldots $	$0.18134189168918099148257522\ldots $
5	$0.59171732124782490246973807\ldots $	$0.18134189168918099148257522\ldots $
6	$0.76276620495816449290886952\ldots $	$0.15685332293894364366898110\ldots $
7	$0.89833323870681336979577696\ldots $	$0.11119051722668723527217800\ldots $
8	$0.98014492824876811584178043\ldots $	$0.05061426814518812957626567\ldots $

In addition, $(t,\nu )=(8,11)$ are the values used in Eqs. (29)–(32). The convergence of the methods towards the root

$X^*=(1.00209624503115679\ldots ,1.00990031618748877\ldots ,1.01972696099317687\ldots $, $1.02643574303062052\ldots ,1.02643574303062052\ldots ,1.01972696099317687\ldots $, $1.00990031618748877\ldots ,1.00209624503115679\ldots )^{T}$ is tested in Table 2.

Table 2 Performance of various iterative schemes at initial value $(0.85,0.85,\ldots ,0.85)^{T}$

Full size table

Example 3

(see Sharma and Arora 2013)

$$\begin{aligned} G(x_{1},x_{2})= \left\{ \begin{aligned}&(x_{1}-1)^4+e^{-x_{2}}-x_{2}^2+3x_{2}+1,\\&4sin(x_{1}-1)-log(x_{1}^2-x_{1}+1)-x_{2}^2. \end{aligned} \right. \end{aligned}$$

To calculate computational cost and efficiency indices the values $(t,\nu )=(2,120)$ are used in Eqs. (29)–(32). The convergence of the methods towards the root $X^*=(1.271384307950131633\ldots $, $0.88081907310266102\ldots )^{T}$ is tested in Table 3.

Table 3 Performance of various iterative schemes at initial guess $(1.2,\;-1.2)^{T}$

Full size table

Example 4

(see Grau et al. 2007) Consider the following boundary value problem:

$$\begin{aligned} y''+y^3=0,\; y(0)=0,y(1)=1. \end{aligned}$$

Further, assume the partition of the interval [0, 1], which is defined as follows:

$$\begin{aligned} x_0=0<x_1< x_2<x_3< \cdots < x_n,\quad \mathrm{where} \; x_i=x_0+i h,\; h=\frac{1}{n}. \end{aligned}$$

Let us define $y_0=y(x_0)=0,\; y_1=y(x_1),\; \ldots , \; y_{n-1}=y(x_{n-1}),\; y_n=y(x_n)=1.$

The following discretization for the second derivative is used:

$$\begin{aligned} y''_k= \frac{y_{k-1}-2y_k+y_{k+1}}{h^2},\quad k=1,\;2,\;\ldots , \; n-1, \end{aligned}$$

which reduces to a system of nonlinear equations of order $n-1$

$$\begin{aligned} y_{k-1}-2y_k+y_{k+1}+h^2y_{k}^{3}=0,\quad k=1,\;2,\;\ldots , \; n-1. \end{aligned}$$

The solution of this system $X^*=(0.0207113\ldots ,0.0414227\ldots $,$0.0621341\ldots $, $0.0828453\ldots ,0.1035564\ldots ,0.1242670\ldots $,$0.1449769\ldots ,0.1656856\ldots $, $0.1863926\ldots ,0.2070970\ldots ,0.2277981\ldots 0.2484946\ldots ,0.2691852\ldots $, $0.2898683\ldots ,0.3105421\ldots ,0.3312043\ldots ,0.3518526\ldots ,0.3724841\ldots $, $0.3930958\ldots ,0.4136841\ldots ,0.4342452\ldots ,0.4547747\ldots ,0.4752682\ldots ,0.4957203\ldots $, $0.5161257\ldots ,0.5364781\ldots ,0.5567712\ldots ,0.5769980\ldots ,0.5971509\ldots ,0.6172219\ldots $, $0.6372025\ldots ,0.6570837\ldots ,0.6768557\ldots ,0.6965086\ldots ,0.7160316\ldots ,0.7354134\ldots $, $0.7546422\ldots ,0.7737059\ldots ,0.7925915\ldots ,0.8112857\ldots ,0.8297745\ldots ,0.8480437\ldots $, $0.8660785\ldots ,0.8838634\ldots ,0.9013829\ldots ,0.9186208\ldots ,0.9355607\ldots ,0.9521858\ldots $, $0.9684789\ldots ,0.9844228\ldots )^T$ by taking $n=51$ and the values $(t,\nu )=(50,4)$ used in Eqs. (29)–(32) are tested in Table 4.

Table 4 Results of various iterative schemes at initial value $(0.6,0.6,\ldots ,0.6)^T$

Full size table

Example 5

(see Grau-Sánchez et al. 2013)

$$\begin{aligned} cos^{-1}(x_i)-\sum _{j=1,i\ne j}^{20} (x_j-x_i)=0, \quad i=1,2,3 \ldots 20, \end{aligned}$$

where $(t,\nu )=(20,119)$ are the values used in Eqs. (29)–(32). Solution of this problem is $X^*=(0.08266851975958913\ldots , 0.08266851975958913\ldots ,\dots $,$0.08266851975958913\ldots )^{T}$ and comparisons of the method are displayed in Table 5.

Table 5 Performance of various iterative schemes at initial guess $(0.083,0.083,\ldots ,0.083)^{T}$

Full size table

Example 6

Considering the gravity flow discharge chute problem (see [Burden and Faires (2014), pp. 646]).

$$\begin{aligned} G_{i}= \left\{ \begin{aligned}&\frac{\mathrm{sin}~x_{i+1}}{v_{i+1}}(1-\mu w_{i+1})-\frac{\mathrm{sin}~x_{i}}{v_{i}}(1-\mu w_{i}) =0,~1\le i\le 19,\\&\Delta y \Sigma _{i=1}^{20} tan~x_{i}-X=0,~~~~~~~~i=20, \end{aligned} \right. \end{aligned}$$

where $v_i^2=v_0^2+2gi\Delta y-2\mu \Delta y\Sigma _{j=1}^{20}\frac{1}{cos~x_{j}},~1\le i\le 20~~and~~w_{i}=-\Delta y v_{i}\Sigma _{j=1}^{20}\frac{1}{v_{j}^3cos~x_{j}}$, $1\le i\le 20.$

Here, $v_0=0$ initial velocity of the granular material, $X=2$ the x-coordinate the end of the chute, $\mu =0$ the friction force, $g=32.17ft/s^2$ gravitational force and $\Delta y=0.2$ has been considered. The solution of this system $X^*=(0.14062\ldots ,0.19954\ldots ,0.24522\ldots , 0.28413\ldots ,0.31878\ldots ,0.35045\ldots $, $0.37990\ldots ,0.40763\ldots ,0.43398\ldots ,0.45920\ldots $, $0.48348\ldots ,0.50697\ldots $, $0.52980\ldots ,0.55205\ldots ,0.57382\ldots ,0.59516\ldots $, $0.61615\ldots ,0.63683\ldots $,$0.65726\ldots ,0.67746\ldots )^T$ and the values $(t,\nu )=(20,84.8)$ used in Eqs. (29)–(32) are tested in Table 6.

Table 6 Results of various iterative schemes at initial value $(0.75,0.75,\ldots ,0.75)^T$

Full size table

In Tables 2, 3, 4, 5, and 6, $\Vert X^{(k)}-X^*\Vert $ shows the errors of approximations to the corresponding solutions of Examples 3–6, $(\rho )$ the computational order of convergence and $C_i$ the computational costs given by Eqs. (29)–(32) in terms of products and the computational efficiencies CEI , where $\widetilde{b}(-\,a)$ denoted by $\widetilde{b} \times 10^{-a}$. The numerical results in Tables 2, 3, 4, 5, and 6 demonstrate that proposed methods work more efficiently with less error as compared to existing methods, namely $M_{4,1},~M_{4,2}$ and $S_7$. In addition, the higher order methods not only works on simple experiment, it also works on application-oriented problems as shown in Examples 4 and 6.

7 Concluding remarks

In this work, we have proposed several families of Ostrowski’s method for solving nonlinear systems. The new families are completely derivative free, and therefore, suited to those problems in which derivatives require lengthy computations. A development of an inverse first-order divided difference operator for multivariable function is applied to prove the convergence order of proposed methods. Moreover, the fourth- and sixth-order methods proposed by Grau-Sánchez et al. (2013) have been recovered as the special cases of the presented families. Further, the computational efficiency index is used to compare the efficiency of these different proposed families. Computational results have conformed robust and efficient character of the proposed families. We have also presented local convergence analysis based on divided differences of order one and Lipschitz constants. This way we expand the applicability of method (5), since in Sect. 3, we have to use hypotheses on high order derivatives to obtain convergence which may not exist (Argyros et al. 2015). Some numerical experimentations have also being carried out for a number of problems and results are found to be at a par with those presented here. Thus, the new methods are very suitable and applicable to solve nonlinear systems.

References

Amat S, Busquier S, Plaza S (2005) Dynamics of the King and Jarratt iterations. Aequationes Math 69(3):212–223
Article MathSciNet Google Scholar
Amat S, Busquier S, Plaza S (2010) Chaotic dynamics of a third-order Newton-type method. J Math Anal Appl 366(1):24–32
Article MathSciNet Google Scholar
Amat S, Hernández MA, Romero N (2008) A modified Chebyshev’s iterative method with at least sixth order of convergence. Appl Math Comput 206(1):164–174
MathSciNet MATH Google Scholar
Argyros IK (2008) Convergence and application of Newton-type iterations. Springer, New York
MATH Google Scholar
Argyros IK, George S, Magréñan ÁA (2015) Local convergence for multi-point-parametric Chebyshev–Halley–type methods of high convergence order. J Comput Appl Math 282:215–224
Article MathSciNet Google Scholar
Argyros IK, Hilout S (2013) Numerical methods in nonlinear analysis. W. Sci. Publ. Comp, New Jersey
Book Google Scholar
Behl R, Kanwar V, Sharma KK (2013) Optimal equi-scaled families of Jarratt’s method. Int J Comput Math 90:408–422
Article MathSciNet Google Scholar
Burden RL, Faires JD (2014) Numerical analysis, 9th edn. Cengage Learning Publishing Company, Boston
MATH Google Scholar
Ezquerro JA, Hernández MA (2009) New iterations of R-order four with reduced computational cost. BIT Numer Math 49:325–342
Article MathSciNet Google Scholar
Ezquerro JA, Grau-Sánchez M, Hernández-Veron MA, Noguera M (2015) A family of iterative methods that uses divided differences of first and second orders. Numer Algorithm 70(3):571–589
Article MathSciNet Google Scholar
Grau-Sánchez M (2011) $\grave{A}$. Grau, M. Noguera, Ostrowski type methods for solving systems of nonlinear equations. J Comput Appl Math 218:2377–2385
Article Google Scholar
Grau-Sánchez M, Noguera M (2011) Frozen divided difference scheme for solving system of nonlinear equations. J Comput Appl Math 235:1739–1743
Article MathSciNet Google Scholar
Grau-Sánchez M, Noguera M, Amat S (2013) On the approximation of derivatives using divided difference operators preserving the local convergence order of iterative methods. J Comput Appl Math 237:363–372
Article MathSciNet Google Scholar
Grau-Sánchez M, Noguera M, Diaz-Barrero JL (2014) On the local convergence of a family of two-step iterative methods for solving nonlinear equations. J Comput Appl Math 255:753–764
Article MathSciNet Google Scholar
Grau M, Peris JM, Guti$\acute{e}$rrez JM, (2007) Accelerated iterated methods for finding the solutions of a system of nonlinear equations. Appl Math Comput 190(2):1815–1823
Fousse L, Hanrot G, Lef$\grave{e}$vre V, Pálissier P, Zimmermann M, (2007) MPFR: A multiple-precision binary floating-point library with correct rounding. ACM Trans Math Softw 33(2):1–14
Kelley CT (2003) Solving nonlinear equations with Newton’s method. SIAM, Philadelphia
Book Google Scholar
Ortega JM, Rheinboldt WC (1970) Iterative solution of nonlinear equations in several variables. Academic, New York
MATH Google Scholar
Ostrowski AM (1960) Solutions of equations and system of equations. Academic, New York
Google Scholar
Petković MS (2011) Remarks on a general class of multipoint root finding methods of high computational efficiency. SIAM J Numer Anal 49:1317–1319
Article MathSciNet Google Scholar
Potrá FA, Pták V (1984) Nondiscrete induction and iterative processes. Pitman Publishing, Boston
MATH Google Scholar
Sharma JR, Arora H (2013) An efficient derivative free iterative method for solving systems of nonlinear equations. Appl Anal Discret Math 7:390–403
Article MathSciNet Google Scholar
Sharma JR, Arora H (2014) A novel derivative free algorithum with seventh-order convergence for solving systems of non-linear equations. Numer Algorithm 67:917–933
Article Google Scholar
Steffensen JF (1933) Remarks on iteration. Skand Aktuar Tidskr 16:64–72
MATH Google Scholar
Traub JF (1964) Iterative method for the solution of equations. Prentice-Hall, Englewood cliffs
MATH Google Scholar
Wolfram S (2003) The mathematica book, 5th edn. Wolfram Media, USA
MATH Google Scholar

Download references

Acknowledgements

The author, Sanjeev Kumar, would be extremely thankful to Thapar institute of engineering and technology, Patiala, India for the financial support under Grant TU/DORS/57/157.

Author information

Authors and Affiliations

School of Mathematics, Thapar Institute of Engineering and Technology, Patiala, Punjab, 147004, India
S. Bhalla & S. Kumar
Department of Mathematics Sciences, Cameron University, Lawton, OK, 73505, USA
I. K. Argyros
Department of Mathematics, King Abdulaziz University, P.O. Box 80203, Jeddah, 21589, Saudi Arabia
Ramandeep Behl

Authors

S. Bhalla
View author publications
You can also search for this author in PubMed Google Scholar
S. Kumar
View author publications
You can also search for this author in PubMed Google Scholar
I. K. Argyros
View author publications
You can also search for this author in PubMed Google Scholar
Ramandeep Behl
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ramandeep Behl.

Additional information

Communicated by Andreas Fischer.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Bhalla, S., Kumar, S., Argyros, I.K. et al. A family of higher order derivative free methods for nonlinear systems with local convergence analysis. Comp. Appl. Math. 37, 5807–5828 (2018). https://doi.org/10.1007/s40314-018-0663-x

Download citation

Received: 16 November 2017
Accepted: 01 June 2018
Published: 16 June 2018
Issue Date: November 2018
DOI: https://doi.org/10.1007/s40314-018-0663-x

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

j	\(t_j\)	\(w_j\)
1	\(0.01985507175123188415821957\ldots \)	\(0.05061426814518812957626567\ldots \)
2	\(0.10166676129318663020422303\ldots \)	\(0.11119051722668723527217800\ldots \)
3	\(0.23723379504183550709113047\ldots \)	\(0.15685332293894364366898110\ldots \)
4	\(0.40828267875217509753026193\ldots \)	\(0.18134189168918099148257522\ldots \)
5	\(0.59171732124782490246973807\ldots \)	\(0.18134189168918099148257522\ldots \)
6	\(0.76276620495816449290886952\ldots \)	\(0.15685332293894364366898110\ldots \)
7	\(0.89833323870681336979577696\ldots \)	\(0.11119051722668723527217800\ldots \)
8	\(0.98014492824876811584178043\ldots \)	\(0.05061426814518812957626567\ldots \)

A family of higher order derivative free methods for nonlinear systems with local convergence analysis

Abstract

Similar content being viewed by others

A family of iterative methods that uses divided differences of first and second orders

Local Convergence for an Efficient Eighth Order Iterative Method with a Parameter for Solving Equations Under Weak Conditions

A study on the local convergence and the dynamics of Chebyshev–Halley–type methods free from second derivative

1 Introduction

2 Construction of iterative family

3 Convergence analysis

Theorem 1

Proof

4 Computational efficiency

4.1 Comparison between efficiencies

Theorem 4.1

5 Local convergence

Theorem 2

Proof

Remark 5.2

6 Numerical results

Estimation of computational cost of elementary functions computed with Mathematica 7.1 in a processor Intel(R) Core (TM) i5-2430M CPU @ 2.40 GHz (32-bit machine) Microsoft Windows 7 Ultimate 2009, where \(x=\surd 3 - 1\) and \(y=\surd 5\)

Example 1

Example 2

Abscissas and weights of Gauss–Legendre quadrature formula for \(t=8\)

Example 3

Example 4

Example 5

Example 6

7 Concluding remarks

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

A family of higher order derivative free methods for nonlinear systems with local convergence analysis

Abstract

Similar content being viewed by others

A family of iterative methods that uses divided differences of first and second orders

Local Convergence for an Efficient Eighth Order Iterative Method with a Parameter for Solving Equations Under Weak Conditions

A study on the local convergence and the dynamics of Chebyshev–Halley–type methods free from second derivative

1 Introduction

2 Construction of iterative family

3 Convergence analysis

Theorem 1

Proof

4 Computational efficiency

4.1 Comparison between efficiencies

Theorem 4.1

5 Local convergence

Theorem 2

Proof

Remark 5.2

6 Numerical results

Estimation of computational cost of elementary functions computed with Mathematica 7.1 in a processor Intel(R) Core (TM) i5-2430M CPU @ 2.40 GHz (32-bit machine) Microsoft Windows 7 Ultimate 2009, where \(x=\surd 3 - 1\) and \(y=\surd 5\)

Example 1

Example 2

Abscissas and weights of Gauss–Legendre quadrature formula for \(t=8\)

Example 3

Example 4

Example 5

Example 6

7 Concluding remarks

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation