1 Introduction

Let a system of nonlinear point (with real coefficients) equations be given

$$\begin{aligned} F(x)=0, \end{aligned}$$
(1)

where \(F(x)=\bigl (F_1(x),F_2(x),\ldots ,F_n(x)\bigr )^{\top }\), \(x=(x_1,x_2,\ldots ,x_n)^{\top }\).

To apply interval methods, the formulation of the problem of solving systems of nonlinear point (1) is given in the following form:

$$\begin{aligned} \left[ \begin{array}{c} \textit{For each solution of the system of equations}\\ F(x)=0\\ \textit{on a given set D}\subseteq \mathbb {R}^n \textit{find guaranteed two-sided}\\ \textit{boundaries (if possible, the mast accurate)} \end{array}\right] \end{aligned}$$
(2)

And the problem of outer estimation for interval systems is formulated as follows:

for systems of equations of the form

$$\begin{aligned} F(\varvec{a},x)=0, \end{aligned}$$
(3)

\(F(\varvec{a},x)=\bigl (F_1(\varvec{a},x),F_2(\varvec{a},x),\ldots ,F_n(\varvec{a},x)\bigr )^{\top }, \;\;\varvec{a}\in \mathbb{I}\mathbb{R}^l\), find the outer interval estimate for the united set of solutions

$$\begin{aligned} \Xi _{uni}(F,\varvec{a})=\{x\in \mathbb {R}^n\;|\;(\exists a\in \varvec{a})\bigl (F(a,x)=0\bigr )\}, \end{aligned}$$
(4)

within the interval box \(\varvec{X}\in \mathbb{I}\mathbb{R}^n\).

Nonlinear systems of equations and interval methods for solving them are described in detail in monographs [1,2,3,4,5,6,7]. Currently, there are many interval iterative methods for solving nonlinear point systems of equations, the most popular of which are Newton’s method, Krawczyk’s method [8] and the Hansen-Sengupta method [9]. These methods are considered classical and fundamental because many recently developed methods use them as a component or are based on their ideas (we refer to these methods as classical interval methods below). Other interval methods have also been developed, for example, T. Eftekhari [10] proposes an interval analogue of the classical Ostrovsky method. In [11], S. Singh et al. develop higher order interval multi-step methods. B. Kubica in [12] presents a multi-threaded interval solver for subdetermined and well-defined nonlinear systems, where some options are proposed for ensuring consistency of boxes and eliminating initial boxes. E. Hansen et al. [13] offer a test for the existence of solutions to a nonlinear point system, where sufficient conditions for the existence of a solution for the Krawczyk method are checked. L. Kolev in [14] proposes a linearization method using modified affine arithmetic, which provides a more optimal shell compared to the well-known linear interval form [15].

It should be noted that most of the developed interval methods are intended for solving point nonlinear systems (1), and methods for estimating sets of solutions to interval systems of the form (3) are not yet sufficiently developed.

In this work, we limit ourselves to considering some special version of the equations, which is a special case of system (3):

$$\begin{aligned} F(a,x)=\varvec{b}, \end{aligned}$$
(5)

where \(F(a,x)=\bigl (F_1(a,x),F_2(a,x),\ldots ,F_n(a,x)\bigr )^{\top }\), \(x=(x_1,x_1,\ldots ,x_n)^{\top }\in \varvec{X}\in \mathbb{I}\mathbb{R}^n\), \(a\in \mathbb {R}\), \(\varvec{b}\in \mathbb{I}\mathbb{R}\), that the coefficients on the left side are point, and the coefficients on the right side are interval and the solutions are obtained in interval form.

Systems of the form (5) often arise in tolerance problems, optimization, and also in practice, for example, in calculating the parameters of electrical networks, when the parameters of the right side of the system are specified with limited errors in the form of intervals.

The set of solutions (4) of interval systems of equations in the general case geometrically form heterogeneous, complexly structured objects. Accurately describing such objects is an NP-hard problem [16, 17]. Therefore, such problems are reduced to interval estimation problems, i.e. tasks with less complexity of description. For clarity, we illustrate the solution areas and their approximate interval estimates (Fig. 1) for problems (2) and (5) using the following point and partially interval systems as examples, respectively:

$$\begin{aligned} \begin{array}{ccc} \left\{ \begin{array}{c}\cos (x_1)+x_2=1,\\ \frac{x_1^2}{3}-\frac{x_1^2}{3}=1, \end{array}\right. & \textrm{and} & \left\{ \begin{array}{c}\cos (x_1)+x_2=[1-r,1+r],\\ \frac{x_1^2}{3}-\frac{x_1^2}{3}=[1-r,1+r], \end{array}\right. \end{array} \end{aligned}$$
(6)

where r is the radius of the interval.

Fig. 1
figure 1

Solutions and their approximate interval estimates for systems (6): a) point case; b) partial-interval case

To solve problem (2), you can use classical interval methods, but with sufficiently small sizes of the initial box. To find global solutions, it becomes necessary to use constraint propagation methods [12]. In classical (real) numerical analysis there are almost no developed methods for solving this problem. Commonly used approaches with limited success include analytical research, multistart, and continuation methods [18].

The fundamental difference between interval systems of equations and point systems is that in the interval case, outer estimates of solution sets obtained using classical interval methods can have noticeable or even large errors, differing significantly from optimal (exact) outer estimates. This coarsening is naturally related to the width of the interval parameters. In addition, due to the increase in the size of decision sets due to interval coefficients, the feasible region for classical interval methods will become incompatible with the region of decision sets. Therefore, these methods lose their applicability to systems (3) or (5).

Based on these considerations, in this paper we propose a combined vertex method that works regardless of the size of the solution sets of interval systems.

The structure of the article is organized in the following order. The basic facts and notations necessary for the article are given in Section 2. Section 3 provides an analysis of the theoretical and practical aspects of the Newton and Krawczyk methods. The proposed method and its algorithm are presented in Section 4. The results of a numerical experiment and a comparative analysis of the methods under consideration are given in Section 5. In the final Section 6, conclusions on this study are drawn.

2 Preliminaries

An informal international standard has been adopted for notation in interval analysis; the latest modified version of this text is available on the Internet and published in the article [19]. According to this standard, all interval quantities in the text of the article are highlighted in italic bold mathematical font, and point (real, non-interval) quantities are not specially highlighted in any way.

The interval number \(\varvec{a}=[\underline{a},\overline{a}]\subset \mathbb {R}\) is defined as the set of all numbers of the real axis located between the given boundaries \(\underline{a}\) and \(\overline{a}\) including them, i.e.

$$\begin{aligned} \varvec{a}=[\underline{a},\overline{a}]:=\{x\in \mathbb {R}\;|\; \underline{a}\le x\le \overline{a}\}. \end{aligned}$$

Next we use some interval characteristic functions: width: \(\text {wid}(\varvec{a})=\overline{a}-\underline{a}\); radius: \(\text {rad}(\varvec{a})=\frac{1}{2}(\overline{a}-\underline{a})=\frac{1}{2}\text {wid}(\varvec{a})\); Middle (center): \(\text {mid}(\varvec{a})=\frac{1}{2}(\underline{a}+\overline{a})\); Absolute value (modulus or magnitude): \(|\varvec{a}|=\max \{|a|\;|\;a\in \varvec{a}\}=\max \{|\underline{a}|,|\overline{a}|\}\).

Definition 1

Let \(\varvec{a},\varvec{b}\in \mathbb{I}\mathbb{R}\). Then the distance between intervals \(\varvec{a}\) and \(\varvec{b}\) is defined as:

$$\begin{aligned} \texttt {dist}(\varvec{a},\varvec{b}):=\max \{|\underline{a}-\underline{b}|, |\overline{a}-\overline{b}|\}. \end{aligned}$$

Definition 2

Let \(\varvec{x},\varvec{y}\in \mathbb{I}\mathbb{R}^n\). Then the metric on a multidimensional interval space for vectors \(\varvec{x}\) and \(\varvec{y}\) is defined as:

$$\begin{aligned} \texttt {dist}(\varvec{x},\varvec{y}):=\max \{\Vert \underline{x}-\underline{y}\Vert , \Vert \overline{x}-\overline{y}\Vert \}, \end{aligned}$$

where \(\Vert \cdot \Vert \) is the absolute vector norm in \(\mathbb {R}^n\).

3 Analysis of interval iterative methods for solving point systems

In this section we discuss the application of Newton and Krawczyk methods for solving point systems of nonlinear equations according to problem (2).

3.1 Newton’s interval method

Suppose that system (1) with n equations and n unknowns is given on the box \(\varvec{X}\). Let \(\varvec{L}\) be the interval Lipschitz matrix of the mapping F onto \(\varvec{X}\). In particular, \(\varvec{L}\) can be an interval extension of the Jacobian of the mapping F onto \(\varvec{X}\). Then for any points of \(x,\tilde{x}\in \varvec{X}\) the following representation holds:

$$\begin{aligned} F(x)\in F(\tilde{x})+\varvec{L}(x-\tilde{x}). \end{aligned}$$

In particular, if \(x=x^*\) is a solution to the system of equations (1), i.e. \(F(x^*)=0\), then

$$\begin{aligned} 0\in F(\tilde{x})+\varvec{L}(x^*-\tilde{x}. \end{aligned}$$
(7)

We present Beck’s criterion for the united set of solutions of interval systems of linear algebraic equations (ISLAE):

Theorem 1

(Beeck [20]) Let \(\varvec{A}\in \mathbb{I}\mathbb{R}^{m\times n}\) and \(\varvec{b}\in \mathbb{I}\mathbb{R}^{m}\) be given. The vector \(\tilde{x}\in \mathbb {R}^n\) belongs to the united set of solutions \(\Xi (\varvec{A},\varvec{b})\) of the interval system of linear algebraic equations \(\varvec{A}x=\varvec{b}\) if and only if \(\varvec{A}\tilde{x}\cap \varvec{b}\ne \varnothing \; \Leftrightarrow \; 0\in \varvec{A}\tilde{x}- \varvec{b}\).

It follows that the point \(x^*\) satisfies inclusion (3) if and only if it belongs to the united set of solutions of the interval linear system

$$\begin{aligned} \varvec{L}(x-\tilde{x})=- F(\tilde{x}). \end{aligned}$$

Let Encl [6] be a procedure for outer estimation of the set of solutions of the interval linear system \(\varvec{A}x=\varvec{b}\), it is easy to show that the following relation \(\Xi (\varvec{A},\varvec{b})\subseteq \textit{Encl}(\varvec{A},\varvec{b})\) holds. Then the inclusion

$$\begin{aligned} x^*-\tilde{x}\in \textit{Encl}\bigl (\varvec{L},-F(\tilde{x})\bigr ) \quad \Rightarrow \quad x^*\in \tilde{x}-\textit{Encl}\bigl (\varvec{L},F(\tilde{x})\bigr ), \end{aligned}$$

because \(\textit{Encl}\bigl (\varvec{L},-F(\tilde{x})\bigr )= -\textit{Encl}\bigl (\varvec{L},-F(\tilde{x})\bigr )\).

Here, it is preferable to use the Gauss-Seidel method as the Encl procedure, since this method does not require the regularity of the Jacobi matrix. Even if the Jacobian matrix is singular and the solution set is unlimited, the algorithm gives the result on a finite box.

Let the interval Lipschitz matrix \(\varvec{L}\in \mathbb{I}\mathbb{R}^{n\times n}\) be known for the mapping \(F:\mathbb {R}\supseteq D\rightarrow \mathbb{I}\mathbb{R}^n\). Then the mapping \(\mathcal {N}:\mathbb{I}\mathbb{D}\times \mathbb {R} \rightarrow \mathbb{I}\mathbb{R}^n\), given by rule

$$\begin{aligned} \mathcal {N}\bigl (\varvec{X},\tilde{x}\bigr ):=\tilde{x}-\textit{Encl}\bigl (\varvec{L},F(\tilde{x})\bigr ), \end{aligned}$$
(8)

defines the interval Newton operator on \(\mathbb{I}\mathbb{D}\) relative to the point \(\tilde{x}\). In this case, the iterative scheme of Newton’s interval method is determined by the formula:

$$\begin{aligned} \left\{ \begin{array}{l} \varvec{X}^{(0)}\leftarrow \varvec{X}\\ \varvec{X}^{(k+1)}\leftarrow \varvec{X}^{(k)}\cap \mathcal {N}\bigl (\varvec{X}^{(k)},\tilde{x}^{(k)}\bigr ),\quad \tilde{x}^{(k)}\in \varvec{X}^{(k)},\quad k=0,1,2,\ldots \end{array}\right. \end{aligned}$$
(9)

As a consequence, if \(\varvec{X}\cap \mathcal {N}\bigl (\varvec{X},\tilde{x}\bigr )=\varnothing \), then the box \(\varvec{X}\) contains no solutions to the system \(F(x)=0\). If for some \(\tilde{x}\) the inclusion \(\mathcal {N}\bigl (\varvec{X},\tilde{x}\bigr )\subseteq \varvec{X}\) holds, then according to Brouwer’s theorem there is exactly one solution to the system \(F(x)=0\) in the box \(\varvec{X}\). The most unfavorable situation when operating Newton’s interval method is the appearance of an inclusion

$$\begin{aligned} \mathcal {N}\bigl (\varvec{X},\tilde{x}\bigr )\supseteq \varvec{X}. \end{aligned}$$
(10)

In case (10), all subsequent steps are looped on the box \(\varvec{X}\) and do not provide any additional information about the required solutions of the system.

3.1.1 Discussion on the application of Newton’s method

Here we discuss some issues on the application of Newton’s interval method.

1. Why does Newton’s interval method not fully correspond to the point version?

In the point (real) case, the calculation formula of Newton’s method for solving a system of equations is usually written in the form

$$\begin{aligned} X^{(k+1)}\leftarrow X^{(k)}-\bigl (J(X^{(k)})\bigr )^{-1}\cdot F(X^{(k)}), \end{aligned}$$

where the matrix appears, inverse to the Jacobian \(J(X^{(k)})\) of the mapping F at point \(X^{(k)}\). But we do not need this inverse matrix itself “in its pure form”, but only its product by the vector \(F(X^{(k)})\), which, when implemented, is more convenient to represent as a solution to a system of linear algebraic equations with the matrix \(J(X^{(k)})\). This results in a gain in labor intensity and better accuracy of the result. These arguments remain fully valid in the interval case and, in addition, another good reason is added to them. Writing an interval estimate of the values of the product \(\bigl (J(X^{(k)})\bigr )^{-1}\cdot F(X^{(k)})\) through outer evaluation of the set of ISLAE solutions reduces the coarsening of the result. That is why the calculation formulas of Newton’s multidimensional interval method are defined in the form (8), which does not fully correspond to the point case.

2. What is the best way to choose the center of the \(\tilde{x}\) expansion when using Newton’s interval method?

It makes sense to do this so that the value of \(\Vert F(\tilde{x})\Vert \) is as small as possible. The smaller the norm of the vector function \(F(\tilde{x})\), the smaller will be the norm of the vectors forming the set of solutions of the interval linear system

$$\begin{aligned} \varvec{L}(x-\tilde{x})=-F(\tilde{x}), \end{aligned}$$

which we must intersect with the original box, the smaller its dimensions will be. In this case, we will most likely obtain a narrower outer estimate of the set of solutions of the original nonlinear system and more accurately determine the status of the box under study. Numerical experiments conducted by E. Hansen and R. Greenberg [21] confirm this conclusion. The book [13] discusses a simple iterative procedure called “inner iteration” which is designed to minimize \(\Vert F(\tilde{x})\Vert \) within a given box.

3. When does inclusion (10) appear and what to do in such situations?

If the width of the box \(\varvec{X}\) is large, then the applied solution methods may not be successful. In such situations, we have inclusion (10), which does not give us any information about the existence of a solution or its absence on a given box \(\varvec{X}\). One of the simple and universal techniques that allows you to move the refinement of solutions from the “dead point” is to split the original box into smaller sub-boxes. We consider this technique in more detail in the next Section 3.1.2

3.1.2 Forced crushing of interval box

The most common technique in such situations is bisection - dividing the box \(\varvec{X}\) into two equal or unequal parts along some edge, for example, into halves (Fig. 2).

$$\begin{aligned} \varvec{X}=\varvec{X}'\cup \varvec{X}'', \end{aligned}$$

where

$$\begin{aligned} \varvec{X}'&=\bigl (\varvec{X}_1,\ldots ,[\underline{X}_i,\text {mid}(\varvec{X}_i)],\ldots ,\varvec{X}_n\bigr ), \nonumber \\ \varvec{X}''&=\bigl (\varvec{X}_1,\ldots ,[\text {mid}(\varvec{X}_i),\overline{X}_i],\ldots ,\varvec{X}_n\bigr ), \end{aligned}$$

for some \(i\in \{1,2,\ldots ,n\}\). In this case, we select the component for crushing according to its maximum width, i.e. \(\varvec{X}_i=\max \{\text {wid}(\varvec{X})\}\).

Fig. 2
figure 2

Forced crushing of boxes into descendants

Then we can apply Newton’s interval method to \(\varvec{X}'\) and \(\varvec{X}''\) separately and combine the results. In this case, the sub-boxes \(\varvec{X}'\) and \(\varvec{X}''\) have a smaller width than the original \(\varvec{X}\), then the application of Newton’s interval method to them can be more successful. Next, these descendants can be divided again, as many as necessary to achieve the desired smallness of their sizes, then we can successfully perform tests of the existence of solutions on these boxes. To organize such crushing, the book [6] suggests creating a list \(\mathcal {L}\) of all descendants of the initial box \(\varvec{X}\) that are suspicious for the content of solutions. At the same time, in order to ensure that the operating time of the algorithm is limited, it is necessary to set a certain size \(\delta \) for the boxes; when this size is reached, it no longer makes sense to further crush the box.

Algorithms for crushing boxes are usually called branch-and-cut methods, where branching is the splitting of the original search area for solutions into subdomains, and cutting is the discarding of unpromising parts of the original area that certainly do not contain solutions.

Algorithm 1
figure a

Algorithm of the branching and cutting method for the global probative solution of equations [6].

Applying Algorithm 1, we get three lists of boxes:

\(\mathcal {L}_1\) is a list consisting of boxes of size no more than \(\delta \), which are guaranteed to contain solutions, systems of equations in \(\varvec{X}\);

\(\mathcal {L}_2\) is a list consisting of boxes of size no more than \(\delta \) that are suspected of containing a solution to a system of equations in \(\varvec{X}\).

\(\mathcal {L}_3\) - a list consisting of “under-processed” boxes that have a size greater than \(\delta \).

All solutions of the system of equations under consideration that do not belong to the boxes from the list \(\mathcal {L}_1\) are contained in the boxes from the lists \(\mathcal {L}_2\) and \(\mathcal {L}_3\). This technique can be applied not only to Newton’s interval method itself, but also to its other modifications, for example, the Krawczyk [8] and Hansen-Sengupta [9] methods.

3.2 Krawczyk method

The Krawczyk operator for the mapping \(\mathcal {K}: \mathbb{I}\mathbb{D}\times \mathbb {R}\rightarrow \mathbb{I}\mathbb{R}^n\) is given by the expression

$$\begin{aligned} \mathcal {K}(\varvec{X},\tilde{x}):= \tilde{x}-\Lambda F(\tilde{x})+(I-\Lambda \varvec{L})(\varvec{X}-\tilde{x}), \end{aligned}$$
(11)

where \(\Lambda \in \mathbb {R}^{n\times n}\) is a real matrix.

If the spectral radius is \(\rho \bigl (|I-\Lambda \varvec{L}|\bigr )<1\), then, according to Schröder’s theorem [6], the mapping has a unique fixed point, which is a solution to the system of equations under consideration. Thus, the box \(\varvec{X}\) contains exactly one solution to the system \(F(x)=0\).

The iterative scheme of the Krawczyk method has the following form:

$$\begin{aligned} \left\{ \begin{array}{l} \varvec{X}^{(0)}\leftarrow \varvec{X}\\ \varvec{X}^{(k+1)}\leftarrow \varvec{X}^{(k)}\cap \mathcal {K}\bigl (\varvec{X}^{(k)},\tilde{x}^{(k)}\bigr ),\quad \tilde{x}^{(k)}\in \varvec{X}^{(k)},\quad k=0,1,2,\ldots \end{array}\right. \end{aligned}$$
(12)

The iterative scheme (12) is called the Krawczyk method, and its special case is also used for outer estimation of solution sets of ISLAE.

Each solution of the system \(F(x)=0\) on the box \(\varvec{X}\) also lies in \(\mathcal {K}\bigl (\varvec{X},\tilde{x}\bigr )\), therefore, to determine more accurate boundaries, we find \(\varvec{X}\cap \mathcal {K}\bigl (\varvec{X},\tilde{x}\bigr )\). If \(\varvec{X}\cap \mathcal {K}\bigl (\varvec{X},\tilde{x}\bigr )=\varnothing \), then the box \(\varvec{X}\) contains no solutions to the system \(F(x)=0\).

Let us present two main facts about the existence of a solution from [6]:

  • if \(\mathcal {K}\bigl (\varvec{X},\tilde{x}\bigr )\subseteq \varvec{X}\) and the matrix \(\Lambda \) in the Krawczyk operator is regular, then the box \(\varvec{X}\) contains at least one solution to the system \(F(x)=0\);

  • if \(\mathcal {K}_i\bigl (\varvec{X},\tilde{x}\bigr ) \subset \varvec{X}_i,\;\;i=1,2,\ldots ,n\), then the interval Lipschitz matrix \(\varvec{L}\) is strongly regular and the box \(\varvec{X}\) contains exactly one solution to the system \(F(x)=0\).

The Krawczyk operator and the iterative scheme of the Krawczyk method work for fairly arbitrary \(\tilde{x}\) and \(\Lambda \), but to obtain the best results it is advisable to choose them so that the result of the Krawczyk operator is, if possible, the narrowest box possible. It can be noted that the smallest width of values of the Krawczyk operator is achieved when choosing \(\tilde{x}=\text {mid}(\varvec{X})\), and for the matrix \(\Lambda \) the most favorable value is \(\bigl (\text {mid}(\varvec{L})\bigr )^{-1}\), since at \(\tilde{x}=\text {mid}(\varvec{X})\) the width of values \(\mathcal {K}\bigl (\varvec{X},\tilde{x}\bigr )\) is minimized.

Another improvement of the Krawczyk operator is associated with a strengthened version of Brouwer’s fixed point theorem, in which it is necessary to check the nesting conditions for the image not for the entire box, but only for its boundary [22].

4 The problem of estimating a set of solutions for partially interval nonlinear systems

Classical interval methods were considered in the previous section for finding solutions to point systems of nonlinear equations in a given interval box. For outer estimation of solution sets of interval systems of the form (5), they are not always applicable, i.e. can sometimes be used when the interval coefficients have fairly small widths. This circumstance limits the applicability of these methods. Below we propose a new method that works in combination with some well-known algorithms.

4.1 The idea of the proposed method

W. Dong and H. Shah [23] proposed the so-called “Vertex Method” for calculating the boundaries of the value domains of functions of fuzzy variables. Z.Qiu and his co-authors in [24, 25] use the vertex method to determine the dynamic response boundary of mechanical systems with interval parameters.

Here we use the idea of this method to solve systems of nonlinear equations of the form (5). To do this, we write system (5) in a more understandable form:

$$\begin{aligned} \left\{ \begin{array}{cccc}F_1& (x_1,x_2,\ldots ,x_n)& =& [\underline{b}_1,\overline{b}_1],\\ F_2& (x_1,x_2,\ldots ,x_n)& =& [\underline{b}_2,\overline{b}_2],\\ \vdots & \ddots & \vdots & \vdots \\ F_n& (x_1,x_2,\ldots ,x_n)& =& [\underline{b}_n,\overline{b}_n]. \end{array}\right. \end{aligned}$$
(13)

Next, with all possible combinations of left and right boundaries of intervals \(\varvec{b}_i=[\underline{b}_i,\overline{b}_i]\; (i=1,2,\ldots ,n)\), we will create a sequence of point systems of nonlinear equations. Then for n intervals \(\varvec{b}_1,\varvec{b}_2,\ldots ,\varvec{b}_n\in \mathbb{I}\mathbb{R}\) - this will be 2n combinations. For example, for \(n=2\), we have:

$$\begin{aligned} \left\{ \begin{array}{c}F_1(x_1,x_2)=[\underline{b}_1,\overline{b}_1],\\ F_2(x_1,x_2)=[\underline{b}_2,\overline{b}_2]. \end{array}\right. \end{aligned}$$
(14)

Thus, we get \(2^2=4\) point equations:

$$\begin{aligned} \begin{array}{cccc} \left\{ \begin{array}{c}F_1(x_1,x_2)=\underline{b}_1\\ F_2(x_1,x_2)=\underline{b}_2 \end{array}\right. & \left\{ \begin{array}{c}F_1(x_1,x_2)=\underline{b}_1\\ F_2(x_1,x_2)=\overline{b}_2 \end{array}\right. & \left\{ \begin{array}{c}F_1(x_1,x_2)=\overline{b}_1\\ F_2(x_1,x_2)=\underline{b}_2 \end{array}\right. & \left\{ \begin{array}{c}F_1(x_1,x_2)=\overline{b}_1\\ F_2(x_1,x_2)=\overline{b}_2 \end{array}\right. \end{array}. \end{aligned}$$
(15)

Having solved each system of nonlinear equations (15) on a given box \(\varvec{X}\) using the Newton or Krawczyk interval method, we obtain four pairs of interval solutions \((\varvec{x}_{1j},\varvec{x}_{2j}) \;(j=1,2,3,4)\), which can be written in more detail in the following form:

$$\begin{aligned} \begin{array}{cc}\varvec{x}_{11}=[\underline{x}_{11},\overline{x}_{11}],& \qquad \varvec{x}_{21}=[\underline{x}_{21},\overline{x}_{21}],\\ \varvec{x}_{12}=[\underline{x}_{12},\overline{x}_{12}],& \qquad \varvec{x}_{22}=[\underline{x}_{22},\overline{x}_{22}],\\ \varvec{x}_{13}=[\underline{x}_{13},\overline{x}_{13}],& \qquad \varvec{x}_{23}=[\underline{x}_{23},\overline{x}_{23}],\\ \varvec{x}_{14}=[\underline{x}_{14},\overline{x}_{14}],& \qquad \varvec{x}_{24}=[\underline{x}_{24},\overline{x}_{24}]. \end{array} \end{aligned}$$
(16)

From the calculated intervals we find the components of the solution to (15), calculating the minimum and maximum of the corresponding boundaries of the intervals from (16), as a result we obtain the final outer interval estimates:

$$\begin{aligned} \varvec{x}_{1}=[\underline{x}_{1},\overline{x}_{1}]= \Bigl [\min _{j=1}^4\{\underline{x}_{1j}\},\max _{j=1}^4\{\overline{x}_{1j}\}\Bigr ],\nonumber \\ \varvec{x}_{2}=[\underline{x}_{2},\overline{x}_{2}]= \Bigl [\min _{j=1}^4\{\underline{x}_{2j}\},\max _{j=1}^4\{\overline{x}_{2j}\}\Bigr ].\nonumber \end{aligned}$$

This can be generalized to n equations:

$$\begin{aligned} \varvec{x}_{i}=[\underline{x}_{i},\overline{x}_{i}]= \Bigl [\min _{j=1}^n\{\underline{x}_{ij}\},\max _{j=1}^n\{\overline{x}_{ij}\}\Bigr ],\quad i=1,2,\ldots ,n. \end{aligned}$$
(17)

Here, the found components \(\varvec{x}_i \;\; (i=1,2,\ldots ,n)\) determine the interval estimate of the set of solutions of system (13) in the form of an interval vector-bar \(\varvec{X}=(\varvec{x}_1,\varvec{x}_2,\ldots ,\varvec{x}_n)\).

Thus, by reducing the interval system to point systems, we will simplify a complex problem into a simpler problem. At the same time, we can prevent overestimation of interval estimates, the so-called “Wrapping effect”, which is often found in interval calculations [26].

4.2 Algorithm

The main algorithm of this method includes two auxiliary algorithms.

1) To determine the tuple of coefficients (vector components) of the right-hand sides of point systems of equations having the form (15), we propose the following simple and effective auxiliary algorithm (Algorithm 2), which can be tested directly on the Python interpreter:

Algorithm 2
figure b

Selecting vertices \(c_i \;(i=1,2,\ldots ,2^n)\), for point equations.

Here, to implement Algorithm 2, the boundaries of the intervals on the right side are entered sequentially as elements of a one-dimensional array.

Example 1

Let’s show the work of Algorithm 2 for \(n=3\), \(\varvec{b}_1=[13,15]\), \(\varvec{b}_2=[7,8]\), \(\varvec{b}_3=[11,12]\):

$$\begin{aligned} \begin{array}{cc}c_{1}=[13,7,11],& \qquad c_{5}=[15,7,11],\\ c_{2}=[13,7,12],& \qquad c_{6}=[15,7,12],\\ c_{3}=[13,8,11],& \qquad c_{7}=[15,8,11],\\ c_{4}=[13,8,12],& \qquad c_{8}=[15,8,12]. \end{array} \end{aligned}$$
(18)

Here, each tuple \(c_i\) defines the vector of the right side of a point systems of the form \(F(x)=c_i \; (i=1,2,\ldots ,8)\), for example, the first system for the tuple \(c_1=[13, 7, 11]\) is defined as

$$\begin{aligned} \left\{ \begin{array}{l}F_1(x_1,x_2,x_3)=13,\\ F_2(x_1,x_2,x_3)=7,\\ F_3(x_1,x_2,x_3)=11. \end{array}\right. \end{aligned}$$

2) To localize solutions, we connect Algorithm 1. This algorithm creates lists of boxes that are guaranteed to contain solutions to the system, removing boxes that do not contain solutions. In this case, Algorithm 1 is run for each point system separately.

Finally, we present the general algorithm design for the proposed combined vertex method (Algorithm 3).

Algorithm 3
figure c

Combined vertex method for solving partially interval systems.

When applying iterative algorithms to solve point systems of equations (step 3), the criterion for stopping the iterative process is the condition \(\texttt {dist}\bigl (\varvec{x}^{(k)},\varvec{x}^{(k+1)}\bigr )<\varepsilon \), where \(\varepsilon \) is the required accuracy.

5 Numerical experiments

In this section we provide several examples showing the process of solving point and partially interval systems. First, Example 2 analyzes the operation of the interval methods of Newton and Krawczyk for point systems, comparing their results in terms of the permissible area of application and the number of iterations. Example 3 demonstrates the direct application of these methods to partially interval systems. Next, in Example 4, the performance of the proposed method is tested, comparing the numerical results on the quality of the found interval estimates, relative to the Krawczyk method. The last example 5 considers the application of the proposed method to one problem of electrical circuit analysis, where classical interval methods are not applicable.

All numerical experiments were carried out on an Intel\(\circledR \) CoreTM i3-3217U laptop, CPU 1.83 GHz, 4 GB DDR3 Memory, the program code was written in GNU Octave v.4.2.1, using the “octave-interval v.3.2.0-3_amd64” software package [27]. This package complies with IEEE Std 1788-2015 for interval arithmetic [28].

Example 2

Let us be given a system of nonlinear equation

$$\begin{aligned} \left\{ \begin{array}{l}x_1^2+x_2^2=1,\\ x_1+x_2^2=0. \end{array}\right. \end{aligned}$$
(19)
Table 1 Results for various initial data for system (19)
Table 2 Numerical results with the initial approximation \(\varvec{X}^{(0)}=[0.5,1]\times [0.5,1]\) of system (19)

For this system we define the corresponding functions

$$\begin{aligned} \left\{ \begin{array}{l}F_1(x_1,x_2)=x_1^2+x_2^2-1,\\ F_2(x_1,x_2)=x_1+x_2^2. \end{array}\right. \end{aligned}$$

The results of applying the interval methods of Newton (9) and Krawczyk (12) for system (19) are given in Table 1 (where \(\varvec{X}^{(0)}\) is the initial approximation, \(k_{\mathcal {N}},k_{\mathcal {K}}\) is the iteration number of the interval methods of Newton and Krawczyk, respectively).

From Table 1 it is clear that in the first three cases, Newton’s interval method does not provide a solution, this is due to the fact that the interval Lipschitz matrix \(\varvec{L}\) contains singular point matrices and therefore the set of solutions is unlimited. Krawczyk’s method converges more slowly than Newton’s interval method, but provides a solution in areas where Newton’s method is not applicable. Newton’s method works with quadratic convergence when the width of the interval vector of the initial approximation is small enough. Both methods produce the same and more accurate interval estimates.

If we are only interested in the point solution, then we can take the midpoint of these interval estimates as

$$\begin{aligned} x_1=\text {mid}\varvec{X}_1=\frac{\underline{X}_1+\overline{X}_1}{2}=0.618035,\quad x_2=\text {mid}\varvec{X}_2=\frac{\underline{X}_2+\overline{X}_2}{2}=0.786155. \end{aligned}$$

For graphical illustration, let us consider in detail the case with the initial approximation \(\varvec{X}^{(0)}=[0.5,1]\times [0.5,1]\), in which the results of the iterative calculation are given in Table 2.

Fig. 3
figure 3

Iterative process of Newton’s method for system (19)

Fig. 4
figure 4

Iterative process of the Krawczyk method for system (19)

A graphical representation of the iterative process of interval methods is illustrated in Figs. 3 and 4, where nested sequences of rectangles are interval estimates in the corresponding iterations.

The dimensions of the boxes \(\varvec{X}^{(3)},\varvec{X}^{(4)},\varvec{X}^{(5)}\) (see Table 2) are quite small, so they are almost invisible in the graphs (Figs. 3 and 4).

Example 3

Consider the system of (19), in partially interval form

$$\begin{aligned} \left\{ \begin{array}{l} x_1^2+x_2^2=[1-r,1+r],\\ x_1+x_2^2=0, \end{array}\right. \end{aligned}$$
(20)

where r is the radius of the created interval. Let, for this example, \(r=0.1\).

Using this fairly simple example, we discuss the work of Newton’s and Krawczyk’s interval methods in the process of solving interval systems.

Defining the relevant functions

$$\begin{aligned} \left\{ \begin{array}{l} \varvec{F}_1(x_1,x_2)=x_1^2+x_2^2-[0.9,1.1],\\ F_2(x_1,x_2)=x_1+x_2^2, \end{array}\right. \end{aligned}$$

where \(\varvec{F}_1(x_1,x_2)\) is an interval-valued function, and \(F_2(x_1,x_2)\) is a point (real) function. The results of the numerical experiment are presented in Table 3.

Table 3 Numerical results with the initial approximation \(\varvec{X}^{(0)}=[0.5,1]\times [0.5,1]\) of system (20)

As the results from Table 3 show, when applying the Newton and Krawczyk interval methods for interval systems (20), the number of iterations increases compared to the application for point systems (19), but both methods always give the same final results. The process of approaching the interval solution for system (20) is presented graphically in Figs. 5 and 6.

Fig. 5
figure 5

Iterative process of Newton’s method for system (20)

Fig. 6
figure 6

Iterative process of the Krawczyk method for system (20)

From Figs. 5 and 6 it is clear that the interval-valued function \(\varvec{F}_1(x)\) forms families of point functions. The sequence of rectangles signifies approximations of the outer interval estimates of the united solution set, and the resulting interval estimate is highlighted in red.

Example 4

Now consider an interval system of third-order nonlinear algebraic equations

$$\begin{aligned} \left\{ \begin{array}{c} x_1+x_2^2+x_3^2=[11.9,12.1],\\ x_1^2+x_2+x_3^2=[13.8,14.2],\\ x_1^2+x_2^2+x_3=[7.8,8.1]. \end{array}\right. \end{aligned}$$
(21)

To create an iterative process, we select the initial approximation \(\varvec{X}^{(0)}=[1.5,2.5]\times [0.5,1.5]\times [2.5,3.5]\), since the point (2; 1; 3) is satisfied by all equations in the system (21). In this case, the spectral radius is \(\rho \bigl (|I-\Lambda \varvec{L}|\bigr )\approx 1.2896>1\), which means convergence is not guaranteed.

Based on this example, we will conduct a comparative analysis of the results on the quality of interval estimates between the direct application of the Krawczyk method to solve system (21) and the proposed approach, which uses the Krawczyk method to solve the generated systems of point equations. To do this, we will consider two approaches.

First approach (direct application of the Krawczyk method)

First, let us consider the case of direct application of the Krawczyk method for the interval system (21). We create the corresponding interval-valued functions \(\varvec{F}:\mathbb{I}\mathbb{R}^3\rightarrow \mathbb{I}\mathbb{R}\):

$$\begin{aligned} \varvec{F}(x)=\left( \begin{array}{l}\varvec{F}_1(x_1,x_2,x_3)=x_1+x_2^2+x_3^2-[11.9,12.1],\\ \varvec{F}_2(x_1,x_2,x_3)=x_1^2+x_2+x_3^2-[13.8,14.2],\\ \varvec{F}_3(x_1,x_2,x_3)=x_1^2+x_2^2+x_3-[7.8,8.1]. \end{array}\right) . \end{aligned}$$

The interval Lipschitz matrix of the system is equal to:

$$\begin{aligned} \varvec{L}(\varvec{X})= \begin{pmatrix} [1,1] & 2\cdot \varvec{X}_2 & 2\cdot \varvec{X}_3 \\ 2\cdot \varvec{X}_1 & [1,1] & 2\cdot \varvec{X}_3 \\ 2\cdot \varvec{X}_1 & 2\cdot \varvec{X}_2 & [1,1] \end{pmatrix}. \end{aligned}$$

We find the inverse matrix \(\Lambda =\Bigl (\varvec{L}\bigl (\text {mid}(\varvec{X})\bigr )\Bigr )^{-1}\). The first step of the process is decisive, if the box \(\bigl (I-\Lambda \varvec{L}\bigr )\bigl (\varvec{X}-\text {mid}(\varvec{X})\bigr )\) from expression (11) decreases in size, then the process is successful. The results of the computational experiment are shown in Table 4.

From Table 4 you will notice that the values of \(\varvec{X}_2\) do not change, since \(\rho \bigl (|I-\Lambda \varvec{L}|\bigr )>1\) is the same in all iterations.

So, as a result of direct application of Krawzcyk’s method, we obtained the outer interval estimate \(\varvec{X}_{\mathcal {K}}\) (\(\mathcal {K}\) - Krawzcyk):

$$\begin{aligned} \varvec{X}_{\mathcal {K}}= \begin{pmatrix} \varvec{X}_1\\ \varvec{X}_2\\ \varvec{X}_3 \end{pmatrix} = \begin{pmatrix} [1.6334,2.3553]\\ [0.5,1.5]\\ [2.7680,3.2452] \end{pmatrix} . \end{aligned}$$
(22)
Table 4 Results of direct application of the Krawczyk method for system (21)
Table 5 Interval estimates of point systems of equations, compiled from the interval system (21)

Second approach (application of the proposed method)

To apply the proposed method, we divide system (21) into eight \((2^3=8)\) point systems using Algorithm 2, as shown in Example 1. Therefore, to solve these point systems we use the Krawczyk method, since this method is more applicable than Newton’s interval method.

Next, we present the interval estimates of the corresponding point systems of equations obtained by the Krawczyk method with the initial approximation \(\varvec{X}^{(0)}=[1.5,2.5]\times [0.5,1.5]\times [2.5,3.5]\) (Table 5), where k is the total number of iterations, and \(c=(c_1,c_2,c_3)^{\top }\) are the coefficients of the right side obtained using Algorithm 2, as shown in (18).

From the obtained local interval estimates for point systems, we find global interval estimates for the interval system (21):

$$\begin{aligned} \varvec{X}_{1}=[\underline{x}_1,\overline{x}_1]=\Bigl [\min _{j=1}^8\{\underline{x}_{1j}\},\max _{j=1}^8\{\overline{x}_{1j}\}\Bigr ]=[1.9170,2.0693],\nonumber \\ \varvec{X}_{2}=[\underline{x}_2,\overline{x}_2]=\Bigl [\min _{j=1}^8\{\underline{x}_{2j}\},\max _{j=1}^8\{\overline{x}_{2j}\}\Bigr ]=[0.7698,1.1439],\nonumber \\ \varvec{X}_{3}=[\underline{x}_3,\overline{x}_3]=\Bigl [\min _{j=1}^8\{\underline{x}_{3j}\},\max _{j=1}^8\{\overline{x}_{3j}\}\Bigr ]=[2.9565,3.0570].\nonumber \end{aligned}$$

Thus, we obtained an interval estimate of \(\varvec{X}_{\mathcal {V}}\) (\(\mathcal {V}\) - vertex) of system (21):

$$\begin{aligned} \varvec{X}_{\mathcal {V}}= \begin{pmatrix} \varvec{X}_1\\ \varvec{X}_2\\ \varvec{X}_3 \end{pmatrix} = \begin{pmatrix} [1.9170,2.0693]\\ [0.7698,1.1439]\\ [2.9565,3.0570] \end{pmatrix} . \end{aligned}$$
(23)

From the obtained numerical results (22) and (23), we note that the proposed method (second approach) gives better interval estimates than the direct application of interval iterative methods (first approach) for solving interval systems of nonlinear algebraic equations:

$$\begin{aligned} \varvec{X}_{\mathcal {V}}= \begin{pmatrix} [1.9170,2.0693]\\ [0.7698,1.1439]\\ [2.9565,3.0570] \end{pmatrix} \subset \begin{pmatrix} [1.6334,2.3553]\\ [0.5,1.5]\\ [2.7680,3.2452] \end{pmatrix} =\varvec{X}_{\mathcal {K}}. \nonumber \end{aligned}$$

Example 5

(Application) In this example, consider the problem of analyzing a known electrical circuit [29, 30], where the voltage transfer function (Butterworth) is defined as

$$\begin{aligned} T(s)=\frac{1/R_i}{2+s(C_1+C_3+L_2/R_i)\cdot s^2 \cdot (L_2C_3+C_1L_2/R_i)}+\frac{1/R_i}{C_1L_2C_3s^3}= \nonumber \end{aligned}$$
$$\begin{aligned} = \frac{1/R_i}{2+a_1s+a_2s^2+a_3s^3}. \end{aligned}$$

By introducing the designations for the circuit parameters as \(x_1=C_1\), \(x_2=L_2\), \(x_3=C_3\), the coefficients \(a_1\), \(a_2\), \(a_3\) are expressed by the equations:

$$\begin{aligned} \begin{array}{lll} a_1& =& x_1+x_3+x_2/R_i,\\ a_2& =& x_2x_3+x_1x_2/R_i,\\ a_3& =& x_1x_2x_3. \end{array} \end{aligned}$$
(24)

As shown in [30], at \(R=0.5\) we obtain a specific system of equations for calculating the circuit parameters

$$\begin{aligned} \left\{ \begin{array}{l}x_1+2x_2+x_3=6,\\ 2x_1x_2+x_2x_3=6,\\ x_1x_2x_3=3. \end{array}\right. \end{aligned}$$
(25)

System (25) has two point solutions \(x_1=(1,1.5,2)^{\top }\) and \(x_2=(3.261,0.779,1.181)^{\top }\).

Let us assume that the values of the coefficients \(a_1\), \(a_2\), \(a_3\) in (24) vary within certain intervals

$$\begin{aligned} \left\{ \begin{array}{l}x_1+2x_2+x_3=[5.8,6.5],\\ 2x_1x_2+x_2x_3=[5.5,6.4],\\ x_1x_2x_3=[2.6,3.5]. \end{array}\right. \end{aligned}$$
(26)

For this example, the Newton and Krawczyk methods are not applicable, since these methods work with sufficiently small sizes of the initial box, and the region of the solution set of system (26) is larger than the size of the admissible region of these iterative methods, for which they can lead us to a convergent iterative process.

When applying the proposed method, step 2 of Algorithm 3 narrows the size of the initial box by discarding the part that does not contain a solution. As a result, we will obtain an admissible region in which Krawczyk’s method will be applicable for each point system of type (15). For example, Krawczyk’s method with initial box \(\varvec{X}^{(0)}=[3,3.5]\times [0.5,1]\times [1,1.5]\) does not converge to a solution for the first point equation

$$\begin{aligned} \left\{ \begin{array}{l}x_1+2x_2+x_3=5.8,\\ 2x_1x_2+x_2x_3=5.5,\\ x_1x_2x_3=2.6. \end{array}\right. \end{aligned}$$

At the very first crushing (step 2 Algorithm 3), we get two sub-boxes \(\varvec{X}^{(0)}=\left( \varvec{X}^{(0)}\right) '\bigcup \left( \varvec{X}^{(0)}\right) '' =\left( [3,3.25]\times [0.5,1]\times [1,1.5]\right) '\cup \left( [3.25,3.5]\times [0.5,1]\times [1,1.5]\right) ''\), which the first sub-box \(\left( [3,3.25]\times [0.5,1]\times [1,1.5]\right) '\) already advances the process towards a solution and in 7 iterations we get \(\varvec{X}=[3.2455,3.2456]\times [0.7239,0.7239]\times [1.1066,1.1067]\), and discard the second half \(([3.25,3.5]\times [0.5,1]\times [1,1.5])''\), since there are no solutions in this box. Here we can choose a larger initial box \(\varvec{X}^{(0)}\), but the execution time of the algorithm increases significantly.

The numerical results when applying Algorithm 3 for system (26) are given in Table 6. Here we present the final result, omitting all intermediate data.

Table 6 Numerical results for system (26)

From Table 6 it is clear that we have found two partially intersecting parallelotopes that cover the set of solutions of the interval system (26), corresponding to point solutions of the system (25).

Partially interval systems often arise in the analysis of parameters of mathematical models, for example, if in system (25) the value of the parameter \(\varvec{x}_1=[\underline{x}_1,\overline{x}_1]\) is given in intervals, and it is necessary to estimate the values of the remaining variables \(\varvec{x}_2\) and \(\varvec{x}_3\), then by transferring all \(\varvec{x}_1\) in the system equations to one side of the equality, we can create a new partially interval system. This transformation is useful when analyzing individual parameters of a mathematical model if they have an interval type of uncertainty.

6 Conclusion

The main objects considered in this article were the problems of estimating a set of solutions to point and interval systems of nonlinear equations and their solution methods. In order to study the advantages and disadvantages of the known classical interval methods for solving these problems, we first analyzed the Newton and Krawczyk methods from a theoretical and practical point of view. The results of the analysis showed that these methods work for fairly small sizes of the initial boxes, and in the general case are not suitable for solving systems with interval coefficients, even when used in combination with branch-and-prune type methods [12]. Based on the refined properties of these methods, a combined vertex method was proposed for estimating the united set of solutions of interval nonlinear systems. Due to the limited application of Newton’s and Krawczyk’s methods, we cannot directly apply them to solve systems with wider interval coefficients. Numerical experiments have shown that the proposed method is more efficient and gives more accurate estimates in feasible areas than the direct application of classical interval iterative methods. In the work, in particular, in the examples given, we limited ourselves to considering only partially interval systems with a small size in order to outline the essence of the idea of this approach. In future research, we intend to develop a theoretical rationale for this approach and consider larger-scale examples. The approaches considered and statements made in this work can be used by engineers when solving practical problems where systems of nonlinear equations arise. For example, when solving nodal nonlinear equations for calculating parameters in electrical circuits [31, 32], as well as when using the method of integrating differential equations [33] under conditions of interval data uncertainty.