1 Introduction

Environmental problems in the modern world occupy one of the priority places among the scientific community (Oves et al. 2018; Sładkowski 2020; Kapustnyk et al. 2019). This is primarily caused by a rapid increase in the average temperature on the planet (Assad et al. 2019; Lelieveld et al. 2016; Gasparrini et al. 2017; Caloiero 2017) due to an increase in greenhouse gases in the air (Meyer and Newman 2020; Van Vuuren et al. 2017; Zhu-Barker et al. 2017), which in turn is associated with the development of industry (Fraccascia and Giannoccaro 2019; Meng et al. 2017), an increase in the number of automobile (Kumar and Gupta 2016; Shiraki et al. 2020) and aviation (Hudda et al. 2020) transport, and other natural phenomena and anomalies (Herndon 2018). This also results in a general deterioration in the condition of atmospheric air.

Several approaches are now being used to reduce the amount of air pollutants:

  1. 1.

    Modernization of existing power equipment to improve its technical and economic characteristics (Zaporozhets 2019; Babak et al. 2020; Zaporozhets 2020; Dzikuć et al. 2020; Li et al. 2019);

  2. 2.

    Using of new types of fuel (biofuels, hydrogen, etc.) (Li et al. 2019; Tang et al. 2019; Devarajan et al. 2018; Slenkin and Geletukha 2007; Hasan and Rahman 2017);

  3. 3.

    Increasing the share of renewable energy sources (solar power, wind power, hydropower) (Hu et al. 2018; Liang et al. 2019; Hannan et al. 2019; Liddle and Sadorsky 2017; Pfeifer et al. 2019);

  4. 4.

    Development of methods and means for the capture and recycling of pollutants from the air (Nematollahi and Carvalho 2019; Baker et al. 2017; Akram et al. 2016; Hua et al. 2017; Breyer et al. 2020; Rao et al. 2017);

  5. 5.

    Increasing the amount of green spaces (Mader 2020; Dadvand and Nieuwenhuijsen 2019; Ren et al. 2017);

  6. 6.

    Optimization of the urban environment to reduce the number of road transport (Figliozzi et al. 2020; Tajalli and Hajbabaie 2018; Penazzi et al. 2019), etc.

The problem of air pollution is very acute for many countries. For example, according to the World Health Organization, Ukraine ranks 1st place in the world in terms of the number of deaths from air pollution (Popov et al. 2020a). In other cities and regions of the world, the situation is also quite difficult (Di et al. 2017; Koengkan et al. 2020; Rappazzo et al. 2019).

To monitor ambient air quality, air pollution monitoring networks have become widespread (Idrees and Zheng 2020; Dhingra et al. 2019; Grace and Manju 2019; Guanochanga et al. 2019; Su 2018). Such networks can include both high-precision devices for the quantitative and qualitative analysis of air samples and relatively inexpensive sensor devices designed for indicative monitoring (Castell et al. 2017; Rai et al. 2017). Their main task is to signal the excess of certain concentrations of pollutants in the air (for example, PM2.5, PM10, O3, NOx, SOx, and CH2O) (Zaporozhets et al. 2020; Yang and Wang 2017). But even a sufficiently large network of observation posts cannot always generate the necessary information. For this, methods of mathematical modeling are used.

Currently, a large number of models are used in practice to study the distribution of pollutants (Elperin et al. 2016; Larkin et al. 2017; Alimissis et al. 2018; Nasari et al. 2016). Such models solve both direct and inverse problems. The solution to the direct problem is reduced to creating a concentration field according to the known characteristics of the pollution source. The solution of the inverse problem is reduced to finding the characteristics of the pollution source by the known values of the concentration of pollutants, determined at the posts of the air pollution monitoring network.

In the countries of Eastern Europe (Ukraine, Belarus, Russia, etc.), at the state level, either the OND-86 methodology or other methodologies close to the OND-86 is used (Sokolov 2017; Pavlovich et al. 2016). OND-86 allows to calculate the maximum possible distribution of emission concentrations under conditions of a weakly stable state of the atmosphere and averaged values over 20–30-min intervals, but does not take into account such factors as the stability class of the atmosphere and the type of the earth’s surface (Popov et al. 2020b). This technique was applied to calculate the concentration of pollutants at a distance of up to 100 km from the emission source.

The methodology makes it possible to calculate the distribution of pollutants emitted into the atmosphere by single point, linear sources, and a group of sources, taking into account the influence of the terrain, to determine the limiting concentrations of pollutants in a two-meter layer above the earth’s surface, as well as the vertical distribution of concentrations.

One of the main disadvantages of this methodology is the way of solving the inverse problem to determine the characteristics of the pollution source (Gochakov et al. 2018; Jafari et al. 2019). It directly provides for the presence of such a source (that is, the known location of the pollution source) that emits, and comes down to calculating the emission power M and the height of the pipe H from the value of the maximum surface concentration Cm. At the same time, it is impossible to find the location of the source of atmospheric air pollution (Kendler et al. 2020).

Thus, it can be concluded that the existing models for solving inverse problems in the study of the distribution of atmospheric air pollution need to be developed.

This article is devoted to the improvement of methods for modeling the distribution of pollutants for solving inverse problems, in particular, for localizing sources of air pollution according to the data of air pollution monitoring network.

2 Methods

2.1 General Approach

For modeling atmospheric pollution, the equation of turbulent diffusion was used, which has the form:

$$ \frac{\partial \mathrm{C}}{\partial t}+\sum \limits_{i=1}^3\left[{U}_i\frac{\partial C}{\partial {X}_i}\right]=\sum \limits_{i=1}^3\left[\frac{\partial }{\partial {X}_i}\left({K}_i\frac{\partial C}{\partial {X}_i}\right)\right]- aC, $$
(1)

where t is the time parameter for the non-stationary model of the problem, described by this equation; Xi are spatial coordinates of the calculation area; Ui is the value of the average speed of movement of the pollution source along the spatial coordinates Xi; Ki is the value of the exchange coefficient along the spatial coordinates; i = 1, 2, 3 are directions of the coordinate axes of spatial coordinates; a is the coefficient showing the change in concentration due to the transformation of the pollution source during the considered period of time; and С is the concentration of the pollution source during the entire calculation period.

Applying the Cartesian coordinate system, it was defined the axes X1, X2, and X3 through x, y, and z, respectively; speed −U1 = u, U2 = v, U3 = w; and exchange coefficients −K1 = kx, K2 = ky, K3 = kz.

Therefore, Eq. (1) takes the following form:

$$ \frac{\partial \mathrm{C}}{\partial t}+u\frac{\partial \mathrm{C}}{\partial x}+v\frac{\partial \mathrm{C}}{\partial y}+w\frac{\partial \mathrm{C}}{\partial z}=\frac{\partial }{\partial x}\left({k}_x\frac{\partial C}{\partial x}\right)+\frac{\partial }{\partial y}\left({k}_y\frac{\partial C}{\partial y}\right)+\frac{\partial }{\partial z}\left({k}_z\frac{\partial C}{\partial z}\right)- aC. $$
(2)

In the process of solving real practical problems, the form of Eq. (2) can be significantly simplified. If the axis is oriented in the direction of the average wind speed, then v = 0.

The considered version of the model is some idealized, since it does not take into account the influence of other pollution sources and anthropogenic factors. This article discusses the possibility of localizing one pollution source with using one sensor. The inverse problems in the article are based on direct problems, which, in turn, are solved by both deterministic and stochastic methods. Such models effectively describe the distribution of pollutants over limited distances.

2.2 Setting Goals. Simulation of a Two-dimensional Case. Direct Problem

The equation is considered in the calculation area Ω = [0; 1]2:

$$ \frac{\partial \mathrm{C}}{\partial t}+u\frac{\partial \mathrm{C}}{\partial x}+w\frac{\partial \mathrm{C}}{\partial z}=\frac{\partial }{\partial x}\left({k}_x\frac{\partial C}{\partial x}\right)+\frac{\partial }{\partial z}\left({k}_z\frac{\partial C}{\partial z}\right), $$
(3)

where u = 0.25; w = 0.03; kx = kz = 1; t = [0; 0.5].

The initial condition for (3) has the form:

$$ {\left.C\left(x,z,0\right)\right|}_{\left(x,z\right)\in \Omega}=0. $$
(4)

Condition (4) guarantees that at the initial moment of time the concentration of air pollutant is absent, that is, equal to zero.

The conditions at the boundary Γ of the calculation area in (3) are as follows:

$$ {\left.\left(\partial C\left(x,z,t\right)/\partial \overrightarrow{\mathbf{n}}\right)\right|}_{\Gamma}=0, $$
(5)

where \( \overrightarrow{\mathbf{n}} \) is the normal vector to the boundary of the calculation area Ω.

The internal condition at the source of pollution for (3) has the form:

$$ C\left({x}_{Src},{z}_{Src},t\right)=1. $$
(6)

In Eq. (6), (xSrc, zSrc) = (0.67; 0.32) are coordinates of the geometrical position of the pollution source. For monitoring the concentration in a fixed position, the point of the calculation area Ω was chosen, which receives data during the entire time period of the calculation, namely, for t = [0; 0.5].

The data received by the sensor can be written as follows:

$$ {C}_{DS}=C\left({x}_S,{z}_S,t\right),t\in \left[0;0.5\right], $$
(7)

where (xS, zS) = (0.32; 0.83) are coordinates of the geometric position of the sensor.

The posed problem, described by Eqs. (3)–(6), modeled in the MATLAB 2019b application software package. The simulation results are shown in Fig. 1.

Fig. 1
figure 1

Results of solving the problem by formulas (3)–(6) with a pollution source (red point) and a sensor (yellow point), which determines the pollutant’s concentration: ae concentration distributions C (x, z, t) at time points t = 0.1; t = 0.2; t = 0.3; t = 0.4; and t = 0.5 respectively

As a result of modeling the problem according to Eqs. (3)–(6), data for Eq. (7) is obtained. A graphical representation of the results of solving Eq. (7) is shown in Fig. 2.

Fig. 2
figure 2

The data obtained by the sensor, which is the result of solving Eqs. (3)–(6)

So, the data from the sensor make it possible to solve a number of inverse problems (Golovnya and Khaidurov 2017; Petrov et al. 2018; Adam and Branda 2016; Penenko et al. 2019), with the help of which it is possible to establish the position of the pollution source.

2.3 Simulation of a Two-dimensional Case. Inverse Problem Models

The solution of such problems in ecology can be solved using Eq. (7). According to the results of measurements in such problems, the characteristics of the pollution source are determined. Based on the data from the sensor, it is possible to construct the first mathematical model of the inverse problem based on the turbulent diffusion Eq. (3) is a model for determining the height of the pollution source and the concentration of emissions on it.

The second mathematical model of the inverse problem, based on the turbulent diffusion Eq. (3), is a model for determining the coordinates of the location of the pollution source pipe, its height and concentration of emissions, which it generates.

First Mathematical Model of the Inverse Problem

The problem is posed in an optimization form. It is necessary to find the global minimum of a functional of the form:

$$ J\left({z}_{Src},{C}_{Src}\right)=\underset{\tau_{beg}}{\overset{\tau_{end}}{\int }}{\left({C}_{DS}\left({C}_{Src}\left({x}_{Src},{z}_{Src}\right),t\right)-{C}_{DS}(t)\right)}^2 dt, $$
(8)

where CDS(t) is data on the concentration of a pollutant in the air obtained by the sensor as a result of direct measurements, which is located in position (xS, zS); CDS(CSrc(xSrc, zSrc), t) is data of the concentration of the pollutant on the sensor, obtained as a result of indirect measurements. This value depends on the position of the pollution source and the concentration CSrc that it generates.

The restriction on functional (8) is a turbulent diffusion equation of the form (3) with initial and boundary conditions (4) and (5), respectively. The value CDS(t) is the observation statistics and is set in a table form:

$$ {C}_{DS}\left({t}_i\right)={TableData}_i,i=\overline{1,N}, $$
(9)

where N is the total number of observations. The data in Eq. (9) are given from Fig. 2.

The following mathematical model is a modification of the first model and consists in determining the geometric position of the pollution source, as well as the concentration of these emissions.

Second Mathematical Model of the Inverse Problem

The problem is posed in an optimization form. It is necessary to find the global minimum of a functional of the form:

$$ J\left({x}_{Src},{z}_{Src},{C}_{Src}\right)=\underset{\tau_{beg}}{\overset{\tau_{end}}{\int }}{\left({C}_{DS}\left({C}_{Src}\left({x}_{Src},{z}_{Src}\right),t\right)-{C}_{DS}(t)\right)}^2 dt, $$
(10)

where CDS(t) is the data on the concentration of a pollutant in the air obtained by the sensor as a result of direct measurements, which is located in position (xS, zS); CDS(CSrc(xSrc, zSrc), t) is data of the concentration of the pollutant on the sensor, obtained as a result of indirect measurements. This value depends on the position of the pollution source and the concentration CSrc that it generates.

The restriction on functional (10) is a turbulent diffusion equation of the form (3) with initial and boundary conditions (4) and (5), respectively. The value CDS(t) is the observation statistics and is set by Eq. (9).

2.4 Methods for Solving Models of Inverse Problems

The process of solving inverse problems can be divided into several stages. Considering such problems from the point of view of the main aspects, obtaining optimal estimates and the content of the problems can be disclosed in several stages (Маtsevitiy et al. 2017).

  1. Stage I:

    solving the optimization problem, the optimal mathematical model is selected with the most convenient dependence of the input and output parameters.

  2. Stage II:

    choice of the form of setting the inverse problems and the ways to solve it.

  3. Stage III:

    selection of the optimal measurement vector, including smoothing and other forms of preliminary processing of experimental information.

  4. Stage IV:

    choice of a possible method or algorithm for solving, based on the type of mathematical model of the inverse problem, the degree of incorrectness of its formulation, computing facilities that are at the disposal of the researcher, and the software for them.

  5. Stage V:

    choice of the optimality criterion, the target functional, its norm, the method for finding the regularization parameters, the order of the stabilizing functional, and the method for minimizing the target functional.

  6. Stage VI:

    compilation of an optimal algorithm for solving the inverse problem, taking into account the chosen method, the optimality criterion, the required accuracy, and the shortest path to obtain stable solutions.

  7. Stage VII:

    selection of optimal computational procedures and implementation of the corresponding algorithm.

  8. Stage VIII:

    analysis of the solutions obtained, including preliminary processing of the results (approximation, smoothing, etc.).

The process of solving the inverse problem does not necessarily include all the listed stages or all the actions of each stage. It depends on the method used. Any method can be used at different stages, and during solving one problem, as a rule, several methods are used (Babak et al. 2019; Eremenko et al. 2020; Zaporozhets et al. 2018). In order to choose the optimal research method, it is necessary to have an idea of the whole variety of existing methods, which must be clearly systematized. There are various classifications of methods for solving the main classes of inverse problems.

Functionals of the forms (8) and (10) are minimized by solving a system of nonlinear equations. Let designate all unknown parameters of functionals (8) and (10) in the form of the vector\( \overrightarrow{x}=\left({x}_1,\dots, {x}_i,\dots, {x}_n\right) \). The system of equations has the form (Van Vuuren et al. 2017; Fraccascia and Giannoccaro 2019):

$$ \left\{\frac{\partial J}{\partial {x}_i}=0\right\},i=\overline{1,N}, $$
(11)

where xi are sought-for values of the parameters of functionals of the form (8), (10); N is the total number of parameters in the corresponding functionality. The number of unknown parameters in functional (8) is 2, and 3 in functional (10).

System (11) can be solved by Newton’s method and its modifications (Schleicher and Stoll 2017; Ramos and Monteiro 2017).

Then, the optimization problem can be posed as follows. Among the numerous existing optimization methods of Newton’s type, it is necessary to choose the most effective one for finding solutions to models of the posed inverse problems and to reduce the number of calls to the procedure for solving the direct problem in them.

Example. Let \( f\left(\overrightarrow{x}\right)\to \min, x\in {R}^n \). First, it can be applied the steepest descent method to the unconstrained optimization problem. As the direction of descent, it can be chosen movement along the antigradient, for example, as (Lakhno et al. 2020):

$$ {\overrightarrow{x}}^{\left(k+1\right)}={\overrightarrow{x}}^{(k)}-{h}_k{f}^{\prime}\left({\overrightarrow{x}}^{(k)}\right),{h}_k>0,k=0,1,2,\dots . $$

The problem consists in finding at each step such a value hk that minimizes the function for a certain vector \( {\overrightarrow{x}}^{(k)} \), and which is taken from the previous iteration number k. The search for such a value hk can be implemented using any methods of one-dimensional unconstrained optimization. The value of the parameter hk ∈ R1 is taken from the condition of the minimum of the function \( f\left({\overrightarrow{x}}^{(k)}\right) \) in the direction of movement of the antigradient:

$$ f\left({\overrightarrow{x}}^{(k)}-{h}_k\cdot \nabla f\left({\overrightarrow{x}}^{(k)}\right)\right)=\min f\left({\overrightarrow{x}}^{(k)}-h\cdot \nabla f\left({\overrightarrow{x}}^{(k)}\right)\right),h>0. $$

So, using approximations to the initial condition, the sought-for value can be refined using the steepest descent method.

2.5 Modifications of the Classical Newton Method

Methods of this type replace the procedure for solving a nonlinear system of equations of the form \( {F}_i\left(\overrightarrow{x}\right)={F}_i\left({x}_1,\dots, {x}_n\right)=0, \) \( i=\overline{1,N} \) by a procedure for finding a solution to a sequence of systems of algebraic equations of the form:

$$ \left(\begin{array}{c}\frac{{\partial F}_1}{{\partial x}_1}\left(\begin{array}{l}\to (k)\\ {}x\end{array}\right)\\ {}\cdots \\ {}\frac{\partial {F}_n}{{\partial x}_1}\left(\begin{array}{l}\to (k)\\ {}x\end{array}\right)\end{array}\kern0.5em \begin{array}{c}\cdots \\ {}\cdots \\ {}\cdots \end{array}\kern0.5em \begin{array}{c}\frac{{\partial F}_1}{{\partial x}_n}\left(\begin{array}{l}\to (k)\\ {}x\end{array}\right)\\ {}\cdots \\ {}\frac{\partial {F}_n}{{\partial x}_1}\left(\begin{array}{l}\to (k)\\ {}x\end{array}\right)\end{array}\right)\kern0.5em \left(\begin{array}{c}{x}_1^{\left(k+1\right)}-{x}_1^{(k)}\\ {}\cdots \\ {}{x}_n^{\left(k+1\right)}-{x}_n^{(k)}\end{array}\right)=-\left(\begin{array}{c}{F}_1\left(\begin{array}{l}\to (k)\\ {}x\end{array}\right)\\ {}\cdots \\ {}{F}_n\left(\begin{array}{l}\to (k)\\ {}x\end{array}\right)\end{array}\right). $$
(12)

Recall that it is necessary to find a solution to a system of the form (11), where the unknown parameters are the values of the required input parameters for functionals J of the forms (8) and (10). So, the matrix of system (12) will be the Hessian of the original system, that is, the matrix consisting of the second derivatives of the equations of the system:

$$ H\left({\overrightarrow{x}}^{(k)}\right)=\left(\begin{array}{ccc}\frac{\partial^2J}{\partial {x}_1^2}\left({\overrightarrow{x}}^{(k)}\right)& \dots & \frac{\partial^2J}{\partial {x}_1\partial {x}_n}\left({\overrightarrow{x}}^{(k)}\right)\\ {}\dots & \dots & \dots \\ {}\frac{\partial^2J}{\partial {x}_n\partial {x}_1}\left({\overrightarrow{x}}^{(k)}\right)& \dots & \frac{\partial^2J}{\partial {x}_n^2}\left({\overrightarrow{x}}^{(k)}\right)\end{array}\right). $$
(13)

Since to calculate the Hessian in formula (13), the solution is differentiated with respect to the unknown parameters of the functionals, then, in the general case, it is simply impossible to find the derivative analytically in such problems. It must be found numerically using the truncated Taylor series. For the numerical search for the second derivative, it is necessary to have the value of the differentiated function at three points, and for the search for the mixed derivative—at four points.

Let us consider a simple case. Let it be required to solve a nonlinear equation of the form F(x) = 0, x ∈ R1. The classical Newton method, which solves this equation, is to apply the following relationship:

$$ {x}_{k+1}={x}_k-{\left({F}^{\prime}\left({x}_k\right)\right)}^{-1}F\left({x}_k\right). $$
(14)

The accuracy of method (14) can be improved by the following approach. Let it be required to solve a differential equation of the form y = f(x). Integrating both of its parts, we obtain the expression:

$$ \left\{\underset{x_k}{\overset{x_{k+1}}{\int }}{y}^{\prime } dx=\underset{x_k}{\overset{x_{k+1}}{\int }}f(x) dx\right\}=\left\{{y}_{k+1}-{y}_k=\underset{x_k}{\overset{x_{k+1}}{\int }}f(x) dx\right\}=\left\{{y}_{k+1}={y}_k+\underset{x_k}{\overset{x_{k+1}}{\int }}{y}^{\prime }(x) dx\right\}. $$
(15)

Obviously, the procedure for finding the integral is closely related to the accuracy of calculations. Using the right rectangles formula, Eq. (15) can be rewritten as follows:

$$ {\displaystyle \begin{array}{c}\left\{{y}_{k+1}={y}_k+\underset{x_k}{\overset{x_{k+1}}{\int }}{y}^{\prime }(x) dx\right\}=\left\{{y}_{k+1}={y}_k+h{y}^{\prime}\left({x}_k\right)\right\}=\\ {}=\left\{{y}_{k+1}={y}_k+\left({x}_{k+1}-{x}_k\right){y}^{\prime}\left({x}_k\right)\right\}.\end{array}} $$
(16)

These transformations are reduced to the classical Newton’s method if yk + 1 = 0 in Eq. (16) and solving the equation with respect to xk + 1. It is known that the accuracy of one iterative step of the broken line method is O(h2). The classical Newton’s method has a similar calculation accuracy. It is possible to increase the accuracy of one step during solving an equation using the predictor-corrector method. During applying the trapezoid method to find the integral, it will have:

$$ {\displaystyle \begin{array}{c}\left\{{y}_{k+1}={y}_k+\underset{x_k}{\overset{x_{k+1}}{\int }}{y}^{\prime }(x) dx\right\}=\left\{{y}_{k+1}={y}_k+0.5h\left({y}^{\prime}\left({x}_k\right)+{y}^{\prime}\left({x}_{k+1}\right)\right)\right\},\\ {}\left\{{y}_{k+1}={y}_k+0.5\left({x}_{k+1}-{x}_k\right)\left({y}^{\prime}\left({x}_k\right)+{y}^{\prime}\left({x}_{k+1}\right)\right)\right\}.\end{array}} $$
(17)

In Eq. (17), the value y(xk + 1) is unknown. It is being evaluated. Estimation is carried out using the polyline method as follows:

$$ z={x}_k-\beta \frac{y_k}{y_k^{\prime }},{x}_{k+1}={x}_k-\frac{2{y}_k}{y^{\prime}\left({x}_k\right)+{y}^{\prime }(z)}. $$
(18)

The parameter β can be taken equal to one. To speed up computations by reducing their total number, this parameter can be searched using the steepest descent method at each iteration in Eq. (18).

So, the finally obtained optimization method for the considered functionals is as follows:

$$ \overrightarrow{z}={\overrightarrow{x}}^{(k)}-\beta {H}^{-1}\left({\overrightarrow{x}}^{(k)}\right)J\left({\overrightarrow{x}}^{(k)}\right),{\overrightarrow{x}}^{\left(k+1\right)}={\overrightarrow{x}}^{(k)}-2{\left(H\left({\overrightarrow{x}}^{(k)}\right)+H\left(\overrightarrow{z}\right)\right)}^{-1}J\left({\overrightarrow{x}}^{(k)}\right), $$
(19)

where k is the iteration number of the optimization method; z is the intermediate values of unknown obtained parameters; \( {\overrightarrow{x}}^{(k)},{\overrightarrow{x}}^{\left(k+1\right)} \) are values of the unknown parameters obtained in the mathematical models for identifying the characteristics of the pollution source at iterations k and (k + 1), respectively; J(⋅) is the vector of derivatives of the functional; and H(⋅) is the Hesse matrix (Hessian).

3 Results

The convergence of the method is described by Eq. (19) for the task of minimizing functional (8) with constraints in the form of Eqs. (3)–(5), and (9) is shown in Table 1. The number of iterations is 30. The problem was solved by the finite difference method on different grids, which have dimensions: 40 × 40, 50 × 50, 60 × 60, 80 × 80, and 100 × 100 design nodes.

Table 1 Result of the convergence of Eq. (19) for an optimization problem with a functional of the form (8) (initial condition—zero vector, computational grid—100 × 100)

In the course of solving the problem, an approximate solution was obtained\( \left({\overset{\sim }{z}}_{\mathrm{Source}},{\overset{\sim }{C}}_{\mathrm{Source}}\right)=\left(0.3192348764;0.9946455187\right) \).

The resulting optimization method gives a more accurate result in contrast to the classical Newton method. After 30 iterations, the value of computational error of classical Newton method is 6.42∙10−6.

The convergence of the method described by Eq. (19) for the problem of minimizing functional (18) with constraints in the form of Eqs. (3)–(5), and (9) is shown in Table 2. The number of iterations is 38. The problem was solved by the finite difference method on different grids, which have dimensions: 40 × 40, 50 × 50, 60 × 60, 80 × 80, and 100 × 100 design nodes.

Table 2 Result of the convergence of Eq. (19) for an optimization problem with a functional of the form (10) (initial condition—zero vector, computational grid—100 × 100)

The resulting optimization method also gives a more accurate result in contrast to the classical Newton method. After 38 iterations, the value of computational error of classical Newton method is 1.55∙10−7.

The values in Tables 1 and 2 are given for the coefficient β = 1.

Table 3 contains a comparison of the results of the obtained method with the classical Newton’s method, as well as taking into account the steepest descent method to determine the optimal value of the β parameter.

Table 3 Comparative analysis of the efficiency of the obtained method in comparison with the classical Newton’s method

The calculation accuracy for all computational experiments was set equal to 10−15.

In the course of solving the problem, an approximate solution was obtained: \( \left({\overset{\sim }{x}}_{\mathrm{Source}},{\overset{\sim }{z}}_{\mathrm{Source}},{\overset{\sim }{C}}_{\mathrm{Source}}\right)=\left(0.6201465184894;0.3124693487;0.9954751464\right). \)

4 Conclusions

The proposed model is a special case of atmospheric air pollution using 1 source of pollution with 1 post of the monitoring system. The resulting models can be modified to account for several different pollution sources. The total number of calculations in such models is directly proportional to the number of pollution sources and the number of equations describing the process, which is directly considered. It should be noted that the obtained modifications of the classical Newton method have a number of advantages. First, the total number of calculations for obtaining solutions to the problems posed has been reduced by more than 3 times for two different functionalities. Secondly, the method is much more stable than the classical one due to the selection of the optimal parameter, which is used directly both for decreasing and for obtaining stable solutions at each iteration of the method.

During numerically realizing mathematical models, which are based on inverse problems, it must be taken into account the fact that the amount of computation can be very large. Thus, to obtain a numerical solution to inverse problems in the optimization formulation, it is necessary to solve multiple direct problems that are related to one inverse problem. Despite the fact that stochastic methods do not require information about the objective function, such as differentiability and local extrema, they can increase the number of calls to the procedure for solving the direct problem, and therefore the total number of calculations. If the objective function has local extrema, then it is advisable to use a combination of the obtained deterministic method and stochastic optimization methods in the complex.