1 Introduction

Air quality monitoring plays a crucial role in developing policies and strategies in order to achieve the aims of environmental policies. The ultimate goal of air quality monitoring is to collect data by which scientists, politicians and planners are enabled to make appropriate decisions in managing and boosting global environmental quality (Gurjar et al. 2010; Liu et al. 1986; Mofarrah and Husain 2010). Recent developments in online air quality monitoring and short-term measuring have led to the introduction of advanced warning systems and immediate notifications. These systems can reduce emissions during pollution episodes and help vulnerable inhabitants to cope with these conditions. Such developments, and also successful air pollution control policies, have led to dramatic improvements in air quality, public health, and life quality over the last few decades (Hsu et al. 2013; Kuhlbusch et al. 2014).

The aim of design of air quality monitoring network (AQMN) is to determine the number and locations of stations (configuration) which is highly essential in achieving the air pollution control. Considering the high cost of monitoring stations, i.e. equipment, maintenance and operating personnel, optimization of AQMN is crucial for air pollution control managers (Bayraktar and Turalioglu 2005; Modak and Lohani 1985a; Mofarrah and Husain 2010).

The primary tasks in the optimization of AQMN are based on empirical and quantitative approaches (Kao and Hsieh 2006; Nejadkoorki and Baroutian 2012). However, recent literatures have shown increasing tendencies to apply multi-objective and systematic approaches in the design of AQMN (Gómez-Losada et al. 2014; Liu et al. 1986; Lozano et al. 2009a, b, 2010; Maria Grazia et al. 1999; Mazzeo and Venegas 2008; Nejadkoorki et al. 2011; Pires et al. 2008a; Zarandi et al. 2008). Table 1 summarizes such descriptive studies.

Table 1 Systematic approaches in AQMN designing

The studies done in the field of AQMN design can be classified based on: (1) target objectives for the network, (2) intended pollutants for monitoring, (3) method to determine the concentration distribution of pollutants over the study area, (4) method of finding the number and locations of stations (design technique), and (5) intended areas for monitoring.

According to Table 1, most of the studies have proposed multi-objective planning methods and cost reduction has been considered as a sub-objective. In other words, the design and/or optimization aim has been to achieve the desired objective(s) with minimum cost, i.e. number of stations. Measuring the maximum concentrations is usually one of the general goals of AQMN (Zhang et al. 2014). The common fault of the methods when this objective is considered as the only criterion in the design procedure is that the proposed stations will not be truly representative of the whole area.

Having in mind the high cost of stations simultaneous measurement of several pollutants at one station is preferred. Therefore, newly proposed network design methods have been established in a way in which multiple pollutants are considered. In addition, if more than one pollutant is measured at a common station, estimating missing values will be possible by using cross-correlations between the other pollutants (Sarigiannis and Saisana 2008). Various techniques have been used to determine the concentration distribution of pollutants over the studied area. In the case of AQMN design, the area is usually divided into grids and then the pollutants concentration is determined for each grid. The centroid of each grid is considered as a potential location for a monitoring station (Nejadkoorki et al. 2011). The pollutant concentration in each grid is estimated by using different methods. Dispersion models with respect to the distribution of pollution sources are used to determine the concentration of pollutants in each grid in the source-oriented siting procedures. Sometimes, available input data makes the application of dispersion models and complex numerical tools impossible; thus, the statistical approaches such as interpolation techniques and Land Use Regression (LUR) models are used to estimate the pollutants concentration in which the available data like existing monitoring stations, environmental measurement data, and satellite turbidity data are considered as input data (Elkamel et al. 2008; Gerharz and Pebesma 2013; Liu et al. 2014b; Sarigiannis and Saisana 2008). Generally, in most methods, a score is given to each grid based on the calculated concentrations. In the case of multi-objective design (e.g., monitoring areas of high population density and detection of violations over ambient standards) another score is given to each grid based on the considered objective. After this stage, each of the presented methods will take different route. In most studies as seen in Table 1, after determining the objective function and its value for each grid, the grid ranking (ranking potential station locations) is used to select the optimal location. In these methods, simultaneous determination of the optimal number and locations of the stations is difficult and usually the number of stations has been specified in advance based on budget constraints. The methods in which the number and locations of the stations are offered simultaneously, i.e. joint solution, would be more practical. Another major difference between the proposed methods is the type of monitoring area, where different types of monitoring areas (such as industrial or urban) will lead to different objectives, pollutants, dispersion models, and design and/or optimization techniques.

Emission of wide variety of pollutants from industrial plants such as oil refineries has made them one of the largest sources of air pollution (Abdul-Wahab et al. 2011). The present study introduces an integrated approach to design a sensitive AQMN for industrial plants (i.e. oil refinery and petrochemical plants). This method optimizes the number and locations of air quality monitoring stations with respect to the sensitivity of monitoring stations to emission sources. The sensitive monitoring station has the capability of quick diagnosis of stochastic accidents in the emission sources which lead to an increase in the pollutants concentration. This character attribute of monitoring stations enables the authorities to specify the defective emission sources and to take prompt measures to control them. The optimization algorithm is developed considering (1) maximum coverage area of the monitoring network, (2) continuity of coverage area, (3) least overlap among coverage areas, (4) maximum detection of violations over ambient air standards, and (5) sensitivity of monitoring stations to emission sources. The developed integrated method leads to a source-specific monitoring network which can be used effectively to control emission sources.

The multiple cell approach (MCA), which was described previously (Fatehifar et al. 2006, 2007, 2008) was expanded and used to simulate air pollution dispersion and determine ground level concentrations. The outputs of the MCA model were used as input data for calculation of the fitness function of optimization algorithms in order to find the optimal configuration of AQMN. The solution with the least value of the fitness function offers the best configuration of monitoring stations. In order to get the best solution to the problem, Ant Colony Optimization Algorithm (ACO) and Genetic Algorithm (GA), integrated with the MCA model, were used separately as the optimization tools. ACO and GA are robust methods capable of locating near global optimal solutions for complex problems. The proposed methodology was implemented as a MATLAB program which is flexible and expandable.

2 Materials and methods

2.1 Air pollutant dispersion model

In this study, the multiple cell approach (MCA) based on the solution of the three-dimensional diffusion equation is used to predict ground level pollutant concentrations. The model based on the previous papers (Fatehifar et al. 2006, 2007, 2008) is developed in a way that can consider wind direction in the prediction of ground level pollutant concentrations in order to make it more adapted for AQMN optimization. However, a brief description of its main considerations and assumptions is included below.

The program of the model uses MATLAB-based graphical user interface (GUI) and it is applicable for network of refinery stacks, petrochemical complexes, urban and industrial stacks. The model is based on Eulerian Model and k-theory in which the advection and diffusion of pollutants from point sources in Cartesian Coordinates (considering constant wind velocity and turbulent diffusivities) are expressed by a advective–diffusive equation (Fatehifar et al. 2008). The simplifying assumptions of the model are:

  1. 1.

    Steady state conditions.

  2. 2.

    The initial conditions are arbitrarily set at zero.

  3. 3.

    Transport by bulk in wind direction exceeds diffusion in that direction, i.e., the x-axis eddy diffusion coefficients of the model were neglected.

  4. 4.

    The wind velocity is constant, a function of z and only in x direction.

  5. 5.

    There is no chemical reaction and no deposition in the system.

Figure 1 shows the schematic view of selected domains for modeling of pollutants dispersion. As can be seen, two domains for modeling procedure are introduced: main and modelling domains.

Fig. 1
figure 1

Schematic of selected domains for modeling

The main domain is defined in a way which includes all the area that pollution gets dispersed over, in different wind directions. The modeling domain is a moving domain which can rotate based on the wind direction, around the central point of the main domain. The network of the point sources is located in the center of the main domain.

The empirical equations (which are dependent on the stability classes of atmosphere and functions of surface roughness length and friction velocity) are used to calculate eddy diffusivities in the lateral and vertical directions and wind velocity at different heights. The modified Holland’s equation is used for plume rise calculation. Application of this model to industrial stacks has shown results that agree with observational data within a reasonable accuracy (Fatehifar et al. 2006, 2007, 2008).

2.2 Objectives of the air quality monitoring network

The main goal of designing an AQMN is to collect the most possible information in the least expensive way. So, the monitoring stations must be located in the points where they properly represent pollutant concentration of the relevant surrounding area and provide the maximum coverage area with the minimum number of stations. Therefore, the minimum overlap among covered areas of stations (which leads to the maximum coverage area) can be considered as an objective. Furthermore, the stations must be located in areas which have the maximum air pollutant concentration and also, high fluctuations of pollutant concentration which leads to the estimation of sensitivity of stations to pollutants sources. Maximum population coverage can also be considered when resident safety is a concern, but considering the studied area, which is located far from urban and high density areas, population protection is not an objective in this study.

2.2.1 The coverage area

To determine the coverage area of the stations, the concept of sphere of influence (SOI) along with the continuity of coverage is used. The continuity of coverage area leads to an increment of accuracy of monitoring. The concept of SOI has been widely used in the literature (Baldauf et al. 2001, 2002; Chang and Tseng 1999a, b; Elkamel et al. 2008; Kao and Hsieh 2006; Liu et al. 1986; Mofarrah and Husain 2010; Mofarrah et al. 2011; Arbeloa et al. 1993; Tseng and Chang 2001) without considering the continuity of coverage area.

The contiguous SOI of a station is defined as the surrounding area of the station where the air quality data can be considered as representative of the whole area. It is based on the similarity between the information contained in a given station compared with the rest of area. To calculate the contiguous SOI, a spatial correlation coefficient (r) based on the extent of similarity in ground level pollutants concentration is used. The ground level pollutants concentration are estimated by the MCA model. Presuming that C 1 = (C 11, C 12, C 13, …, C 1n ) and C 2 = (C 21, C 22, C 23, …, C 2n ) are the indicators of pollutant concentration in two various network points calculated by the MCA model at the same time, the spatial correlation coefficient for a sample size n can be expressed as:

$$r = \frac{{\sum\nolimits_{i = 1}^{n} {\left( {C_{1i} - \bar{C}_{1} } \right)\,\left( {C_{2i} - \bar{C}_{2} } \right)} }}{{\sqrt {\sum\nolimits_{i = 1}^{n} {\left( {C_{1i} - \bar{C}_{1} } \right)^{2} } \sum\nolimits_{i = 1}^{n} {\left( {C_{2i} - \bar{C}_{2} } \right)^{2} } } }} \,$$
(1)

where, \(\bar{C}_{1}\) and \(\bar{C}_{2}\) are the average concentrations at locations 1 and 2, respectively.

$$\bar{C}_{1} = \frac{1}{n}\sum\limits_{i = 1}^{n} {C_{1i} } \quad and\quad \bar{C}_{2} = \frac{1}{n}\sum\limits_{i = 1}^{n} {C_{2i} } {\kern 1pt}$$
(2)

To calculate contiguous SOI in each point, the r value obtained for each point is compared to a predefined value called cut-off value (r c ). If the spatial correlation coefficient (r) is higher than r c , the corresponding points, which are interconnected to each other (Fig. 2), would be considered as correlated. The coverage area of ith candidate location is defined as the number of potential locations placed inside the contiguous SOI of candidate location i can be quantified in terms of pattern scores. Pattern score for an ith candidate location is denoted by \(N_{p}^{i}\).

Fig. 2
figure 2

Continuity of SOI

Therefore, determining the optimum number and locations of monitoring stations, where the overall pattern score is maximized, is the first interest in AQMN design. High overall pattern score leads to the optimal coverage area. This objective of the design can be expressed as:

Maximize \(\sum\nolimits_{i = 1}^{m} {N_{p}^{i} }\).where, m is the number of stations.

2.2.2 Violation over ambient standards

The concept of violation score is used to determine the extent of violation over ambient standards for each point: the point with higher violation score is assumed to have a high potential of detecting the violations and a high sensitivity to emission sources. A weighted scoring is used to determine the violation scores based on the concentration of pollutants which depends on:

  1. 1.

    The threshold level of pollutant concentration;

  2. 2.

    Weighing factor between each threshold range and weighing function.

A decision on threshold levels and weighing factors is indeed pollutant-specific and dependent upon the considered average time. Several weighing functions such as linear, segmented linear, non-linear, and segmented non-linear can be employed (Elkamel et al. 2008). In this study, a segmented non-linear function is used. The violation score for each candidate location is calculated using the equation:

$$N_{v}^{i} = \,\sum\limits_{i = 1}^{T} {\sum\limits_{k = 1}^{{N_{t} }} {\frac{{\left( {w_{k + 1} - \,\,w_{k} } \right)\left( {x_{i} - \,x_{k} } \right)X}}{{\left( {x_{k + 1} - \,x_{k} } \right)}}\,\,} }$$
(3)

where, \(N_{v}^{i}\) is the violation score for the ith candidate location, w k is the weighing factor corresponding to threshold x k , x k is the kth threshold, X = 0 if (xi − xk) ≤ 0, otherwise X = 1, N t is the total number of thresholds and T is the total number of simulated observations. The threshold values are chosen based on the United States Environmental Protection Agency (USEPA) limit values of common pollutants (Elkamel et al. 2008; Sarigiannis and Saisana 2008).

The second interest of optimization could therefore be deliberated as the identification of the optimum number (m) and locations of monitoring sites so that the overall violation score is maximized. High overall violation score leads to more sensitivity of stations to pollutants sources and achieving a maximum detection of violations over ambient air standards. This objective of the designing can be expressed as:

Maximize \(\sum\nolimits_{i = 1}^{m} {N_{V}^{i} }\)

where m is the number of stations.

2.2.3 Sensitivity of candidate locations to emission sources

The sensitivity of candidate locations to emission sources is calculated using the following procedure for every meteorological scenario:

Initially, the MCA model is used to calculate the ground level concentration of pollutant at each point for the eth meteorological scenario using the average concentration of pollutant emitted from emission sources. In the next step, the concentration of pollutant is raised 25 % in the first source (concentrations of other sources are constant) and the ground level concentration of pollutant is obtained (using MCA model) at each point again. This procedure is done accordingly for all of the sources at 25, 50, 75 and 100 %. Finally, the score of sensitivity of candidate locations to emission sources are calculated using the following equation:

$$N_{s}^{i} = \,\sum\limits_{e\, = 1}^{E} {\,\,\sum\limits_{s = 1}^{S} {\;Sl_{es} \,\,} }$$
(4)

where, \(N_{s}^{i}\) is the sensitivity score for the ith candidate location, Sl es is the slope of concentration variation at ith candidate location versus concentration changes in pollution source s and meteorological scenario e, S is the number of emission sources, and E is the number of meteorological scenarios.

This objective of the designing can be expressed as:

Maximize \(\sum\nolimits_{i = 1}^{m} {N_{s}^{i} }\)

where m is the number of stations.

2.2.4 Fitness function

Given that having a network with high overall pattern score, overall violation score, and overall sensitivity score is desirable, a combination of functions is used to achieve the simultaneous maximum pattern score, violation score, and sensitivity score. Several forms of combinations (e.g., additive or multiplicative form) could be used for combining these independent variables, out of which the multiplicative form is used:

$$F = (N_{p} )^{b1} \times (N_{v} )^{b2} \times (N_{s} )^{b3} \quad (where\,b_{1} + b_{2} + b_{3} = 1)$$
(5)

The parameters b 1, b 2, and b 3 are used to weigh the relative importance given to the three objectives and their value depends on the purpose of the network to be developed. The multiplicative form implies an intensity function, and for positive high correlations, the interests for a location are quite well represented by the product (N p  × N v  × N S ), rather than the addition (Elkamel et al. 2008; Modak and Lohani 1985b). Also this structure of the objective function ensures that if any of the three sub-objectives become zero, the overall objective function (F) will also has a zero value. For example, if a potential station location has a high violation score, but zero effective pattern score, then it will be discarded in the optimal design.

It should be noted that the effect of weighting factors on F, N p , N v , and N s values of each pollutant were investigated to give appropriate weight value for each component.

Since the objective of this research is to design an AQMN for multiple pollutants, an aggregating approach is used:

$$UF_{i} (u){\kern 1pt} = {\kern 1pt} \sum\limits_{j = 1}^{p} {w_{j} F_{j}^{i} }$$
(6)

and

$$\sum\limits_{j = 1}^{p} {w_{j} {\kern 1pt} = {\kern 1pt} 1}$$
(7)

where, w j is the importance associated with pollutant j, \(F_{j}^{i}\) is the objective function for pollutant j at location i and UF i (u) is the additive/overall objective for P pollutants at location i. Considering that each pollutant had the same importance (in this case study), the equal weight values were used to give the same tendency for each pollutant.

Now, given the defined equations, the objective of the network design is to achieve the maximum sum of the UF i of stations. In order to reach this maximum value, the minimum number of overlap points among the coverage areas of stations (which is inconsistent with this maximum value) is used. For the sake of design a fitness function of ACO and GA the two dimensionless parameters are defined as:

$$UF_{n} \, = \,\frac{{UF_{\hbox{max} } {\kern 1pt} \times {\kern 1pt} m}}{{\sum\nolimits_{i = 1}^{m} {UF_{i} (u)} }}$$
(8)
$$OL_{n} \, = \,\frac{number\,\,of\,\,overlaped\,\,grids}{maximum\,\,number\,\,of\,possible\,\,\,overlaped\,\,\,grids}$$
(9)

where, UF n is Overall Objective number of network, OL n is Overlap Number, m is the number of stations and UF max is the maximum value of UF(u) among all UF(u)s of the potential station locations.

Finally, since obtaining least UF n and OL n simultaneously is desirable in network design (leading to more coverage area), an aggregate function, is used as follow:

$$G\, = \,g_{1} \, \times \,OL_{n} \, + \,g_{2} \, \times UF_{n} \quad (g_{1} \, + \,g_{2} {\kern 1pt} = \,1)$$
(10)

where, g 1 and g 2 are the weight coefficients defined according to the relative importance of each parameter. The variation of the coverage efficiency versus g 1 (g 2 = 1 − g 1) was investigated in order to find the best weight values.

In order to find the least value of G where the sum of the UF i of stations is maximum and the overlap among the coverage areas of stations is minimum, GA and ACO are used separately and then the results are compared in order to confirm the obtained results.

ACO is a metaheuristic optimization algorithm based on the behavior of ants seeking a path between their colony and a source of food. This collective trail-laying and trail-following behavior in which an ant is influenced by the pheromone trail left by other ants is the source of inspiration for ACO. This algorithm benefits from artificial ants which deposit pheromone based on the fitness and goodness of the identified trial solutions (Emami Skardi et al. 2015; Fayaed et al. 2013). The GA is a stochastic search algorithm, which mimics evolution in nature and it is an efficient tool in searching for the global optimum. This algorithm is widely used to optimize functions in various geocomputational applications (Liu et al. 2014a; Shad et al. 2009).

Overall, the AQMN designing procedure with sensitivity of monitoring stations to emission sources can be summarized by a flowchart as shown in Fig. 3.

Fig. 3
figure 3

Flowchart of proposed method for optimal allocation of monitoring stations with sensitivity to emission sources

3 Illustrative case

The algorithm defined above was used for optimizing AQMN around Tabriz Oil Refining Company. The company is located in the southwestern part of Tabriz, East Azerbaijan province, Iran. The refinery has 14 refining units and 10 utility units which include 26 furnaces. Given that the exhausts of some of furnaces are joined to each other, 20 stacks disperse the exhaust pollutants over the surrounding area. The MCA model was used to create spatial distributions for the concentrations of the pollutants (SO2, NOx, CO) using 1 year’s collected data. Normally 1 year is considered to be the minimum period which can reflect the meteorological fluctuations (Elkamel et al. 2008; Arbeloa et al. 1993). A flue gas analyzer (MRU Varioplus, Germany) was used to measure the temperature, velocity and concentration of the flue gas species. The measurement was done periodically every week during the year 2012. Totally, 48 data sets were collected. The other information of emission source that needs to be input into the model are the physical stack dimensions such as height, location and internal diameter. It should be noted that the effect of background concentration is negligible with regard to the refinery location and prevailing wind directions. Table 2 summarizes the emission sources information.

Table 2 Information of Emission sources

In addition, the meteorological data such as wind speed, wind direction, ambient temperature, height of the mixing layer, and stability class, registered at a closest weather station of Tabriz Meteorological Organization, were used as the input data. Figure 3a shows the location of Tabriz Oil Refining Company and the positions of air pollution sources. Location 1 includes 18 sources and each of the locations 2 and 3 include only one source. Investigation of the wind regimes of the study area between the years 2005 and 2012 indicated that the prevailing wind directions throughout the study area were eastern (90°) and north eastern (45°) for more than 80 % of the time. Thus, only these two directions were used in the modelling and optimization procedures. Tabriz has a semi-arid climate with dry and semi-hot summer and snowy cold winter. The temperature varied from −12 and 39 °C during the year 2012. It is worth noting that Tabriz Oil Refining Company is located at a smooth plate.

4 Results and discussion

At the beginning of AQMN designing, the study area was divided into 100 × 100 m continuous grids in which each grid represented a potential site for monitoring stations. According to the prevailing wind directions throughout the study area, the area was divided into 3245 grids (59 × 55) and they were numbered sequentially according to Fig. 4a. The MCA model was used to model the distribution of pollutants concentrations in the area based on the 48 collected data. A matrix of 48 × 3245 for SO2, NOx, and CO concentrations that has been generated at the specified 3245 candidate locations was used as an input to the optimization algorithm. Figure 4b schematically shows the modeling grids, location of sources and potential zones for locating monitoring stations according to the prevailing wind directions.

Fig. 4
figure 4

a Location of industrial plant and emission sources, b Modeling grids, location of emission sources, potential zones and wind directions

As an example, Fig. 5 shows the ground level concentration of pollutant for a selected domain at different wind directions and meteorological conditions.

Fig. 5
figure 5

Concentration of pollutants at ground level with wind blowing direction: a 45, b 90

Given the air pollution dispersion, a “potential zone” is recommended for locating the stations. A potential zone is an area in which the pollutant concentrations are higher than a predefined threshold value (Kao and Hsieh 2006). According to the Fig. 5, a potential zone can be diagnosed from the pollutant concentration profile at ground level of simulation examples.

Considering the intended objectives for the AQMN, the candidate locations with high value of N p , N v and N S will occur in the potential zone. So, the output results of the MCA model determine the location of the potential zone in each modelling scenario and subsequently the behavior of integrated optimization algorithms (ACO and GA). The length and the distance of the potential zone from the emission sources are dependent on the pattern of pollutant concentration which is a function of meteorological conditions and emission rates. Accordingly, the effects of important meteorological parameters such as wind velocity, atmospheric stability, air temperature and dispersion coefficient and also effective parameters of emission sources on the dispersion of pollution and the location of the potential zone were investigated. The results showed that when the wind velocity increases, the pollution dispersion decreases and pollutants go far from the stacks. The distribution of pollutants increases with increasing atmospheric instability, so the pollutants do not go far from the emission sources. The dispersion of pollutants increases with increasing air temperature and pollutants come down near the emission sources. Increasing the stack exit velocity and temperature leads to a decrease of the ground level concentration. Therefore, it can be concluded that the distance of potential zone from the emission sources increases by increasing the wind velocity and atmospheric stability, and decreasing the stack exit velocity and temperature and the air temperature.

In order to verify the predictions of the model, a comparison of model output with the measured ambient concentration data was done. The measured data was collected around the refinery simultaneously with stack sampling. The measurement was done twice at downwind distances of 400, 800, and 1200 m from the emission sources. Note that the measurements at sampling points were not done simultaneously.

Figure 6 illustrates a comparison between measured data and model results. As can be seen, there was good agreement between the measured data and modeling results. The mean error between measured and modelled values was less than 10 %.

Fig. 6
figure 6

Comparison of measured and modelled concentration a, c first data sets, b, d second data sets

In order to give appropriate weight value for each component in Eqs. (5) and (10), their sensitivity was investigated. In the case of Eq. (5), the results showed that by increasing the b 1 (decreasing b 2 and b 3) the overall N v and N s values of the network decrease and the F value and the coverage efficiency of the network increase. Increasing the b 2 and b 3 (decreasing b 1) showed almost the same trend (decreasing the coverage efficiency and overall N p of the network). For b 1 values lower than 0.8, the coverage efficiency decreases significantly. Having in mind that the low coverage efficiency is not appropriate for AQMN and it leads to a network with more monitoring stations which is not economically feasible, the 0.8 was considered as the minimum value of b 1 which causes a network with acceptable coverage efficiency. For the b 1 values higher than 0.9 the overlap among coverage areas of stations increases markedly and also the overall N v and N s values of the network decrease. Finally in order to have a sensitive monitoring station with high capability of detection of violations over ambient air standards, the equal weight values were used for N v and N s (b 1 = b 2 = 0.1 and b 2 = 0.8). It should be noted that the selected weight values are not applicable to any case study. Because the N p , N v and N s values are dependent to the air pollutants dispersion and the characteristics of emission sources.

The results of sensitivity analysis of Eq. (10) showed that for the g 2 values lower than 0.92 the coverage efficiency (and UF n ) decreased markedly due to the increasing overlap among covered areas of stations. On the other hand for the g 1 values of higher than 0.8 the OL n and also the coverage efficiency had acceptable values. So, the g 1 = 0.8 and g 2 = 0.92 were selected as the weight values to reach the best configuration of AQMN, in which the optimization results (e.g., Figure 9) confirm the selected weight values.

4.1 Comparison of ACO and GA results

The optimization algorithm was implemented for different cut-off values r c (0.75–0.95) and different station numbers (1–6). Table 3 shows the locations of stations which were obtained by using ACO and GA for cut-off values 0.75 and 0.95. Due to the different structure of the algorithms, the obtained station locations were different.

Table 3 Optimal locations for air quality monitoring stations, obtained using ACO and GA (r c  = 0.75 and r c  = 0.95)

Figure 7 shows the overall UF i of Network for different configurations denoted in Table 3. As shown in the figure, by decreasing the r c , the overall UF i of network increases, which causes an enlargement of the network coverage area. For a stipulated budget, an air quality monitoring organization can maintain either a high or a low r c value based network. A high r c based network may not necessarily cover the entire region, but the covered region could be well represented. A low r c based network, on the other hand, would offer more coverage of the area, but the covered area may not be satisfactorily represented (Elkamel et al. 2008).

Fig. 7
figure 7

Overall UF i of network versus number of stations as a function of r c a ACO b GA

Considering Figs. 7 and 8 it becomes obvious that the overall UF i and overlap among coverage areas of designed network by GA were higher than that of ACO. However, concurrent comparison of overall UF i of the network, the overall number of overlapped grids and also the number of covered grids of network showed that both algorithms up to three stations had the same trend. Figure 7 indicates that by increasing the number of stations (from 4 to 6), increasing rate of overall UF i of network decreased. In contrast Fig. 8 shows a sharp increase of overall number of the overlapped grids. In general, it can be concluded that increase of overall UF i of the network was due to the coverage of identical points by several stations, or in the other words, increasing the number of overlapped grids. In conclusion, establishing a network with more than three stations would not be economically justified for the study area.

Fig. 8
figure 8

Overall number of overlapped grids versus number of stations (r c  = 0.85)

A comparison between the results of the current study and a previous study (Elkamel et al. 2008) was made in order to evaluate the performance of the proposed method. The results of the comparison indicated that the new method performed better than the previously reported method. The new method led to a sensitive network with more coverage area and high accuracy of monitoring while the number of proposed stations was the same in both studies. Higher coverage area with the same number of monitoring devices leads to more information and decreasing costs of equipment, maintenance and operating personnel. The reduction in costs is due to the fact that even by increasing (almost doubling) the coverage area, AQMN was designed in a way that requires the same number of stations. However, considering the doubling of the coverage area, if the previously reported method was used at least six stations would be required.

4.2 Sensitivity of monitoring stations to emission sources

Using values obtained at sensitive monitoring stations and having the knowledge of the sensitivity of the monitoring stations to emission sources will allow to quickly identify the defective sources, and then take the necessary measures to control them. Defective sources are the emission sources in which, due to the unexpected or stochastic changes, the amount of released pollutants has been higher than normal value.

For this purpose, a database of the sensitivity of the location of monitoring stations to emission sources was created and a program capable of ranking the defective sources by using the created sensitivity database and changes in measured concentration at monitoring station was designed to be used after setting up the stations. Three candidate locations (795, 1152 and 1505) with r c  = 0.85 were selected for the placement of the monitoring stations. Development of the database was done using the data which are used to calculate the sensitivity scores of selected optimal locations.

Figure 9 shows the coverage area of AQMN for selected location of stations for r c  = 0.85 and pollutants NOx, SO2 and CO.

Fig. 9
figure 9

Coverage area of AQMN for selected location of stations: a Pollutant NOx, b Pollutant SO2, and c pollutant CO

5 Conclusions

Designing air quality monitoring networks is an important task for environmental protection authorities. It is necessary for AQMN to be planned effectively and systematically. An improved methodology is presented to optimize the number and location of air quality monitoring stations with respect to multi-pollutants and multi-objective model.

In this approach, the developed MCA model was employed to model the distribution of pollutants concentrations, and the results of dispersion model were used as inputs for the optimization algorithm. The generated monitoring network provides the optimal number and location of monitoring stations with respect to maximum coverage area with the minimum overlap among coverage areas as well as the maximum detection of violations over ambient standards and the maximum sensitivity of monitoring stations to emission sources for primary gaseous pollutants such as SO2, NOx and CO. Ant colony optimization algorithm and Genetic algorithm were used in the optimization procedure. This approach was applied to the case of Tabriz Oil refining Company.

The results of the optimization procedure were investigated for different values of the correlation coefficient (r c ) which showed that as the cut-off correlation coefficient was increased, the coverage area of stations and the overlap region were decreased.

Comparison of the performance of the ACO and GA indicated that ACO has better ability for reduction of overlap among coverage areas of monitoring station than GA. However both algorithms have good ability in finding the best configuration of AQMN and maximizing the coverage area of monitoring stations.

A comparison between the results of the current study and a previous study (Elkamel et al. 2008) indicated that the new method provides more coverage area. This increase in coverage efficiency will be remarkable when the increase of network coverage area (due to considering the prevailing wind directions) and accuracy of monitoring (due to considering the continuity of coverage area) be considered. Finally, three sensitive stations were suggested for AQMN of study area which enables the authorities to specify defective sources and then take prompt measures to control them.