1 Introduction

Urban air quality has degraded at an alarming rate in the recent decades [1]. Chennai being one of the major metropolitan and industrial area has shared the same fate. Chennai regarded as the “Detroit of India” due to its large Automobile industry has been under constant scrutiny of Central pollution control board (CPCB) and Tamilnadu pollution control board (TNPCB) [2]. Chennai city has been classified as a non-attainment area for PM10 due to the number of exceedances [3]. This leads to the challenge of informing the residents and policy makers about the air quality of region effectively and in time.

Air quality information dissemination is an integral part of any air quality management system. Air quality index plays a vital role in this purpose. Many air quality indices have been developed in the past few decades [4, 5] to serve the purpose of providing current air quality status for resident’s health and safety at the time of exceedances and informed decision making for air quality management strategies of the area.

National Air quality index (NAQI) based on a linear scale working with maximum operator has the problem of eclipsing that is showing lower concentration as compared to the observed concentration and ambiguity showing higher concentration as compared to the observed concentration at or near the class boundaries. These problems occur due to a strict boundary between air quality classes which don’t give enough information about the uncertainty and imprecision in air quality data.

The following paper is divided into 7 parts. Followed by the introduction, study area is explained with the location details and types of monitoring location. Conventional air quality indexing techniques are explained in Sect. 3. In the next section the philosophy of fuzzy logic is elaborated with the explanation of Fuzzy similarity method and distance metrics employed in the development of the model. The air quality of the study area is evaluated in the results and discussion section. The paper ends with conclusion describing the main attributes of the model and analysis and future scope of the work.

2 Study Area

Chennai city is located on the south-eastern coast of peninsular India [2]. It has a flat coastal terrain with the average elevation of 6.7 m. It stretches along a length of 25.6 km with an area of 176 Km2. Chennai is considered as the Indian capital of automobile manufacturing with a share of 30% of automobiles and 40% of auto-parts manufactured in India.

According to Indian meteorological department (IMD) the weather in Chennai is sub-tropical (hot and humid) with 4 seasons i.e. winter, summer, pre-monsoon and monsoon [6]. The season of winter occurs between January to February with an average temperature of 25.6 °C, Summer from March to May with an average temperature of 30.5 °C, Pre-monsoon or southwest monsoon occurs from June to September with an average temperature of 29.7 °C, while Post-monsoon or northeast monsoon occurs from October to December with an average temperature of 26.7 °C. 60% of rainfall occurs during the season of post monsoon or northeast monsoon.

Daily average concentration of Sulphur dioxide (SO2), Nitrogen dioxide (NO2) and Respirable suspended particulate matter (RSPM) for 2009–2013 was collected from Tamilnadu pollution control board (TNPCB). TNPCB monitors Air quality data twice a week at 6 locations as depicted in Fig. 1 and Table 1.

Fig. 1
figure 1

Map showing monitoring locations for air quality monitoring

Table 1 Details of monitoring locations for the study area

3 Air Quality Index

Air quality index has been developed for weighted and aggregated representation of complex environmental data in a simplified and intelligible form. It helps the common public in taking proper precautions in the case of exceedances and decision makers in developing new policies for reducing the harmful effects of air pollution and developing a sustainable society.

The most commonly used air quality index was developed by USEPA earlier named as Air pollution index (API) [7] and later renamed to USEPA AQI. In the above index the air quality is divided into 5 strict linear classes based on concentration breakpoints. A sub-index is calculated for each pollutant using the given Eq. (1)

$$I_{p} = \frac{{I_{HI} - I_{LO} }}{{BP_{HI} - BP_{LO} }}\left( {C_{p} - BP_{LO} } \right) + I_{LO}$$
(1)
Ip:

index for pollutant p

Cp:

Rounded concentration of pollutant p

BPHI:

Breakpoint that is higher than or equal to Cp

BPLO:

Breakpoint that is lower than or equal to Cp

IHI:

AQI value corresponding to BPHI

ILO:

AQI value corresponding to BPLO

4 Fuzzy Logic

Fuzzy logic is a multi-valued logic formed by extending the binary logic into the realm of partial membership. It was first described by Lotfi A. Zadeh in a seminal paper published in 1965 [8]. Fuzzy logic uses overlapping membership functions to model the uncertainty and ambiguity pertaining in a real world complex system like urban air quality. Fuzzy logic helps in classification using linguistic parameters developed by employing the expert knowledge-base derived by responses from a fuzzy questionnaire.

Fuzzy logic has been applied in a number of papers for the assessment of air quality and development of air quality management tools [9,10,11,12]. In assessment of a complex system like air quality fuzzy logic based techniques are able to trap and model the uncertainty arising in data due to error during monitoring, and analysis.

The membership functions were determined by employing expert’s perception using fuzzy questionnaire duly filled by different air quality experts. The mean of the membership functions developed were used as the final set of membership functions.

The membership functions help in determining the AQI by integrating the health and ecological effects of different pollutants obtained through the expert’s experience in the field of air quality into the Fuzzy based model (Fig. 2).

Fig. 2
figure 2

Fuzzy membership functions for SO2, NO2 and RSPM

5 Fuzzy Synthetic Evaluation

In Fuzzy synthetic evaluation (FSE) the air quality standards for different pollutants were used to develop linguistic classes which were represented in matrix form with the columns as pollutant and rows as classes [13]. The pollutant concentration value was compared with the standards using the min max approach.

FSE constitutes Fuzzy similarity method (FSM) and simple fuzzy classification (SFC) [14]. In the case of FSM an evaluation matrix R as shown in Eq. (2) is derived from the mean value of each trapezoidal membership function with rows representing the pollutants and columns representing the air quality class and another matrix R’ represents the membership value of each pollutant for different air quality class at a given instance [15]. The 2 matrices are compared using the similarity function in Eq. (3) and the maximum value of the similarity function was used to represent the final air quality class for the given case.

$$R = \left[ {\begin{array}{*{20}c} {r_{11} r_{12} } & {r_{15} } \\ {r_{21} r_{22} } & {r_{25} } \\ {r_{31} r_{32} } & { r_{35} } \\ \end{array} } \right]$$
(2)
$$B_{j} = \frac{{\mathop \sum \nolimits_{i = 1}^{5} \hbox{min} \left[ {r_{ij} \left( x \right),r_{ij}^{{\prime }} } \right]}}{{\mathop \sum \nolimits_{i = 1}^{5} \hbox{max} \left[ {r_{ij} \left( x \right),r_{ij}^{{\prime }} } \right]}}$$
(3)
rij:

membership value for pollutant i and class j

i:

pollutant 1, …, 3, 1 = SO2, 2 = NO2 and 3 = RSPM

j:

class 1, …, 5, 1 = very low, 2 = low, 3 = medium, 4 = high, 5 = very high

In case of SFC, weight for each pollutant was derived using Shannon’s information entropy function [16]. Information entropy signifies the amount of information contained in a given set of data and represented by the negative log of normalised data [17].

Concentration for jth reading of the ith pollutant was normalised using Eq. (4) from the evaluation matrix. The normalised value rij is divided by the sum of all the normalised values using Eq. (5) and the final value is used to calculate the Shannon’s entropy for each pollutant in Eq. (6). Weight of a pollutant is calculated using Eq. (7)

$$r_{ij} = \frac{{x_{ij} - \min_{j} \left\{ {x_{ij} } \right\}}}{{\max_{j} \left\{ {x_{ij} } \right\} - \min_{j} \left\{ {x_{ij} } \right\}}}$$
(4)
$$f_{ij} = \frac{{r_{ij} }}{{\mathop \sum \nolimits_{j = 1}^{n} r_{ij} }}$$
(5)
$$H_{i} = - k\sum\limits_{j = 1}^{n} {f_{ij} } \ln f_{ij} \quad \quad i = 1,2,\, \ldots ,\,m$$
(6)
$$w_{i} = \frac{{1 - H_{i} }}{{m - \mathop \sum \nolimits_{i = 1}^{m} H_{i} }}\quad \quad 0 \le w_{i} \le 1,\quad \sum\nolimits_{i = 1}^{m} {w_{i} = 1}$$
(7)

Distance metric for each case was calculated from the Eq. (8) utilising the membership value from the evaluation matrix (R) and weight derived using Shannon entropy. The maximum value of the distance metric for each case determines the final air quality category

$$k\left( j \right) = \sqrt {\frac{{\mathop \sum \nolimits_{i = 1}^{3} \left( {w_{i} .\lambda_{ij} } \right)^{2} }}{{\mathop \sum \nolimits_{j = 1}^{5} \mathop \sum \nolimits_{i = 1}^{3} \left( {w_{i} .\lambda_{ij} } \right)^{2} }}}$$
(8)
\(\lambda_{ij}\) :

Membership value for jth value of ith pollutant

k(j):

distance metric for jth value

w i :

entropy weight for pollutant I

$$k\left( f \right) = \hbox{max} \left( {k\left( j \right)} \right)$$
(9)
k(f) :

air quality category of the value

6 Results and Discussion

The AQI procedure was applied to air pollutant concentration values obtained from the 6 urban air quality monitoring stations in Chennai. The fuzzy AQI was calculated using FSM and SFC and compared with the conventional National air quality index (NAQI). The results for Kathivakkam and Manali industrial area for 2013 are shown in Tables 2 and 3 respectively.

Table 2 Comparison of Fuzzy synthetic evaluation methods with National air quality index for Kathivakkam for 2013
Table 3 Comparison of Fuzzy synthetic evaluation methods with National air quality index for Manali for 2013

The FSE results are optimistic compared to the NAQI as they show that the air quality is better than air quality determined by the NAQI. This happens because the concentration of all the pollutants are integrated for calculation of index in case of FSE techniques while in the case of NAQI the index is determined only by the concentration of the pollutant with the maximum sub index which causes the NAQI to be less sensitive to the health effect of other pollutants. The FSE techniques give a clearer picture about the air quality as the effect of all the individual pollutants are integrated in the final AQI. The use of membership functions, expert’s perception and predetermined Shannon’s entropy based weights make the FSE based methods more sensitive to changes in concentration of air pollutants.

The AQI in the winter months of January and February are poor as compared the other months of the year. The best air quality was observed during the Northeast monsoon period of November and December.

Seasonal variation for the 6 monitoring locations for 2009–2012 are shown in Figs. 3, 4, 5 for SO2, NO2 and RSPM respectively. No exceedances were observed for SO2 and NO2.

Fig. 3
figure 3

Seasonal variation of SO2 at TNPCB monitoring locations

Fig. 4
figure 4

Seasonal variation of NO2 at TNPCB monitoring locations

Fig. 5
figure 5

Seasonal variation of RSPM at TNPCB monitoring locations

In case of all pollutants highest concentration was observed during the winter season due to stable atmospheric conditions which restricts the vertical mixing of air mass and dispersion of pollutants.

In the case of RSPM concentration, exceedances were observed during the years 2009–2011 for some of the industrial locations. All the exceedances were observed during the winter season (Fig. 4).

7 Conclusion

FSE methods integrates multiple pollutant’s concentration and expert’s knowledge into the AQI as opposed to conventional NAQI in which a single pollutant with maximum subindex is considered to define the air quality. Urban air quality is not easy to assess and comprehend because of its multi-pollutant characteristics. Contribution of each pollutant can be measured and assessed in the case of FSE.

The given methods are easy to model, less data intensive and the results are comprehensible for local public and decision makers. FSE techniques show more optimistic results as compared to the conventional AQI which can reduce the burden of money and effort on air quality management. The methods overcome the uncertainty of air quality classification near the class boundaries by implying fuzziness in the class boundaries using the overlapping Fuzzy classes and produce an air quality index with higher sensitivity. The above techniques can be used to develop an effective air quality management plan for the urban area of Chennai city.