1 Introduction

Rapid degradation of urban air quality has become a major health concern worldwide in recent decades (Mage et al. 1996; Mayer 1999; Baldasano et al. 2003; Gurjar et al. 2008). Infrastructure development in urban areas including high rise buildings, flyovers and reduced green cover has resulted in poor air circulation within cities. The increase in vehicular population and emissions due to vehicles are accumulating within the lower boundary of the urban atmosphere (Masiol and Harrison 2015). The trend of the past air quality data of many cities indicated that proper land-use planning is a must to achieve better air quality (Frank et al. 2006; Hoek et al. 2008). Therefore, analysis of air quality trends and their associated relation with ambient meteorological conditions will provide useful information to the urban planners, policymakers and other stakeholders. In the present study, the temporal trends of three criteria pollutants (\(\hbox {PM}_{10}\), \(\hbox {SO}_{2}\) and \(\hbox {NO}_{2})\) and meteorological parameters and their interrelation were studied at urban monitoring locations in the Chennai metropolitan area. A rapid increase in emissions has led to serious health issues, degraded quality of life and increased mortality rate, especially in megacities.

Sulphur dioxide (\(\hbox {SO}_{2})\) is produced by the combustion of sulphur containing compounds that are present in most of the fossil fuels (Tecer and Tagil 2013). Short-term exposure of \(\hbox {SO}_{2}\) has been linked to bronchoconstriction and to increased asthma symptoms at higher concentrations (Nadel et al. 1965). \(\hbox {SO}_{2}\) can also react with other compounds to produce fine particulates. Wet and dry depositions contribute as a sink for \(\hbox {SO}_{2}\).

Thermal power plants and automobile exhausts are the major sources for nitrogen dioxide (\(\hbox {NO}_{2})\). Inhalation can lead to respiratory tract inflammation and worsening of condition for patients with asthma, emphysema and heart diseases (Monn 2001). \(\hbox {NO}_{2}\) is removed from the atmosphere by conversion into a secondary pollutant, nitrate.

Respirable suspended particulate matter, also denoted as \(\hbox {PM}_{10}\), have aerodynamic diameter less than \(10\, \upmu \hbox {m}\) and include particles and liquid droplets. It constitutes components like acid salts (nitrate and sulphate), organic chemicals, metals, and soil and dust particles. Exposure to an excess concentration of \(\hbox {PM}_{10}\) causes aggravation of asthma symptoms, irritation of the respiratory tract, and worsening of heart conditions such as ischaemic heart disease and dysrhythmias, leading to premature death due to heart failure and cardiac arrest (Kampa and Castanas 2008). Chennai being a major metropolitan and industrial hub has shared the same fate. Chennai city is the fourth largest city in India with a population of 6.72 million. The city has been described as the ‘Detroit of India’ due to its large automobile industry, contributing to 30% of automobiles and 40% of auto-parts manufactured in India. Chennai city has been classified as a non-attainment area for \(\hbox {PM}_{10}\) due to the number of exceedances over the National Ambient Air Quality Standards (NAAQS) [Central Pollution Control Board (Ministry of Environment and Forests) 2011].

Table 1 Details of monitoring location.
Fig. 1
figure 1

Map showing monitoring locations for air quality data of Chennai, India.

Various studies have been conducted to perceive the effect of meteorological parameters on the ambient concentration of criteria pollutants. Banerjee et al. (2011) analysed the statistical dependency of criteria pollutants, \(\hbox {SO}_{2}\), \(\hbox {NO}_{2}\) and total suspended particulate matter (TSPM), on different meteorological parameters such as relative humidity (RH), wind speed (WS), precipitation (P) and mean air temperature (T) at Pantnagar, India from May 2008 to April 2009. Both \(\hbox {NO}_{2}\) and TSPM concentrations showed a higher dependency on meteorological parameters as compared to \(\hbox {SO}_{2}\). Partial correlation coefficients revealed that wind speed had the maximum influence on \(\hbox {NO}_{2}\) and \(\hbox {SO}_{2}\), closely followed by weekly mean temperature. However, in the case of TSPM, the individual contribution of ambient temperature was found to be the maximum, followed by relative humidity (Banerjee et al. 2011). Jayamurugan et al. (2013) did a regression analysis for northern Chennai during 2010–2011 and found a negative correlation of \(\hbox {SO}_{2}\) and \(\hbox {NO}_{2}\) with temperature, whereas a significant positive correlation was found in the case of \(\hbox {PM}_{10}\) with temperature due to the absence of scrubbing action during the summer season (Jayamurugan et al. 2013). Chelani et al. (2002) developed a five class air quality index (AQI) to assess the air quality at industrial, commercial and residential locations in five major cities (Mumbai, Delhi, Calcutta, Chennai and Nagpur) in India. They found Delhi to be the most polluted city followed by Calcutta and Mumbai (Chelani et al. 2002). The current study reveals the impact of meteorology on concentration of criteria pollutants and temporal variation of pollutant concentration. The national air quality index (NAQI) for the study area is also analysed. The current study also deals with the determination of the statistical distribution of air pollutant concentration data using quantile–quantile plot (QQ plot), a graphical technique used to compare a given data distribution (air pollutant concentration in our case) with a given statistical distribution. The technique is elaborated in the methodology section. Data distribution is very helpful in the calculation of NAAQS exceedance, AQI and forecasting pollutant concentration (Jakeman 1986; Taylor et al. 1986; Jakeman et al. 1988). Data distribution identification helps us with the removal of noise caused by human error or erroneous instrumentation during the monitoring of pollutant concentration (Holland and Fitz-simons 1981; Berger et al. 1982; Gokhale and Khare 2007).

2 Materials and methods

2.1 Description of study area

2.1.1 Chennai city

Chennai city is located on the south-eastern coast of peninsular India bound between \(12^{\circ }50'49''\)\(13^{\circ }17'24'\)’ latitude and \(79^{\circ }59'53''\)\(80^{\circ }20'12''\) longitude. It has a flat coastal terrain with an average elevation of 6.7 m (Krishnamurthy and Desouza 2015). It stretches along a length of 25.6 km with an area of \(176\hbox { km}^{2}\).

The weather in Chennai is sub-tropical (hot and humid) with four seasons, i.e., winter, summer, pre-monsoon and monsoon. Winter season occurs between January and February, summer from March to May, pre-monsoon or south-west monsoon from June to September and post-monsoon or north-east monsoon from October to December with an average temperature of \(25.7{^{\circ }}\hbox {C}\), \(30.6{^{\circ }}\hbox {C}\), \(29.8{^{\circ }}\hbox {C}\) and \(26.8{^{\circ }}\hbox {C}\), respectively. Season of post-monsoon or north-east monsoon accounts for 60% of the annual rainfall.

The daily average concentration of the three criteria pollutants, \(\hbox {SO}_{2}\), \(\hbox {NO}_{2}\) and \(\hbox {PM}_{10}\), were collected over the years 2009–2012 from Tamil Nadu Pollution Control Board (TNPCB). TNPCB monitors the air quality twice a week under the National Ambient Air Quality Monitoring Programme (NAAMP) at five monitoring locations in the Chennai city as shown in figure 1. Among the five monitoring locations, three locations (Kathivakkam, Thiruvottiyur and Manali) are designated as industrial locations IS-1 to IS-3, General Hospital monitoring site is designated as traffic intersection site (TS) and Taramani as the residential site (RS) as shown in table 1.

2.2 Data collection

Air quality was monitored by the TNPCB at selected locations, as described in section 2.1. High-volume sampler (Envirotech India, Model – AQM 460 NL) was used for collection of pollutant data. \(\hbox {PM}_{10}\) data were collected through gravimetric separation using glass fibre filter paper. An integrated gas sampling assembly with 35 ml impingers for the collection of \(\hbox {SO}_{2}\) and \(\hbox {NO}_{2}\) was connected to the high-value sampler for analysis of gaseous pollutants. Colorimetric Jacobs and Hochheiser modified method was used for the measurement of \(\hbox {NO}_{2}\) concentration. \(\hbox {NO}_{2}\) was collected in the impinger by flowing air through a solution of sodium hydroxide and sodium arsenite. The concentration of the nitrite ion (\(\hbox {NO}_{2})\) produced during sampling was determined colorimetrically by reaction of the nitrite ion with phosphoric acid, sulphanilamide and N-(1-naphthyl)-ethylenediamine di-hydrochloride (NEDA) and measuring the absorbance of the azo-dye at 540 nm. Modified West and Gaeke method was used for the measurement of \(\hbox {SO}_{2}\) concentration (Kamyotra and Basu 2011). Sulphur dioxide from the air is absorbed in a solution of potassium tetrachloro-mercurate (TCM), which forms a dichlorosulphitomercurate complex, which resists oxidation in the air. The complex is stable to strong oxidants such as ozone and oxides of nitrogen and therefore, the absorber solution can be stored for some time prior to analysis. The complex is made to react with para-rosaniline and formaldehyde to form the intensely coloured pararosaniline methylsulphonic acid. The absorbance of the solution is measured by means of a spectrophotometer. Meteorological data for the study period were collected from the Weather Underground website http://www.wunderground.com (Weather Underground 2018). Weather Underground collects the daily average meteorological data from the weather monitoring station at the Chennai international airport. Uncertainty in sampling or analysis can occur due to various reasons. It could be an instrumental error like flow measurement error in high-volume sampler or human error during sampling and data handling. TNPCB takes care of these uncertainties by validating data through a stringent quality assurance process before uploading. We have further cross-verified the data for outliers using quartile test. No outliers were found during the test.

2.3 Statistical techniques

2.3.1 Exploratory statistical analysis

Correlation analysis determines the strength of the statistical relationship, i.e., dependence among the variables (pollutant concentration in the current case). The high positive correlation indicates a strong relationship between the parameters, and high (or low) values of one parameter occur in conjunction with the high (or low) values of the other parameter. If the correlation is negative and large, the relationship is strong but inverse. In the current study, Pearson’s product moment correlation (PMCC) was used.

PMCC is calculated based on individual observations and studies based on the case of linear dependency. It was first proposed by Francis Galton (Stigler 1989) and formally developed by Karl Pearson. It is defined as the covariance of two datasets divided by the product of standard deviations as shown in

$$\begin{aligned} R_{\mathrm{p}}=\frac{\sum \nolimits _{i=1}^n \left( {x_i -\bar{x}}\right) \left( {y_i -\bar{y}} \right) }{\left[ {\sum \nolimits _{i=1}^n \left( {x_i -\bar{x}} \right) ^{2 }\sum \nolimits _{i=1}^n \left( {y_i -\bar{y}} \right) ^{2 }} \right] ^{{1 }/{2 }}}. \end{aligned}$$
(1)

Box plots depict the monthly variation and range of concentration of criteria pollutants through their quartiles. The median value for the month is shown by the red horizontal line, whereas the first and third quartiles are shown by the lower and upper boundaries of the box and the lowest and highest concentrations are demarcated by the lower and upper ends of the whiskers extending from the box.

2.4 Statistical distribution identification

The distribution of the probability of occurrence of a given value of pollutant concentration is termed as a statistical distribution or probability distribution function. Several statistical tests have been developed over the years for the identification of distribution of any set of data. In the current study, two hypothesis tests, i.e., Anderson–Darling test and Kolmogorov–Smirnov test, and a graphical method, QQ method, have been used for distribution identification and verification. The tests were applied on the pollutant concentration data for probability distributions, as shown in table 2.

Table 2 Statistical details for different Probability distribution.

The hypothesis tests are used to check whether a given data sample comes from a population with a specified probability distribution (Jakeman et al. 1988; Chelani et al. 2002; Jayamurugan et al. 2013). The cumulative distribution function (cdf) of the data is compared with the empirical cdf of a theoretical statistical distribution. The maximum absolute distance between the two distributions is used as a test statistic. The Anderson–Darling test is much preferred compared to the Kolmogorov–Smirnov test as it gives more weightage to the tails of the distribution and is better for outlier values, whereas Kolmogorov–Smirnov test is more sensitive to the central values compared to the tails.

2.5 National air quality index

AQI is a representation of air quality status in terms of health effects on the residents of a locality (Shooter and Brimblecombe 2005; Plaia and Ruggieri 2010; Kanchan et al. 2015). It is represented as a numerical or linguistic value based classification scheme. Several air quality indices have been developed over the years based on the number of pollutants, aggregation methods or health effects being considered (Holland and Fitz-simons 1981).

NAQI derived from the US EPA’s AQI is based on linear interpolation of air pollutant concentration to air quality classes based on effects of specific pollutants on health.

$$\begin{aligned} I_{\mathrm{p}} =\frac{\left( {I_{{\mathrm{HI}}} -I_{{\mathrm{LO}}} } \right) }{\hbox {BP}_{{\mathrm{HI}}} -\hbox {BP}_{{\mathrm{LO}}} }\left( {C_{\mathrm{P}} -\hbox {BP}_{{\mathrm{LO}}} } \right) +I_{{\mathrm{LO}}} , \end{aligned}$$
(2)

where \(I_{\mathrm{P}}\) is the index for pollutant P, \(C_{\mathrm{P}}\) is the rounded concentration of pollutant P, \(\hbox {BP}_{\mathrm{HI}}\) is the break point that is greater than or equal to \(C_{\mathrm{P}}\), \(\hbox {BP}_{\mathrm{LO}}\) is the Breakpoint that is less than or equal to \(C_{\mathrm{P}}\), \(I_{\mathrm{HI}}\) is the AQI value corresponding to \(\hbox {BP}_{\mathrm{HI}}\) and \(I_{\mathrm{LO}}\) is the AQI value corresponding to \(\hbox {BP}_{\mathrm{LO}}\).

3 Results and discussion

3.1 Air quality data

Tables S1–S3 in the supplementary material show the seasonal mean, standard deviation and 24-hr average minimum and maximum concentrations of \(\hbox {SO}_{2}\), \(\hbox {NO}_{2}\) and \(\hbox {PM}_{10}\) at the monitoring locations. No exceedances above the NAAQS level were observed for \(\hbox {NO}_{2}\) concentration during the study duration. Predominantly higher concentration was observed at Manali (IS-3) and Thiruvottiyur (IS-2) as compared to General Hospital (TS) and Taramani (RS). The highest \(\hbox {SO}_{2}\) concentration of \(25.7\, \upmu \hbox {g/m}^{3}\) was observed at Manali (IS-3) in winter and the lowest concentration of \(2\, \upmu \hbox {g/m}^{3}\) at General Hospital (TS) during monsoon. Table S2 shows the highest and lowest 24-hr average \(\hbox {NO}_{2}\) concentration of \(33.1\, \upmu \hbox {g/m}^{3}\) and \(3.2\, \upmu \hbox {g/m}^{3}\) at Thiruvottiyur (IS-2) during the winter and at Taramani during the monsoon season and a maximum and minimum seasonal average of \(23.8\, \upmu \hbox {g/m}^{3}\) and \(6.3\, \upmu \hbox {g/m}^{3}\) at Manali during the winter and at Taramani during the monsoon, respectively.

Table S3 depicts the seasonal maximum, minimum, average and standard deviation of \(\hbox {PM}_{10 }\) concentration. Maximum and minimum concentration was observed to be 200 and \(5.5\, \upmu \hbox {g/m}^{3}\) at Manali during the winter and General Hospital during the monsoon, respectively. Similarly, the seasonal average maximum and minimum of \(141 \,\upmu \hbox {g/m}^{3}\) during the winter at Manali and \(12.5\, \upmu \hbox {g/m}^{3}\) during the pre-monsoon at General Hospital, respectively.

Tables S7 and S8 show the exceedances of NAAQS for \(\hbox {PM}_{10}\) concentration as number as well as percentage of total observations. No exceedances were observed for \(\hbox {NO}_{2}\) and \(\hbox {SO}_{2}\) concentration for any of the locations. The highest number of exceedances was observed at Manali and Kathivakkam with 100% exceedance during the winter season for both the locations. The lowest number of exceedances was observed at Taramani with no exceedance during the summer, pre-monsoon and monsoon seasons.

3.2 Monthly and seasonal variation

Data frequency distribution of seasonal average and a monthly average of \(\hbox {SO}_{2}\), \(\hbox {NO}_{2}\) and \(\hbox {PM}_{10}\) during the study period is shown in line plots in figure 2. Monthly values were calculated from the daily average concentration monitored during the months. Seasonal average was calculated for the seasons, i.e., winter (January–March), summer (April–July), pre-monsoon (August–October) and monsoon (November and December).

Fig. 2
figure 2

Monthly average pollutant concentration for monitoring locations in the study area, where J = January, F = February, M = March, A = April, J = June, J = July, A = August, S = September, O = October, N = November and D = December. All the months were indicated in sequence for four consecutive years (2009–2012).

Fig. 3
figure 3

Boxplots of seasonal variation of pollutant concentration for five monitoring location from 2009 to 2012 for (a) \(\hbox {SO}_{2}\), (b) \(\hbox {NO}_{2}\) and (c) \(\hbox {PM}_{10}\). Trend line of monthly average concentration is also represented.

No exceedances were observed for monthly and seasonal average concentration of \(\hbox {SO}_{2}\) and \(\hbox {NO}_{2}\). The seasonal and monthly variation for \(\hbox {SO}_{2}\) concentration was not very prominent but an increase in the concentration has been observed in the winter season as shown in figure 2. A higher concentration during the winter is attributed to stable atmospheric conditions caused by lower mixing height and lower wind velocity (Zvyagintev et al. 2014). Stable atmospheric condition restricts vertical mixing of air mass and dispersion of pollutant leading to pollutant accumulation and higher concentration. An increase in annual average concentration at 15, 55, 38 and 23%, respectively, for Manali, Taramani, Kathivakkam and General Hospital and a decrease of 29% were observed at Thiruvottiyur during the study period, respectively. A distinct difference has been observed between the \(\hbox {SO}_{2}\) concentration at the residential or traffic site and the industrial sites with Kathivakkam and Thiruvottiyur having a higher concentration among the monitoring sites, whereas Taramani and General Hospital having a lower concentration. These differences can be attributed to the industrial emission sources like thermal power station (Ennore thermal power station), automobile industries and other medium-scale industries.

The monthly average of \(\hbox {NO}_{2}\) concentrations was well below the NAAQS as shown in figure 2. Winter months had the highest concentration, whereas a decrease in concentration during the summer was attributed to high temperature and favourable atmospheric air mass exchange. Atmospheric air mass exchange reduces the concentration of pollutant through vertical wind movement below the planetary boundary layer. Monsoon season also shows a decrease in concentration because of dissolution and natural scrubbing caused by the rainwater. The annual average \(\hbox {NO}_{2}\) concentration increased by 52, 4 and 37% over the study period in the case of Taramani, Kathivakkam and General Hospital, respectively. The increase in \(\hbox {NO}_{2}\) concentration can be attributed to increase in the number of vehicles and vehicle travelled over the years (Hargreaves et al. 2000; Bell et al. 2013; Huang et al. 2013) as well as poor inspection and vehicle maintenance systems in the study area. The rapid increase in vehicle numbers (especially diesel vehicles) is attributed to population growth and buying trend of the public in urban areas specifically metropolitan cities like Chennai. In the case of government high school and Thiruvottiyur, the annual average concentration decreased by 27 and 20%, respectively.

Monthly variation of \(\hbox {PM}_{10}\) is also shown in figure 2. Exceedances were observed for Manali, Kathivakkam and Thiruvottiyur (all industrial areas) during the winter season. The higher concentration of \(\hbox {PM}_{10}\) was observed in the winter season for all the locations with summer and monsoon showing a decreasing trend with lower concentration. \(\hbox {PM}_{10}\) concentration was considerably low for Taramani. Among the industrial sites, the highest concentration has been observed at Manali and Thiruvottiyur. In 2009, \(\hbox {PM}_{10}\) concentration has violated the NAAQS in Kathivakkam and Thiruvottiyur monitoring sites for 17 and 32 days, respectively. Similarly, \(\hbox {PM}_{10}\) concentration exceeded the NAAQS at Kathivakkam and Manali in 2010 for 5 and 12 days and at Kathivakkam and Thiruvottiyur in 2011 for 45 and 10 days, respectively. A number of exceedances observed for \(\hbox {PM}_{10}\) in the year 2012 were 9 and 16 days, respectively. There is an increase in the annual average concentration of \(\hbox {PM}_{10}\) by 5, 28, 45 and 58% in the case of Manali, Kathivakkam, General Hospital and Thiruvottiyur and a decrease in the concentration by 2% in the case of Taramani, respectively, were observed.

Figure 3 shows the Box–Whisker plots for monthly average concentration of three criteria pollutants: (a) \(\hbox {SO}_{2}\), (b) \(\hbox {NO}_{2}\) and (c) \(\hbox {PM}_{10}\). The plots have been divided into five subplots with each subplot showing the seasonal trend of pollutant concentration for one of the monitoring locations in Chennai city. The edges of the box represent the upper quartile of the dataset with Whisker edges showing the max and min values. A trend line has also been added to the plot to demarcate the seasonal average trend. In figure 3(a), the first subplot i.e., the boxplot for industrial site at Kathivakkam monthly variation was not verysignificant for the years 2009 and 2010, but during 2011 and 2012 an increasing trend is observed during the winter and a decline in the concentration during the summer months. In the case of Manali industrial area, data available only for the years of 2009 and 2011 showed a trend of higher concentration during the winter season and decline during the summer months. In the plot for TS General Hospital, winter months of January–March showed a comparatively higher trend. A significant variation was not visible in the monthly time series of \(\hbox {SO}_{2}\) concentration for the monitoring stations Taramani and Thiruvottiyur as shown in the fourth and fifth subplots. When comparing all the locations, the highest concentration was observed at Manali and least at Taramani. A similar trend was observed for \(\hbox {NO}_{2}\) concentration as shown in figure 3(b) with the winter months of January and February having a higher value for all the five monitoring locations over the years. No exceedance over the NAAQS was observed for any of the observations. A dip in the concentration values was observed in the summer season. The lowest concentrations were observed at Taramani, whereas the highest concentration in the case of General Hospital and Manali. The General Hospital location is very close to traffic intersection which has proximity to vehicular sources. Manali industrial area site has proximity to industrial sources due to heavy industries like petrochemical industries. The trends of seasonal concentration are shown in figure 3(c) with rows of plots representing different monitoring locations and columns representing different years. No exceedances over the NAAQS were observed for the plot that represents TS of General Hospital and subplot that represents the RS of Taramani. A higher monthly average \(\hbox {PM}_{10}\) concentration was observed during the winter season for all the years at most locations. The higher concentration and the increasing trend were observed in the case of industrial sites at Manali having the highest concentration. Figure 4 shows the annual variation of \(\hbox {PM}_{10}\) concentration with increasing number of vehicles in Chennai city. \(\hbox {PM}_{10 }\) concentrations at two industrial locations have been used to compare the increasing trends. As shown in figure 4, there is an increase in \(\hbox {PM}_{10}\) concentration (in \(\upmu \hbox {g/m}^{3})\) and vehicle number (in millions) over the study duration. The increase in traffic density (vehicular number) is a source of contribution to increase of pollutant concentration, especially \(\hbox {PM}_{10}\), but as the paper is about the effect of meteorology on air quality, the detailed discussion is out of scope.

Fig. 4
figure 4

Plot showing an increase in annual concentration of \(\hbox {PM}_{10}\) at two industrial locations (Kathivakkam and Thiruvottiyur) with increase in vehicle count of Chennai city.

Fig. 5
figure 5

Quantile–Quantile plots (QQ plots) of air quality data from General Hospital (TS) monitoring site used for determination of statistical distribution in the case of daily average concentration of (a) \(\hbox {SO}_{2}\), (b) \(\hbox {NO}_{2}\) and (c) \(\hbox {PM}_{10}\).

Fig. 6
figure 6

Monthly variation of meteorological parameters for Chennai city.

Fig. 7
figure 7

Wind rose plots showing the distribution of wind patterns for Chennai city during the year (a) 2009, (b) 2010, (c) 2011 and (d) 2012.

Fig. 8
figure 8

Monthly variation of NAQI for the five monitoring locations in Chennai city during the study period.

Chennai data have also been compared with Mumbai city. The daily average concentration of \(\hbox {PM}_{10}\), \(\hbox {SO}_{2}\) and \(\hbox {NO}_{2}\) was downloaded from data.gov.in website of Government of India (Govt. of India 2018) and the monthly average concentration of each of the pollutant was calculated as shown in tables S3–S6 in the supporting material. Air quality data were measured by Maharashtra Pollution Control Board (MPCB) under the NAAMP programme at two locations – Worli (RS) and Parel (industrial site) – for the duration of 2009–2012. Mumbai city was chosen because of its coastal meteorology and being one of the metropolitan cities in India. It has similar conditions in terms of industries and vehicular sources. No exceedances in the monthly values were observed in the case of \(\hbox {SO}_{2}\) and \(\hbox {NO}_{2}\) concentrations, which is quite similar to our observations in the case of Chennai city. Exceedances were observed in \(\hbox {PM}_{10}\) concentration with higher concentration during the winter season. Monitoring locations in Mumbai showed similar variation in terms of season and meteorology.

3.3 Statistical distribution of air quality data

Determination of statistical distribution of air quality data (pollutant concentration) plays an important role in the calculation of exceedances over NAAQS as well the environmental impact of air pollution (Jakeman et al. 1988; Chelani et al. 2002; Jayamurugan et al. 2013). The QQ plot was used to identify the statistical distribution of the dataset of the daily average concentration of \(\hbox {SO}_{2}\), \(\hbox {NO}_{2}\) and \(\hbox {PM}_{10}\). The quantile method compares the probability of the values in a dataset with an emperical data distribution created using parameters extracted from the dataset. Figure 5 shows the results of QQ plot evaluation for \(\hbox {SO}_{2}\), \(\hbox {NO}_{2}\) and \(\hbox {PM}_{10}\) at the TS. The results for the rest of the four locations are shown in the supplementary section (figures S6–S9). The best fit was obtained for Weibull distribution in the case of \(\hbox {SO}_{2}\) and \(\hbox {NO}_{2}\) concentrations with exponential showing overprediction at higher quantiles and generalised Pareto showing underprediction at the higher end quantiles. \(\hbox {PM}_{10}\) showed the best fitting with Weibull distribution at Taramani (RS) as compared to other statistical distributions and lognormal for other locations. As shown in tables S9 and S10 in the supplementary section, the Anderson–Darling test and Kolmogorov–Smirnov test showed the best fitting probability distribution as Weibull for daily average \(\hbox {SO}_{2}\) and \(\hbox {NO}_{2}\) concentrations, whereas in the case of \(\hbox {PM}_{10}\) concentration, the lognormal distribution best represents the data, which have been reaffirmed from the literature (Chelani et al. 2002; Lu 2002) (figure 5).

Table 3 Correlation of daily average pollutant concentration with meteorological parameters for different seasons.

3.4 Meteorological variation

Monthly average data were calculated for all the available meteorological parameters (ambient temperature, humidity, wind speed, visibility and cloud cover) (figure 6).

A subtropical and humid climate of Chennai shows high ambient temperature and humidity. The highest monthly average ambient temperature was recorded in the summer months of May and June for all the 4 yr with 2012 being the hottest with an ambient temperature of \(35{^{\circ }}\hbox {C}\). The least temperature was recorded in the month of January for all the years at a monthly average temperature of \(25{^{\circ }}\hbox {C}\). The overall monthly average temperature ranges from \(25{^{\circ }}\hbox {C}\) to \(35{^{\circ }}\hbox {C}\) over the years. In the case of humidity, the highest values were recorded in the monsoon months of November and December followed by the pre-monsoon period of September and October. The highest wind speed was recorded in the month of June–August.

Peak level in humidity (80–90%) was recorded during the months of November and December during the south-west monsoon. A dip in the humidity (60%) was observed during the summer month of May–July for all the years. The highest wind speed was observed during the summer season at 12–15 km/h. High wind speed occurred due to the formation of a low-pressure regions caused by the heating of ambient air during the summer season. The least wind speed was recorded during the months of January and February in the range of 4–7 km/h. Visibility of up to 7 km was observed during the summer months between May and July with a decrease during the winter and south-west monsoon. The lowest visibility value of 4 km was observed during the monsoon.

Figure 7 shows the distribution of wind direction and wind speed for the year 2009–2012. The categories of the wind speed were designated as follows: \(337.5{^{\circ }} < \hbox {N} \le 22.5{^{\circ }}\), \(22.5{^{\circ }} < \hbox {NE} \le 67.5{^{\circ }}\), \(67.5{^{\circ }} < \hbox {E} \le 112.5{^{\circ }}\), \(112.5{^{\circ }} < \hbox {SE} \le 157.5{^{\circ }}\), \(157.5{^{\circ }} < \hbox {S} \le 202.5{^{\circ }}\), \(202.5{^{\circ }} < \hbox {SW} \le 247.5{^{\circ }}\), \(247.5{^{\circ }} < \hbox {W} \le 292.5{^{\circ }}\) and \(292.5{^{\circ }} < \hbox {NW} \le 337.5{^{\circ }}\). The predominant wind direction was found to be south-west and south–south-west during the years. Wind speed was mostly in the category \(\ge 5\hbox { m/s}\). In 2009, most wind speed observations were in \(\ge 5 \hbox { m/s}\) wind velocity class and only a few observations were in 4–5 and 1–2 m/s class. The most wind observations were recorded in the south-west direction followed by north–north-east. During the study period, most observations were found in \(\ge 5 \hbox { m/s}\) wind speed category in the south-west direction followed by a south direction.

3.5 Correlation analysis

Correlation between pollutant concentration and meteorological parameters was calculated for all the seasons and all the locations. Table 2 shows the PMCC of weekly pollutant concentration with five meteorological parameters such as temperature, humidity, visibility, wind speed and cloud cover. The correlation plots for the monitoring locations are added in the supplementary figures S1–S5. The rows of subplots in the figure signify the season for which the data were collected. In the case of General Hospital, high correlation was observed for \(\hbox {SO}_{2}\) with ambient temperature for monsoon season at (\(R = 0.57\)) \(p = 0.06\) and with cloud cover for summer season at (\(R = 0.63\)) \(p = 0.08\). All the other meteorological parameters had a low or negative correlation. In the case of \(\hbox {NO}_{2}\), higher correlation was observed in the case of temperature for monsoon at (\(R = 0.72\)) \(p = 0.1\) and cloud cover for winter at (\(R = 0.75\)) \(p = 0.07\). All the other correlation values were either low or negative. In the case of \(\hbox {PM}_{10}\), higher correlation was observed in the case of temperature for monsoon at (\(R = 0.58\)) \(p = 0.08\) and humidity for summer season (\(R = 0.51\)) at \(p = 0.08\). All the other correlation values were either low or negative. In the case of Kathivakkam, for \(\hbox {NO}_{2}\) concentration, the high positive correlation was observed with temperature during the monsoon and with humidity and cloud cover during the winter season. As detailed in table 2, all the pollutant observations showed highly positive to low negative correlation except in the case of winter season where the correlation in highly negative due to the lower mixing height and stable atmospheric conditions. Humidity showed mostly a negative or weak positive correlation. The weak positive correlation can be attributed to the partition of semi-volatile species into the aerosol phase, whereas the negative correlation is due to the deposition and dissolution of pollutants especially during the monsoon and pre-monsoon periods. All the pollutants showed mostly the negative correlation with wind speed, which is due to the horizontal dispersion caused by the increase in wind speed.

3.6 NAQI evaluation

Monthly average concentration of \(\hbox {SO}_{2}\), \(\hbox {NO}_{2}\) and \(\hbox {PM}_{10}\) was used for evaluation of NAQI for the monitoring locations over the study duration. Figure 8 shows the monthly NAQI values calculated for each location. Line plots in each subplot signify the years for which the data were available. Horizontal lines denote the class interval for each air quality category. As shown in the subplots NAQI showed higher values for the winter months of January and February for all industrial locations. The higher AQI leading to health problems can be attributed to lower mixing height causing stable meteorological conditions as well as emissions from petrochemical industries and thermal power plant in the industrial areas of northern Chennai. NAQI values were healthy for Taramani and General Hospital as no exceedances were observed for \(\hbox {PM}_{10}\) in the locations. It can be concluded from the observation of ambient concentration time series and NAQI plots that \(\hbox {PM}_{10}\) was the responsible pollutant for higher AQI as the other two criteria pollutants were (\(\hbox {NO}_{2}\) and \(\hbox {SO}_{2})\) always well below the NAAQS.

4 Conclusion

Daily average pollutant concentration from five monitoring locations in Chennai city over 4 yr (2009–2012) was analysed. The following observations were made from the data analysis (table 3):

  • Monthly and seasonal trends showed a higher concentration during the winter season and a decrease in concentration during the monsoon. Pollutant concentration (especially \(\hbox {PM}_{10})\) was distinctively higher at the industrial monitoring locations.

  • An increasing trend was observed from 2009 to 2012 for all pollutants attributed to the vehicle and industrial emissions.

  • Analysis of meteorological parameters showed an increase of temperature and wind speed over the years with the summer season showing the peak values for every year. The highest level of monthly humidity was observed during the monsoon of 2010. Wind rose plots showed a predominant wind direction in the south and south-west directions and the majority of wind speed data falls in the \(\ge 5 \hbox {m/s}\) category.

  • Correlation plots of pollutant concentration (especially in the case of \(\hbox {PM}_{10})\) with meteorological parameters showed a higher positive correlation (\(R > 0.5\)) with temperature and humidity as compared to other meteorological parameters.

  • Statistical distribution of daily average air quality parameters was also analysed using non-parametric hypothesis tests (Anderson–Darling test and Kolmogorov–Smirnov test) and graphical technique called QQ plot. The plots showed that the concentration of \(\hbox {SO}_{2}\) and \(\hbox {NO}_{2}\) mostly showed Weibull distribution, whereas \(\hbox {PM}_{10 }\) followed Weibull distribution at Taramani and lognormal distribution at all the other locations.

  • NAQI showed the worst air quality during the winter season. Air quality was good at Taramani (residential site) and General Hospital (TS). In the case of industrial locations, air quality ranged between moderately polluted to very poor during the winter season.

The results of the above study can be used for forecasting air quality in Chennai city using the air quality parameters and meteorology. It will also help policymakers in applying effective measures to mitigate air pollution in sub-humid tropical regions.