1 Introduction

Atmospheric trace gases, despite their relative scarcity, subsist as chemically reactive factors influencing the thermal budget, atmospheric chemistry, ecology and human health (Ramanathan et al. 1985) whereas particulate matter (PM), has been shown to be linked with cardiovascular mortality (Pope et al. 2004), respiratory morbidity (Schwartz 1993) and detrimental health effects (Chow et al. 2006; Pope et al. 2009), more so in biologically susceptible populations (Chow and Watson 2006; Rückerl et al. 2011; Sacks et al. 2011) as well as affecting the atmospheric photochemistry.

Tropospheric ozone (O3) (Tilton 1989) is a secondary product of the oxidation of hydrocarbons (CH4 and NMHCs) and CO via reactions catalyzed by HOX and NOX radicals (Crutzen 1995; Jacob 2000) (Fig. 1). While NOX primarily regulates the oxidizing capacity of the atmosphere through partitioning or formation of free radical species (OH, HO2 and RO2), CO and CH4-NMHCs serve as a sink for the reactive OH radicals (Wayne et al. 1991). As a greenhouse gas, O3 contributes to global warming (Staehelin et al. 2001; Hansen et al. 2005) exerting prominent impacts on environment, health and vegetation (Delfino et al. 1998; Finlayson-Pitts and Pitts 1999; Weschler 2006). It is also a strong oxidizing agent and a vital source of OH radical (Chameides and Walker 1973; Thompson 1992) accountable for major chemical transformations in the atmosphere (Crutzen and Zimmermann 1991), especially in the tropics, making it the photochemically most active region in the world (Naja and Lal 2002). Inevitably, India is at a reasonably higher risk, alongside a grim apprehension that this pollution will further extend from a local to global scale. Besides, South-East Asia is an anthropogenically active zone and model calculations have anticipated a potential for considerable environmental impacts owing to increasing surface O3 values, exclusively in India, China etc. (Berntsen et al. 1996) and NASA/GISS calculations have shown that the O3 production efficiency (OPE) is greatest over the Indian subcontinent (Saini et al. 2014). This validates a need for the continuous monitoring of atmospheric trace gases and their secondary by-products.

Fig. 1
figure 1

The details of O3 chemistry: Atmospheric oxidation efficiency is largely determined by OH radicals. Photodissociation of O3 by solar UV radiation produces electronically excited O(1D) atoms which combine with water vapor to produce OH radicals. Reaction with OH is the major atmospheric sink for most trace gases and the primary precursor pollutants (CO and hydrocarbons (HCs)) react with OH to produce hydroperoxyl (HO2) or organic peroxyl (RO2) radicals, respectively. These radicals react with NO (produced from photodissociation of NO2) to generate more NO2 and OH radicals, at the same time as the O (1D) atoms can either oxidize to produce O3 or in presence of HCs, react together to produce acyl (RCO) radicals which get oxidized to form peroxyacyl (RCO3) radicals. The RCO3 radicals can react with more HCs to produce complex organic species of aldehydes and ketones, or it can react with NO2 to produce peroxyacyl-nitrate (PANs) or get oxidized to produce O3 and inevitably generate the ingredients of photochemical smog

Several studies have hence been conducted around the globe during the past three decades showing presence of a clear seasonal cycle of surface O3 over the continents. Winter maximum was observed at many remote/ rural locations (Logan 1985; Ayers et al. 1997; Gros et al. 1998; Monks 2000; Varotsos et al. 2002) whereas spring/ summer maximum have also been reported (Oltmans and Levy 1992; Derwent et al. 1998; Pochanart et al. 1999). Likewise in India, various groups have carried out surface O3 measurements in different regions (Varshney and Aggarwal 1992; Jain and Arya 2001; Nair et al. 2002; Jain et al. 2005; Ghude et al. 2006; Mittal et al. 2007; Ghude et al. 2008; Mahapatra et al. 2012; Ojha et al. 2012; Sharma et al. 2013; Peshin et al. 2014; Sharma et al. 2016a) and their observations elucidating the regional air quality have been very useful.

Besides the trace gases, PM is an additional significant tropospheric component comprising both fine (PM2.5) and coarse (PM10) fractions (Pope and Dockery 2006). Source apportionment studies have confirmed diverse sources of PM (Sharma et al. 2014; Sharma et al. 2016b) which are integrated into the particulate phase by different processes. PM affects the atmospheric photochemistry by its ability to scatter the solar and terrestrial radiation, indirectly altering the air temperature and subsequently affecting chemical rate coefficients. The same has been established by model simulations that have calculated a reduction in the O3 mixing ratios as an effect of these UV-absorbing aerosols (Dickerson et al. 1997). It is hence desirable to comprehend the gas phase-aerosol interactions that serve as the sink of O3 precursors to condense upon (Bauer et al. 2007), dissolve within or react on their surfaces (Jacobson 1998).

The reactive chemical species are transferred to aerosols with vapor molecules initially diffusing to their surface, eventually reacting with other soluble and reactive species. The sorption of a gas onto a particle is a two body (vapor and particle) chemical reaction (reaction 1) and its rate coefficient depends on the number density of particles, the size of the particle, the nature of the diffusing species and other factors. Studies have documented this process; the alkalinity of mineral dust (Schütz and Sebert 1987) supports the uptake of gaseous species; the presence of transition metals such as Fe and Mn in the Asian dust offers reaction surfaces for soluble species such as OH, HO2 and H2O2 (Gao et al. 1992); the heterogeneous oxidation of nearly 60% of the tropospheric SO2 to SO4 2− has also been credited to aerosols (Luria and Sievering 1991). Moreover, atmospheric NOX (or N2O5) sinks on the surfaces of mineral aerosols as nitrate and eventually yields nitric acid (HNO3) which is a termination reaction in the photochemical O3 system, implying that the production rate of HNO3 is a function of ambient values of NOX and OH. These denitrification processes (reaction 2, 3) leading to formation of particulate nitrate are expected to lower the ambient air levels of NOX (Rodriguez et al. 1991) subsequently influencing the O3 chemistry (Zhang et al. 1994) and it is desirable to compute the magnitude of this influence.

$$ \mathrm{Vapor}+\mathrm{Particle}\to \mathrm{Sorbed}\ \mathrm{Vapor} $$
(1)
$$ {HNO}_3+\mathrm{aerosol}\ \mathrm{surface}\to \mathrm{Particulate}\ \mathrm{nitrate} $$
(2)
$$ {\mathrm{NO}}_3+{\mathrm{H}}_2\mathrm{O}+\mathrm{aerosol}\ \mathrm{surface}\to \mathrm{Particulate}\ \mathrm{nitrate} $$
(3)

Along with the pollutant emissions in the lower atmosphere, air pollution also depends on the favorable meteorological conditions with meteorology playing a critical function in transport, formation or accumulation of trace species within the atmospheric boundary layer (ABL) (Kleinman et al. 1994; Toh et al. 2013; Ma et al. 2016). Hence, an assessment of its behavior with O3 values is obligatory to figure out the observed trends and to analyze the controlling nature of meteorology concerning pollution levels in the city.

The key objective of this study is to elucidate the temporal variations and interrelationships of surface O3 and its precursors, oxides of nitrogen (NOX), carbon monoxide (CO), methane (CH4) and non-methane hydrocarbons (NMHCs) along with particulate matter (PM10 and PM2.5) and meteorology. Present analysis of combined data set (O3 precursor gases, UV irradiance and meteorology) which are required to generate and/or influence surface O3 provides a conclusive study for an urban site.

2 Methodology

2.1 Description of site

The sampling location at CSIR-National Physical Laboratory (CSIR-NPL), New Delhi (28°38′N, 77°10′E; 216 m amsl) represents a typical urban atmosphere, surrounded by huge roadside traffic and agricultural fields in the southwest direction (Fig. 2). Delhi experiences highly variable weather throughout the three major seasons- scorching summers (March–June) followed by humid monsoons (July–October) and cold winters (November–February). Intense heat, low humidity and dust laden hot winds from the west bound Thar Desert are a conspicuous feature of Delhi’s summer. The start of rainy season is marked by the advent of moisture laden south-western winds, traveling from the Arabian Sea. The average annual rainfall in the city is ~617 mm. Chilly north-western winds from the Himalayas are accountable for the winter onset. This shift in the wind pattern also provides access to the polluted air masses along with fog and poor visibility. High variations between summer and winter temperatures is a characteristic trait of Delhi’s climate. As an urban site with intense anthropogenic activities and fluctuating weather patterns, it provides for a suitable location to study the active chemistry of surface O3 and its precursors.

Fig. 2
figure 2

The study site (CSIR-National Physical Laboratory) at New Delhi, India (28°37′N, 77°17′E; 216 m amsl)

2.2 Experimental setup

The measurements of trace gases (O3, NOX, CO, CH4 and NMHCs), PM10 and PM2.5 samples and the meteorological parameters (temperature (Temp), relative humidity (RH), wind speed (WS), wind direction (WD), UV irradiance) were carried out during January 2012 to December 2014 with the help of ground based analyzers, particulate samplers and automatic weather station (AWS), respectively.

A UV-based O3 analyzer (Model 49C; M/s The Thermo Environmental Instruments Inc.) was used for measurement of surface O3 (accuracy ±1 ppb). The O3 analyzer was calibrated periodically with the Primary Ozone Standard of CSIR-NPL, New Delhi (Standard Reference Photometer) with an accuracy of < ± 2%. NO and NO2 were measured using NOX-Analyzer (Model CLD 88 p; M/s ECO Physics AG.) using a photocatalytic converter (Model: PLC 866; M/s ECO Physics AG.) with an accuracy of ±0.05 ppb. NOx analyzer was calibrated periodically using Pure Air Generator (Model: PAG 003; M/s ECO Physics AG, accuracy 0.01 ppb) and NIST-USA certified NO gas (500 ppb ± 5%). CO was measured using non-dispersive infrared (NDIR) gas filter correlation analyzer (Model 48C; M/s Thermo Environmental Instruments Inc.). CO analyzer was calibrated periodically using Pure Air Generator and NIST-USA certified CO gas (8.1 ppm ± 5%) with multi gas calibrator. CH4 and NMHCs were measured using flame ionization detection (FID) based hydrocarbon analyzer (Model APHA-360; M/s Horiba) and their measurements were conducted for a period of nine months (April–December 2014). Zero and span gas of hydrocarbon analyzer were calibrated using Pure Air Generator and NIST-USA certified CH4 gas (8.0 ppm ± 5%). These analyzers were operated continuously (20–25 days in each month) during the measurement period with an uninterrupted power supply. The inlets of all the analyzers were stationed at ~10 m height (above the ground). The mixing ratios of these gases were recorded at a 1 min time resolution and computed on an hourly basis. The other details of these analyzers including their working principles etc. are summarized in supplementary information S1.

PM10 and PM2.5 samples were collected simultaneously on Whatmann quartz microfiber (QM-A) filters which were initially pre-baked at 550°C in a muffle furnace for at least 5 h to eliminate all impurities and then kept in a desiccator for a period of 24 h prior to sample collection which was done using the particle samplers (APM 550, Make: M/s. Envirotech, India; one unit for PM10 and another unit for PM2.5). Ambient air was passed through a 47 mm diameter QM-A filter at a flow rate of 1 m3 h−1 (accuracy ±2%) for 24 h during the sampling period. The flow meter of the samplers was calibrated (with the accuracy of ±2% of Full Scale) with air flow calibrator traceable to national standard. The concentrations of PM10 and PM2.5(in μg m−3) were calculated on the basis of the difference between final and initial weights of the quartz filters measured by a micro balance (M/s. Sartorius, resolution ± 10 μg) and dividing the difference by the total volume of air passed during the sampling. After collecting samples, filters were stored under dry conditions at −20°C in the deep-freezer. Analysis of elemental carbon (EC) and ionic components (NO3 and SO4 2−) have been carried by an OC/EC carbon analyzer (Model: DRI 2001A, USA; Make: Atmoslytic Inc., USA) and Ion Chromatograph (Model: DIONEX-ICS-3000, USA) with conductivity detector, respectively. Details of OC-EC and ion analysis are described in Sharma et al. (2015).

In addition, the meteorological parameters (Temp, RH, WS, WD) were measured by a five-stage meteorological tower (30 m height), installed in the same campus, about 100 m away from the observational site. We made use of the meteorological data accessible at level 2 (10 m height). The solar flux measurements for total UV (direct + diffuse) have been done using a radiometer (Model: CUV-4; Make: Kipp and Zonen). The instrument is intended to measure irradiance resulting from radiant fluxes incident upon a plane surface from the hemisphere above in the bandwidth 280-400 nm. The radiant flux measurements of CUV-4 has experimental errors <10% in the total flux during the day. The flux data (in Wm−2) were obtained every 2 min, 24 h a day during the entire observation period.

2.3 Analysis methods

For analysis of a three year data-set, the first approach involved investigating the time series and comprehensively probing the temporal variations of the measured parameters using frequently used statistical methods and graphical representations. This was followed by the application of Pearson correlation test, Multiple Linear Regression (MLR) analysis and Principal Component Analysis (PCA) to identify the relationships amongst the observed atmospheric variables. This supported a detailed evaluation of our collected dataset. Source identification for surface O3 was executed utilizing the data for isentropic 1-day backward trajectories. Isentropic backward trajectories were selected since they replicate a more realistic motion of air parcels allowing for their displacement (vertical motions) in an adiabatic atmosphere (Harris and Kahl 1990; Oltmans et al. 1996). Subsequent to this, we used GIS based tools (Potential Source Contribution Function and Concentrated Weighted Trajectory analysis) for statistical examination of the trajectories in collaboration with the O3 measurements to derive results.

3 Results and discussion

3.1 Temporal variations of the trace gases

Annual mean of O3 mixing ratio was calculated to be 26.1 ± 7.5 ppb, 25.8 ± 4.3 ppb and 36.6 ± 10.0 ppb during 2012, 2013 and 2014, respectively. Annual mean of NOX and CO mixing ratios were recorded to be 34.3 ± 9.1 ppb, 30.6 ± 12.6 ppb, 39.4 ± 11.9 ppb and 1.67 ± 0.68 ppm, 1.93 ± 0.45 ppm, 1.85 ± 0.43 ppm during 2012, 2013 and 2014, respectively. The mean mixing ratio of CH4 and NMHCs during the measurement period was recorded to be 3.07 ± 0.37 ppm and 0.53 ± 0.17 ppm, respectively. Figure 3 shows box and whisker plots illustrating the average monthly variations of the gases and as a measure of variability for individual months, the mean, median, 25th and 75th percentiles and the maximum-minimum values were plotted. For surface O3, the statistical mean was greater than the median for all months, excluding July and August, suggesting that the outliers (high values) are towards the upper end of the distribution, signifying high O3 values, except in the monsoon months. For CO, NOX and CH4, the majority of the mean values were roughly equivalent to the median value, which signifies an approximately symmetric distribution. The time series of the trace gases, surface O3, NOX,CO, CH4 and NMHCs, presenting the arrangement of data (Bisht et al. 2015) is given in Fig.S1 (in supplementary information). It shows the temporal evolution of the daily mean measurements of the various parameters. A 10 day moving average of the daily mean values is made to smoothen out the fluctuations in the collected data. There exists a noteworthy daily, monthly and seasonal variability for all the measured parameters. Surface O3 shows high value during the month of May. Beginning from June, following the summer season, the mixing ratio for surface O3 reaches its lowest levels for at least two months. The month of October appears to be the time when the pollutant concentrations start hiking up after a relatively cleaner monsoonal hiatus. Table S1 (in supplementary information) summarizes the monthly average values of the variables, calculated by using hourly averages of the measured data.

Fig. 3
figure 3

Statistical analysis of O3, CO, NOX, CH4 and NMHCs at the study site (the horizontal solid line indicates the median, the square indicates the mean, top and bottom of the box indicate the 75th and 25th percentile, respectively, top and bottom crosses in the whiskers indicate the maximum and minimum values, respectively)

To determine the seasonal-diurnal trends of the trace gases during the measurement period, the observed data was initially grouped on the basis of the three major seasons, winter (November–February), summer (March–June) and monsoon (July–October) and then plotted against time (Fig. S2; in supplementary information). Seasonal variations for trace gases are mainly influenced by difference in convection processes, photochemical removal, source strengths etc. (Jobson et al. 1994; Klonecki et al. 2003). The O3 mixing ratio at this site demonstrates typical diurnal cycles with maxima during afternoon [12:00–14:00 h IST–(Indian Standard Time)] and minima in the early morning hours (05:00–07:00 h) and the late night hours (00:00–02:00 h) (Nair et al. 2002; Naja et al. 2003; Jain et al. 2005; Sharma et al. 2016b). Evidently, the diurnal profile during the monsoons depicted a shorter climb on account of a relatively cleaner atmosphere.

Unlike O3, the average diurnal cycle of the precursor gases show noticeably patterns (Ahammed et al. 2006; Beig et al. 2007; Purkait et al. 2009). The prominent diurnal profiles of CO are dual-peaked (Pun et al. 2003), one during the early morning hours (07:00–08:00 h) and the second during night (20:00–21:00 h). The CO minimum is observed during afternoon, usually around 15:00 h, because of a high mixing layer (Chow et al. 1999). Since CO is an O3 precursor, its reducing values indicates that it helps in maintaining O3 build up in presence of sunlight even after peak hours. The dual peaked behavior is particularly prominent during the winter months. This should be credited to the incomplete combustion of hydrocarbons in vehicular fuel (Chinkin et al. 2003; Parrish 2006) coupled with low boundary layer height owing to low temperatures triggering peaked behavior during the traffic hours. Similarly, the diurnal profile of NOX show minima in the afternoon hours, during the time O3 begins to show a buildup, suggesting its part in photochemical conversion (Bhugwant and Brémaud 2001) while the maximum is observed during night and early morning hours, possibly due to increased vehicular emissions due to traffic hours, given that NOX is an indicator of fossil fuel based emissions (Vellingiri et al. 2015; Sharma et al. 2016a). Related diurnal profiles are recorded for both CH4 and NMHCs indicating their role as a surface O3 precursor. CH4 shows buildup during early morning hours possibly because the study site is near to the agricultural fields of the Indian Agricultural Research Institute (IARI), New Delhi (Neue et al. 1997; Mosier et al. 1998) and its recognized that the biological processes control the exchange of CH4 (Nishanth et al. 2014). The peaks are also discernible during the evening traffic hours (20:00–21:00 h) and that is likely as the hydrocarbons are released into the ambient air due to partial or negligible burning of hydrocarbon based vehicular fuels. Clear seasonal variability was identified for CH4 and NMHCs with distinct maxima during winters. Similar set of observations have been witnessed in different studies across the world (Dueñas et al. 2002; Tu et al. 2007).

The average surface O3 values during daytime (06:00–17:00 h) and nighttime (18:00–05:00 h) for the measurement period (Table S2; in supplementary information) were used to estimate the average rate of increase of O3, calculated to be 1.12 ppb h−1. Similar observations at other sites show different results; Anantapur (1 ppb h−1; Reddy et al. 2008), Kannur (1.3 ppb h−1; Nishanth 2012) and Kullu (2.4 ppb h−1; Sharma et al. 2013). The maximum and minimum O3 values observed during individual months have also been given in Table S2 (in supplementary information). We used the data to calculate the ratio, O3max/O3min which has been proposed to be a pollution indicator (Saliba et al. 2008); for urban and highly polluted sites, the values are reported in excess of 10 (Cvitas et al. 1995) while lower values have been reported for relatively cleaner areas. O3max/O3min of the order of 22.9 indicates the study location is an urban, polluted site. Another distinct feature of an urban polluted site is a sharp daytime increase in surface O3 value and to assess that, the rate of change of O3 was calculated between 08:00–12:00 h and 17:00–21:00 h (Table S3; in supplementary information) and found to be 5.25 ppb h−1 and -3.55 ppb h−1, respectively. Since the rate of decrease is far less than the rate of increase, O3 values are increasing. The diurnal rate of change of O3(dO3/dt) for different seasons was calculated and the results have been summarized in Table S4 and Fig. S3 (in supplementary information). The rate of change was highest during the winter season primarily due to the related meteorological conditions. Additionally, sharp changes during 09:00–10:00 h and 16:00–18:00 h for all seasons indicates the influence of increased emission of precursor gases, especially from vehicles during traffic hours and the associated meteorology as well to O3 titration. Also, the dry deposition process of O3 on surfaces of leaves or soil leads to its loss (Chameides and Walker 1973; Wesely and Hicks 2000).

3.2 Relationship of O3 with particulate matter (PM10 and PM2.5)

The preceding section elucidates that trace gas pollution at the study site is increasing. Besides the trace gases, high levels of PM is also an indicator of a deteriorating air quality. It has been observed that PM exhibits highest concentrations during the winter months of November to February (Fig. S4; in supplementary information). Monthly average PM2.5 concentrations during the measurement period ranged between 34.9 ± 20.4 μgm−3 (September) to 270.2 ± 98.7 μgm−3 (January) while for PM10, it ranged between 76.7 ± 27.6 μgm−3 (September) to 367.1 ± 154.4 μgm−3 (December). Higher PM concentrations during winters and higher O3 values during summers justifies a need for a separate study of O3 and PM on the grounds that their respective high pollution periods are not concurrent on seasonal timescales but on a diurnal scale, both of them are sensitive to boundary layer variations and meteorology, however only weakly negatively correlated (Table 1). Yet, PM and O3 are known to share a chemical coupling (Meng et al. 1997) but unanswered questions linger concerning the possible effects of PM on the photochemical oxidant cycle and eventually on O3.

Table 1 Correlation matrix of O3 with PM10 and PM2.5 concentrations and with their individual components (EC, NO3 and SO4 2−)

The primary organic particulate emissions incorporate in the PM as the black (elemental) carbon core coated by organic material (Mauderly and Chow 2008) and might reduce O3 by 5–8% (Penner et al. 1993) by absorbing solar and terrestrial radiation or as an ingredient of the CCN (Jensen and Toon 1997). In a study by Li et al. 2005, black carbon aerosol reduced the photolysis frequencies, J [O3(1D)] and J [NO2] in the ABL. To validate this, we correlated PM2.5 and PM10 and their common chemical constituents such as EC, NO3 and SO4 2− with O3 and obtained weak negative correlations (Table 1). Following this, simple linear regression analysis was executed to predict O3 values based on the concentrations of EC, NO3 and SO4 2−. The regression equation obtained was used to quantify the influence of these chemical constituents of PM on O3. It was found out that the EC component of PM2.5 and PM10 decreased the O3 values by 9.7% and 16.6%, respectively. Similarly, the NO3 component of PM2.5 and PM10 decreased O3 values by 13.4% and 16.9%, respectively while the SO4 2− component did the same by 23.9% and 17.8%, thus confirming our proposition.

3.3 Relationship of O3 with UV irradiance

Figure 4a represents a column plot between O3 and UV irradiance using their integrated averages to assess the relationship between them. It is evident that UV irradiance begins to take effect around 06:00 h reaching highest value at noon around 12:00 h and its peak coincides with the highest O3 values. This advocates the central role of solar UV flux in O3 production. Both parameters demonstrate a similar diurnal trend, but while UV irradiance begins to plunge after 13:00 h, O3 continues to show high values till 17:00 h. This is due to the persistent release and occurrence of precursor gases, reacting and eventually generating O3. The mean UV irradiance was found to be highest during summers (16.39 Wm−2) followed by monsoon (14.51 Wm−2) and winter (12.27 Wm−2). The hourly averaged UV irradiance values range between <1 Wm−2 to 48.36 Wm−2 during the measurement period. The hourly mean values (06:00–17:00 h) have been calculated for each month followed by average on a seasonal basis (Fig. 4b). It explains the seasonal-diurnal classification of the relation between surface O3 and UV irradiance. It can be observed that high O3 values during summers and monsoon exist for a longer duration between 8:00–15:00 h. During winters, due to higher PM concentration, absorption and scattering of the radiation is expected (Bano et al. 2013). Correlation analysis was attempted between O3 and UV irradiance; summer (r = 0.47), winter (r = 0.53) and monsoon (r = 0.38).

Fig. 4
figure 4

a Average relationship between O3 and UV irradiance with respect to time b Contour plots depicting seasonal relationships between O3 and UV irradiance c Relationship between Temperature, Relative Humidity and O3

3.4 Relationship of O3 with meteorology

Table 2 describes the average mixing ratio of trace gases determined within selected ranges of temperature. The relationship between O3 and temperature is indirect (Fig. 4c), executing itself through higher downward solar radiation. As O3 production is a photochemical reaction involving radicals, high temperature promotes this rate of reaction, the propagation of the radical chains and inevitably O3 formation (Tu et al. 2007; Martins et al. 2012). On the contrary, the O3 precursors show high values within low temperature ranges, probably since low temperatures prohibit their convective dispersal, resulting in their buildup in the ABL. Surface O3 and RH are inversely related; the highest RH values are observed during monsoons when ambient air O3 values are the lowest (Fig. 4c). Likewise, the diurnal pattern of RH indicates elevated levels at midnight and in the early morning, gradually dipping after sunrise contrary to the diurnal profile of O3. This might occur due to loss of O3 due to water vapor (Fig. 1).

Table 2 Average mixing ratio of trace gases determined within selected ranges of temperature

Wind allows the dispersal of the trace pollutant species along horizontal and vertical levels by promoting the mixing and transport across/ in-between the boundary layer and the upper free atmosphere. The plot relating the diurnal profiles of O3 and WS in Fig. 5a signifies that high WS are accountable for the transboundary transport of O3 laden air. High wind speeds during the afternoon are accompanied by higher O3 mixing ratios and low wind speeds during early morning and late night contribute to the O3 sinking process. Also, high WS removes PM and increases the incoming solar radiation and thus boosts O3 formation (Zhang et al. 2015). Figure 5b represents contour plots relating the diurnal and seasonal profiles of WS and WD. It is evident that high wind speeds within the range of 1.4–2.2 ms−1 are a distinguishable feature of summers (which record the highest O3 mixing ratio). This suggests that winds transport pollutant laden gases to the study site and the prevailing meteorological conditions support O3 formation. However, in winters, the low wind speeds within the range of 0.6–1 ms−1 during early morning and late night hours complement the relatively lower O3. Moreover, wind direction (WD) delivers an understanding of the first order advection process and correlating the surface O3 data with WD provides insight into the impact of air mass transport on surface O3 variability (Lin et al. 2008). Specific WD’s are found to influence air pollutants more than other directions since the emission intensities of pollutants in those directions are comparatively higher because of regional transport (Zhang et al. 2015). Figure 5c shows that the specific directions with lower wind speeds support the accumulation of air pollutants and also the average O3 in different wind direction sectors segregated seasonally and it can be observed that the highest values appear in the ESE- SW sectors. A probable reason can be the presence of two coal based thermal power plants (Badarpur and Faridabad) located in this sector, a few kilometers distant from the study site and contribute to the regional transport of pollutant loaded air in the direction of the study site.

Fig. 5
figure 5

a Average diurnal profiles of surface O3 and WS during the observational period at the study site.The error bars indicate one standard deviation b Contour plots demonstrating the diurnal and monthly variation of WS and WD c Bivariate polar plots depicting relationship between WS-WD and O3-WD

3.5 Statistical analysis

To describe the complex relationships between O3 and the many variables that may cause or hinder its production, we first analyzed whether our data was normally distributed so as to understand whether to apply a parametric or a non-parametric correlation test (S2; in supplementary information). We eventually employed a parametric test (Pearson Correlation Test) to understand the relationship between the quantitative variables. An additional advantage of correlation analysis is that study of parameters with different units is achievable using this method (Özbay et al. 2011). Table 3 presents the correlation values among all variables. As observed, O3 negatively correlated with CO, PM2.5 and RH. Amongst these, RH was most strongly correlated and displayed a significant correlation, r (12) = − 0.585 (at p ≤ 0.05). Surface O3 positively correlated with NOX, CH4, NMHCs, PM10, Temp, WS, WD and UV irradiance. Strong positive relationships were observed between the O3 precursors, CH4 and NOX, r (12) = 0.759 and NMHCs and NOX, r (12) = 0.567 signifying their interactions in the troposphere, with hydrocarbons oxidizing to produce O3 in reactions catalyzed by NOX (Fig. 1). A strong correlation coefficient (0.892) between CH4 and NMHCs proposes that these locally emitted pollutants have a key role in active photochemistry (Nishanth et al. 2014). As discussed before, UV irradiance and WS is negatively correlated with PM10 and PM2.5, validating that PM scatters and absorbs radiation while high WS removes PM. Temperature and UV irradiance share strong significant correlation, r (12) = 0.873. The results are compatible with the available literature (Abdul-Wahab et al. 2005; Yerramsetti et al. 2013).

Table 3 Correlations between input variables

Multiple Linear Regression (MLR) analysis was also implemented to find out the relationship between surface O3 and the explanatory variables (precursors) (S3 in supplementary information). We obtained the following equation,

$$ \begin{array}{l}\left[{\mathrm{O}}_3\right]=36.04-1.38\left[{\mathrm{NO}}_{\mathrm{X}}\right]-13.82\left[CO\right]+44.94\left[CH4\right]-21.50\left[\mathrm{NMHCs}\right]\\ {}+0.04\left[{\mathrm{PM}}_{10}\right]+0.01\left[{\mathrm{PM}}_{2.5}\right]+1.31\left[\mathrm{T}\right]-0.68\left[RH\right]-29.85\left[WS\right]-0.08\left[WD\right]\end{array} $$

A multiple correlation coefficient (R) is the measure of the strength of the relationship between the dependent variable and the ten independent variables selected for inclusion in the equation. In this study, R = 0.998 was observed, suggesting an extremely strong linear relationship. By squaring ‘R’, we identified the value of the coefficient of multiple determinations (i.e. R 2). This statistic enables us to determine the amount of explained variation (variance) in Y (O3) from the ten predictors on a range of 0–100% (Ilten and Selici 2008). Thus we are able to say that 99.64% of variation in [O3] was accounted for through the combined linear effects of the predictor variables and that approximately 99.6% points fall on the regression line and fit the model. Another statistic (adjusted R2 value) compares the explanatory power of regression models, unaffected by the number of predictor variables (Dominick et al. 2012). It was calculated to be 0.960.

Principal Component Analysis (PCA) is a dimension reduction technique for restructuring the data where the output is essentially measured by correlation (Yong and Pearce 2013). This technique is useful in this study because it extracts maximum variance from the data set and thus reduces a large number of variables into smaller number of uncorrelated components, thereby facilitating an identification of the most dominant groups of pollutants and meteorological parameters. PCA was applied by using the monthly averaged data for the variables (O3, NOX, CO, CH4, NMHCs, PM10, PM2.5, Temp, RH, WS and WD) over the measurement period using SPSS software. The Kaiser-Meyer-Olkin (KMO) statistic was used to measure the suitability of the factor analysis (Supplementary information S4); the value was 0.622 and so we were confident of the application of factor analysis on our data set. Additionally, for factor analysis to work, we need a significant Bartlett’s test (significance value p < 0.05), so as to justify existence of some relationship between the input variables. The results indicated that the test was highly significant (p < 0.001) and therefore, factor analysis was appropriate.

In this study, the principle components (PCs) were extracted based on Eigen value (greater than 1) as rest are not significant (Kim et al. 2009) and the eleven initial variables were decreased (extracted) to three PCs. While the first PC accounts for an approximately 32% of the total variance, the cumulative contribution of the three PCs is about 73% variability (Table S5; supplementary information). Each PC corresponds to the collective behavior of the original variables and the Eigen values associated with each variable depicts the amount of variance depicted by that linear component. For a better interpretation, factors are rotated; Varimax rotation, a type of orthogonal rotation has been used because it minimizes the number of variables that have high loadings on each factor and works to make small loadings even smaller (Statheropoulos et al. 1998) (Table 4). The output of the component matrix before rotation contains the loadings of each variable onto each factor but since all loadings less than 0.3 were to be suppressed in the output and so there are blank spaces for many of the loadings. The rotated component matrix is the key output of PCA and contains estimates of the correlations between each of the variables and the estimated components. Through PCA, the concentrations of the meteorological parameters were grouped to PC1, the values of O3 and its precursor gases (NOX, CH4, NMHC) were grouped to PC2 while PC3 corresponds to the concentrations of PM (PM10 and PM2.5). Strong correlations have been witnessed between the traffic related pollutants (CO and NOX) in PC1. A strong correlation of NOX with other pollutants was identified in PC2 which is an indication of a collective influence of urban landscape including the traffic movement, local topography, atmospheric chemistry and physical processes (Beckerman et al. 2008).

Table 4 PCA resolved component matrix after Varimax rotation

We have also calculated correlations between O3, NOX and CO with UV irradiance during different times of the day to find out the role of dominant parameters in determining O3 production during the day (06:00–17:00 h). The analysis period has been subdivided into 3 time zones based on the variation in the diurnal trends of the trace gases: 06:00–08:00 h (early morning), 11:00–13:00 h (midday) and 15:00–17:00 h (late day hours) (Table 5). It can be observed that O3 is strongly correlated with UV irradiance (0.58) during early morning hours. NOx is strongly negatively correlated with O3 during complete day length, signifying the critical role of NOx in O3 production at the study site. Like NOx, CO also shows strong negative correlations with O3 during early morning hours indicating that photochemical activity of O3 begins during morning hours due to precursor availability (i.e., NOx and CO). During midday when O3 reaches peak values, correlations between O3 and UV irradiance are moderate (0.30), but, less as compared to morning hours. As expected CO shows insignificant correlation with O3 (−0.15) diminishing its direct role in formation of O3, however, NOx sustain the O3 buildup even after peak hours. The reported wavelength of UV dataset in the present study is in the range of 280–400 nm. The equivalent wavelength for the dissociation energy of NO2 is ~390.3 nm and hence NOX shows weak negative correlation with UV irradiance during early morning hours but is strongly negatively correlated with UV irradiance during the noon time, suggesting that it photochemically converts to O3. The results establish that oxidation of CO with OH to produce peroxy radicals is more significant in O3 chemistry than its relationship with UV irradiance.

Table 5 Correlations between the different parameters during different time of the day

Moreover, we have attempted to establish how the oxidant concentrations ([OX] = O3 + NO2) vary with the level of NOX. This allows us to understand the variability and behavior of oxidant levels, which depend on numerous factors such as photochemical reactions, convective mixing of pollutants, long range transport or the height of the ABL (Notario et al. 2013). The slope of the relationship between [OX] and [NOX] (Fig. 6a) show the local OX contributions, while the intercept is suggestive of the NOX independent regional contribution (Varotsos et al. 2014). The regression lines of the [OX] vs. [NOX] was:

$$ \left[\mathrm{OX}\right]=0.562\kern0.2em \left[{\mathrm{NO}}_{\mathrm{X}}\right]+8.293\kern0.2em \mathrm{with},r=0.66 $$
Fig. 6
figure 6

Variation of a) [NOX] vs. [OX] b) [NOX] vs. [NO2]/[OX] c) [O3] vs. [NMHCs]/[NO]

This basically suggests that the NOX independent regional contribution which is essentially equivalent to the background O3 value is relatively low than the local contribution which is a linear function of the local emissions and largely dependent on the primary pollution. The variation of [NO2] / [OX] with [NOX] was plotted in Fig. 6b to determine the proportion of [OX] in the form of NO2 and an analysis of the plot reveals a linear relationship with the increase in the levels of the ratio [NO2] / [OX] with increasing levels of NOX suggesting that a major proportion of OX is present in the form of NO2. The regression lines of the [NO2] / [OX] vs. [NOX] was:

$$ \left[{\mathrm{NO}}_2\right]/\left[\mathrm{OX}\right]=0.005\kern0.2em \left[{\mathrm{NO}}_{\mathrm{X}}\right]+0.173\kern0.2em \mathrm{with},r=0.63 $$

We can infer that a major proportion of oxidants at the study site are in the form of NO2 and that relative to the background surface O3 values, the local emissions of primary pollutants are high in the ambient air of the city. Moreover, [NMHCs]/[NOx] ratio was calculated to recognize the role of NOX in generation or inhibition of O3. High [NMHCs]/[NOx] ratio (24.57) was observed which suggests that there exists a NOX sensitive regime at the study site, letting NOX to cater to the role of an O3 generator. This ratio differs for a location and for time of day; during nighttime hours, it was nearly two times (35.28) the ratio observed during daytime hours (17.63). Since, NMHCs do not have any secondary sources in the atmosphere and simply the emission processes near the source regions contribute to their abundance, it can be concluded that vehicular emissions, solvent evaporation, industrial releases etc. during nighttime along with the boundary layer processes are accountable for the high nighttime ratio. A scatter plot showing correlation between [O3] and [NMHCs]/[NOX] (Fig. 6c) demonstrates an inversely proportional relationship between the two parameters, validating that O3 increases with increasing NOX (NOX sensitive). Due to possible emission of particular species of NMHCs, the regime changes of O3 dependency on NOX is quite important.

3.6 Analysis of air mass back trajectories

One day backward isentropic air mass trajectories were computed using the NOAA (National Oceanic and Atmospheric Administration) Air Resource Laboratory (ARL) Hybrid Single Particle Lagrangian Integrated Trajectory (HYSPLIT) model with the Global Data Assimilation System (GDAS) data (http://www.arl.noaa.gov) as an input. The trajectories were calculated at 500 m above ground level (AGL) for the sampling duration and have been represented in Fig. 7a. They have been plotted in order to show the origin of the air masses and their transport pathways to the sampling site and it is apparent that they originate from the continental land masses to reach the receptor site and so allow an influx of gaseous and particulate pollutants. These groups of trajectories have been clustered together (Fig. 7b) to represent the major directions of the air mass offering a better conception of the source regions. Cluster analysis splits the ‘n’ number of trajectories into distinct groups (clusters) based on the transport speed and direction, eventually yielding clusters with similar length and curvature (Brankov et al. 1998; Li et al. 2012). To chart the source potentials of the studied geographical region to facilitate identification of the probable source regions contributing to elevated levels of O3, a potential source contribution function (PSCF) analysis was done (Fig. 7c) using air mass trajectories statistics software, TrajStat (version 1.2.2.6). PSCF is the ratio of polluted trajectory segment endpoints falling in a grid cell to the total number of trajectory endpoints passing over that grid (Wang et al. 2009). The PSCF value can be interpreted as the conditional probability that an air parcel which passed through a grid cell during transport to the receptor site had a high concentration upon arrival at the trajectory endpoint (Hopke 2003). The color coding is used to denote the lowest to highest probability grids (in our study, the 60th percentile of O3 mixing ratio was used as the pollution criterion value) to identify cells containing emission sources. The probable source grids span the densely populated and polluted regions of Northern India, spanning the Indo-Gangetic Plain (IGP) including parts of Haryana, Punjab, Rajasthan, Uttar Pradesh as well as parts of Pakistan which contain key oil fields, refineries and major thermal power plants (Naja et al. 2014). Thus, the air masses flowing from these regions are bound to bring pollutant laden gases to the receptor site and in due course generate secondary pollutants such as surface O3.Nonetheless, PSCF method has a problem in distinguishing between strong and moderate sources. Even though it identifies the potential source areas, it is unable to allocate their discrete contribution to the measured concentrations at the receptor site and so the concentration weighted trajectory (CWT) model is used to help define the relative significance of potential sources. In CWT, every concentration is used as a weighting factor for the residence times of all trajectories in each grid cell and then divided by the cumulative residence time from all trajectories (Cheng et al. 2013). CWT analysis also points out the contribution of the IGP towards pollution at the receptor site (Fig. 7d). The strongest contributors remain parts of Punjab followed by Rajasthan, Haryana and Uttar Pradesh. It is known that these regions are greatly urbanized and highly populated and boast of numerous thermal power plants for energy generation. Moreover, the enormous amounts of burning of agricultural crop residues in these fertile farmlands is a critical issue (Jain et al. 2014) that has also been adjudged a contributing factor (Badarinath et al. 2009) to the high values of gaseous and particulate emissions.

Fig. 7
figure 7

a Five day back trajectories b Clustered Trajectory Analysis c PSCF Analysis d) CWT Analysis of surface O3 during January 2012 – December 2014. The black circle represents the study site at New Delhi

4 Conclusion

The study takes account of the measurements of trace gases (O3, NOX, CO, CH4, NMHCs), particulate matter (PM2.5 and PM10) and meteorological parameters (Temp, RH, WS, WD, UV irradiance) during the period of January 2012 to December 2014. Following are the prominent outcomes of the study:

  • Over the 3 years measurement period, the mean mixing ratio of surface O3, NOX, CO, CH4 and NMHCs were calculated to be 29.5 ± 7.3 ppb, 34.7 ± 11.2 ppb, 1.82 ± 0.52 ppm, 3.07 ± 0.37 ppm and 0.53 ± 0.17 ppm, respectively. Their temporal evolution displayed notable variability revealing high values. The pollution indicator ratio (O3max/O3min) exhibited high value (22.9), confirming high O3 pollution at the study site. The O3 production was observed to occur at a rate of 1.12 ppb h−1.

  • The study recognizes that a separate analysis of O3 and PM on a seasonal timescale is ideal given the difference in their high pollution periods. Linear regression analysis was used to interpret the influence of the chemical constituents (EC, NO3 and SO4 2−) of PM (both fine and coarse fractions) on surface O3 values, revealing negative effects.

  • UV irradiance shows strong positive correlation with O3, temperature and WS and strong negative correlations with PM validating its role in O3 generation, and verifies that PM scatters or absorbs radiation. Both the mean UV irradiance and O3 mixing ratio were found to be highest during summers. But, UV irradiance recorded lowest values during the winter season implying the role of other factors and meteorology in determining O3 levels.

  • An evaluation of the role of meteorology on the mixing ratio of O3 verifies that pollution at the study site is an amalgamation of two factors: high emissions of precursor gases and meteorological factors encouraging the formation and buildup of secondary pollutants.

  • Bivariate correlation analysis, MLR analysis and PCA were implemented on the data set to corroborate the observed relationships and the results supported our findings. It was also found out that a major proportion of the total oxidant concentration [OX] at the study site is in the form of NO2 and local emissions of primary pollutants are high in comparison to the regional background O3 value. High [NMHCs]/[NOX] ratio denote NOX sensitive conditions implying that NOX serves to be an O3 generator at the study site. Strong correlation of NOx with other variables during different daytime period implies the influential role of NOX in O3 formation. Backward trajectory, PSCF and CWT analysis established that the major source regions for O3 pollution are located in and around the city.