Introduction

A large number of studies have been and are being focused on characteristics and emission sources of atmospheric aerosols (Pant & Harrison, 2013; Adam et al., 2015; Dall’Osto et al., 2013; Paterson et al., 1999; Pant et al., 2015; Friend et al., 2013; Lang et al., 2015; Guttikonda et al., 2014; Begam et al., 2017; Sharma & Mandal, 2023; Gargava et al., 2014). Still, it is a task for scientists and environmentalists to precisely quantify the sources responsible for the poor air quality or pollution in the urban regions (Pant & Harrison, 2012). Delhi, the capital city of India, i.e. the hot spot for economy, air quality is at higher risk (Gargava et al., 2014). ~ 3000 metric tons of pollutant emission is recorded every day with the contribution mainly from the industrial sector, traffic sector and agriculture sector (Rizwan et al., 2013). Enhancement in these sectors ultimately boosts the scale of atmospheric pollutants like PM10, PM2.5, PM1, SOx, NOx, NH3 and O3 in the atmosphere. Further long-time exposure to such factors leads to diseases like cardiovascular, respiratory, neurological impairments, risk of preterm birth, mortality and morbidity (Rai et al., 2016; Gupta & Elumala, 2017; Shubhankar & Ambade, 2016). The anthropogenic and natural events are accountable for the emissions of pollutants in the atmosphere (Fuzzi et al., 2015). To grasp the consequences of atmospheric pollutants on climate and human health, adherent attention has been believed towards the chemical composition, morphology, aerosol transport and thus the source apportionment (Zeb et al., 2018).

According to the 2018–2019 report released by the Central Pollution Control Board (CPCB), the number of monitoring station over the country is 731 and the concentration of the criterion pollutants is approximately four times the threshold set by National Ambient Air Quality Standards (NAAQS) (CPCB 2018–2019). Thus, to integrate the high concentrations of criterion pollutants, receptor modelling (multivariate statistical techniques) has been accepted for the identification and quantification of atmospheric pollutant (PM10, PM2.5, PM1) sources in a precise way at the sampling site (Hopke et al., 2006; Pant & Harrison, 2012). Receptor model, i.e. chemical mass balance (CMB), potential component analysis (PCA), edge analysis (UNMIX) and positive matrix factorization (PMF) (Banerjee et al., 2015; Belis et al., 2012; Song et al., 2006), relates the source and the receptor (Chow & Watson et al., 2002; Garcia et al., 2006; Olson & Norris, 2008; Gildemeister et al., 2017). This receptor model works on the basic foundation of mass conservation as:

$$X=G\times F+e$$
(1)

As X denotes the input concentration matrix of defined dimension, G is the source contribution matrix and F is the source profile matrix and e the residual matrix (Paatero & Tapper, 1994a, 1994b). Moreover, for the development of a model, the physical constraints to the system must be accepted (Hopke, 2016; Henry & Kim, 1990), i.e. final data must be imitated by the model explaining the observations, expected source composition and contribution must be non-negative as negative mass emission is not acceptable (Hopke, 2016). Previously most of the source apportionment studies in India had espoused the PCA tool (Banerjee et al., 2015), recent studies have notorious the efficiency of the PMF performance, as it incorporates with the mass concentration and the uncertainties. The basic norms for a true solution in the PMF tool embrace the ratio Qtrue/Qrobust (< 1.5); weighted residuals for the species should have a centred normal distribution (e.g. between ± 3), right appraisal of the signal to noise sensitivity for each species for 100 successive iterations from the bootstrap (BS) test and the correlation r2 > 0.6, consequently perceiving the rotational ambiguity from the displacement test (DISP). Studies as Feng et al. (2022) and Soleimani et al. (2022) explored PMF model to PM2.5 organic tracer as polycyclic aromatic hydrocarbon in Shanghai and Isfahan city in Iran. Similarly, Lima et al. (2022) quantified the heavy metal sources in the urban street dust in Brazil using PMF approach.

Scrutinising the model with the meteorological data (wind speed or wind direction) results in a reliable outcome. Conditional Probability Function (CPF) assesses the probability of wind direction coalesced with specific threshold criteria pollutant (75th–90th percentiles) (Masiol et al., 2019). Moreover, the wind speed is allied with the wind direction and the mass concentration for the precise examination of the sources regions, i.e. Conditional Bivariate Probability Function (CBPF). The application of the CBPF study to the atmospheric pollutants is very limited in India (Bapna et al., 2013, Rai et al., 2016).

This work aims the identification and quantification of emission sources and their percentage contribution to the particulate matter (PM10) over National Capital Region of Delhi, India, using the statistical multivariate tools (PCA/APCS-MLR and PMF). Morphological study has also been incorporated to explore the possible emission sources of the PM10 over the NCR-Delhi. Moreover, the programmed CBPF identify the local source regions contributing to the study sites. In the previous years, a very limited study has been performed following the morphology and CBPF over NCR-Delhi.

Methodology

Site description

Increase in the urbanization and industrialization NCR-Delhi in the centre is walled with the Himalayas in the North, Indo-Gangetic plain (IGP) in the east, hot plains in the south and the Thar desert in the west, such geography adds in deteriorating the atmosphere over NCR-Delhi. The periodical sampling (i.e. twice a week at CSIR-NPL, and IGDTUW-Delhi except Faridabad) of coarse particulate matter  (PM10) was performed at the terrace of the urban sites of Faridabad (28°38′N, 77°29°E), Haryana; Indira Gandhi Technological University for Women (IGDTUW-Delhi), Kashmiri-Gate (28°66′N, 77°23′E), Delhi; and CSIR-National Physical Laboratory (CSIR-NPL), Delhi (28°38′N,77°10′E) for the year 2015. Figure 1 imitates the layout for these three urban sampling sites. Moreover, these locations are walled with traffic junctions, small-large scale industries, constructions, forest, residential area etc. Our previous study (Banoo et al., 2020; Sharma et al., 2014) explored the sites and their meteorology in brief.

Fig. 1
figure 1

Map for study sites over NCR

Sample collection

Before sampling the Pallflex quartz micro-fibres (QM-A) filters of an area (20 \(\times\) 25 cm2) were pre-baked in a furnace at a temperature of 550 °C for 5 h to exterminate the organic scums. These pre-baked filters were desiccated for 24 h in a controlled atmosphere. Desiccated filters were initially weighed using M/s. Sartorius microbalance (resolution ± 10 µg). Furthermore, initially weighed filters were carried out for 24-h sampling of PM10 (following the protocol defined by CPCB, Delhi, India) where the assembly of high-volume respirable dust samplers: AAS 212 NL (make: M/s. Ecotech, India; flow rate 1.2 m3/min; accuracy ± 2%) was mounted on the terrace at a height of 10-m above ground level (AGL). The flow metre was calibrated with an airflow calibrator traceable to National Standard. Sampled filters were weighed and thus the gravimetric method was accepted to calculate the mass concentration as:

$$\text{Mass concentration}\left(\mathrm{\mu g} {\text{m}}^{-3}\right)\text{ = }\frac{\left({\text{F}}-{\text{I}}\right)\times {10}^{6}}{\text{V}}$$
(2)

Where F is the final weigh, I is the initial weight and V is the volume of the sampler defined as [V = flow rate × sampling time].

Chemical analysis

Interaction of focused electron beam with the sample surface atoms can backscatter from the elastic collision reflects the information about the surface morphology. The sample is scanned by the focussed electron beam in a raster pattern (Biswas et al, 2021). The sample was prepared by cutting 5 mm × 10 mm area of quartz filter further made adhere to the field emission scanning electron microscope (FE-SEM). This assembly was coated with platinum and positioned under vacuum condition in a spray tank so for good conductivity and reduction of electron charge. After the coating, the sample was placed into the ultra-high resolution Schottky field emission scanning electron microscope (Model: JSM-7610F) accelerating at 5 kV in a high vacuum mode. Thus, the morphology as spherical, irregular, flocculent etc. was imaged at different resolutions. Sample ID with Blank-01, FBD-23, IT-01 and NPL-05 was carried out for morphological study.

For major and trace elements (Al, Ti, Na, Mg, Cr, Mn, Fe, Cu, Zn, Br, Ba, Mo, Pb etc.) a punch of 3.0 cm diameter of the quartz filter was prepared for analysing. Such samples were positioned in WD-XRF (ZSX Primus, Rigaku, Tokyo, Japan), a non-destructive method. The detailed sample analysis procedure, principle/methodology and calibration standards used are discussed in our previous publication (Jain et al., 2017). Mathematical approach for elemental concentration is defined as:

$$\text{Elemental concentration} \left(\upmu {\mathrm{g m}}^{-3}\right)= \frac{\left(I\times {\text{E}}_{\text{a}}\right)}{V}$$
(3)

where I is the instrumentation reading, Ea is the total filter exposed area and V is the volume of the sampler.

Total organic carbon (TOC) was estimated using the assembly TOC-LCPH/CPN (M/s. Shimadzu, Japan). An area of 4 cm2 of each filters was carried out for TOC analysis (Banoo et al., 2020). TOC is calculated as:

$$\mathrm{TOC }\left(\upmu {\mathrm{g m}}^{-3}\right)=\frac{\left(I)\times \left({\text{V}}_{x}\right)\times ({\text{P}}_{\text{a}}\times {\text{T}}_{\text{a}}\right)}{V\times {10}^{3}}$$
(4)

where I is the instrumentation reading, Vx is the extraction volume, Pa is the punched area, Ta is the total area and V is the volume. Thus,

$$\mathrm{WSOC }\left(\upmu {\mathrm{g m}}^{-3}\right)=\mathrm{ TOC}-{\text{OC}}$$
(5)

A punch of ~ 0.56 cm2 of each filter was carried out for the analysis of organic and elemental carbon using the OC/EC analyser (Model: DRI 2001A, Atmoslytic Inc, Calabasas, CA, USA) by US EPA “IMPROVE-A Protocol” (Chow et al., 2004; Sharma et al., 2014; Jain et al., 2017). Where the concentration was estimated as:

$$\mathrm{EC }\&\mathrm{ OC }(\upmu {\mathrm{g m}}^{-3})= \frac{\left(I\right)\times \left({\text{E}}_{\text{a}}\right)}{V}$$
(6)

I is the instrumentation reading, Ea is the total exposed area of the filter and V is the volume of the sampling.

Ion chromatograph with model: DIONEX-ICS-3000, CA, USA was used to estimate the amount of water-soluble inorganic species (NO3, SO42−, Na+, Mg2+, Cl, F, K+). The analytical error (repeatability) was estimated to be 3–7% based on triplicate (n = 3) analysis. Previous publications (Sharma et al., 2014 a: b; Banoo et al., 2020) discussed the brief calibration and the quality assurance/quality check (QA/QC) for the cited instrumentations. The sample blank filters were also analysed for all species and corrections were applied to maintain the QA/QC of the analysis. The method detection limit (MDL) of each chemical species (OC, EC, WSIC and major and trace elements) is calculated as three times of the average standard deviation of 10 replicates of filter blank analysis (Jain et al., 2017).

Enrichment factor (EF)

In order to identify the natural and anthropogenic sources of elements present in the aerosol samples, enrichment factors (EFs) were computed. Enrichment factor specifies the sources corresponding to the metals presented in the aerosol. Sources like crustal, anthropogenic, natural and sea salt generation are attributed to the metals in the aerosols (Taylor & McLennan, 1995; Gaonkar et al., 2020). EF of elements have been calculated using following equation (Taylor & McLennan, 1995):

$${\text{EF}}=\frac{(\frac{{E}_{{\text{sample}}}}{{X}_{{\text{sample}}}})}{(\frac{{E}_{{\text{crust}}}}{{X}_{{\text{crust}}}})}$$
(7)

where elemental concentration (Esample), reference element concentration (Xsample), elemental concentration in the upper continental crust (Ecrust) and reference element concentration in the upper continental crust (Xcrust) are defined accordingly. As (Al, Si, Ti and Fe) are not affected by contamination and are abundant in Earth crust, thus preferred as reference crustal elements (Saad et al., 2018). Al is considered as the reference crustal element in the present study.

Coefficients of divergence (COD)

In order to appreciate the intra-urban mass concentration variability, Wilson et al. (2005) proposed COD approach. It is defined as:

$${\text{COD}}_{\text{ij}}\text{=}\sqrt{\frac{1}{{\text{n}}}\sum_{\text{k=1}}^{\text{n}}{\left(\frac{{X}_{ki }-{X}_{kj}}{{X}_{ki }+{X}_{kj}}\right)}^{2}}$$
(8)

where Xki and Xkj are the kth mass concentration at i and j location, and n represents the number of data points (Krudysz et al., 2009; Pakbin et al., 2010). COD value in the range (0–1) depicts the homogeneity in the sites, i.e. similar pattern of mass concentration. COD > 1 or 1 reflects the inhomogeneity in the sites thus different profiles of mass concentration (Pakbin et al., 2010). Present study showed homogeneity across the sites as Faridabad-IGDTUW-Delhi (COD—0.039), IGDTUW-Delhi-CSIR-NPL (COD—0.031) and among CSIR-NPL-Faridabad (COD—0.026). It was noted that among these sites, COD is ̴ 0 which means the homogeneity between the sites, thus the similarity in PM10 mass concentration. Kong et al. (2010) and Liu et al. (2017) reported the similarity and the dissimilarities in the concentration of polycyclic aromatic hydrocarbons (PAHs) among five cities of Liaoning province and Shanghai, China. The detailed homogeneity and inhomogeneity in the concentration of PM2.5 and PM10 among the 10 sites across Los Angeles were also discussed by Pakbin et al. (2010).

Source apportionment

Positive matrix factorization (PMF)

To accomplish the PM10 source apportionment, present study used the US Environmental protection agency-positive matrix factorization (US-EPA PMF 5.0) software. Like other receptor models, PMF aims to work on chemical mass balance formulation (Paatero, 1997, Paatero & Tapper, 1994a, 1994b) as given in Eq. 1.

Where the weight is envisioned by minimising the function Q (global or local) and the optimal solution as:

$$Q=\sum \left(\frac{\text{e}}{{\text{U}}}\right)$$
(9)

U is the uncertainty matrix of data point, calculated as:

$$U=\left\{\begin{array}{c}\sqrt{{\left({\text{C}}\times {\text{E}}_{\text{f }}\right)}^{2}+{\left(\text{0.5}\times {\text{MDL}}\right)}^{2}}\\ \\ \frac{5}{6}\times {\text{M}}{\text{D}}{\text{L}} if C<MDL \end{array}\right.$$
(10)

As C is the concentration, Ef is the error fraction and MDL is method detection limit (Gianini et al., 2012; Polissar et al., 1998). EPA-PMF 5.0 version encompasses the signal/noise ratio (S/N) for each species and thus the error estimation EE (Brown et al., 2015).

$$\frac{S}{N}=\left\{\begin{array}{c}<0.5 Bad species\\ 0.5<1 weak species \\ >1 strong species\end{array}\right.$$
(11)

The EE, bootstrap (BS) confront the random errors, displacement (DISP) tackles the rotational ambiguity and BS-DISP comprises the rotational ambiguity and random errors effects, thus to precisely comprehend the uncertainties (Brown et al., 2015, Norris et al., 2014).

Principal component analysis–multilinear regression (PCA-MLR)

In this study, PCA-MLR was applied to the species of PM10 using the software IBM SPSS to estimate the percentile contribution of individual sources. The key objective of PCA is to reduce the dimensionality thus get the total variability of the data in a lesser number of factors with maximum variance (Larsen & Baker, 2003; Jolliffe & Cadima, 2016; Paul et al., 2013). Furthermore, to identify the percentile contribution absolute principal component score (APCS), i.e. Xk is coupled with MLR analysis (Larsen & Baker, 2003; Thurston & Spengler, 1985; Prakash et al., 2018).

Where,

$$y \left({\text{PM}}_{10}\right)=\sum_{k=1}^{n}{\text{a}}_{\text{k}}\times {\text{X}}_{\text{K}}+b$$
(12)

is the multi-linear equation with ak as slope and b as intercept, and Xk is the APCS.

The variables are normally standardized as:

$$S= \frac{\mathrm{variable value}-{\text{mean}}}{\mathrm{standard deviation}}$$
(13)

So that, the intercept b = 0, and thus variables are equated through the regression coefficients as:

$$A=\sum_{k=1}^{n}{\text{B}}_{\text{k}}\times {\text{X}}_{\text{k}}$$
(14)

To predict the modelled mass concentration and source contribution, the APCS is multiplied with the regression coefficient and thus obtained the source’s contribution. PCA regenerated matrix error is minimized through APCS (Ogundele et al., 2016).

Conditional bivariate probability function (CBPF)

Conditional probability distribution programme was firmed to scrutinize the influences of sources in various local directions. Furthermore, it gages the probability of a source positioned within a specific wind direction sector, demarcated as:

$$Y = \overline{Y } .\mathrm{sin }\left(\frac{2\pi }{\theta }\right) ; X = \overline{Y } .\mathrm{cos }\left(\frac{2\pi }{\theta }\right)$$
(15)

Y and X are the wind components, with θ and \(\overline{Y}\) as the average wind direction and speed. In CBPF, the individual grid mean mass concentration is calculated and differentiated (coloured) within the reference of wind speed coupled with conditional probability function (CPF) therefore discriminates the emissions source region (Uria-Tellaetxe & Carslaw, 2014).

Mathematically defined as:

$${\text{CBPF}}=\frac{{m}_{\Delta \Theta ,\Delta u}}{{n}_{\Delta \Theta ,\Delta u}}$$
(16)

mΔθ, Δu is the number of events in the wind sector and interval (Δɵ, Δu) with concentration C.

nΔθ, Δu is the total number of events in (Δɵ, Δu).

Open air package in R-studio software is used to program CBPF (http://www.rstudio.com/).

Results and discussions

PM 10 concentration

Table 1 summarizes the annual mass concentration and chemical constituents, i.e. carbonaceous, major-trace elements and the water-soluble ionic components (WSIC) of PM10 over NCR-Delhi (Faridabad, IGDTUW-Delhi, CSIR-NPL). It is studied that for all the three-study locations, the annual mass concentration of PM10 is ̴ 3–4 times higher than the standard defined by National Ambient Air Quality Standard (NAAQS), i.e. annual—60 µg m−3 (Faridabad, 195 ± 121 µg m−3; IGDTUW-Delhi, 275 ± 141 µg m−3; CSIR-NPL, 209 ± 81 µg m−3). Enumerated the comparable mass concentration pattern by the several researchers (Gupta et al., 2018; Jain et al., 2017; Mandal et al., 2014; Perrino et al., 2011; Tiwari et al., 2013) as 161 ± 80 µg m−3; 285 ± 26 µg m−3; 369 ± 220 µg m−3; 238 ± 106 µg m−3 and 183.0 µg m−3, respectively over Delhi. Bawase et al. (2021) recorded PM10 concentration as 260 ± 107 µg m−3 over Delhi-NCR. The high vehicular traffic, intricate pollution source, biomass burning, secondary aerosols, soil dust, manufacturing actions, meteorological conditions etc. might be the prime actions for such higher mass concentration (Sharma et al., 2018, Jain et al., 2020). A brief discussion of the seasonal and annual mass concentration with the carbonaceous constituents (OC, EC, WSOC, POC, SOC) has been already discussed in our previous publication (Banoo et al., 2020).

Table 1 Annual average concentration of the PM10 constituents at the study sites of NCR

Water soluble ionic components (WSIC)

Table 1 represents the analysed nine WSIC (F, Cl, SO42−, NO3, NH4+, Na+, K+, Mg2+, Ca2+) of PM10. The pattern for WSIC was observed as for Faridabad, NO3 (19. ± 13.0 µg m−3) > SO42− (13.3 ± 10.5 µg m−3) > NH4+ (8.1 ± 5.8 µg m−3) > Cl (7.2 ± 5.7 µg m−3) > Na+ (3.2 ± 4.3 µg m−3). For IGDTUW-Delhi, SO42− (21.2 ± 11.0 µg m−3) > NH4+ (18.1 ± 9.6 µg m−3) > NO3 (17.0 ± 10.7 µg m−3) > Cl (12.7 ± 9.1 µg m−3) > K+ (3.7 ± 1.8 µg m−3). For CSIR-NPL, SO42− (20.2 ± 16.6 µg m−3) > NO3 (18.9 ± 21.8 µg m−3) > NH4+ (10.8 ± 9.9 µg m−3) > Ca2+ (9.3 ± 6.5 µg m−3). Bawase et al. (2021) reported an analogous study of secondary inorganic ionic species as SO42− (11.97 ± 6.19 µg m−3), NH4+ (9.95 ± 7.16 µg m−3) and NO3 (9.13 ± 8.30 µg m−3) over Delhi-NCR. WSIC contributed (32.5%, 28% and 38%) to PM10 mass for the site (Faridabad, IGDTUW-Delhi and CSIR-NPL), thus dominated the major-trace elements. Sufficient quantity of ammonia is the base in atmosphere to significantly neutralize the particle fraction of nitrate, sulphate and chloride which could be achieved with the aerosol electro-neutrality relationship between ammonium (Sharma et al., 2015). Figure 2 represents a linear correlation of 2*[SO42−], [NO3], 2*[SO42−] + [NO3] and 2*[SO42−] + [NO3] + [Cl] concentration with NH4+ concentration for all the three sites. A very significant correlations was observed as [NH4+ ] vs 2*[SO42− ] (R2 = 0.51; Faridabad), (R2 = 0.68; IGDTUW-Delhi), (R2 = 0.62; CSIR-NPL). For [NH4+ ] vs [NO3] (R2 = 0.70; Faridabad), (R2 = 0.87; IGDTUW-Delhi), (R2 = 0.68; CSIR-NPL). For [NH4+ ] vs 2*[SO42−  ]+ [NO3] (R2 = 0.84; Faridabad), (R2 = 0.82; IGDTUW-Delhi), (R2 = 0.69; CSIR-NPL). [NH4+] vs 2*[SO42− ] + [NO3] + [Cl] (R2 = 0.83; Faridabad), (R2 = 0.80; IGDTUW-Delhi), (R2 = 0.80; CSIR-NPL). This significantly positive regression reflects the associations of SO42−, NO3 and Cl with NH4+. Jain et al. (2020) reported 21.2% of the secondary aerosol’s contribution to PM10 whereas Sharma et al., (2016) reported ̴ 40% of WSIC contribution.

Fig. 2
figure 2

Charge balance. a [NH4+] vs 2*[SO42−]. b [NH4+] vs [NO3]. c [NH4+] vs 2*[SO42−]+[NO3] . d [NH4+] vs 2*[SO42−] + [NO3] + [Cl] of PM10

Major-trace elements

A total of 15 elemental species (Na, Al, Mg, S, Cl, K, Ca, Ti, Cr, Mn, Fe, Cu, Zn, Pb and Br) species were diagnosed for the locations Faridabad, IGDTUW-Delhi and CSIR-NPL of Delhi-NCR. Statistical values are comprised in Table 1. Over Faridabad, the concentration of Ca (7.4 ± 5.8 µg m−3) was found to be higher followed by Al (5.9 ± 2.3 µg m−3), S (4.5 ± 2.9 µg m−3), K (4.4 ± 4.9 µg m−3), Fe (4.3 ± 2.9 µg m−3) and Cl (4.3 ± 4.2 µg m−3) respectively. For IGDTUW-Delhi, Ca (9.7 ± 6.4 µg m−3) showed the higher concentration with Al (5.3 ± 3.6 µg m−3), Fe (4.9 ± 3.9 µg m−3), S (4.6 ± 3.2 µg m−3) and K (4.0 ± 2.7 µg m−3), respectively. Over CSIR-NPL, the concentration profile for elements were recorded as Ca (7.4 ± 3.6 µg m−3), Cl (6.0 ± 4.2 µg m−3), S (5.0 ± 2.1 µg m−3) and K (4.1 ± 2.2 µg m−3), respectively which reflected the dominancy of the crustal elements over the heavy elements for all the three sites. Moreover, taking the average of the crustal elements (Ca, Mg, K, Mn, Al, Ti, Fe and Na) and the corresponding total mass concentration for each observation site, the crustal elements percentage is calculated, i.e. the crustal elements correspond to Faridabad, IGDTUW-Delhi and CSIR-NPL contributed as 12.2%, ̴ 8% and ̴ 11% to the PM10 concentration. Parallel studies like Jain et al. (2020) and Bawase et al. (2021) reported 14.1% and 11–21% of the crustal elements to PM10. The spectral distribution (intensity vs 2θ) of the elements is enclosed in the supplementary study S1 (Figure S1; in supplementary information). Moreover, Fig. 3 shows a significantly positive correlation among the crustal elements (i.e. Al vs Mg, Ca, Ti and Fe) for Faridabad, IGDTUW-Delhi and CSIR-NPL: Mg vs Al (R2 = 0.72, R2 = 0.86, R2 = 0.89), Ca vs Al (R2 = 0.78, R2 = 0.67, R2 = 0.77), Ti vs Al (R2 = 0.60, R2 = 0.62, R2 = 0.86) and Fe vs Al (R2 = 0.73, R2 = 0.95, R2 = 0.81), respectively, thus emulating the profusion of mineral dust over the sites of Delhi-NCR. Additionally, for the locations Faridabad, IGDTUW-Delhi and CSIR-NPL, the ratios were calculated as Mg/Al (0.21, 0.25, 0.26) with range 0.1–1.0, 0.1–0.5 and 0.1–0.3 which are adjacent to the upper continental crust (UCC) value (Mg/Al, 0.17) (Taylor and McLennan, 1995). Ca/Al (1.0, 1.9,1.2) with range 0.1–1.7, 1.0–6.1 and 0.5–2.1 which are quite higher than the UCC values (Ca/Al, 0.37). Fe/Al (0.6, 0.9, 0.5) with range 0.03–1.1, 0.6–1.4 and 0.01–1.05 which are fairly close to the UCC (Fe/Al, 0.43) (Taylor and McLennan, 1995). The higher values of the ratio Ca/Al and Fe/Al signify that the observation sites are enriched in Ca and Fe. Srinivas et al. (2011) reported Fe/Al, 0.42; Ca/Al, 0.27; and Mg/Al, 0.24 across Bay of Bengal. Sharma et al. (2020) quantified Fe/Al, 1.07 and Ca/Al, 1.36. Fe/Al, 0.59 was reported at Nainital (Kumar & Sarin, 2009). For North Indian Plains, Fe/Al ranged 0.55–0.63 (Sarin et al., 1979). Recently, Sharma and Mandal (2023) also reported the similar sources of particulate matter over Delhi base on long-term monitoring.

Fig. 3
figure 3

Relationships (Al vs Mg, Ca, Ti, Fe) among the crustal/mineral/dust elements of PM10 over the three study sites

Enrichment factor (EF)

To comprehend the root of the above analysed elements (e.g. crustal, natural or anthropogenic), EF is demarcated into ranges; elements with EF < 5 are classified as crustal or natural origination, 5 < EF < 10 are mostly anthropogenic and natural, and elements with EF > 10 are considered as purely anthropogenic nature (Sharma et al., 2014, 2021; Sharma & Mandal, 2023; Jain et al., 2020). The annual EF for the study sites is represented in Fig. 4. Homogeneity in the sites (Faridabad, IGDTUW-Delhi and CSIR-NPL) resulted in analogous pattern of EF over the study sites. The elements (Ca, Mg, K, Mn, Al, Ti, Fe, Na) showed lower EF < 5, which accredited the crustal/soil dust. Elements (Mo, Zn, Cu, Cr, Pb, Ni) reflected EF > 10 or higher EF attributed to the anthropogenic sources. Study over Delhi (Sharma et al., 2021) reported similar pattern of lower EF (Al, Mg, Ti, Fe, Ca and K) and higher EF (Cr, Cu, Zn, Mo, Pb and Ni). The seasonal EF pattern for the three sites is presented in Figure S2 (see the supplementary information).

Fig. 4
figure 4

Enrichment factor of elements in PM10 at  the study sites

Morphology

Backscattered electron from the elastic collision comprehends the surface information (shape and size etc.) (Biswas et al., 2021). Morphological study of PM was done to grasp the idea about the possible sources towards the PM. The measurement scale for FE-SEM instrument is 10 μm for the resolution range × 1100–3000, 1 μm for × 3000–30,000 and 100 nm for greater than × 30,000 resolutions. Figure 5 shows that at a lower resolution (× 1500, 10 µm), the morphology was homogenous and difficult to singularize the shape, and at higher resolution (× 3000, 1 µm), distinct morphology of the PM was observed. Figure S3 represents the morphology corresponding to × 33,000, 100 nm, i.e. 0.1 µm for the three study sites. Blank in Fig. 5 represents the morphology of the quartz filter before sampling (i.e. no deposition). Pragmatic morphologies were analogous across the study sites (Table 2). In Fig. 5, morphologies of the PM at Faridabad and IGDTUW-Delhi were observed as spherical/fly ash, irregular and flocculent shaped, whereas at CSIR-NPL, irregular and flocculent types of morphologies were imaged. Spherical or the coal fired fly ash particulate is generally instigated from the incineration activities, high temperature combustions and biomass burning with the elemental composition (K, Al, Na, Mg etc.) (Zeb et al., 2018). Fe, Mn, Al etc. are the elemental compositions towards the irregular particulates, which are usually initiated through the crustal/soil dust and are typically wind-blown (Pipal et al., 2011). Flocculent particulates are basically cluster of spherical particulates of 30–50-nm size, and they are primarily emanated through vehicles (Yin et al., 2020). Such sorts of particulate morphologies over the study sites are fairly acceptable as the sites are hedged with traffic flow, land fields, incineration activities, small-large-scale industries etc. A study sponsored by Delhi Pollution Control Committee (DPCC) imaged the fly ash, soil dust and aluminosilicate’s particulate across the IIT-Delhi, Okhla, Murthal, Punjabi Bhag and Anand Vihar of Delhi (Shiva Nagendra and Khare, 2019). Pipal et al. (2011) found irregular, fine rod, crystalline and spherical types of particulate morphologies at Agra. Moreover, for the better understanding of the identification and quantification of sources, the statistical approach is performed in the following section, where the two approaches (PMF and PCA) are applied.

Fig. 5
figure 5

SEM lower and higher resolution images reflecting the morphology of PM10 for the three locations of NCR

Table 2 Morphological study at the study sites of NCR

Sources and possible regions of PM 10

In this study, the receptor models PMF (I-approach) and PCA (II-approach) have been applied to the species of PM10 accordingly to categorize the sources and their contribution towards the ambient atmosphere of the locations. Furthermore, CBPF was performed to distinguish the local source regions.

I st -approach (PMF)

Both the concentrations and the uncertainty files were measured as an elementary input to the PMF model. Assorted runs were made with the three-error estimation (EE) methods (BS, DISP, BS-DISP) to accomplish the optimal solution for individual location. PMF to the species of PM10 at Faridabad, IGDTUW-Delhi, CSIR-NPL is summarized in Table 3 and Table S1S2 (in supplementary information). BS was proceeded to comprehend the imitated base solution by mapping BS factor to base solution. In this way, BS resample the input point. In the present study, BS factor presented 100% of mapping with the base solution with model uncertainty (7% for Faridabad, 5% for IGDTUW, 8% for CSIR-NPL), signal to noise ratio (S/N > 2) and regression coefficient (R2 > 0.6). Inflections in the swaps among the factor are vindicated by DISP; swapping of factors means there is exchange of features among the factor which is not considered to define a well-defined solution (Paatero & Tapper, 1994a, 1994b; Brown et al., 2015). Current study performed zero swapping among the factors. BS-DISP pacts with the overall error of mapping, swapping, thus change in Q (Brown et al., 2015). With the consideration of EE, PMF model extracted 6-sources of PM10 for each study site. Figure 6 represents the factor profile for individual site.

Table 3 PMF summary for the PM10 over the study sites
Fig. 6
figure 6figure 6

PMF source profile of PM10 for the observation sites (a Faridabad; b IGDTUW-Delhi; c CSIR-NPL)

Faridabad

  • Source-I (soil dust): The first extracted source is soil dust (SD), as Al, Ti and Fe reflected the higher contribution (Fig. 6). This source contributed 15% to the PM10 mass which agrees with the EF results in Fig. 4 as well the correlations in Fig. 3. Al, Ti, Fe, Ca, Na, Mg and K are framed as the marker components towards soil or road dust (Sharma & Mandal, 2023; Balachandran et al., 2000; Banerjee et al., 2015; Begum et al., 2011). Wind-blown local dust from the open field, constructions, re-suspended dust and the brick kiln stations in the vicinity of site could be the prime reason for such emission. Moreover, the long-range transport dust storm from the regions of Afghanistan and adjacent Pakistan in the north-west region contributes to the mass loading.

  • Source-II (unidentified): The second source is mixed/unidentified, as a single component Mn is contributing 12% of the PM10 concentration. Mn is extant in diesel fuel and brake wear, whereas Mn, Ni and Cu are originated from industrial emissions (Jain et al., 2017). Quantification of source based on single component is quite baffling, thus considered as mixed/unidentified (Fig. 6).

  • Source-III (industrial emissions): Third source is identified as industrial emission (IE); the constituents (Cu, Zn, Br and Pb) contributed 14% of the PM10 mass. Zn, Cu, Pb and Cr are defined as the marker to industries (small-large scaled) (Jain et al., 2020; Sharma et al., 2021; Gupta & Karar, 2007). As there are large- and small-scale manufacturing industries, e.g. steel, cotton, jute based, leather based, plastic and rubber, chemical, electrical machinery, wooden and agrobased are running in the new industrial town of Faridabad. Additionally, it also includes the major exportable manufacturers, i.e. shoes, tractor auto. Thus, emissions from such manufacturers could be the key sources for IE.

  • Source-IV (vehicular emissions): Fourth source is extracted as vehicular emission (VE); species (EC, WSOC, Cr, Mo) showed the higher influence to this factor. Sharma et al. (2021) reported EC, Cr, Zn, Mn and OC as source to vehicular emission. Studies (Gupta & Karar, 2007, Sharma et al., 2018, Sharma et al., 2021) quantified EC emission from diesel vehicle. It contributed around 19% to the particulate mass. Toxic smoke emerging from the vehicles exhaust containing Cr dilutes the ambient air. Furthermore, the emissions from the non-exhaust as brake wear, tyre wear road surface wear and resuspended road dust enhance the VE.

  • Source-V (secondary aerosols): Fifth source, due to high contribution of (SO42−, NO3, NH4+), this source is classified as secondary aerosols (SA). SO42−, NO3 and NH4+ in the atmosphere are ascribed by the gases (SO2, NOx, NH3) to particle conversions stimulated at higher and lower temperatures (Seinfeld, 1998, Sharma et al., 2015, Zheng et al., 2015; Song et al., 2006). SO42−, NO3 and NH4+ are the key markers of secondary aerosols. Oxidation of NOx at lower temperature results in the formation of secondary nitrate, whereas high temperature and the solar radiation favoured the formation of secondary sulphate via photochemical reactions. This source contributed approximately 23% to PM10 mass.

  • Source-VI (sodium magnesium salt): Sodium magnesium salt (SMS) is the sixth source contributing 17% (Cl−, Na+, Mg2+ and Ca2+) in PM10 concentration. Kothai et al. (2008) and Kumar et al. (2001) documented Na+, Cl, Mg2+ and Ca2+ as the precursors to sea salt and SMS. But in the present study, the sites are not in the vicinity of any coastal regions. Therefore, it is obvious to consider Na+, Mg2+ and Ca2+ as the markers of SMS. The resolved sources for the site are pretty conceivable, as the site reflects an urban atmosphere, and its topography includes the land field, walled traffic flux, small-large industries and construction stations.

IGDTUW-Delhi

  • Source-I (soil dust): Al, Ti and Fe framed the first source as soil dust. Approximately 16% of PM10 is affianced with such source. Al, Ca, Mg, Fe and Ni are classified as SD sources (Moreno et al., 2013; Chelani et al., 2008). Ca associates the resuspension of bare soil from the agriculture fields by local winds. Small agricultural field on the bank of Yamuna River, road suspended dust and the long-range transport dust boosts such sources at the site. In India, the elemental tracers Ca, Al, Si, Ti, Pb, Cu, Co, Cr, Mg and Ni have been used for such source identification (Chelani et al., 2008; Sharma et al., 2015). Corresponding EF (Fig. 4) and the correlation (Fig. 3) results strengthen this result.

  • Source-II (sodium magnesium salt): Dominant contribution of Na+, Mg2+, Cl and Ca2+ determined the source as sodium magnesium salt. The precursors Na+ and Mg2+ with Cl and Ca2+ could have defined both the SMS and sea salt (SS) (Kumar et al., 2001), as the sites are not located nearby the coastal area, the SS contribution could be lesser, thus assumed as SMS. This has contributed around 18% to PM10.

  • Source-III (secondary aerosols): High concentration of SO42−, NO3 and NH4+ resulted in secondary aerosol. Around 27% of SA have subsidized to PM10. It has been scrutinized that the secondary aerosols are being produced as (NH4)2SO4 and NH4NO3 from the precursors (NOx, SO2 and NH4) (Jain et al., 2018). Profusion of gases NOx, SO2 and NH3 at Delhi supports such emissions over the site. Indian studies as Pant & Harrison (2012) and Khare & Baruah (2010) considered NH4+ as marker to biomass burning and SO42− as coal combustion marker.

  • Source-IV (vehicular emissions): High values of EC, OC, WSOC and Cr outlined the source as vehicular emission. Pant & Harrison (2012) have considered EC, OC, Cr and Zn emissions from road vehicles. Silva et al. (2015) reported Cr and Zn from brake wear. International domain considers EC as marker to diesel exhaust marker, whereas in India, the VE tracers are generally Cu, Zn, Mn and Pb. As the site is walled with the running traffic roads also the railway tracks, residential areas and market place on the other side could be the possible sources for such emissions. VE have contributed 19% to the ambient PM10 at the site.

  • Source-V (industrial emissions): Species as Zn and Pb classified the source from Industrial emission. This source has 11% of contribution to PM10 at the location. Small-medium scaled industries in the vicinity of the site could be the possible reason for such emission. Jain et al. (2020) have reported Zn, Pb and Cr as industrial markers across Delhi. Such heavy elements might be instigated from the functioning small-medium-scale industries in the direction of study site.

  • Source-VI (vehicular + industrial emissions): Cu and Zn could be considered as the marker to both vehicular and industrial emissions. Cu is instigated through wearing of brake lining. Zn instigated as a tracer to electroplating, metallurgic and galvanising industries. Such amalgamated source has contributed 9% to the PM10. Jain et al. (2017) have measured such species from vehicular, whereas Banerjee et al. (2015) and Sharma et al. (2021) defined the species to industrial emission. We note the small-scale industries and the heavy traffics, i.e. the busy inter-state bus terminal (ISBT), railway tracks, congested busy market place at a distance of few metres (< 100 m) from the site. Such sources could be the foremost reason for the VE and IE kind of emissions at IGDTUW-Delhi.

CSIR-NPL

  • Source-I (soil dust): Profuse of Al, Ti, Fe, Mn and Zn framed the source as soil dust, which has contributed 24% to the ambient PM10. Crustal elements (Al, Ca, Mg, Fe, Si, Ti and Na) are generally used as tracers for soil dust and are generally consistent in the upper continental crust composition (Sharma & Mandal, 2023; Sharma et al., 2021). SD not only associates with the mineral soil, but it also emerges from the activities as road dust, constructions, demolition works and other erosive process. The agricultural field (ICAR-Indian Agricultural  Research Institute), central forest, busy traffic roads and residential areas possibly load the SD at the sampling site. Furthermore, the outcomes from the EF (Fig. 4) and the correlations (Fig. 3) supported this source. Sharma et al. (2016) have derived 20.5% contribution, and Jain et al. (2017) reported 20.5% of contribution to SD of PM10 in Delhi.

  • Source-II (sodium magnesium salt): This source has the 20% of contribution towards PM10, as the tracers are found to be abundant as Na+, Mg2+, Cl, F and Ca2+. Cl, F and Na+ are considered as the sea spray indicators. Long-range air parcel from Bay of Bengal and the Arabian sea could be the probable source, but the topography of the site CSIR-NPL reflects no coastal area in its vicinity. Parallel findings were reported in Sharma et al. (2016) with 21.3% contribution. Therefore, source is classified as SMS.

  • Source-III (vehicle emissions): High abundant of Cu with moderately Al premeditated the source as vehicular emissions. Gasoline and diesel vehicles can endorse the non-exhaust (compression) and exhaust (spark) emissions (Jain et al., 2017). Brake linings of vehicle assure the Cu emission. In Indian cities, the VE contribution may vary as 10–80% in atmospheric particulate (Sharma et al., 2016). This study estimated 15% of the VE contribution to atmospheric particulate (PM10). Al may be originated from the soil dust. However, the contribution is moderate towards PM. Thus, defining the source as purely VE.

  • Source-IV (industrial emissions): Pb and Br inferred the factor as industrial emissions (IE). Approximately 8% of this source have a contribution to PM. Leaded fuel may be the indicator to Pb in atmospheric particulate at the sampling site. Pb is also emitted from acid-lead battery as well as other related industries (Jain et al., 2017).

  • Source-V (vehicular + industrial emissions): Species like Zn, Pb and Cl could be the possible sources for both vehicular and industrial emissions, thus their amalgamation. Twelve percent of the PM mass is portioned to this source. Pb with Cl may be attributed to coal combustion. Zn along with Pb attributes to IE as well VE.

  • Source-VI (secondary aerosol + biomass burning): Species SO42−, NO3, NH4+, EC, OC, WSOC and K+ are the possible markers to SA and BB. Around 21% of this amalgamation has contributed to PM10 at the site. Profusion of gases NOx, SO2 and NH3 in Delhi results in the formation of SO42−, NO3 and NH4+. According to Wu et al. (2006), SO42− along with K+ has been defined as marker to wood, biomass burning. Pant & Harrison (2012) study over Asia and Europe considered K+as BB indicator. EC, OC and K+ attribute to BB (Cesari et al., 2018). Jain et al. (2020) have showed 19% contribution of BB towards coarse mode (PM10) concentration. PMF resolved sources are pretty conceivable for each individual site, as they reflect an urban atmosphere with the topography of land field, walled traffic flux, small-large industries, constructions, incineration activities etc.

IInd-approach (PCA)

PCA–Varimax rotation using the Kaiser normalization (k > 0.6) was modelled to the species of PM10 for the sites Faridabad, IGDTUW-Delhi and CSIR-NPL. The rotational matrix to the principal component is embedded in the Supplementary file S1 (Table 3 (a), (b), (c)).

Faridabad

Figure 7 a represents the principal components (PCs) for 23 species of PM10 in Faridabad, PCA to such species built 5-components. In PC1, Na+ and Mg2+ have showed a strongly positive loading value with a total variance of 35.3%. The species Na+ and Mg2+ have been an active marker of sodium magnesium salt (SMS) as conferred above; thus, PC1 has been classified as SMS source. Moreover, it has contributed 11% to PM10 concentration. The second resolved source (PC2) is secondary aerosol + biomass burning (SA + BB), as the precursor of SA (SO42−, NO3, NH4+) and BB (OC, K+, Cl) has showed a strongly and moderately positive loading values with a total variance of 24.5%. This source contributes 30% to PM10 mass. Cland K+ are the sign for biomass burning (Shridhar et al., 2010; Khare & Baruah, 2010; Robinson et al., 2006; Pant & Harrison, 2012). PC3 is defined as soil dust; this source is supported by the crustal elements (Al, Ti, Fe and Cu) with strong positive loading values and the total variance of 9.4%. In addition, there %age contribution to particulate mass has estimated as approximately 36%. The PC4 has been classified as industrial emission, as Zn, Br and Pb represented strong loading values and Cu showed a moderately positive loading value. Furthermore, it showed 13% of contribution to particulate concentration. The total variance corresponding to this component is 7.9%. Amalgamation of industrial emission and vehicular emission (IE + VE) has been classified as PC5. The total variance to this factor is 5.6%. Mo, Cr and SO42− figured a strong and moderate loading value. This has a contribution of 10% to PM.

Fig. 7
figure 7

Rotational components factors resolved by PCA. a Faridabad. b IGDTUW-Delhi. c CSIR-NPL

IGDTUW-Delhi

PCA model was executed to 19 constituents of PM10 in IGDTUW-Delhi, such execution resulted in 6-components. As shown in Fig. 7 b, PC1 has been defined to SMS with a total variance of 17.18%. Na+, Mg2+ and Cl have been identified with strong positive loading factor (> 0.7). Additionally, it has contributed 19.3% to PM10 concentration. The second component (PC2) has been defined as SA. SO42−, NO3 and NH4+ imitated strong positive loading values with 16.8% of total variance, thus contributed in mass concentration by 4.7%. PC3 has assorted as SD, as Al, Ti and Fe reflected high loading value. The corresponding variance and the mass contributed have framed as 14.7% and 12.1%. High and strong positive loading factor of EC, OC and WSOC has delineated the source as amalgamation of VE and BB. Total variance is 14.3% and the mass contribution is 21.4%. Cu, Zn and Pb with the dominant loading factor outlined PC5 as VE. The last factor (PC6) has been identified as IE with the lowest variance of 8.89%. As the elements Cr and Al pointed moderately positive loading factor. Nearly, 14% of IE has contributed PM10.

CSIR-NPL

Modelled PCA to 20 species of PM10 extracted 5-PCs (Fig. 7 c), with different variances and contributions. PC1 has been classified as SA (SO42−, NO3, NH4+ and Cl), with 21.7% total variance. This species emanates in the range of strongly positive loading values. PC2 has inferred to the amalgamation of SMS and SD as Na+, Mg2+, Ca2+, Al, Ti, Fe and K+ have been ranged to loading factor (> 0.5). Further with the variance of 16.9%, its contribution to PM10 is 23%. Strong loading factor with total variance of 16.29%, Cu, Fe and Zn have demarcated the source as IE (PC3). Moreover, 24% of IE has shared to PM. EC, OC, WSOC and K+ with dominated loading factor resulted in BB source (PC4), with a total 12% contribution. PC5 described the amalgamation of IE and VE with a total variance of 10.4%. Zn and Br are the major loaded species. This amalgamated source has showed 7% of contribution to PM10 mass.

Observed/predicted concentration

Observed data points correspond to the statistical calculated concentration, whereas the predicted data points are the model generated data points based on the input data file and the uncertainty (in PMF model). The observed values of the mass concentration (PM10) have been regressed with the predicted data points for PM10 (Fig. 8). Exploring the PMF regression for the three locations, the observed and the predicted values showed a positive relation with R2 = 0.93, R2 = 0.88 and R2 = 0.94 corresponding to Faridabad, IGDTUW-Delhi and CSIR-NPL respectively. The EE and the diagnoses in PMF were made incorporation with both observed and model generated data points. Likewise, Table 4, 5 and 6 represent the APCS-MLR regression of the observed and predicted concentration for individual species with R2 > 0.6. 186 ± 123; 195 ± 121, 256 ± 117; 275 ± 141 and 195 ± 77; 209 ± 81 µg m−3 are the observed and modelled concentrations of PM10 for Faridabad, IGDTUW-Delhi and CSIR-NPL respectively. Furthermore, Fig. 8 shows the positive correlation between the observed and predicted mass concentration (PM10) with R2 = 0.88, R2 = 0.75 and R2 = 0.87 corresponding to each site (Faridabad, IGDTUW-Delhi, CSIR-NPL). Such significant correlation among the observed and predicted/modelled mass concentration strengthened the outcomes of the model.

Fig. 8
figure 8

Comparison of the modelled and the observed values of PM10 for PCA and PMF receptor model

Table 4 PCA-MLR summary for Faridabad
Table 5 PCA-MLR summary for IGDTUW-Delhi
Table 6 PCA-MLR summary for CSIR-NPL

Outline to PMF and PCA

Both the model extracted the probable sources (SD, SMS, SA, BB, VE and IE) and the amalgamation of these sources (VE + BB, VE + IE, SMS + SD, SA + BB etc.) with different %age contribution to mass concentration of PM10. Such extracted sources are quite obvious for the three urban sites of NCR-Delhi. Sampling sites at Faridabad are surrounded by the small-medium-scale industries, traffic junctions, agricultural/open fields, open incineration activities etc. IGDTUW-Delhi is curbed with the Inter State Bus Terminal (ISBT), the railway junction and the small agricultural filed at the bank of Yamuna rivers. The traffic emission is quite high. Furthermore, it is surrounded with the residential and small-scale industries. CSIR-NPL is fenced with the heavy traffic junction, agricultural filed, forest and the residential areas (Banoo et al., 2020). In Delhi, Jain et al. (2017) made a comparable study using PMF (SA 25%, VE 13%, SD 16%, SMS 4%, IE, 11%). Sharma et al. (2016) identified SA 21.3%, SD 20.5%, VE 19.7%, BB 14.3%, FCC 13.7%, IE 6.2% and SS 4.3% using PMF.

Both PMF and PCA outlined a smooth outcome according to their performance and input requirement. The sources extracted by these methods are approximately similar. However, there is difference in the percentage contribution among the similar sources (Fig. 9). Different working mechanism, procedure or requirements could be the possible reason for such differences. In PCA/APCS, input data points are analysed by a statistical approach (Thurston & Spengler, 1985). PMF outcome is constructed by analysing and normalising each single data point (USEPA-PMF 2014). Values below detection limit (BDL) and the missing data point are prudently taken into account; along with the concentration file, it contemplates the uncertainty file (Jain et al., 2017). PMF perceives the rotational ambiguity and random error (USEPA-PMF 2014), which strengthened the outcomes of PMF model as compare to PCA (Banerjee et al., 2015; Jain et al., 2017).

Fig. 9
figure 9

Source percentage contribution profile of PM10 extracted by PCA and PMF for the study sites of NCR

CBPF

Long-range transport of air parcel towards Delhi is dominated from North, North-West, North-East, South-West and South-East, i.e. sources in the region of Indo-Gangetic plain (IGP), Arabian sea and Bay of Bengal, enhances the loading of atmospheric pollutants in Delhi (Banoo et al., 2020; Jain et al., 2020; Shivani et al., 2019). To grasp the directionality of the local source, CBPF plot is programmed as Fig. 10. The mass concentration of PM10 is incorporated with the wind speed (ws) and the wind direction (wd) (Banoo et al., 2021). The dotted radial edges the ws. At Faridabad, the mass concentration (> 75th percentile), i.e. 266 µg m−3, was programmed with the ws (0.4–3.7 ms−1), which advocated the local source in the North-West (N-W), West-South (S-W) and the South-East (S-E) regions of the receptor site aid in enhancing the mass concentration at the site. Moreover, the highly dense residential area, the agricultural field and the trivial traffic influx in the N-W and W-S including the national highway in the S-E direction of the receptor site could be the possible source emitter towards the site (Fig. 10 a). With (> 75th percentile) of mass concentration (378 µg m−3) and the ws ̴ (0.1–1.0 ms−1), CBPF for IGDTUW-Delhi directs the source in the local regions. W-S and S-E could be conceivable cause for mass loading at the site (Fig. 10 b). As the railway tracks, the dense fluxed traffic junctions (ISBT) including the dense residential area in the W-S (Banoo et al., 2020), Yamuna River bank (Gupta et al., 2018) and the small agricultural field in the S-W of the site could be the probable sources for the mass loading at the site. At CSIR-NPL, programmed ws ̴ (0.1–1.5 ms−1) along with the mass concentration (> 75th, 252 µg m−3) proposed the sources in the N, N-E and S-W regions of the receptor site subsidize the mass concentration (Fig. 10 c). This contribution could be from the traffic, as the site is fenced with traffic junctions (Patel Nagar, Rajendar Place, Shadipur) in the N and N-W regions, agriculture field (ICAR-Indian Agricultural Research Institute) and forest (central ridge forest); moreover, the site is fenced with the residential areas (Banoo et al., 2021).

Fig. 10
figure 10

CBPF plot of PM10 for the study sites. a Faridabad. b IGDTUW-Delhi. c CSIR-NPL

Conclusion

In this study, a whole array of chemical species (OC, EC, WOSC, Al, Ti, Na, Mg, Cr, Mn, Fe, Cu, Zn, Br, Ba, Mo Pb, F, Cl, SO42−, NO3, NH4+, Na+, K+, Mg2+, Ca2+) were analysed and used for the identification and quantification of PM10 source over the three urban regions of NCR-Delhi using different approaches (EFs, morphology, PMF and PCA/APCS-MLR). Lastly, framed the CBPF for the directionality of local sources. The conclusions are enumerated as:

  • It was observed that for over the observation sites, the mass concentrations of PM10 were quite higher than the NAAQS of India. The mass concentrations of PM10 were calculated as 195 ± 121 µg m−3 over Faridabad, 275 ± 141 µg m−3 for IGDTUW-Delhi and 209 ± 81 µg m−3 over CSIR-NPL.

  • Different types of morphological images (spherical, flocculent, irregular etc.) defined the possible source of PM10 as biomass burning, road/soil dust, industrial and vehicular emission.

  • Application of PMF to the particulate species over Faridabad extracted the factor SD (15%), IE (14%), VE (19%), SA (23%), SMS (17%) and unidentified source (12%). 6-sources were resolved for IGDTUW-Delhi, SD (16%), SMS (18%), SA (27%), VE (19%), IE (11%) and VE + IE (9%). Similarly, six factors correspond to CSIR-NPL, SD (24%), SMS (20%), VE (15%), IE (8%), VE + IE (12%) and SA + BB (21%). The observed and the predicted values showed a positive relation with R2 = 0.93, R2 = 0.88 and R2 = 0.94 corresponding to Faridabad, IGDTUW-Delhi and CSIR-NPL, respectively.

  • PCA/APCS-MLR resolved 5-sources at Faridabad, SMS (11%), SD (36%), IE (13%), SA + BB (30%) and IE + VE (10%) with different percentage of total variance. 6-sources at IGDTUW-Delhi, SMS (19%), SA (95%), SD (12%), VE (28%), IE (14%) and VE + BB (21%). At CSIR-NPL, it extracted 5-sources, SA (34%), SMS + SD (23%), IE (24%), BB (12%) and IE + VE (7%). Such sorts of sources are fairly acceptable for the three urban sites. Observed and the modelled concentration showed a positive correlation with R2 = 0.88, R2 = 0.75 and R2 = 0.87 corresponding to each site (Faridabad, IGDTUW-Delhi, CSIR-NPL), respectively.

  • CBPF programmed to Faridabad advocated the local source in the North-West (N-W), West-South (S-W) and the South- East (S-E) regions of the receptor site which aids in enhancing the mass concentration at the site. Source in the local regions of W-S and S-E could be conceivable cause for mass loading at the site IGDTUW-Delhi. At CSIR-NPL, the sources in the N, N-E and S-W regions subsidize the mass concentration at the site.