Introduction

Non-aqueous phase liquids (NAPLs) exhibit low solubility in water and are characterized by complex migration modes in heterogeneous aquifers. NAPLs can be divided into two categories: light non-aqueous phase liquids (LNAPLs), e.g., BTEX group, and dense non-aqueous phase liquids (DNAPLs), e.g., halohydrocarbon. These liquids are widely applied in industrial and chemical processes, but can pose serious environmental challenges and causes toxicity to ecosystems when they leak into aquifers1,2,3. LNAPLs, which are affected by spills, may either be retained in the vadose zone or infiltrate further down and contaminate groundwater4. NAPL-contaminated sites typically have two main zones, i.e., the source zone and the plume zone. The source zone is commonly characterized by the portion of contaminant trapped in the pore space (i.e., residual phase), and by NAPLs which can be transported through a hydraulic gradient (i.e., free-phase)5. On the other hand, a small fraction of the NAPLs migrate away from the source zone due to water infiltration and groundwater movement, forming the plume zone, which includes the dissolved, adsorbed, or volatilized constituents along with free phase NAPLs6,7,8. For example, the lateral spread of BTEX group above the water table is influenced by hydraulic gradients due to its low solubility.

Precise characterization of contamination plumes is critical, and is often achieved through drilling and sampling to directly access contamination. However, these invasive methods provide only point information rather than a continuous distribution of the contaminant plume. Furthermore, drilling and chemical sampling not only are costly, but also increase the risk of vertical spreading of contaminants9,10. In contrast, the geophysical surveys, including ground penetrating radar (GPR)11,12, electrical resistivity (ERT)13,14, and electromagnetic methods (EM)15,16 are often cost effective and non-invasive techniques for obtaining continuous information on soil properties over a large area at contaminated sites. Accordingly, electrical or EM methods are promising techniques for detecting LNAPLs in the subsurface, as these substances are electrically insulating and therefore cause a decrease in conductivity when they replace water in the pore space. The conductivity anomaly caused by LNAPLs mainly depend on the following factors, including type of LNAPLs and release history, plume location, hydrologic processes, biological processes, etc7,17. Nonetheless, the complex distribution of LNAPLs in soil presents challenges in the field, where the age and degradation state of contaminants can alter groundwater chemistry, i.e., ions are released during biodegradation, or weathering and dissolution of minerals caused by bacteria resulting in organic acids can lead to an increase in groundwater conductivity18. Thus, relying entirely on electrical conductivity for contaminant characterization might lead to the uncertainty in the delineation of LNAPL extent19,20. For example, the 2D ERT method was used by Godio and Naldi21 to identify the low-resistivity anomalies at hydrocarbon contaminants in an industrial site, whereas Rosales et al.22 classified hydrocarbon contaminated soils as high-resistivity anomalies in a former petrol stations survey, underscoring the importance and inclusion of more parameters that should be considered to enhance the interpretation accuracy.

One possible solution to address the limitations of LNAPLs survey with resistivity or conductivity alone is to expand the scope of measurement to include induced polarization (IP). As an extension of resistivity method, IP method can provide information on both the conductive and capacitive properties of the subsurface at the same time, which has a great potential to enable the detection of LNAPLs. Initially, effectiveness of IP in detecting organic contaminants within soils containing clay minerals was observed in the laboratory23 and field measurements24. The results of Vanhala et al.25 reported that soil and organic mixtures were also polarizable even with low clay mineral content, and extended the applications of IP method in characterizing LNAPL contaminated soil. Slater and Lesmes26 further discussed and summarized the effects of different survey situations on IP data at laboratory and field scales. Recently, several laboratory measurements have demonstrated that the changes in chemical, biological and geological conditions can affect the distribution of LNAPLs within the pores27. Moreover, the sensitivity of IP method to geochemical processes occurring at the pore scale has led to its gradual application in delineating the distribution of contamination plume28,29.

However, conflicting IP results were found for both field and laboratory scale measurements. Some laboratory measurements indicate that the presence of NAPLs typically reduces the IP effect30, while others suggest the opposite31. Similarly, contradictory results have also been reported in the field surveys as well. For example, the presence of free-phase NAPLs in an unconfined aquifer has caused a decrease in chargeability8, whereas high chargeability has been associated with the signal of LNAPLs in clay in another field investigation32. Consequently, different interpretations have been proposed to explain the IP responses to LNAPL plume33,34. Despite the more frequent use of IP methods in investigating LNAPL contaminated sites, these methods are not as widely used as ERT method. In addition, field scale IP measurements are often complex and challenging due to various factors such as geological background and contaminant compositions35.

This study presents the application of IP method at a former chemical plant to obtain a reliable delineation of LNAPL plume. Field data obtained from time domain IP surveys are analyzed in relation to the interpolation of sampling data. Specifically, two main zones were characterized, including the source zone and the plume zone. Additionally, a comparison between geophysical imaging and the BTEX concentration underscores the validity and rationality of inversion results. Finally, the polarization mechanisms of LNAPLs in source and plume zones are explored based on the IP response characteristics.

Theory of induced polarization

The IP method, also known as complex resistivity or complex conductivity, is based on injecting electric current through a pair of electrodes, while other pairs measure the resulting voltages. The parameters obtained from these measurements reflect the conductive and capacitive properties of the subsurface, with the latter being associated with the polarization of the ions within the electrical double layer (EDL) formed at the interface between fluid and grain26. The EDL is often assumed to consist of two different layers, i.e., the Stern layer and the diffuse layer. The Stern layer is a stationary layer consisting of counter ions on the grain surface, while the diffuse layer consists of ions weakly attracted electrostatically near the grain/water interface36.

Stern layer polarization and membrane polarization are the two main mechanisms that control the low-frequency polarization of metal-free porous media. The Stern layer polarization mechanism is strongly linked to the ion composition and physical properties of soil, such as grain size, ion concentration and the surface charge density of the Stern layer, which have been described in detail in previous work26,37,38. The presence of an additional electric field causes the generation of IP phenomenon by the EDLs, i.e., both the Stern layer and the diffusion layer generate a secondary field in the opposite direction to the electric field, which gradually returns to its original state once the electric field is removed. It is widely accepted that polarization of the Stern layer is much stronger than that of the diffuse layer26. Conversely, the membrane polarization mechanism is associated with the diffuse layer and pore structure, depicting the accumulation of charge at the diffusion layer or pore throats, facilitating the formation of counter-diffusion through the pores39. Our focus in this study is mainly on the Stern layer polarization and membrane polarization, and we examine these mechanisms in conjunction with the IP imaging results in Section "Results and discussion".

IP measurements can be conducted in both frequency and time domains using different equipment, with time domain IP typically utilizing direct current and frequency IP employing alternating current. In frequency domain, the amplitude (\(\left| \sigma \right|\)) and phase (\(\varphi\)) of an alternating current passing through a soil volume are measured. The total complex conductivity \(\sigma^{*}\) can be calculated as follows:

$$ \sigma^{*} \left( \omega \right) = \left| \sigma \right|e^{i\varphi } = \sigma^{\prime } + i\sigma^{\prime \prime } , $$
(1)

where \({\upomega } = 2{\uppi }f\) is angular frequency (rad), f is frequency (Hz), \(i = \sqrt { - 1}\) represents the pure imaginary number. In addition, \(\left| \sigma \right|\) is the magnitude of electrical conductivity (S/m), \(\varphi\) is phase (rad), \(\sigma^{\prime }\) and \(\sigma^{\prime \prime }\) are the real and imaginary components (S/m) of the complex conductivity \( \sigma^{*}\).

In this study, we will focus on TDIP measurements. In time domain, resistivity or conductivity is measured during the period of current pulse injection. On the other hand, chargeability is recorded when the current is turned off. The intrinsic chargeability m is described as follows41:

$$ m = V_{s} /V_{p} , $$
(2)

where \(V_{s}\) (V) is the maximum voltage measured immediately after the interruption of the current pulse and \(V_{p}\) (V) is the primary voltage of the transmitted direct current (DC). In fact, chargeability m is usually measured as the integral of the decay curve during a definite time window:

$$ m = \frac{1}{{V_{p} }}\mathop \smallint \limits_{{t_{1} }}^{{t_{2} }} V\left( t \right) \cdot dt, $$
(3)

where \(V\left( t \right)\) (V) is the residual voltage at time t after the current interruption.

There is a linear correlation between phase \(\varphi\) and chargeability m. Time domain equivalence with the quadrature conductivity \(\sigma^{\prime \prime }\) is called normalized chargeability \(M_{n}\), which is usually calculated to represent the surface chemical characteristics of porous media26. It can be approximately computed by40,42

$$ M_{n} = k \cdot m \cdot \left| \sigma \right| \cong \varphi \cdot \left| \sigma \right| \cong \varphi \cdot \sigma^{\prime } \cong \sigma^{\prime \prime } , $$
(4)

where k is the linear coefficient (k = 1.3)42. The normalized chargeability can be calculated from the multiplication of chargeability and conductivity, or the ratio of chargeability to resistivity (Eq. 4).

Field applications frequently involve time-domain IP (TDIP) survey43. In this method, conductivity is measured during the on-time Ton, and the secondary voltage decay is recorded during the off-time Toff, which is called 50% duty cycle method (Fig. 1b). A limitation of this method is that conductivity and chargeability measurement are conducted for only half of the acquisition time. Advancements in time-domain hardware allows for a faster survey mode with 100% duty cycle method44, where the current is injected continuously and the IP responses are also recorded during the on-time Ton (Fig. 1a). This advancement substantially increases the signal-to-noise ratio by up to 100% with minimal information loss45. Additionally, another pronounced advantage of this survey mode is the reduction of acquisition time by up to 50%, which is mostly preferred for the field surveys.

Figure 1
figure 1

Illustration of 100% duty and 50% duty cycles time-domain induced polarization survey methods. With 100% duty cycle survey, negative current injection period –Ton immediately follows the positive current injection + Ton. In the 50% duty cycle survey, after current injection during Ton period, current is turned off during Toff period for secondary voltage measurements.

Mechanistic models have been developed to understand the low and high frequency conductivity of sands46,47. The CPA model is suitable if the phase shift variation is negligible in the induced polarization data48. In cases where soils or rocks exhibits little or no frequency-dependent phase shift, a simpler two-parameter CPA model can be used to fit the IP spectrum. The CPA model can be represented as follows:

$$ \sigma^{*} \left( \omega \right) = \sigma_{n} \left( {\frac{i\omega }{{\omega_{0} }}} \right)^{b} , $$
(5)
$$ \varphi = \frac{\pi }{2}b, $$
(6)

where \(\sigma_{n}\) is the magnitude of the complex conductivity measured at some low frequency \(\omega_{0}\) and b is a parameter describing a power-law dispersion in the spectral response.

Materials and method

The following sub-sections demonstrate information about the study area and previous sampling results in the study region, followed by the geophysical survey conducted to measure IP data.

Field site description

The study site is located in a former chemical plant on the outskirts of Beijing, China. The chemical plant was shut down in 2012, and the original buildings in the plant were all demolished. During its three decades of operation, the investigation site was used for chemical tank storage. Note that several tracks used to transport raw materials across the plant have also been dismantled (see black dashed lines in Fig. 2). Previous geological drilling revealed that the shallow subsurface is predominantly occupied by Quaternary alluvial deposits. These deposits form a shallow unconfined aquifer characterized by four heterogeneous units. The backfill layer extended to 2 ~ 4 m below ground surface (bgs) and consisted of silt, sand and gravel materials. Below this layer, it was the main water bearing sandy layer, with occurrence of discontinuous silt and silt clay layers. The main contaminants at the site were characterized as the BTEX group (benzene, ethylbenzene and xylene). The released contaminants are located in this unconfined aquifer, which is confined by a continuous silt clay layer as the bottom impervious boundary at approximately 15 ~ 20 m bgs. The water table at this site fluctuated within the range of 2.9 ~ 5.2 m bgs. Contour mapping of groundwater levels (Fig. 2) indicated that the overall groundwater flow direction was from southwest to northeast, and the water level at the time of measurement was approximately 5 m bgs.

Figure 2
figure 2

Illustration of field survey and interpolation of contaminant concentration. A total of 14 soil and 2 groundwater samples locations were identified within the survey site. The interpolated BTEX surface indicates the primary area of contamination marked in red dashed line (BTEX concentration ≥ 180 mg/kg). On the other hand, blue dashed line represents the source zone (BTEX concentration ≥ 450 mg/kg). Six time-domain induced polarization survey lines were established, each featuring 32 electrodes with 2 m electrode spacing. The water table in the study region is denoted by solid gray lines and numbers against blue background. In addition, the black dashed lines indicate the dismantled track lines.

Previous soil and groundwater sampling

As an integral part of the investigation strategy, both soil and groundwater were sampled to characterize the contamination (see details in Fig. 2). A total of seventeen direct-push boreholes were drilled with Geoprobe, reaching a maximum depth of 6 m. Soil samples were collected from each direct push borehole at 0 ~ 2 m bgs and 2 to 6 m bgs, aiming to target the backfill layer and sandy layer, respectively. Apart from the direct push boreholes, two groundwater sampling wells were drilled to a maximum depth of 10 m. Groundwater samples were taken from screen intervals of 8 to 10 m bgs, with soil sampled at 8 m bgs and 10 m bgs during drilling. Samples were collected/processed and analyzed in strict accordance with the relevant standard49.

The concentrations dataset of soil and groundwater samples collected from boreholes is presented in Table 1. In general, soil with BTEX concentrations exceeding 180 mg/kg were mainly located in areas encompassing locations S3, S19, S20, S21, S24 and S25 at depth of 2–5 m bgs. The total contaminant concentration ranges from 220.4 mg/kg to 853 mg/kg, with the highest concentration samples collected from S21, and the lowest concentration samples obtained from S25. Among these four positions, the highest benzene concentration of 352 mg/kg was recorded in borehole S20, while the lowest was measured in borehole S25 with a concentration of 4.4 mg/kg. For ethylbenzene, the highest concentration of 361 mg/kg was detected in borehole S21, and the lowest concentration of 81 mg/kg was detected in borehole S25. Moreover, excessive xylene was observed only in borehole S21, which had a concentration of 186 mg/kg. In addition, no contaminants were found in soil and groundwater samples collected from boreholes G1 and G2.

Table 1 Concentrations of BTEX group (benzene, ethylbenzene, and xylene) in soil (mg/kg) and groundwater (mg/L) samples. The threshold values are indicated in bold, which are adopted following the relevant standard50.

TDIP survey

Time-domain induced polarization survey was subsequently conducted on chemical measurements in the same area to improve the accuracy of localizing the contaminant distribution using borehole information. Figure 2 illustrates the deployment of six parallel survey lines with a spacing of 20 m to cover the site. To encompass areas with varying contaminant concentrations, all lines were oriented nearly perpendicular to local groundwater flow direction. Each survey line was equipped with 32 electrodes, spaced 2 m apart. Stainless steel electrodes were used for both current injection and potential measurements43. A gradient measurement protocol was adopted, resulting in 294 data points per survey line. A 100% duty cycle with 1 s on-time of current injection and 3 full stacks was used for the survey. Data measurements were conducted using ABEM LS2 (www.guidelinegeo.com), where full-wave form polarization data were measured at a sampling rate of 1 kHz. All contact resistances were kept well below 1 kΩ.

Full-wave IP data were inverted with Aarhus workbench51 by assuming the CPA model52. The CPA model is displayed in the form of inversion parameters (\(\sigma\), \(\varphi\)) and the normalized chargeability (\(M_{n}\)). Every decay curve was process with the method described by Olsson et al.45, and decay curves with all negative chargeability were all discarded. In the end, 274, 269, 248, 256, 259 and 251 data points were retained for the induced polarization inversion, i.e., approximately 88% of the decay curves were retained for each survey line, which indicated a good quality of measured data. Inversion errors were less than 5% for all profiles, with an average pseudodepth of 10 m.

Results and discussion

The subsequent sections include the results from IP inversion representing the delineation of potentially contaminated areas, followed by the integration of IP imaging and chemical results to further delineate the source and plume zones. Finally, we discuss the polarization mechanisms of LNAPLs in different zones based on IP responses.

LNAPL plume characterization with IP images

Prior to the description of IP inversion profiles, the distribution of BTEX is interpolated in Fig. 2, which presents the distributions of total benzene, ethylbenzene and xylene concentrations derived from the soil samples collected at the depths of 2–6 m bgs. Since geophysical methods are not capable of separating contaminant types, the contaminant concentration in this study represents the sum of benzene, ethylbenzene and xylene concentrations, which is similar to findings from previous studies reported by Flores Orozco et al.8 and Xia et al.13. The interpolation results indicate the areas with BTEX concentration greater than 180 mg/kg, delineated by the red dashed line, which illustrates the main contamination location. Specifically, the area with concentrations greater than 450 mg/kg represents the contaminated source zone (marked by the blue dashed line, e.g., S20 and S21), while the area with lower concentrations (i.e., below 450 mg/kg) represents the plume zone, e.g., S24 and S25. Note that sampling results provide specific information on contaminated site.

Figure 3 shows the inversion results for all six profiles based on the CPA model. The conductivity distribution (Fig. 3a) illustrates that the area above the groundwater table in profiles L1-L4 indicates a low conductivity response of less than 5 mS/m. Figure 3b shows that a high phase layer (> 20 mrad) occupies the shallower subsurface with discontinuities in survey lines L1 to L4. The normalized chargeability shown in Fig. 4c demonstrates the contribution of induced polarization effect to conductivity. Specifically, three survey lines, i.e., L2, L3 and L5, are chosen as examples for detailed analysis, and geological boreholes overlap on the geophysical profile as well. Notably, survey lines L2 and L3 cut through the contaminated zone, while L5 served as a background reference with lower BTEX concentration.

Figure 3
figure 3

Time-domain distribution of induced polarization results of six profiles; (a) conductivity, (b) phase, and (c) normalized chargeability.

Figure 4
figure 4

Inversion results of LNAPL contaminant in profile L2; (a) conductivity, (b) phase, (c) normalized chargeability. The black solid lines represent the position of water table. The low conductivity anomaly (< 5 mS/m) marked in black dashed lines indicates the contaminated area consistent with the sampling data. The phase values corresponding to the high resistance anomaly are divided into the source zone marked in red dashed lines (< 20 mrad) and the plume zone marked in blue dashed lines (> 20 mrad). Moreover, the normalized chargeability below 80 mS/m marked in orange dashed lines represents the decrease of polarization within the source zone.

Stratigraphic information obtained from boreholes S21 and S24 close to L2 are consistent, i.e., the vadose zone is mainly composed of backfill, silt and fine sand. However, the IP responses are not identical for different areas with similar geological features. Specifically, a significantly low-conductivity zone (less than 5 mS/m) is observed between depths of 2 and 6 m bgs (Fig. 4a). The intrusion of electrically insulating BTEX leads to a low conductivity response, which is consistent with the sampling results. Although various factors, such as the soil stratum and groundwater level, can affect the electrical response of subsurface media, soil types and moisture content seem to have minimal impacts on conductivity compared to the contaminant intrusion.

At the same depth, the phase continuation is interrupted (Fig. 4b). A lower phase value of less than 20 mrad is marked in the center part by red dashed lines. In contrast, the high phase anomalies above 20 mrad are more evident on the left, as shown by the blue dashed lines (Fig. 4). Interestingly, the corresponding conductivity responses of both areas exhibit low anomalies. The sampling results indicate that these two areas are all contaminated zones, e.g., the BTEX concentration of S21, which corresponds to the red dashed line is 853 mg/kg, while that of S24, which corresponds to the blue dashed line is 243 mg/kg. Therefore, low conductivity accompanied by low phase values signifies a contaminated source zone, while high phase with relatively high conductivity indicates the plume zone. These observations align perfectly with the findings of Johansson et al.53, who reported no obvious phase anomalies in the contaminated source zone, while a relatively strong phase response was detected in the plume area. It is observed that residual free-phase NAPLs could be anticipated from chemical data. This hypothesis is also consistent with measurements from several laboratories18,27,33, suggesting that IP responses are related to the variations in pore properties due to the addition of LNAPLs, and can be explained by the Stern layer polarization or membrane polarization models. Furthermore, normalized chargeability quantifies the magnitude of surface polarization. For porous media comprised of nonmetallic materials, normalized chargeability is related to the characteristics of pore space and surface chemistry, and it can be considered as a direct measurement of the IP effect26. The normalized chargeability (\(M_{n}\)) for the source zone is less than 80 mS/m (marked in orange dashed lines in Fig. 4c), and we speculate that the polarization response significantly decreases due to the high concentration of LNAPLs occupying the pore space within the source zone.

Although conductivity can delineate contaminated areas, it is difficult to distinguish between source and plume zones. This is particularly evident in profile L3 (Fig. 5a). Nevertheless, similar to the results from survey line L2, phase value obtained from time domain IP measurement can be used to effectively distinguish between these two zones. Specifically, phase values corresponding to the low conductivity anomalies less than 5 mS/m are divided into two types: phase less than 20 mrad within the red dashed lines, and phase greater than 20 mrad within the blue dashed lines (Fig. 5b). Soil samples taken from boreholes S20 (675 mg/kg) and S25 (220.4 mg/kg) along profile L3, corresponding to the red and blue dashed lines, are classified as the source zone and the plume zone, respectively. Additionally, the normalized chargeability below 80 mS/m marked in orange dashed lines further verifies the decrease of polarization within the source zone and refines the location of source zone (Fig. 5c).

Figure 5
figure 5

Inversion results of LNAPL contaminated in profile L3; (a) conductivity, (b) phase, (c) normalized chargeability. Legends and description are the same as those in Fig. 4.

In the non-contaminated area, the conductivity and phase values show a different pattern. The estimated conductivity shows a smooth low magnitude of approximately 8 mS/m, and the conductivity values significantly increase in the region below the water table (Fig. 6a). The estimated phase values are generally high exceeding 20 mrad (Fig. 6b), with no abrupt phase decrease in the profile. Note that the high normalized chargeability anomaly in S27 may be related to the silt layer in the vadose zone. Similar research can be referred to Johansson et al.53 and Revil et al.54.

Figure 6
figure 6

Inversion results of non-contaminated in profile L5; (a) conductivity, (b) phase, (c) normalized chargeability. Legends are the same as those in Fig. 4.

As shown in Figs. 46, it is difficult to accurately classify the contaminated areas and different zones based on the phase alone. Especially for the plume zones, the phase response of media invaded by small amounts of contaminants may be similar to that of the clean silt layer, such as the high phase anomaly (> 20 mrad) shown in profiles L3 and L5. Nevertheless, changes in conductivity response due to contaminant intrusion are more pronounced than differences in geological factors, i.e., the conductivity response (< 5 mS/m) is similar in areas with different geological features but similar contaminant concentrations (> 180 mg/kg). Therefore, to accurately classify the contaminated areas and different zones, the phase distribution must be analyzed in combination with conductivity.

Comparison between geophysical imaging and concentration measurements

In this section, we use the estimated induced polarization parameters to mark the scope of the LNAPL contaminated area. Figure 7 indicates the scatter plots between BTEX concentrations and IP parameters extracted from profiles L1–L6. Note that the geophysical data are taken from average cell values at depths of 2–6 m bgs close to the boreholes. Figure 7a indicates the results for 13 boreholes at depths of 2–6 m bgs (S1, S3, S17, S19, S20, S21, S22, S23, S24, S25, S26, S27 and S28 shown in Fig. 2) versus IP parameters, while Fig. 7b,c show the results for 6 boreholes (S3, S19, S20, S21, S24 and S25) within the contaminated area versus IP parameters. The samples analysis segregates conductivity into two distinct regions, i.e., non-contaminated area and contaminated area (Fig. 7a). In particular, the conductivity values at the sampling points corresponding to the BTEX concentrations exceeding 180 mg/kg are below 5 mS/m. Therefore, the threshold conductivity is identified based on the relationship between BTEX concentration and conductivity, which is 5 mS/m. However, it is challenging to distinguish between the source zone and plume zone within contaminated area based solely on conductivity. For example, the conductivity corresponding to source zone with BTEX of 675 mg/kg is 2.62 mS/m, while the conductivity corresponding to plume zone with BTEX of 243 mg/kg is 2.57 mS/m, thus the two zones cannot be effectively distinguished by conductivity alone.

Figure 7
figure 7

BTEX as a function of (a) conductivity, (b) phase, (c) normalized chargeability. Although the conductivity is linearly related to BTEX, the plume and source zones within the contaminated area cannot be directly distinguished. For contaminated area, the maximum difference in phase between plume and source zones is 20 mard (~ 65%), while the maximum difference in normalized chargeability between two zones is more evident (~ 93%, ~ 150 mS/m).

However, a noticeable difference in phase between the source zone and plume zone is observed (Fig. 7b). Within the contaminated area, there is a linear negative relationship between BTEX and phase (slope < 0, R2 = 0.76), where a plume zone with a low concentration corresponds to a high phase, while a source zone with a high concentration corresponds to a low phase. The threshold phase value corresponding to the critical concentration (450 mg/kg) is 20 mrad. In addition, the normalized chargeability threshold for contaminated source zone is 80 mS/m. Since normalized parameters are more sensitive to the chemical properties of material surface compared to conductivity and phase, the difference in normalized chargeability between these two zones is more evident. Specifically, the maximum difference in phase between plume and source zones is ~ 20 mrad, which varies by ~ 65% ((max–min)/max), correspondingly, the maximum difference in normalized chargeability is ~ 150 mS/m, which varies by ~ 93%. On the other hand, the slope of decreasing fit line in Fig. 7c (− 0.2) is smaller than the value in Fig. 7b (− 0.03), which demonstrates a more distinct difference in the normalized chargeability corresponding to the two zones. Nevertheless, the correlation coefficient of the linear fit between BTEX concentration and normalized chargeability (0.88) is greater than that of phase (0.76). Accordingly, as mentioned above, the relationship between IP parameters and BTEX further emphasizes the rationality of inversion results for zone delineation and interpretation. Note that we focused primarily on IP effects in the source and plume zones within the contaminated area delineated by the conductivity distribution, rather than comparing the differences in IP effects between the plume zone and the uncontaminated area. For the contaminated area, the IP effect is significantly greater in the plume zone than that in the source zone, which is verified by the BTEX concentration from the borehole sampling.

A similar numerical simulation study was conducted by Almpanis et al.55, who developed a novel DNAPL-DCIP numerical model and calculated the corresponding resistivity and chargeability responses. The simulation results illustrated that the resistivity distribution effectively delineated the DNAPL source zone, while the chargeability and normalized chargeability distributions accurately delineated the soil variability. It is worth mentioning that in contrast to results from the ideal numerical modelling conditions, the variations in IP response within contaminated area in this study are hypothesized not to be caused by variations in soil but rather by the different geometrical configuration of LNAPL in the pore space. This can be verified from the strata information and contaminant concentration results obtained from the boreholes. For example, despite the consistency in strata information obtained from S21 and S24, the BTEX concentrations measured at these two locations differ and, consequently result in distinct IP responses (\(M_{n}\) and \(\varphi\)) as shown in Fig. 4.

Moreover, based on the chemical sampling analysis, LNAPLs were mainly distributed at depths between 2 m bgs and 6 m bgs, and the averaged geophysical results of six profiles are shown in Fig. 8. The black dashed line with conductivity values less than 5 mS/m reflects the contaminated area (Fig. 8a). The contaminated source zone, characterized by a normalized chargeability less than 80 mS/m and a corresponding phase value less than 20 mrad, is marked by an orange dashed line, (Fig. 8b,c). Sampling points with BTEX concentrations above 450 mg/kg are within the orange dashed lines indicating the source zone, while sampling points with BTEX concentrations above 180 mg/kg are enclosed by the black dashed lines representing the contaminated area.

Figure 8
figure 8

Averaged inversion results between 2 m bgs to 6 m bgs for all six profiles in (a) conductivity, (b) phase, (c) normalized chargeability. Conductivity less 5 mS/m interprets contaminated area enclosed with black dashed lines, and orange dashed lines indicate source zone with normalized chargeability below 80 mS/m, which corresponds to a phase below 20 mrad. The red dots represent the contaminated sampling locations between 2 m bgs to 6 m bgs, and the blue dots represent non-contaminated sampling locations.

Geophysical survey and borehole sampling have distinct requirements for information collection, and can complement and validate each other effectively. Figure 9 demonstrates the difference between interpolation results of geophysical data and interpolation results of chemical sampling analysis. It should be noted that the geophysical results present a more accurate delineation of contaminated area (black dashed lines) than the interpolation (red dashed lines) from the chemical sampling information alone. On the other hand, the characterized source zone based on IP parameters has a different shape and appears to be larger than that inferred from sampling interpolation. The continuous information on the source or plume zone within the contaminated area from geophysical surveys seems to be more convincing than the limited point concentrations from boreholes. However, it is challenging to design the location of the survey lines rationally without prior sampling information. Therefore, a good balance between these two methods facilitates the acquisition of accurate information on the delineation of contaminated zones.

Figure 9
figure 9

Delineation of contaminated areas based on geophysical data. Readers are referred to Fig. 2 and Fig. 8 for the descriptions of red and blue dashed lines, and black and orange lines, respectively.

In addition, by utilizing prior information of the contaminated site, contamination release history is confirmed through the utilization of IP imaging representations. By integrating contaminant and stratum information obtained from boreholes, the anomalies measured on L1-L4 delineate a continuous layer with low conductivity and normalized chargeability, indicating potential contamination originated from multiple locations. No buildings have ever existed on the site other than the tracks used to transport raw materials. Therefore, subsurface contamination may be attributed to the leakage of raw material at rail tracks interchange during transportation.

Induced polarization between plume and source zones

Our results demonstrate that the variations of phase and normalized chargeability related to the BTEX concentration changes are significantly greater than those in conductivity for the contaminated area. The concentration and distribution of LNAPLs in pore space are critical because they can change the micro geometrical characteristics at the pore scale and the current paths through the soil.

For the plume zone where the soil is unsaturated with LNAPLs, there are several possible distributions of free-phase LNAPLs. The distribution of free-phase products within pore space is controlled by the physical characteristics of NAPL (e.g., density and viscosity), and the non-homogeneity of the aquifer parameters (e.g., permeability and storage coefficients)56. In this study, IP response in the plume zone exhibit enhanced polarization characteristics, e.g., phase > 20 mrad or normalized chargeability > 80 mS/m. We focus on the conceptual model in which LNAPLs are present as isolated spheres within the plume zone53, indicating that small droplets are confined to specific soil pores (Fig. 10). This conceptual model of plume zone is similar to the water-wet distribution of LNAPLs in the pore space described by Schmutz et al.33 and Revil et al.18. The increased phase can be attributed to the Stern layer or membrane polarization. Specifically, the intrusion of LNAPL leading to the formation of a water–oil interface within the medium, attracting counter ions similar to the EDL formed at the soil grain-water interface, thus enhancing the IP response of the soil system through the formation of a polarizable EDL57. On the other hand, according to the membrane polarization, the pore space between particles and LNAPLs droplets can be defined as the ion-selective zone, while pore throat constitutes the non-selective zone, i.e., in contrast to the case of uncontaminated media, the pore throat constitutes the selective zone and the non-selective zone consists of pores. A similar process has been described by Titov et al.57. Polarization is enhanced in the presence of LNAPLs as the ion transparency between these zones increases53.

Figure 10
figure 10

Illustration of LNAPL distributions in pore spaces (modified from Flores Orozco et al.8). In source zone with high contaminant concentration, LNAPLs are arranged as a continuous phase in the pore space. With lower concentration next to source zone, residual LNAPL droplets exists in pore space. There is still a complete formation of electrical double layer on the grain surface. In most cases, the decrease in IP effect is due to the free-phase products replacing the pore water required for the formation of the EDLs in which the polarization occurs.

In contrast, laboratory measurements by Shefer et al.34 indicated that the imaginary conductivity decreased for decane contaminated soil under variably saturated conditions. In his hypothesis, the addition of decane may have changed the position or size of anion/cation selective zones, thus causing a variation in the concentration gradient and a decrease in polarization. However, comparing results from previous laboratory measurements is not always straightforward, as there may be differences in experimental conditions and types of NAPLs58. Moreover, under field site condition, residual LNAPLs within soil pores are not all formed in a certain way, i.e., one or more combinations of polarization mechanisms determine the polarization response.

When the soil is predominantly saturated with LNAPLs, such as in the contaminated source zone shown in Fig. 10, there is an interconnected condition in the pore space, with a continuous distribution of LNAPLs. The conceptual model of source zone is similar to the arrangement of NAPL in nearly fully saturated pore space reported by Johansson et al.53. Regardless of the extent of saturation, for source zone with high LNAPLs concentration, insulated LNAPLs that completely occupy soil pores usually prevent current injection into the porous medium and lead to the absence of EDLs, making it impossible to obain IP responses. The decrease in phase response is mainly attributed to the suppression of membrane and Stern layer polarization caused by the substitution of pore water with LNAPLs. Similar results were reported by Cassiani et al.58 and Schmutz et al.33, where experimental measurements of low clay sediments showed that the IP effect decreases with increasing oil content. Consequently, performing IP measurements under field conditions may reveal, a decrease in electrical conductivity (< 5 mS/m) and a decrease in phase responses (< 20 mrad for phase or < 80 mS/m for normalized chargeability) in the source zone within contaminated area.

Overall, the field results in this study show that source zone with high BTEX concentrations in soil correspond to low IP responses, while plume zone with low concentrations enhance IP effect. Similarly, Flores Orozco et al.8 observed that the IP effects (\(M_{n}\) and \(\varphi\)) increase with increasing BTEX concentration below saturation concentration (< 1.7 g/L), while the polarization responses (\(M_{n}\) and \(\varphi\)) show a decreasing trend when the BTEX concentration is higher than the saturation concentration (> 1.7 g/L), which may be caused by the presence of free-phase products replacing pore water.

Conclusions

This study illustrates a typical approach for investigating LNAPLs contamination by employing the induced polarization (IP) imaging method at a former chemical plant. The results illustrate that the variations in phase and normalized chargeability caused by BTEX can be used to effectively distinguish between the source and plume zones within the contaminated area delineated by conductivity.

The interpolation of sampling data indicates that the area with BTEX concentrations exceeding 450 mg/kg represents the source zone within contaminated area. Moreover, IP measurements aid in comprehending the distribution of LNAPLs. The results show that the threshold conductivity value of 5 mS/m defined by the relationship between BTEX concentration and conductivity represent the contaminated area. Although conductivity can delineate the contaminated area, it is difficult to distinguish the source zone from the plume zone. However, phase corresponding to the low conductivity anomaly are divided into two zones, i.e., source and plume zones, which is consistent with the sampling information. Specifically, the threshold phase value corresponding to the critical concentration (450 mg/kg) is 20 mrad. Furthermore, the normalized chargeability below 80 mS/m further verifies the suppression of polarization within source zone and refines the location of source zone. Consequently, geophysical results present a more accurate delineation of contaminated area than the interpolation from the sampling information alone.

Field results further demonstrate that source zone with high BTEX in soil correspond to low IP responses, while plume zone with low concentrations enhance IP effect. In unsaturated soil with LNAPLs within the plume zone, the increased phase may be attributed to the Stern layer or membrane polarization. Despite the different results in previous studies, the enhanced IP response in this case is related to the combination of multiple polarization mechanisms. In the source zone where the soil is essentially saturated with LNAPLs, the insulated LNAPLs fully occupy the pores, which usually leads to the absence of EDLs, resulting in a low phase in the source zone. The different IP responses in two zones is are attributed to the concentration and distribution of LNAPLs within the pore space. Overall, this study suggests that IP is a valuable tool for mapping and characterizing different LNAPLs contaminated zones, and can facilitate the optimization of drillings and monitoring wells for further site assessment and remediation.