Introduction

Pressure transient signals of underground porous media, including oil, gas, gas condensate, and water reservoirs, are valuable information sources. Despite the high importance of pressure transient response (PTRs), the characterization of fluid flow in porous media from the PTR is challenging. Like all other actual signals, the PTR is often poisoned by noisy data that are difficult to either be distinguished or removed (Moosavi et al. 2018a; Su et al. 2019; Chen et al. 2021). The primary objectives of reservoir fluid flow characterization are to improve the reservoir modeling and simulation accuracy, enhance oil recovery, provide production-performance predictions of different reservoir scenarios, and help determine remaining recoverable oil reserves (Cheng et al. 2016). Generic reservoir characterization utilizes multiple data sources, including well logging, PTR, geology, core analysis, and seismic. Such information helps to elucidate the drive mechanisms, reservoir and wellbore geometry, fluid types, rock properties, saturations, and dominant flow directions (Landa et al. 2000; Abdel-Fattah et al. 2015; Khan et al. 2016; Ciftci 2018; Yang et al. 2020). Some researchers focused on different aspects of facilities used to transport oil and gas from the reservoir to consumers (He et al. 2018; Liu et al. 2019b). A multi-dimensional mixed-integer nonlinear optimization model based on modified particle swarm optimization (MPSO) algorithm is proposed for a large-scale oil and gas gathering system (Liu et al. 2019b).

Pressure transient signals make a key contribution to overall reservoir characterization.

This signal typically represents the reservoir pressure drop during changes the flow rates over specified periods. The PTR is long established as one of the best techniques for predicting well and reservoir properties by matching observed pressure responses with some ideal reservoir flow models (Huang et al. 2018; Nategh et al. 2019).

Naturally and hydraulic fracture reservoirs pose some additional challenges of the uneven distribution and orientation of their fractures and typically more significant heterogeneity than homogeneous reservoirs (Li et al. 2020). Consequently, although such reservoirs are known to contain significant in-place oil resources worldwide (Saidi 1983), their oil recovery factors tend to be small. This circumstance highlights the need for better reservoir characterization to improve fluid flow simulation models leading to more efficient drainage of these complex reservoirs. Variation of fracture propagation by the in situ stress, natural fissure development, temporary plugging, and treatment parameters was studied (Guo et al. 2020b). Fluid flow theory was first applied to naturally fractured reservoirs (NFR) by two research groups (Barenblatt and Zheltov 1960; Warren and Root 1963). Their studies develop a dual-porosity model for fractured reservoirs with two different key parts, i.e., a fracture with high fluid transmissibility and low storage capacity, and the matrix with low fluid transmissibility and high storage capacity. These specific characteristics of dual-porosity reservoirs can be quantified by deriving two useful metrics: interporosity flow coefficient (λ) and storativity ratio (ω). It is an earlier measure interaction between matrix and fractured sections, while the latter represents an amount of fluid stored in the fractured section relative to the total fluid held in the whole reservoir (Egya et al. 2017).

Although the different values of λ and ω produce no unique characteristic shapes in the PTR, they produce entirely different patterns in the pressure derivative (PD) graphs. This feature of pressure derivative plots makes it possible to apply various machine learning algorithms to estimate λ and ω using the characteristic shapes of such plots. Similar to wide applications of machine learning techniques in different fields of science and engineering (Zhang et al. 2018; Yang et al. 2019; Shi et al. 2020), they are also employed to predict reservoir underlying models (Vaferi et al. 2011, 2016; Ghaffarian et al. 2014), while some researches focused on predicting specific values for key reservoir parameters (Eslamloueyan et al. 2010; Şahin and Çiftçi 2016).

Most models proposed to predict reservoir fluid flow metrics using machine learning algorithms have applied artificial neural networks (ANN). Alajmi and Ertekin developed an ANN model applied to NFR (Alajmi and Ertekin 2007). Their model involved a 4th-degree polynomial fit to the semi-log relationship between well-test pressure and time data derived via a simulation model (Sierra 1986). Well-test data combined with rock properties and fluid compositions are considered the input variables for their ANN model. Well-test pressure derivative graphs show much greater sensitivity to λ and ω variability for dual-porosity NFR. Eslamloueyan et al. developed their ANN model so that the digitized pressure derivative graph roles as input for characterization of NFR system through prediction both λ and ω (Eslamloueyan et al. 2010). Deep convolutional neural network (Xu et al. 2018; Li et al. 2019; Chen et al. 2020a; Lv and Qiao 2020; Qian et al. 2020a, b) as a new generation of artificial intelligence techniques is recently applied for reservoir characterization from well-testing data (Daolun et al. 2020; Liu et al. 2020).

This study’s main objective is to develop a powerful smart strategy, named multi-output least squares support vector regression (MLS-SVR), for simultaneous prediction of λ and ω from digitized PD graphs. For this purpose, a dataset including 500 simulated pressure-time patterns is generated for a NFR system drilled by a horizontal well with the infinite acting condition. This dataset is converted to PD graphs and then used to train a multi-output least squares support vector regression model to predict λ and ω. The proposed method does not rely upon any graphical or correlation between variables and requires no prior information about the NFR system and its associated flow regimes. Both simplicity and accuracy of the designed MLS-SVR model are among key advantages of this technique. It saves time in analyzing of pressure response of NFR systems drilled by the horizontal well.

Methods

Pressure transient responses of NFR

In a drawdown test, the flow rate is held approximately constant while the bottom-hole pressure is continuously measured during the production period. Figure 1 depicts a typical pressure transient curve for a NFR system drilled with a horizontal well for a drawdown test. This pressure transient data is generated through a solution of the governing equation for the NFR drilled with a horizontal well by the finite element method. The NFR governing equations have been developed using the Warren-Root approach with some basic assumptions applied, including the uniform thickness of the reservoir with impermeable lower and upper boundaries, radial flow, isothermal, single-phase, and slightly compressible fluid with constant viscosity and rock properties. In this model, only fracture-fed wells are considered, and it is further assumed that each continuum (fracture and matrix) is homogenous and separate. Moreover, as mentioned, constant rock and fluid properties are also assumed.

Fig. 1
figure 1

Schematic of variation of pressure a horizontal well drilled in a naturally fractured reservoir during production period

Radial change in the reservoir pressure results in a fluid movement towards the production well. During the early production period, the bottom-hole pressure changes rapidly due to wellbore storage effects and the limited radius of investigation. Subsequently, as time progresses and the larger volume of the reservoir contributes to production, the bottom-hole pressure changes more slowly. Finally, the bottom-hole pressure represents the behavior of the outer boundaries.

Naturally fractured reservoirs

Naturally fractured reservoirs have two key parameters more than homogeneous ones. Therefore, it is useful to detail how these two parameters, i.e., interporosity flow coefficient and storativity ratio, influence on characteristic shapes of the NFRs. It is evident that they have an essential role in distinguishing pressure transient responses of the NRF from homogeneous reservoirs.

Storativity ratio

The storativity ratio is defined as the fraction of the total pore volume associated with one of the two porosities in a dual-porosity reservoir. Specifically, in NRF, ω refers to the volume fraction of total reserves contained within the fractures. It is defined as follows (Moosavi et al. 2018b):

$$ \omega =\frac{{\left(\varphi\;V\;{c}_t\right)}_f}{{\left(\varphi\;V\;{c}_t\right)}_f+{\left(\varphi\;V\;{c}_t\right)}_m} $$
(1)

where φ and c represent porosity and compressibility factor, respectively. Subscript f and m stand for fracture and matrix, respectively.

The storativity ratio has a significant effect on the short-term fluid production deliverability of a reservoir. The ratio is typically between 0.01 and 0.1, and dual-porosity reservoir analysis is used in conjunction with the interporosity flow coefficient.

Interporosity flow coefficient

Interporosity flow coefficient is directly correlated with the permeability of the matrix segment of the reservoir, but inversely correlated with the permeability of the fractured segment of the reservoir. For dual-porosity reservoirs, λ typically exists in the value range 10−4 to 10−8. This parameter can be mathematically expressed by Eq. (2).

$$ \lambda =\alpha\;\frac{k_m}{k_f\;}\;{L}^2 $$
(2)

where k is the permeability, α shows the interporosity shape factor related to the block’s matrix of the NFR, and L stands for half-length of the horizontal well.

Semi-log graph for NFR

As Fig. 2 shows, the dual-porosity components of a NFR generate two parallel straight lines on a semi-log graph. The semi-log is made up in the detail of three distinct components: (1) transient radial flow period, an early straight line component representing the homogeneous flow of the fractured segment of the reservoir before the matrix segment of the reservoir makes a noticeable contribution, (2) interporosity flow onset period—a transition component joining the two straight line components reflecting the first noticeable contribution from the matrix segment of the reservoir, and (3) composite homogeneous flow period—a later stage straight line component reflecting the combined contributions from the fractured segment and the matrix segment, beginning when fluids in the fracture segment are consistently replenished by fluids flowing from the matrix segment of the reservoir.

Fig. 2
figure 2

Drawdown semi-log plot for a horizontal well drilled in a NFR

Effects of variation of interporosity flow coefficient and storativity ratio on a semi-log plot of a NFR drilled by a horizontal well are shown in Figs. 3 and 4, respectively. In Fig. 3, it is noticeable that as the interporosity flow coefficient increases, the first straight line component becomes shorter, and the transition component occurs sooner. Since the second zone on a semi-log plot (i.e., the transition component) involves some fluid flow from the matrix segment to the fractured segment of the NFR, any decrease in the permeability of the fractured segment or any increase in the permeability of the matrix segment will cause the fracture storage contribution to flow to be depleted more rapidly and the interporosity flow onset period to commence earlier.

Fig. 3
figure 3

Effect of interporosity flow coefficient on semi-log response of a horizontal well in a NFR (ω=0.055)

Fig. 4
figure 4

Effect of storativity ratio on semi-log response of a horizontal well in a NFR (λ=1 × 10−8)

As the storativity ratio increases, the first straight line component becomes shorter, the transition component commences earlier, and the transition to the composite homogeneous flow period also commences earlier.

Traditional methods for estimation of ω and λ

Mathematical models are widely used to interpret PTR to establish reservoir characterization purposes. A traditional approach employs semi-log and log-log plots to provide a detailed analysis of pressure transient information. To establish meaningfully accurate reservoir interpretations, these traditional methods need to be constrained in certain ways. For example, distinct flow regimes applicable to specific time intervals must be clearly delineated on both pressure and pressure derivative plots. If the flow regimes applicable to the specific time elapsed intervals are unknown, type curve matching methods fail to provide unique solutions. In such situations, a trial-and-error approach is required to apply various flow regimes to find the best fits to the semi-log curves. The direct synthesis technique avoids type curve matching to derive reservoir metrics from pressure transient data (Tiab 1989). It simultaneously uses pressure and PD curves to analyze vertically (Tiab 1994) and naturally fractured reservoirs (Engler and Tiab 1996) by exploiting analytical and empirical correlations.

Pressure derivative for NFR system

Remarkably, the pressure derivative technique is a powerful and well-established interpretation tool in providing meaningful analysis of pressure transient signals (Bourdet et al. 1989; Escobar et al. 2018). As expressed by Eq. (3), this technique uses three points (i.e., pressure drop versus superposition time) for calculation of pressure derivative at a given point.

$$ {(PD)}_k=\frac{\frac{{\left(\Delta p\right)}_k-{\left(\Delta p\right)}_{k-1}}{\ln\;{\left(\Delta t\right)}_k-\ln\;{\left(\Delta t\right)}_{k-1}}\;\left[\ln\;{\left(\Delta t\right)}_{k+1}-\ln\;{\left(\Delta t\right)}_k\right]+\frac{{\left(\Delta p\right)}_{k+1}-{\left(\Delta p\right)}_k}{\ln\;{\left(\Delta t\right)}_{k+1}-\ln\;{\left(\Delta t\right)}_k}\;\left[\ln\;{\left(\Delta t\right)}_k-\ln\;{\left(\Delta t\right)}_{k-1}\right]}{\ln\;{\left(\Delta t\right)}_{k+1}-\ln\;{\left(\Delta t\right)}_{k-1}} $$
(3)

These PD plots are more useful for parameter estimation and model detection than traditional ones (Eslamloueyan et al. 2010; Vaferi et al. 2011; Wang 2016). The PD technique specifies the real interpretation model of fluid using pattern matching of observed signal with some standardized type curves. After that, it is possible to quantify reservoir properties’ values (Bourdet et al. 1989; Bourdet 2002; Tiab and Donaldson 2015).

Theoretically, the dual-porosity behavior of NFR usually appears as three distinct flow regimes in pressure derivative graphs: (1) transient radial flow period, an early straight line component associated with production from the fractured segment of the reservoir, (2) interporosity flow onset period, commencing in the middle period of the test representing the first noticeable production contribution from the matrix segment of the reservoir into the fractured segment, and (3) composite homogeneous flow period, the final test period during which simultaneous production from matrix segment and the fractured segment of the reservoir both contribute. The transition phase from the first flow regime to the second one appears as a distinct hump on the pressure derivative curve.

Figure 5 highlights the effect of changing the storativity ratio on the pressure derivative curves of NFR having horizontal well. As Fig. 5 clearly shows, by decreasing the storativity ratio, the first flow region finishes earlier. The transitional hump between first and second flow regimes moves to the left on the log-log plot. Also, the trough representing part of flow regime 2 progressively appears earlier as ω decreases. This results in the third flow regime commencing earlier as ω decreases.

Fig. 5
figure 5

Effects of variations of ω on the pressure derivative curve of a horizontal well in a naturally fractured reservoir

Figure 6 presents the effect of variations in λ on the characteristic shape of the pressure derivative curve of NFR system. The earliest hump to appear in the pressure derivative curve is related to production from the fractured segment of the NFR. A decrease in λ can be induced by either decreasing the permeability of the matrix segment of the reservoir or increasing the permeability of the fractured segment of the reservoir. Such a decrease in λ will cause decreases in the pressure decline and the pressure derivative, which will lead to the earliest hump in the pressure derivative curve being more pronounced by extending over a more extended period. The second hump in the pressure derivative curve in Fig. 6 reflects fluid flow from the matrix segment to the fractured (i.e., third flow regime). An increase in λ can be induced by either decreasing the fracture permeability or increasing the matrix permeability. Such an increase in λ results in storage in the fractured segment of the reservoir being depleted earlier, leading to the third flow regime commencing earlier.

Fig. 6
figure 6

Effects of variation of λ on pressure derivative plot of a horizontal well pressure-drawdown test in a NFR

Results and discussion

Interpretation model for NFR systems

In this study, the datasets, including 500 pressure transient signals for NFR system drilled by horizontal wells, are generated using an analytical solution of the governing equation of fluid flow using the finite element method. For generation these PTR, all reservoir properties except storativity ratio and interporosity flow coefficient are considered constant. The typical properties of NFR system for fluid, rock, and wellbore geometry are listed in Table 1.

Table 1 Fluid and rock properties and wellbore geometry assumptions applied to the reservoir flow models evaluated in this study

Only the two-final metrics in Table 1, λ and ω, were allowed to vary while generating the PTR dataset. Although the general shape of all of the generated PTR (i.e., each with unique λ and ω values) are similar, their pressure derivative curves are quite distinct. Indeed, the MLS-SVR approach discriminates among these reservoir flow systems based on the difference in the characteristic shapes of their pressure derivative curves caused by interporosity flow coefficient and storativity ratio.

Accuracy of the MLS-SVR model

Three commonly used statistical measures of accuracy, i.e., correlation coefficients (R2), absolute average relative deviations (AARD%), and mean square error (MSE), were used to the numerical evaluation of the performance of the developed MLS-SVR model. These statistical indices are mathematically expressed by Eqs. (4) through (6), respectively.

$$ {R}^2=\frac{\sum \limits_{i=1}^N{\left({PV}_i^{act.}-\overline{\Delta PV}\right)}^2-\sum \limits_{i=1}^N{\left({PV}_i^{act.}-\Delta {PV}_i^{cal.}\right)}^2}{\sum \limits_{i=1}^N{\left({PV}_i^{act.}-\overline{\Delta {PV}^{act.}}\right)}^2} $$
(4)
$$ AARD\%=\frac{100}{N}\sum \limits_{i=1}^N\left(\left|\frac{PV_i^{act.}-{PV}_i^{cal.}}{PV_i^{act.}}\right|\right) $$
(5)
$$ MSE=\frac{1}{N}\;\sum \limits_{i=1}^N{\left({PV}_i^{act.}-{PV}_i^{cal.}\right)}^2 $$
(6)

where N is the number of data records, PVact. and PVcal. indicate actual values of ω or λ and their associated predicted values by MLS-SVR model, and \( \overline{\Delta PV} \) shows the average value of real data for ω and λ.

Table 2 displays the results of sensitivity analysis for the accuracy measures relating to ω and λ predictions for four different distributions of the training and testing subsets. All the divisions between training and testing subsets display high degrees of accuracy, but using the 90% of databank as training provided the best statistical indices for ω and λ in terms of AARD%, MSE, and R2. The relatively large percentage errors for the AARD% relating to λ are due to the very low values (10−4 to 10−8) of that metric. Indeed, even a very small difference between actual and calculated values for λ tends to produce a high level of error.

Table 2 Sensitivity analysis results for proposed MLS-SVR model to training and testing subset divisions of the data records

For a better presentation of these results and providing visual observation, the variation of the accuracy of the MLS-SVR models with training percent for both storativity ratio and interporosity flow coefficient is illustrated in Figs. 7 and 8, respectively.

Fig. 7
figure 7

Sensitivity of the MLS-SVR model in prediction of ω to the distribution of training and testing subsets

Fig. 8
figure 8

Sensitivity of the MLS-SVR model in prediction of λ to the distribution of training and testing subsets

Based on the results of sensitivity analysis presented in Table 2, Figs. 7 and 8, 90% (450 PD graphs) of the available databank is selected for the training of the MLS-SVR model.

Figures 9 and 10 depict actual values of ω and λ as a function of their associated predicted values by MLS-SVR model for the training and testing data subsets, respectively. The negligible difference between actual and predicted values confirms the highest degree of accuracy achieved by the MLS-SVR model for both storativity ratio and interporosity flow coefficient.

Fig. 9
figure 9

Comparison of actual values of ω versus predicted values by the MLS-SVR model

Fig. 10
figure 10

Comparison of actual values of λ versus predicted values by the MLS-SVR model

3.2. Performance of the developed MLS-SVR model for noisy signal

Unlike the generated PTR from a solution of governing equation which is typically smooth, signals associate with real systems typically has a level of noise and defects (Moosavi et al. 2020; Yue et al. 2020). Therefore, it is crucial to assess the performance of the MLS-SVR model when confronted with noisy data records. For this purpose, normal artificial noise was added to the ten different PD graphs of the NFR system drilled by a horizontal well. Figure 11 illustrates typical smooth as well as noisy pressure derivative records for a NFR flow regime with unique ω and λ values.

Fig. 11
figure 11

Pressure derivative curve for a noisy record data from the dual-porosity NFR

This figure clearly shows that a relatively high level of noise is added to the original curve. The developed MLS-SVR is provided with ten noisy PD graphs, and the obtained results are reported in Table 3. This table provides a comparison between actual values of ω and λ and their associated predicted values by the MLS-SVR model for noisy PD signals. This comparison reveals that noise in the data records does not significantly degrade the prediction performance of the MLS-SVR model, and it tolerates a relatively high level of uncertainty in independent variables.

Table 3 The prediction performance of the MLS-SVR model applied to example noisy data records of well-test pressure derivative curves for a dual-porosity NFR

Outlier detection

The presence of outlier and defect in the data records (Chen et al. 2020b; Guo et al. 2020a) making up the dataset influences the achievable accuracy by a model. As a large dataset has been generated and analyzed to develop the MLS-SVR model, some data record error is expected, resulting in some outliers in the predicted versus actual values. A useful statistical algorithm to detect outliers in datasets is the Leverage method enabling outlier points to be detected (Rousseeuw and Leroy 2005). This method determines standardized residuals between predicted and real values for each record in the dataset.

The distribution of standard residual has a mean of 0 and a standard deviation of 1. The leverage method then creates a Hat matrix for those standardized residuals defining a leverage index determined by Eq. (7):

$$ \mathrm{H}=\mathrm{X}{\left({\mathrm{X}}^{\mathrm{T}}\mathrm{X}\right)}^{-1}{\mathrm{X}}^{\mathrm{T}} $$
(7)

where X is a matrix of standardized residuals with p × q elements and Xt is its transpose. p and q represent the number of data records (500 in the NFR dataset generated) and the number of model parameters (30 data points used to define each pressure derivative curve), respectively.

The diagonal elements of the H matrix establish the leverage index. A William’s diagram cross plotting the standardized residual and the leverage index calculated by Eq. (7) for each data record then reveals suspected outlying data records (Gramatica 2007).

Figures 12 and 13 display William’s plots for ω and λ values obtained by the MLS-SVR model, respectively.

Fig. 12
figure 12

William’s plot for detection of suspected outlier predictions of ω by the MLS-SVR model. The dashed vertical line represents the H* value calculated using Eq. (8)

Fig. 13
figure 13

William’s plot for detection of suspected outlier predictions of λ by the MLS-SVR model for the dual-porosity NFR dataset analyzed. The dashed vertical line represents the H* value calculated using Eq. (8)

In these plots, the warning leverage value (H*) is displayed as a vertical dashed line calculated using Eq. (8):

$$ {\mathrm{H}}^{\ast }=3\left(\mathrm{n}+1\right)/\mathrm{m} $$
(8)

Here, n is the number of model parameters, m shows the number of data records. The cutoff value of 3 is typically applied in Eq. (8), representing a range of plus or minus three standard deviations from the mean value of the dataset metric evaluated.

If a model is robust in its predictions, and the dataset does not involve excessive error measurements, most of the data records should be located on a William’s diagram within the following limits:

$$ {\displaystyle \begin{array}{c}0\le \mathrm{H}\le 0.1725\\ {}-3\le \mathrm{SR}\le 3\end{array}} $$

It is obvious in Figs. 12 and 13 that most of the data points (~94%–95%) for both ω and λ fall within the feasible regions. There are only 28 data records identified as possible outliers for ω, and 32 data records identified as possible outliers for λ. This observation indicates high levels of statistical confidence in the predictions generated by the MLS-SVR model.

Performance of the developed MLS-SVR model applied to a published signal

Figure 14 presents a pressure derivative graph for a drawdown test performed on a horizontal well in a dual-porosity reservoir (Cheng 2004). The values of storativity ratio and interporosity flow coefficient for this signal are reported in that work as 0.1 and 1 × 10−6, respectively.

Fig. 14
figure 14

Pressure derivative graph from literature (Cheng 2004)

The developed MLS-SVR approach is applied to this signal. and predicts storativity ratio and interporosity flow coefficient as 0.08419 and 1.16 × 10−6. The MLS-SVR model is clearly able to identify and predict that signal accurately.

Conclusion

In this study, a multi-output least squares support vector regression model developed and successfully applied to predict two key parameters of naturally fractured reservoirs, i.e., interporosity flow coefficient and storativity ratio. A huge databank having 500 PTRs is generated using an analytical solution of partial differential equations defining fluid flow in a fractured reservoir. These PRT then converted to the pressure derivative curves to facilitate the MLS-SVR for discrimination among different values of ω and λ. AARD of 0.177% and 2.755% were obtained for trained ω and λ predictions by the MLS-SVR, respectively. The leverage method identified that up to about 6% of the predictions made by the method were potential outliers. These results suggest that the MLS-SVR method developed and applied in this study to a dataset of pressure derivative curves provides a useful alternative to conventional PTA methods, mainly when applied to more complex reservoirs dual-porosity natural fractured reservoir flow system evaluated.