Super-Resolution Reconstruction of Porous Media Using Concurrent Generative Adversarial Networks and Residual Blocks

Zhang, Ting; Liu, Qingyang; Du, Yi

doi:10.1007/s11242-022-01892-3

Super-Resolution Reconstruction of Porous Media Using Concurrent Generative Adversarial Networks and Residual Blocks

Published: 25 December 2022

Volume 149, pages 299–343, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Transport in Porous Media Aims and scope Submit manuscript

Super-Resolution Reconstruction of Porous Media Using Concurrent Generative Adversarial Networks and Residual Blocks

Download PDF

Ting Zhang¹,
Qingyang Liu¹ &
Yi Du²

456 Accesses
4 Citations
Explore all metrics

Abstract

Accurate porous media reconstruction has always been one of the significant research hotspots in the numerical simulation of reservoirs. The traditional methods such as multi-point statistics perform porous media reconstruction based on the statistical features of training images, but the process is possibly cumbersome and the result is less effective. Porous media reconstruction has been greatly developed and benefited by applying current flourishing deep learning to its simulation process thanks to the strong capability of extracting features by deep learning. As a typical branch of deep learning methods, generative adversarial network (GAN) can simulate a two-person zero-sum game through confrontation between a generator and a discriminator. However, in real experiments, constrained by the resolution of physical equipment and the size of samples, it is difficult to physically obtain a large-scale image of porous media with high-resolution (HR) since HR and large field of view are usually contradictory for physical equipment. In this paper, a method is proposed based on multistage concurrent GAN to learn the structural features of porous media from one low-resolution 3D image and then stochastically reconstruct larger-sized porous media images. Experimental comparison with some typical methods proves that this method can reconstruct HR images with favorable quality.

Article Highlights

Our method is quite fast in multiple reconstructions by reusing model parameters.
Our method realizes the super-resolution reconstruction of porous media from one low-resolution image.
Our method outperforms SNESIM and some GAN’s variants in the accuracy of reconstruction.

3D super-resolution reconstruction of porous media based on GANs and CBAMs

Article 29 December 2023

Multi-scale reconstruction of porous media based on progressively growing generative adversarial networks

Article 09 April 2022

Sliced Wasserstein Distance-Guided Three-Dimensional Porous Media Reconstruction Based on Cycle-Consistent Adversarial Network and Few-Shot Learning

Article 02 July 2024

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Porous media are widely found in soil, rocks, plants, animals and other substances. Seepage is a very common natural phenomenon existing in porous media. Seepage mechanics studies the motion pattern and law of fluids flow in porous media (Okabe and Blunt 2005; Singh and Mohanty 2000), which is not only related to the properties of the fluids themselves, but also influenced by the internal structures of porous media since the topological structure and geometric characteristics of pore space directly affect the flow of fluids in porous media. The study about the structures of porous media is quite meaningful for the research of fluids flow, but the extremely complex structures of pore space have caused a great challenge for the accurate acquisition or reconstruction of such structures. To address the above issue, physical experimental methods and numerical reconstruction methods are developed to obtain or reconstruct the structures of porous media (Zhang et al. 2016).

The principle of physical experimental methods is to scan the sample of porous media with a high-precision scanning instrument to obtain 2D or 3D data. Typical physical experimental methods include sequence slice imaging (Lymberopoulos and Payatakes 1992), focus ion beam (FIB) (Fredrich and Lindquist 1997), computed tomography (CT) (Hou et al. 2007), etc. The sequence slice imaging method can reasonably divide different segments of pore space into pores and throats and obtain the distribution of pore diameters, throat sizes and lengths through the study of pore space. However, the applicability of this method is limited because it is easy to destroy the pore structure (Vogel and Roth 2001). Fredrich and Lindquist (1997) used FIB to reconstruct 3D digital cores and preliminarily studied the relationship between porosity and permeability of sandstone with different volumes. Digital core reconstruction based on micro- and nano-CT can obtain high-resolution (HR) accurate images, but the experimental equipment is expensive and the preparation of experimental samples is complicated (Hou et al. 2007).

Physical experimental methods can accurately describe the real pore structures of porous media, but they cannot achieve a proper balance between HR and the large field of view (FOV) due to the limited size of experimental samples and equipment. Therefore, as complementary tools for physical experimental methods, numerical reconstruction methods have been extensively studied, which analyze and obtain large-scale digital pore data in a more cost-efficient way. Hazlett (1997) used the simulated annealing method to make the reconstructed model closer to the real results by obtaining more statistical information. Helene and Didier (2015) proposed the truncated Gaussian simulation method to improve reconstruction quality. Okabe and Blunt (2004, 2005) used multi-point statistics (MPS) for digital core reconstruction. Zhang et al. (2016) used isometric mapping to achieve nonlinear dimensionality reduction of training images (TIs) to decrease the redundant data of porous media, accelerating the simulation process.

However, the above numerical reconstruction methods, such as MPS, tend to rely on the statistical information extracted from each simulation process, which is only stored in memory instead of files in hard disk. Hence, when the statistics in memory are cleared (e.g., when the computer is turned off), these data will be lost and cannot be reused. At present, the rapid development of deep learning is supported by the improvements in GPU computing and optimization of neural networks (He et al. 2016), which has been widely used in the field of prediction with the potential for the reconstruction of porous media. These deep learning techniques have been proved successful in enhancing and accelerating modeling beyond the limitations of previous physical modeling, providing a new idea and broad prospect for porous media reconstruction. Besides, deep learning methods can reuse the parameters and models for subsequent simulations or reconstructions, showing an evident advantage over traditional numerical reconstruction methods.

Generative adversarial network (GAN) (Goodfellow et al. 2014) is regarded as an important generative model or an unsupervised generation method in deep learning, whose basic idea is derived from the two-person zero-sum game theory. The purpose of GAN is to estimate the distribution of real data samples and generate new data samples similar to the real ones. GAN was first applied to the reconstruction of porous media by Mosser et al. (2017). Based on the powerful feature extraction capability for TIs and the parallel GPU framework, GAN is able to speed up the reconstruction process and obtain favorable reconstructions. With the development of GAN and its variants, multiple GAN-based methods have been proposed for the reconstruction of porous media. Feng et al. (2019) reconstructed porous media from small sub-areas to a complete image using CGAN (conditional generative adversarial network). Valsecchi et al. (2020) presented a reconstruction method using GAN to reconstruct 3D porous media from 2D images.

One of GAN’s disadvantages is its requirement for a lot of training data, but in practice sometimes it is difficult to meet this condition. To address this issue, a GAN’s variant single-image GAN (SinGAN) (Shaham et al. 2019) used only a single TI and a multistage model to simultaneously learn the detailed information and general information of the TI in different training stages. Then, a concurrent SinGAN (ConSinGAN) (Tobias et al. 2020) was proposed to accelerate the training process by including multiple neighboring stages to perform concurrent training.

Super-resolution (SR) reconstruction can reproduce the statistics of high-frequency components from low-resolution (LR) images and obtain HR images from LR images by learning the implicit redundancy in LR images. SR algorithms based on deep learning establish the nonlinear end-to-end mapping relationship between input and output through multi-layer convolutional neural networks (CNNs). Dong et al. (2014) first proposed a super-resolution CNN (SRCNN) to construct a simple shallow CNN, and the image reconstruction quality was significantly improved compared to other SR reconstruction algorithms. Kim et al. (2016) proposed very deep convolutional networks (VDSR) introducing residual networks (ResNets) into SR. Ledig et al. (2017) proposed a super-resolution generative adversarial network (SRGAN) using perceptual loss and adversarial loss to enhance the reality sense of images in SR. By improving SRGAN through an enhanced residual network with simpler structures, Lim et al. (2017) proposed the enhanced deep residual network (EDSR). However, the reconstruction results of these methods are relatively fixed since they cannot reconstruct a variety of stochastically reconstructed results.

SR reconstruction essentially is a type of numerical analysis problems called the ill-posed problem, in which the output is very sensitive to the input. For an ill-posed problem, if the input has a small error, then the relative error of the output possibly will be very large. For the SR reconstruction problem, the input LR images are often affected by noise and other interference. In this case, multiple stochastically reconstructed SR images with the features similar to the TI can be viewed as equal-probability stochastic models, whose differences reflect the uncertainty in practical application, providing the trend or tendency of generated information with multiple possible results and predicting the models based on incomplete data.

To obtain 3D SR stochastic reconstruction of porous media, this paper proposes a SR concurrent single-image GAN (SRCSGAN), whose stochastic reconstruction can be larger than the original TI. At this point, the reconstructed scale will not be restricted by the size of the TI. In addition, only a single TI is needed in SRCSGAN and the network is combined with ConSinGAN and residual blocks. Compared to some other typical reconstruction methods, SRCSGAN shows certain advantages in terms of reconstruction quality.

2 Related Work

2.1 GAN

The basic structure of GAN consists of two networks: a generator G and a discriminator D. The noise z obeying a certain distribution (such as the uniform distribution, Gaussian distribution, etc.) is input into G, which tries to generate a fake image G(z) similar to the TI. D is a binary classifier to estimate the probability that an image input to D is from a real image x or a fake image G(z). In the end, the fake image cannot be identified from the input data, meaning the probability of the discriminator to correctly identify the fake image is about 50%. The specific structure of GAN is shown in Fig. 1.

The optimization process of GAN can be regarded as a minimum and maximum game problem, whose ultimate goal is to make the discriminator D unable to distinguish the true or false image. The optimization function is as follows:

$$\mathop {\min }\limits_{G} \mathop {\max }\limits_{D} V\left( {G,D} \right) = E_{{x\sim p_{{{\text{data(}}x{)}}} }} \left[ {{\text{log}}D\left( x \right)} \right] + E_{{z\sim p_{z(z)} }} \left[ {\log (1 - D(G(z)))} \right],$$

(1)

where $p_{{{\text{data}}(x)}}$ is the distribution of the real image x; p_z(z) is the probability distribution of the noise z; $D\left( x \right)$ represents the probability that D determines that x comes from the real image; $D\left( {G\left( z \right)} \right)$ represents the probability that D judges $G\left( z \right)$ comes from the noise z; E is the mathematical expectation.

2.2 Residual Blocks

Generally, a deep network can improve its performance by learning more features of TIs through its multiple layers, thus generally better than a shallower one, but it is difficult to train such a deep network. With the deepening of the network layer, the accuracy of the model first continuously improves and reaches the maximum (saturation of accuracy). Then, with the continuous increase of the network depth, the accuracy of the model decreases significantly, which is called the “degradation problem.”

Residual networks (He et al. 2016) were proposed to solve the degradation problem of deep networks. Its main idea is using identity mapping to directly connect the different layers of the network through “shortcut” connections, by which the residual networks are considered as the integration of multiple shallow networks. Then, the training can be carried out more easily compared to traditional deep networks.

A residual network is composed of a certain amount of residual blocks. The way to make the output of a residual block equal to its input is called identity mapping. Assume a traditional learning block and a residual block have, respectively, achieved identity mapping H(x) = x, as shown in Fig. 2, in which $x$ is the input; $F\left( x \right)$ is the residual term; $H\left( x \right)$ is the expected output; relu is an activation function; “$\oplus$” is an adding operation. The main difference of such a traditional learning block and a residual one lies in the “shortcut” connection. In Fig. 2a, there is a relation H(x) = x, while in Fig. 2b this relation has been changed by “shortcut” and can be expressed as:

$$H\left( x \right) = F\left( x \right) + x.$$

(2)

As shown in Fig. 2a, the input x passes through each weight layer and the activation function relu until the excepted output H(x) = x is obtained. At the end of the residual block in Fig. 2b, the input x is directly transferred to the output by a “shortcut” connection, and then the result H(x) = F(x) + x is obtained. Then, the identity mapping H(x) = x can be transformed into learning a residual term F(x) = H(x)-x, so the identity mapping H(x) = x can be achieved in Fig. 2b as long as F(x) = 0.

The “shortcut” connections will not add additional parameters or computational complexity in the training process, which can be trained by stochastic gradient descent (SGD) method in backpropagation and can be easily realized by using known common software libraries [e.g., Caffe (Jia et al. 2014)]. Therefore, compared to the traditional learning block, the residual block simplifies the learning process and learns the identity mapping easier and faster, greatly alleviating the risk of the degradation problem.

2.3 ConSinGAN

For GAN, it is almost impossible to directly capture all the features (e.g., global structures and local details) of TIs through only a single image due to its limited receptive field. Therefore, the training of GAN is usually based on a large number of training samples. However, if the available training data are insufficient, the model trained on the basis of a small number of images or even a single image is often not quite convincing. SinGAN (Shaham et al. 2019) solves this problem by using the convolutional pyramid structure to capture the statistics of a single TI from global structures to local details. The original single TI is downsampled to different stages to form a multistage training sequence. The generator and discriminator of SinGAN are trained in multiple stages at different resolutions. The image generated by the previous stage is input to the current generator, but the generator at each stage is trained separately, which means that the trained weights of the previous generator cannot be reused by the current one.

In SinGAN, each stage is relatively isolated and therefore the corresponding training process at each stage is not related, which has greatly limited the inherent connection of different stages. Instead, ConSinGAN (Tobias et al. 2020) designs a concurrent training process for the generator, in which multiple stages in the generator can be trained together simultaneously. For example, the training process of ConSinGAN with a total of six stages is shown in Fig. 3. The whole training process starts from stage 0 and ends at stage 5. When the network at stage 0 completes its iteration, stage 1 joins the generator G₀ for concurrent training, but the discriminator D remains unchanged. The generator at stage 1 now includes G₀ and G₁, in which G₀ "inherits" from stage 0 and initially reuses the trained parameters of G₀. For the new added generator G₁, its parameters need to be initialized. The discriminator D at stage 1 can reuse its parameters at stage 0 instead of reinitialization. The following stages repeat the above operation until stage 5 is reached. These reused parameters are relatively closer to the final determined values than completely random initialized parameters at each stage. Hence, the adjustment and training of parameters are relatively less, thus accelerating the convergence process.

In addition, to avoid overfitting, only part of generators are trained simultaneously. A maximum concurrent stage number (for convenience called Max_Con) is used to control the number of concurrent training stages. In Fig. 3, at most three generators are trained together in one stage, which means Max_Con is set to 3.

2.4 MSPGAN

Another SR method for porous media reconstruction, called multi-scale pattern generative adversarial network (MSPGAN) (Zhang et al. 2022), will be compared with SRCSGAN in the following experiments, so it is briefly introduced in this section. MSPGAN only needs a single 3D TI to reconstruct high-quality images by capturing and visualizing the distribution information of patches from the TI. The scaling transformation added to the generator enables MSPGAN to reconstruct images of various scales, especially to perform the SR reconstruction. MSPGAN’s multi-scale discriminator can effectively optimize the reconstructed images and guarantee reconstruction quality. In the lower scales, the discriminator mainly discriminates the reconstruction of local detailed information, while in the higher scales, the discriminator pays more attention to the global structural information.

3 Methodology

3.1 The Architecture of SRCSGAN

As mentioned above, ConSinGAN uses concurrent training to obtain global image information and local details at the same time to address the issue of the limited receptive field of GAN. But if the internal features of TIs are difficult to catch, then a proper way of extracting more details is to increase the layer number of convolutional layers in each stage of ConSinGAN. A deeper network understands and characterizes the features of TIs better, nevertheless surely increasing the risk of degradation problem in deep networks. To overcome this problem, SRCSGAN integrates residual blocks with the idea of concurrent training together.

In addition, to accelerate the training process, the original features of the previous stage are upsampled and then input to the current stage. Upsampling is the process of enlarging the original image so that it can be displayed on a higher-resolution scale or display area. To reuse details generated in the previous stage, they are combined with the new learned information of the current stage by residual connections, which are defined as follows:

$$\tilde{x}_{n} = \tilde{x}_{n - 1} \uparrow + \varphi_{n} (z_{n} + \tilde{x}_{n - 1} \uparrow ),\quad n = 1, \ldots ,N,$$

(3)

where $\tilde{x}_{n}$ represents the result generated at stage n; $\tilde{x}_{n - 1} \uparrow$ represents the upsampling result from stage n-1; $z_{n}$ represents the added noise in stage n for increasing the diversity of results; $\varphi_{n} \left( {z_{n} + \tilde{x}_{n - 1} \uparrow } \right)$ represents the new learned information of the convolutional network at stage n.

The structure of SRCSGAN is shown in Fig. 4. For stage 0, multiple residual blocks are used to alleviate the risk of degradation. When the training of stage 0 reaches the convergence, stage 1 is added to the network to start the training. Residual connections are used to unify stage 0 and stage 1. The noises before each stage are used to increase the diversity of results. At this point, stage 1 only needs to slightly adjust its parameters obtained by stage 0 instead of reinitialization. Similarly, in stage 2 and subsequent stages, the network repeats this process until the final stage N.

Since the parameters of a stage are shared by its next stage, the whole adjustments of parameters are relatively small, thus greatly reducing the training time. In addition to the initial stage 0, the input of each stage includes random noises and the upsampled image features obtained from previous stages. Both the generator G and the discriminator D are trained separately. When the loss of one stage reaches the termination condition (a fixed epoch number for each stage), the generator of its next stage will be added to the current stage to form a training sequence for concurrent training, while the discriminator in each stage keeps unchanged. However, too many concurrent stages easily cause overfitting, so a maximum concurrent stage number (called Max_Con, set to 3 in this paper) is used in SRCSGAN, which means at most three stages simultaneously participate in the concurrent training process. If the current concurrent training number is bigger than Max_Con, the first stage in the training sequence will be dropped; otherwise, the next stage is added to the training sequence. This process will be repeatedly performed for N times until the desired stage N is reached, by which the final reconstructed result (i.e., reconstruction N in Fig. 4) is obtained. At each stage, the discriminator tries to estimate the probability that a sample is from the real image or a fake image.$L_{{{\text{all}}}} \left( {G_{n} ,D_{n} } \right)$ is the loss function of SRCSGAN, which will be discussed in Sect. 3.4.

3.2 The Parameter Settings of SRCSGAN

In real experiments, the main parameters of the generator and discriminator are shown in Tables 1 and 2, in which some functions are described below.

1.
Conv3d: a 3D convolution operation for extracting local features of 3D data.
2.
BatchNorm3d: a regularization function that is used to normalize data, alleviating the problem of scattered distribution of data features and making the network model more stable.
3.
LeakyReLU: an activation function that alleviates the vanishing gradient problem.
4.
Tanh: an activation function that normalizes the generator output value to [-1, 1].
5.
Upsample: a 3D deconvolution operation that performs upsampling.
6.
residual_block: the component of residual network.
7.
residual_block × 16: a total of 16 residual blocks.

Table 1 Main parameters used in the generator G

Super-Resolution Reconstruction of Porous Media Using Concurrent Generative Adversarial Networks and Residual Blocks

Abstract

Article Highlights

Similar content being viewed by others

3D super-resolution reconstruction of porous media based on GANs and CBAMs

Multi-scale reconstruction of porous media based on progressively growing generative adversarial networks

Sliced Wasserstein Distance-Guided Three-Dimensional Porous Media Reconstruction Based on Cycle-Consistent Adversarial Network and Few-Shot Learning

Explore related subjects

1 Introduction

2 Related Work

2.1 GAN

2.2 Residual Blocks

2.3 ConSinGAN

2.4 MSPGAN

3 Methodology

3.1 The Architecture of SRCSGAN

3.2 The Parameter Settings of SRCSGAN

3.3 The Size of Input Real Image at Each Stage

3.4 The Loss Functions of SRCSGAN

3.5 Implementation

4 Experimental Results and Analyses

4.1 Experimental Data

4.1.1 Sandstone

4.1.2 Shale

4.1.3 Preparation of Multiple-Stage Experimental Data

4.2 The Reconstruction of Sandstone

4.2.1 The Determination of an REV

4.2.2 Reconstructed Images Using SNESIM, MSPGAN, ConSinGAN and SRCSGAN

4.2.3 Comparison of Pore Space

4.2.4 Comparison of Autocorrelation Functions

4.2.5 Comparison of Absolute Permeability

4.2.6 Comparison of MPC Curves

4.2.7 Comparison of Pore Network Models

4.2.8 The Diversity of Reconstruction Results

4.2.9 The Settings of the Maximum Concurrent Stage Number

4.2.10 The Settings of the Total Stage Number

4.2.11 The Comparison of Reconstruction Time and CPU/GPU/Memory Usage

4.2.12 The Influence of the Size of the LR Training Image

4.3 The Reconstruction of Shale

4.3.1 The Determination of an REV

4.3.2 Reconstructed Images Using SNESIM, MSPGAN, ConSinGAN and SRCSGAN

4.3.3 Comparison of Pore Space

4.3.4 Comparison of Autocorrelation Functions

4.3.5 Comparison of Absolute Permeability

4.3.6 Comparison of MPC Curves

4.3.7 Comparison of Pore Network Models

4.3.8 The Diversity of Reconstruction Results

4.3.9 The Settings of the Maximum Concurrent Stage Number

4.3.10 The Settings of the Total Stage Number

4.3.11 The Comparison of Reconstruction Time and CPU/GPU/Memory Usage

4.3.12 The Influence of the Size of the LR Training Image

5 Conclusion

Availability of data and materials

Code availability

Abbreviations

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation