CN113484908B

CN113484908B - Partial Convolution and Attention Mechanism Fusion Deep Learning Network for Missing Seismic Data Reconstruction

Info

Publication number: CN113484908B
Application number: CN202110980357.0A
Authority: CN
Inventors: 陈学华; 冯永基; 张�杰; 吕丙南; 姜晓敏; 刘俊杰
Original assignee: Chengdu Univeristy of Technology
Current assignee: Chengdu Univeristy of Technology
Priority date: 2021-08-25
Filing date: 2021-08-25
Publication date: 2023-07-14
Anticipated expiration: 2041-08-25
Also published as: CN113484908A

Abstract

The invention provides a data processing method for realizing the reconstruction of missing seismic data by fusing partial convolution and an attention mechanism into a deep learning network. The method can be used for adapting to irregular and large-space data missing situations in the seismic acquisition data, and realizing more accurate and reliable seismic data reconstruction.

Description

Missing seismic data reconstruction method for deep learning network by combining partial convolution and attention mechanism

Technical Field

The invention belongs to the field of petroleum seismic exploration data processing, and relates to a seismic data processing method for reconstructing and recovering data when seismic data are missing or damaged.

Background

In the field acquisition of seismic exploration data, the acquired seismic data is often irregular, undersampled or has data missing due to the limitations of surface topography, geological conditions, data acquisition equipment, environmental factors and the like. In the process of seismic data processing, the deletion of seismic data can be caused by removing bad tracks, cutting off specific data segments or filtering and the like. The integrity of the original seismic data is a key foundation for subsequent seismic data processing and interpretation and reliable subsurface geologic structure and reservoir imaging, so that the reconstruction of the missing seismic data becomes one of key technologies for seismic data processing. The existing seismic data reconstruction method mainly comprises an artificial intelligence method based on wave equation, prediction filtering, transform domain reconstruction, compressed sensing theory and machine learning and deep learning.

In recent years, a method for introducing artificial intelligence into seismic data processing has become a trend, and a deep learning model represented by a GAN countermeasure generation network and a DNCNN deep convolution network has good effects on seismic data reconstruction processing. However, the output result of the ordinary GAN deep learning model often has problems such as deformation of the seismic reflection structure, blurring of effective signals, aliasing, and the like. In particular, in the case of seismic data with large-distance data channel missing, the seismic data channel reconstructed by the common deep learning method is often uncoordinated with the characteristics of the surrounding existing seismic data channels. An important reason for this is that the size of the convolution kernel is limited in such data reconstruction algorithms, so that the convolution result is mainly affected by the data in the convolution kernel, and the effect and influence of effective information at a far position outside the convolution kernel cannot be obtained.

The attention mechanism in the deep learning is used as an additional module in the deep learning network, and can be used for solving the problem of smaller network perception range caused by the limitation of the size of a convolution kernel.

Disclosure of Invention

The invention aims to overcome the defects of the existing seismic data reconstruction method and technology, and develops a missing seismic data reconstruction method of a deep learning network by fusing partial convolution and a attention mechanism, wherein the method comprises the following main steps:

the method comprises the steps of establishing seismic data slices with uniform sizes as a training set;

and secondly, generating an countermeasure network by utilizing the data input of the training set, so that masks M with the same number and size as the seismic data slices in the training set are randomly generated by the countermeasure network. The mask M randomly divides the rectangular missing regions marked with 0 and the known regions marked with 1 of different sizes, respectively. When the mask is multiplied by the number of the data slices, a missing region can be formed on the original data slice;

firstly, the input data I passes through a part of convolution layers in the generation network G, the part of convolution layers judges whether effective data exist or not according to the position of a convolution kernel, convolution operation is selectively executed, and then the input data I' is obtained after being processed by a downsampling layer in the generation network G and is used for further processing by an attention module;

the attention module in the generating network G calculates cosine distances between each point in the missing area and all points in the known area, and the calculation method is as follows:

wherein,,

representing cosine similarity at two points (x, y) and (x ', y') obtained through standardized inner product operation, mapping a calculation result into probability models through a softmax function, wherein the number of the probability models is the same as the number of points in a missing area;

fifthly, multiplying the known region data with the obtained different probability model numbers respectively, wherein the probability model numbers are as follows:

a new value of a point in the missing region corresponding to the probability model can be obtained, after the processing, the value in the missing region is fused with the data characteristics in the known region, and the obtained new data matrix is recorded as

Sixth step of comparing I' with

After being spliced together in the channel dimension, the channel dimension is subjected to pixel convolution operation and is conveyed to an up-sampling layer in a generating network G for subsequent processing;

and after the up-sampling layer in the generation network G processes the data, outputting a plurality of results, calculating the average value of the results, then transmitting the results to the discrimination network D, comparing the reconstructed result with the samples of the training set by the discrimination network D, and feeding back the comparison result to the generation network G, wherein the two networks are mutually influenced, so that the network performance is jointly improved, the constructed loss function is ensured to obtain the minimum value, and the reconstructed seismic data is output.

Drawings

FIG. 1 is a schematic diagram of a deep learning network model incorporating a partial convolution layer and an attention module in accordance with the present invention. The input data sequentially passes through a partial convolution layer in the generation network G, a downsampling layer, a attention module and an upsampling layer and then is input into the discrimination network D.

FIG. 2 is a plot of seismic profiles before and after processing, after random loss of trace data, for a fully acquired single shot trace set in accordance with an embodiment of the present invention. Wherein: FIG. 2 (a) is a cross section of a completely acquired, undelayed seismic trace data, with trace number on the abscissa and time on the ordinate, in seconds(s); FIG. 2 (b) is a cross section of FIG. 2 (a) with trace numbers randomly missing, time on the abscissa, and seconds(s) on the ordinate; FIG. 2 (c) is a cross section of the seismic data reconstruction method of the invention, obtained by processing FIG. 2 (b), with the abscissa being the trace number and the ordinate being time in seconds(s); fig. 2 (d) is a residual profile obtained by subtracting the data of fig. 2 (a) and 2 (c), the abscissa is the trace number, the ordinate is time, and the unit is seconds(s).

Detailed Description

The invention discloses a structure and a flow of a deep learning network for realizing the reconstruction of missing seismic data, which are shown in a figure 1, and the structure and the flow of the deep learning network are integrated with a part of convolution layers and a processing module of an attention mechanism, and the concrete implementation flow of the reconstruction processing of the missing seismic data is as follows:

firstly, using seismic data slices with uniform sizes as a training set;

and when a batch of training data is input to generate an countermeasure network, the network can randomly generate masks M with the same number and size as the data slices. Rectangular missing areas and known areas with different sizes are randomly divided on the mask, data in the known areas in the mask are marked as 1, and the missing areas are marked as 0. When the mask is multiplied by the number of the data slices, a missing region can be manufactured on the original data slice;

and thirdly, the input data I is processed by a partial convolution layer and a downsampling layer to obtain I'. I' is not directly input into the upsampling layer, but enters the attention module for further processing; wherein a portion of the convolution layers selectively convolve based on whether valid data is present in the location of the convolution kernel.

The attention module calculates cosine distances between each point in the missing region and all points in the known region by the following calculation method:

wherein the method comprises the steps of

Representing cosine similarity at two points (x, y), (x ', y') obtained by a normalized inner product operation.

Mapping the calculation result into probability models through a softmax function, and marking the probability models as score, wherein the number of the probability models is the same as the number of points in the missing area; however, with iterations of the neural network, score is not the only fixed. So define the final score after i iterations as score ⁱ ，score ⁱ And score ^i-1 The relationship of (2) is as follows:

score ⁱ ＝λsocre′ ⁱ +(1-λ)score ^i-1

in socre' ⁱ Representing the fraction generated by the softmax function during the ith iteration. Lambda is a parameter that can be learned during the iteration process.

And fifthly, multiplying the known region data with the obtained different probability model numbers respectively, wherein the formula is as follows:

obtaining the missing region corresponding to the probability modelNew values for points in the inner. After such processing, the values in the missing region fuse the data features in the known region, and the new data matrix is recorded as

Sixth step of comparing I' with

After being spliced together in the channel dimension, the pixel convolution is carried out and the pixel convolution is transmitted to the next up-sampling layer;

the data of the user is output a plurality of results through the up-sampling layer, and the average value of the results is taken and then is transmitted to a discrimination network; the loss function sampled by the model is designed as:

wherein the loss function in the generated network G is:

the loss function in the discrimination network D is:

where M represents the mask and ". Sur represents the dot product operation. Loss of deleted region L _hole Representing the difference between the reconstructed result and the target sample. Known area loss is L _valid The loss function may control the transformation of the known region before and after processing. Input sample is I _input The network output result is I _out 。L _tv For the smoothing loss, P represents the hole area after expansion by 1 pixel. The smoothing loss represents the L1 loss for one pixel in the hole area and the right and lower pixels of that pixel. The smoothing loss measures the difference between adjacent data in the horizontal and vertical directions. L (L) _perceptual For perceived loss, it is a pretrained VGG-16 network.

Characterization of the ith pooled layer map in VGG-16 network, H _i W _i C _i The height, width and channel size of the ith feature map are represented. The perceived loss may be a deeper feature than the network output data and the real data. The output of the discrimination network D is 0 or 1. The output of the discrimination network D is 0 or 1.

When the complete single shot seismic trace data in fig. 2 (a) has serious data missing, as shown in the cross section in fig. 2 (b), the data missing of large space and small space simultaneously occurs, wherein the missing space of the largest missing region reaches 50 channels, the missing space of the smallest missing region is 6 channels, 150 channels of seismic data are missing in the cross section in fig. 2 (b), and the data missing rate is 58%. After the processing of the method of the invention is adopted, the data reconstruction section as shown in the figure 2 (c) is obtained, the original area with missing data is clear and continuous seismic reflection in-phase axis, compared with the figure 2 (a), the seismic reflection characteristics of the reconstructed seismic section are the same as the complete single shot seismic trace set, in order to illustrate the effect of data reconstruction, the section data in the figure 2 (a) and the figure 2 (c) are subtracted, the residual seismic sections of the two are obtained, as shown in the figure 2 (d), the effective seismic reflection signals with almost no obvious energy are visible from the residual seismic sections, and the effect of the invention on the reconstruction of the missing seismic data is quite reliable and superior.

The invention has the advantages that: (1) For a foreground data feature, the attention mechanism first calculates its similarity to all background data features, and then converts the similarity to an attention score by softmax. Finally, calculating an average data characteristic by using the background data characteristic and the attention score, wherein the average data characteristic contains useful information needed to be used according to the attention score; (2) After fusing the result with the original data characteristic pixels, carrying out convolution, so that the limitation of the convolution kernel size can be broken through in the convolution process, and the data outside the convolution kernel can also act on the convolution result, thereby reducing the defects of deformation, blurring, aliasing and the like of the stratum reflection structure in the reconstruction result; (3) The method can be suitable for effectively reconstructing irregular and random missing data in the seismic acquisition data.

The foregoing embodiments are merely illustrative of the present invention, and various implementation steps of the method may be changed, and all equivalent changes and modifications performed on the basis of the technical solutions of the present invention should not be excluded from the protection scope of the present invention.

Claims

1. A partial convolution and attention mechanism fusion deep learning network missing seismic data reconstruction method comprises the following main steps:

(1) Establishing seismic data slices with uniform sizes as a training set;

(2) Generating an countermeasure network by using training set data input, wherein the countermeasure network consists of a generation network G and a discrimination network D; enabling the countermeasure network to randomly generate masks M with the same number and same size as the seismic data slices in the training set; the mask M randomly divides the rectangular missing area marked with 0 and the known area marked with 1 with different sizes respectively; when the mask is multiplied by the number of the data slices, a missing region can be formed on the original data slice;

(3) The input data I firstly passes through a part of convolution layers in the generation network G, the part of convolution layers judges whether effective data exist or not according to the position of a convolution kernel, then selectively carries out convolution operation, and then obtains I' after being processed by a downsampling layer in the generation network G for further processing by an attention module;

(4) The attention module in the generation network G calculates cosine distances between each point in the missing area and all points in the known area by the following method:

wherein,,

representing cosine similarity at two points (x, y), (x ', y') obtained by normalized inner product operation, will be calculatedThe calculation result is mapped into probability models through a softmax function, and the number of the probability models is the same as the number of points in the missing area;

(5) Multiplying the known region data with the obtained different probability model numbers respectively, wherein the following formula is as follows:

(6) And I' is combined with

(7) After the up-sampling layer in the network G is generated to process data, a plurality of results are output, the average value of the results is calculated and then is transmitted to the discrimination network D, the discrimination network D compares the reconstructed result with samples of the training set, the comparison result is fed back to the generation network G, the two networks are mutually influenced, the network performance is jointly improved, the constructed loss function is ensured to obtain the minimum value, and therefore the reconstructed seismic data is output.