CN108596327B

CN108596327B - Seismic velocity spectrum artificial intelligence picking method based on deep learning

Info

Publication number: CN108596327B
Application number: CN201810260667.3A
Authority: CN
Inventors: 张昊; 朱培民
Original assignee: China University of Geosciences
Current assignee: China University of Geosciences
Priority date: 2018-03-27
Filing date: 2018-03-27
Publication date: 2021-08-03
Anticipated expiration: 2038-03-27
Also published as: CN108596327A

Abstract

The invention provides an artificial intelligence picking method of an earthquake velocity spectrum based on deep learning, which is characterized in that a velocity spectrum has images, the characteristics of energy clusters in the velocity spectrum are extracted through a convolutional neural network, and the prediction result of a time-velocity pair sequence corresponding to the energy clusters in the velocity spectrum is calculated from the extracted characteristics; and extracting the relation characteristics of the time and the speed of the prediction result by using a recurrent neural network based on the time sequence characteristics of the speed spectrum, and correcting the prediction result according to the relation characteristics to finally obtain an accurate time-speed pair sequence. The invention has the beneficial effects that: the degree of automation is high, the time of seismic data processing is reduced, and the intelligence and physical labor of seismic data processing personnel can be basically and completely liberated.

Description

Seismic velocity spectrum artificial intelligence picking method based on deep learning

Technical Field

The invention relates to the technical field of seismic data processing of seismic exploration, in particular to a seismic velocity spectrum artificial intelligence picking method based on deep learning.

Background

Velocity analysis is the basis for conventional seismic data processing. In the conventional velocity analysis method, a series of velocities at the same interval are given to scan, and a velocity spectrum (fig. 1) is produced with the superposition energy or the similarity coefficient or the like as a criterion for velocity analysis. Energy boluses of the velocity spectrum correspond to strong reflection information and can provide correct kinetic correction velocity. The accuracy of the speed pick-up can directly influence the results of dynamic correction, offset, AVO and time-depth conversion in oil and gas exploration.

The current velocity spectrum picking mode is completed manually, the working efficiency is low, the time consumption is high, and human errors are easy to occur. Therefore, with the continuous development of seismic exploration and the continuous expansion of exploration fields, the number of seismic data to be processed is increased, and higher requirements on the picking efficiency and precision of velocity spectrum in velocity analysis are also provided.

Through the search of the prior art, the Chinese patent with the publication number of CN105445788A provides an automatic velocity spectrum interpretation method based on a model and global optimization, the method has low picking efficiency, the result is often random, and the method is easy to make mistakes.

Another chinese patent, publication No. CN105572733A, provides an automatic seismic velocity spectrum picking method. The method comprises the steps of firstly converting seismic data into two-dimensional grid data, then setting a calculation time window, determining the position of a maximum value in the area of the calculation time window, then defining a speed search threshold value, recording the position of the maximum value of a speed spectrum, connecting a speed curve, and finally carrying out interpolation on the speed curve. The method needs to manually set reasonable calculation time window and speed search threshold according to prior knowledge, and cannot realize intellectualization and automation.

Disclosure of Invention

In view of this, the embodiment of the present invention provides an artificial intelligence picking method for seismic velocity spectrum based on deep learning.

The embodiment of the invention provides an artificial intelligence picking method of a seismic velocity spectrum based on deep learning, which is characterized in that based on the characteristic that the velocity spectrum has images, the characteristics of energy clusters in the velocity spectrum are extracted through a convolutional neural network, and the prediction result of a time-velocity pair sequence corresponding to the energy clusters in the velocity spectrum is calculated from the extracted characteristics; and extracting the relation characteristics of the time and the speed of the prediction result by using a recurrent neural network based on the time sequence characteristics of the speed spectrum, and correcting the prediction result according to the relation characteristics to finally obtain an accurate time-speed pair sequence.

Further, the method comprises the following steps of:

a1: taking the preprocessed original velocity spectrum and the artificial extraction value of the time-velocity pair sequence corresponding to the original velocity spectrum as training samples, training the convolutional neural network, and obtaining a convolutional neural network training model capable of predicting the prediction result;

a2: taking the prediction result and the manually extracted value of the time-speed pair sequence of the original speed spectrum as a new training sample set, training the recurrent neural network, and obtaining a recurrent neural network training model capable of obtaining an accurate time-speed pair sequence;

a3: and sequentially inputting the original velocity spectrum into the convolutional neural network training model and the cyclic neural network training model to obtain an accurate time-velocity pair sequence.

Furthermore, the convolutional neural network comprises a plurality of convolutional layers and a plurality of pooling layers arranged in a cross way with the convolutional layers, each pooling layer is positioned behind the convolutional layer corresponding to the pooling layer, the cyclic neural network further comprises a plurality of full-connection layers positioned behind the last pooling layer, the full-connection layers are arranged in front and at the back, and the last full-connection layer corresponds to the output of the neural network.

Further, the recurrent neural network includes an input layer, an output layer, and a hidden layer between the input layer and the output layer, the input layer corresponding to an output of the convolutional neural network.

Further, the step of refining of step a1 includes:

a1.1: carrying out whitening pretreatment on an original velocity spectrum to obtain a velocity spectrum, wherein the velocity spectrum and a corresponding time-velocity pair sequence thereof form a training data set of a convolutional neural network;

a1.2: randomly shuffling a training data set of the convolutional neural network, grouping the data in the training set, wherein the number of the data in each group is consistent, and sequentially inputting each group into the convolutional neural network, wherein each group of data sequentially passes through a plurality of convolutional layers;

a1.3: extracting and fusing the features on one convolution layer by using a sliding window to obtain convolution feature maps on the convolution layer, wherein each convolution feature map is related to the weight of the convolution layer corresponding to the convolution feature map;

a1.4: inputting the convolution characteristic map into a subsequent pooling layer adjacent to the convolution characteristic map, wherein the pooling layer adopts a maximum pooling algorithm to obtain a pooling characteristic map, and the pooling characteristic map enters another subsequent convolution layer adjacent to the pooling characteristic map;

a1.5: repeating the steps A1.3 and A1.4 until data enters the last convolutional layer, wherein a Dropout layer is inserted in the last convolutional layer, the Dropout layer is used for randomly leading the value of the pooling feature map passing through the Dropout layer to be zero so as to reduce the phenomenon of network overfitting, then the convolutional layer generates a last convolutional feature map, and the convolutional feature map is finally pooled by the last pooling layer to generate a pooling feature map;

a1.6: performing multiple classification identification on the region position of the energy mass of the velocity spectrum based on the pooling characteristic maps of different pooling layers, screening out the real position of the energy mass, and performing multiple detection and inference on the peak value of the energy mass by using a full-connected layer to obtain the actual position of the peak value of the energy mass, namely the prediction result of the time-velocity pair sequence of the velocity spectrum;

a1.7: calculating a cost function of the convolutional neural network, judging whether the cost function of the last full-connection layer is converged, finishing the training of the convolutional neural network if the cost function of the last full-connection layer is converged, and outputting a prediction result of the time-speed pair sequence, otherwise, performing the next step;

a1.8: calculating gradient values of cost functions of all layers in the convolutional neural network by using a back conduction algorithm, calculating error sensitivity of all layers of the convolutional neural network according to the obtained gradient of each layer by using a momentum-based random gradient descent algorithm, and updating the weight of each layer by using the error sensitivity of each layer;

a1.8: and returning to the step 1.2.

Further, the step of refining of step a2 includes:

a2.1: the prediction result and the artificially extracted value of the time-speed pair sequence of the original speed spectrum form a training sample set of the recurrent neural network, and the training sample set of the recurrent neural network is input into the recurrent neural network;

a2.2: the recurrent neural network extracts the relation characteristic of time and speed in the prediction result through the neurons of the recurrent neural network, the relation characteristic is related to the weight of the corresponding layer, and a more accurate time-speed pair sequence is obtained by utilizing the relation characteristic;

a2.3: calculating a cost function of the convolutional neural network, judging whether the cost function of the output layer is converged, if so, finishing the training of the convolutional neural network, and if not, performing the next step;

a2.4: calculating gradient values of cost functions of all layers in the cyclic neural network by using a reverse conduction algorithm, calculating error sensitivity of all layers in the cyclic neural network according to the gradient values of all layers by using an Adam learning rate self-adaptive algorithm, and updating the weight of each layer by using the error sensitivity of each layer;

a2.5: repeat step A2.2.

Further, the training sample set of the recurrent neural network is formed in an input layer of the recurrent neural network, a hidden layer of the recurrent neural network is used for correcting the prediction result, an output layer of the recurrent neural network is used for outputting a more accurate time-speed pair sequence, and the recurrent neural network is a long-short term memory network structure which comprises a forgetting gate, an input gate, an output gate and a cell state.

Further, the step of refining of step a3 includes:

a3.1: carrying out whitening pretreatment on the original velocity spectrum;

a3.2: inputting the preprocessed velocity spectrum into the convolutional neural network training model, wherein the position corresponding to the energy mass in the processed velocity spectrum is the prediction result of the time-speed pair sequence of the processed velocity spectrum;

a3.3: inputting the prediction result of the processed time-speed pair sequence of the velocity spectrum into the recurrent neural network training model, correcting the prediction result by the recurrent neural network training model, and finally outputting an accurate time-speed pair sequence;

a3.4: and combining a plurality of velocity spectrum intelligent pickup results of the same measuring line at different positions, and forming a superimposed velocity profile through interpolation.

The technical scheme provided by the embodiment of the invention has the following beneficial effects: the seismic velocity spectrum artificial intelligence picking method based on the deep learning can (1) carry out artificial intelligence picking on a seismic velocity spectrum by adopting the seismic velocity spectrum artificial intelligence picking method based on the deep learning, and corresponding original seismic velocity spectrum data are input into a convolutional neural network training model and a cyclic neural network training model obtained after training, so that an accurate time-velocity pair sequence can be obtained without any artificial intervention in the period, and the degree of automation is high; (2) the invention needs to spend a lot of time when training the model; after training is finished, the time-velocity curve can be automatically and quickly obtained by inputting the original velocity spectrum data into the training model, the time for processing the seismic data is favorably shortened, and the intelligence and physical labor of seismic data processing personnel can be basically and completely liberated.

Drawings

FIG. 1 is a prior art seismic velocity spectrum;

FIG. 2 is a block diagram of a convolutional neural network and a cyclic neural network of the present invention;

FIG. 3 is a comparison schematic of the velocity spectrum pickup of the present invention;

FIG. 4 is a frame diagram of the seismic velocity spectrum artificial intelligence picking method based on deep learning of the invention.

Detailed Description

In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be further described with reference to the accompanying drawings.

Referring to fig. 2 and 4, an embodiment of the present invention provides an artificial intelligence picking method for seismic velocity spectrum based on deep learning, which mainly includes two stages of learning and intelligent picking, where the learning stage mainly trains a convolutional neural network to obtain a convolutional neural network training model and trains a cyclic neural network to obtain a cyclic neural network training model; in the intelligent picking stage, the accurate time-velocity pair sequence in the original velocity spectrum is obtained by using the convolutional neural network training model and the cyclic neural network training model.

The seismic velocity spectrum artificial intelligence picking method based on deep learning is characterized in that a velocity spectrum has the characteristics of an image, the characteristics of energy clusters in the velocity spectrum are extracted through the convolutional neural network training model, and the prediction result of a time-velocity pair sequence corresponding to the energy clusters in the velocity spectrum is calculated from the extracted characteristics; and extracting the relation characteristics of the time and the speed of the prediction result by utilizing the recurrent neural network training model based on the time sequence characteristics of the speed spectrum, and correcting the prediction result according to the relation characteristics to finally obtain an accurate time-speed pair sequence.

Specifically, the learning phase includes:

a1: and taking the preprocessed original velocity spectrum and the artificial extraction value of the time-velocity pair sequence corresponding to the original velocity spectrum as training samples, training the convolutional neural network, and obtaining a convolutional neural network training model capable of predicting the prediction result.

Referring to fig. 2, the convolutional neural network includes a plurality of convolutional layers and a plurality of pooling layers arranged to intersect with the convolutional layers, each pooling layer is located behind the convolutional layer corresponding to the convolutional layer, the convolutional neural network further includes a plurality of fully-connected layers located behind the last pooling layer, the plurality of fully-connected layers are arranged in front of and behind each other, and the last fully-connected layer corresponds to an output of the neural network.

In this embodiment, the convolutional neural network at least includes 4 convolutional layers, 4 pooling layers, and 3 fully-connected layers.

The convolutional neural network has the specific structure as follows:

the first layer is a convolution layer, 16 convolution kernels are used in the convolution layer, the window size of the convolution kernels is 3 x 3 grids, the moving step length is set to be 2, the activation function is set to be a ReLU function, and 16 convolution characteristic graphs are output;

the second layer is a pooling layer, the kernel window size of the second layer is 2 multiplied by 2 grids, the moving step length is set to be 2, the size of the obtained pooling characteristic graph is changed to be one fourth of the original size after the convolution characteristic graph passes through the pooling layer, the number of the characteristic graphs is not changed, and 16 pooling characteristic graphs are output;

the third layer is a convolution layer, the layer uses 32 convolution kernels, the window size of the convolution kernels is 3 x 3 grids, the moving step length is set to be 2, the activation function is set to be a ReLU function, and 32 convolution characteristic graphs are output;

the fourth layer is a pooling layer, the size of a kernel window of the pooling layer is 2 multiplied by 2 grids, the moving step length is set to be 2, and 32 pooling feature maps are output;

the fifth layer is a convolution layer, the convolution layer uses 64 convolution kernels, the window size of the convolution kernels is 3 x 3 grids, the moving step length is set to be 2, the activation function is set to be a ReLU function, and 64 convolution characteristic graphs are output;

the sixth layer is a pooling layer, the size of a kernel window of the pooling layer is 2 multiplied by 2 grids, the moving step length is set to be 2, and 64 pooling feature maps are output;

the seventh layer is a convolution layer, the layer uses 256 convolution kernels, the window size of the convolution kernels is 3 x 3 grids, the moving step size is set to be 2, the activation function is set to be a ReLU function, meanwhile, a Dropout layer is added, and 256 convolution characteristic graphs are output;

the eighth layer is a pooling layer, the size of a kernel window of the layer is 2 multiplied by 2 grids, the moving step length is set to be 2, and 256 pooling characteristic graphs are output;

the ninth layer is a full connection layer, and 256 neurons are arranged in the layer;

the tenth layer is a full connection layer, and 1024 neurons are arranged in the layer;

the eleventh layer is a fully connected layer, and the number of neurons in the layer corresponds to the number of time-velocity pairs and energy boluses in the velocity spectrum.

The process of training the convolutional neural network is as follows:

a1.1: and performing whitening pretreatment on the original velocity spectrum to obtain a velocity spectrum, wherein the velocity spectrum and a corresponding time-velocity pair sequence form a training data set of the convolutional neural network.

The process of whitening preprocessing the raw velocity spectrum is: an amplitude threshold value of the original velocity spectrum grid points is set, and when the amplitude value of one of the grid points is smaller than the threshold value, the amplitude value of the grid point is set to zero, thereby obtaining a preprocessed velocity spectrum. And manually extracting a time-speed pair sequence of the processed velocity spectrum, wherein the time-speed pair sequence is an estimated value and has relatively low accuracy.

A1.2: randomly shuffling the training data set of the convolutional neural network, grouping the data in the training set, wherein the number of the data in each group is consistent, and sequentially inputting each group into the convolutional neural network, wherein each group of data sequentially passes through a plurality of convolutional layers.

A1.3: and extracting and fusing the features on one convolution layer by using a sliding window to obtain convolution feature maps on the convolution layer, wherein each convolution feature map is related to the weight of the convolution layer corresponding to the convolution feature map.

The forward implementation process of the convolutional layer is as follows:

a_i，k＝f(x_i*w_k+b_k) (I)

wherein, a_i，kIs the ith tensor velocity spectral data x of the training set input via the kth convolution kernel in the convolutional layer_iConvolution characteristic graph obtained by convolution processing, representing convolution operation, W_kWeight representing the kth convolution, b_kFor the threshold corresponding to the kth convolution kernel, f (-) is the ReLU-type activation function:

f(x)＝max(0，x) (2)

as can be seen from formula (2), when x is less than or equal to zero, the function value is zero; when x is greater than zero, the function value is still x.

A1.4: and inputting the convolution characteristic map into the subsequent pooling layer adjacent to the convolution characteristic map, wherein the pooling layer adopts a maximum pooling algorithm to obtain a pooling characteristic map, and the pooling characteristic map enters another subsequent convolution layer adjacent to the pooling characteristic map.

And inputting the convolution characteristic diagram generated by the convolution layer into the pooling layer, wherein the pooling layer adopts maximum pooling, the pooling dimension is set to be 2, the moving step length is 2, the size of the pooling characteristic diagram obtained after the convolution characteristic diagram passes through the pooling layer becomes one fourth of the original size, and the number of the characteristic diagrams is unchanged. The maximum pooling uses the following formula:

c_j＝maxpool(a_i，k) (3)

wherein, c_jA jth pooling profile generated for the pooling layer.

A1.5: repeating the steps A1.3 and A1.4 until the data enters the last convolution layer, wherein a Dropout layer is inserted in the last convolution layer and used for randomly enabling the value of the pooling feature map passing through the Dropout layer to be zero so as to reduce the phenomenon of network overfitting, and then the convolution layer generates a last convolution feature map which is finally pooled by the last pooling layer to generate a pooling feature map.

The calculation process for randomly letting the pooling profile across the Dropout layer be set to zero is:

Dropout(x)＝RandomZero(p)×x (4)

randomzero (p) shows that the value in the data matrix x of the layer is set to be 0 according to the set probability p, and Dropout (x) shows the data matrix obtained after the training stage passes through the Dropout layer.

A1.6: and performing multiple classification and identification on the region position of the energy mass of the velocity spectrum based on the pooling characteristic maps of different pooling layers, screening out the real position of the energy mass, and performing multiple detection and inference on the peak value of the energy mass by using the full-connected layers to obtain the actual position of the peak value of the energy mass, namely the prediction result of the time-velocity pair sequence of the velocity spectrum.

Wherein the forward calculation process of the full connection layer is as follows:

a_i，l＝f(W_la_t，t-1+b_i) (5)

in the formula, a_i，lAnd the characteristic vector represents the ith tensile velocity spectrum data after passing through the full connecting layer, and the characteristic vector contains the actual position of the energy mass peak.

A1.7: and calculating a cost function of the convolutional neural network, judging whether the cost function of the last full-connection layer is converged, finishing the training of the convolutional neural network if the cost function of the last full-connection layer is converged, and outputting a prediction result of the time-speed pair sequence, otherwise, performing the next step.

The calculation formula of the cost function of the convolutional neural network is as follows:

wherein, y_iDenotes x_iThe corresponding objective, f (-) represents the function of the forward computation of all convolutional layers, L (-) represents the loss function, m is the number of input vectors, and θ represents the parameter of the convolutional neural network.

A1.8: calculating gradient values of cost functions of all layers in the convolutional neural network by using a back conduction algorithm, calculating error sensitivity of all layers of the convolutional neural network according to the obtained gradient of each layer by using a momentum-based random gradient descent algorithm, and updating the weight of each layer by using the error sensitivity of each layer.

If the current l layer is a fully connected layer, the residual calculation formula of the current l layer is as follows:

δ_l＝(W_i)⁷δ_l+1f^l(a_t) (7)

parameter W_lThe gradient of (a) is calculated as:

the gradient calculation formula of the parameter b is as follows:

wherein, delta_l+1Is the residual error of the (l + 1) th layer in the network, J (W, b; x, y) is the cost function, W, b, x, y respectively represent the weight, the threshold, the training input and the corresponding target, and T is the matrix transposition.

If the current layer is a convolutional layer, the residual calculation formula of the current layer is as follows:

δ_i＝δ_l+1*rot180(W_l)f^l(a_l) (10)

wherein, rot180 (W)_l) Represents W_lA rotation of 180 degrees is required.

If the current l layer is a pooling layer, the residual calculation formula of the current l layer is as follows:

δ_l＝upsample((W_l)⁷δ_l+1)f^l(a_l) (11)

where the upsamplle (-) function represents a function of the amplification and error redistribution of the pooled error matrix.

The update formula of the momentum random gradient descent algorithm is as follows:

wherein the content of the first and second substances,

representing the gradient of a parameter theta calculated from x, y in the training set, alpha being the learning rate, gamma being the momentum parameter, v_tIs the current velocity vector, v_t-1Is the velocity vector in the previous iteration. Alpha is initially set to 0.1, v_tThe initial setting is 0, gamma is between the intervals (0, 1), gamma is set to 0.5 in the initial training stage, and becomes larger as the number of iterations increases.

A1.8: and returning to the step 1.2.

A2: and taking the prediction result and the manually extracted value of the time-speed pair sequence of the original speed spectrum as a new training sample set, training the recurrent neural network, and obtaining a recurrent neural network training model capable of obtaining an accurate time-speed pair sequence.

The recurrent neural network comprises an input layer, an output layer and a hidden layer positioned between the input layer and the output layer, wherein the input layer corresponds to the output of the convolutional neural network. The input layer vector dimensions correspond to the output of a convolutional neural network; the hidden layer uses at least 20 neurons; the output layer is a time-speed pair and has the same number as the neurons of the eleventh layer of the convolutional neural network.

The process of training the recurrent neural network is as follows:

a2.1: and (3) forming a training sample set of the recurrent neural network by the prediction result and the artificially extracted value of the time-velocity pair sequence of the original velocity spectrum, and inputting the training sample set of the recurrent neural network into the recurrent neural network.

A2.2: and the recurrent neural network extracts the relation characteristic of time and speed in the prediction result through the neurons of the recurrent neural network, the relation characteristic is related to the weight of the corresponding layer, and a more accurate time-speed pair sequence is obtained by utilizing the relation characteristic.

The recurrent neural network is a long-short term memory (LSTM) network structure, which comprises a forgetting gate, an input gate, an output gate and a cell state, and the forward calculation formula is as follows:

f^(t)＝σ(W_fh^(t-1)+U_fx^(z)+b_f) (13)

i^(t)＝σ(W_ih^(t-1)+U_ix^(t)+b_i) (14)

a^(t)＝tanh(W_ah^(t-1)+U_ax^(t)+b_a) (15)

C^(t)＝C^(t-1)⊙f^(t)+i^(t)⊙a^(t) (16)

o^(t)＝σ(W_oh^(t-1)+U_ox^(t)+b_o) (17)

h^(t)＝o^(t)⊙tanh(C^(t)) (18)

wherein, the formula (13) is a mathematical expression of the LSTM forgetting door, f^(t)Indicates the probability of forgetting the state of the previous layer of hidden cells, h^(t-1)Representing the hidden state of the previous sequence, x^(t)Representing the input of the current sequence, W_f，U_f，b_fRespectively, a weight matrix and an offset vector, and sigma is a sigmoid activation function.

The LSTM input gate is composed of two parts, and the expressions (14) and (15) are mathematical expressions of the two parts respectively. i.e. i^(t)，a^(t)Respectively representing the outputs of the two parts. W_iAnd U_iWeight matrix representing input gates, b_iRepresenting the offset vector of the input gate, W_aAnd U_aWeight matrix representing input gates, b_aRepresenting the offset vector of the input gate.

Equation (16) is a mathematical expression for updating the state of cells, where C^(t)Indicating the current cell state, c^(t-1)Indicates the cell state of the previous sequence, which indicates the Hadamard product.

Equations (17), (18) represent two-part mathematical expressions for the output gate, the first part being o^(t)I.e. a hidden neuron which is hidden by the previous sequence of hidden states h^(t-1)And the input vector of the current sequence and the activation function sigmoid, and the second part h^(t)Indicating the current hidden state, which is represented by the current cell state C^(t)Tan h activation function and o^(t)And (4) forming. Hidden neurons and hidden states hide velocity, time information.

Equation (19) is a mathematical expression for updating the prediction output of the current sequence.

For the prediction output of the current sequence, V and c represent the weight matrix and the offset vector of the prediction output, respectively.

A2.3: and calculating a cost function of the convolutional neural network, judging whether the cost function of the output layer is converged, finishing the training of the convolutional neural network if the cost function of the output layer is converged, and otherwise, performing the next step.

A2.4: and calculating gradient values of cost functions of all layers in the cyclic neural network by using a reverse conduction algorithm, calculating error sensitivity of all layers in the cyclic neural network according to the gradient values of all layers by using an Adam learning rate self-adaptive algorithm, and updating the weight of each layer by using the error sensitivity of each layer.

The gradient value update formula is as follows:

equation (20) represents the hidden state h^(t)The expression of the gradient calculation of (a),

in a hidden state h^(t)L represents a cost function,formula (21) represents cell State C^(t)The expression of the gradient calculation of (a),

is in a cellular state C^(t)The gradient value of (a). Equation (22) represents the weight matrix W in the forgetting gate_fThe gradient update expression of (1):

where τ denotes an index value of the current sequence, and the gradient update expression of other parameters is similar to expression (22).

The updating formula of the Adam learning rate self-adaptive algorithm is as follows:

where η represents the step size, set to 0.001,. epsilon.a constant for numerical stability, set to 10-8,

denotes v_tThe correction value for the first moment estimate,

represents m_tCorrection values of second-order moment estimates, v_tAnd m_tThe following were used:

v_t＝ρ₁ v_t-1+(1-ρ₁)L

m_t＝ρ₂m_t-1+(1-ρ₂)L⊙L

where ρ is₁Exponential decay Rate, ρ, representing an estimate of the first moment₂Exponential decay Rate, ρ, representing an estimate of the second moment₁And ρ₂In the interval [0,1), L represents a cost function, and L represents a Hadamard product.

A2.5: repeat step A2.2.

The intelligent pick-up phase comprises:

The method specifically comprises the following steps:

a3.1: the raw velocity spectrum is subjected to a whitening pre-processing.

A3.2: and inputting the preprocessed velocity spectrum into the convolutional neural network training model, wherein the position corresponding to the energy mass in the processed velocity spectrum is the prediction result of the time-speed pair sequence of the processed velocity spectrum.

A3.3: and inputting the prediction result of the processed time-speed pair sequence of the velocity spectrum into the recurrent neural network training model, correcting the prediction result by the recurrent neural network training model, and finally outputting an accurate time-speed pair sequence.

FIG. 1 is a prior art seismic velocity spectrum and FIG. 3 is a comparison schematic of the velocity spectrum pickup of the present invention.

In this document, the terms front, back, upper and lower are used to define the components in the drawings and the positions of the components relative to each other, and are used for clarity and convenience of the technical solution. It is to be understood that the use of the directional terms should not be taken to limit the scope of the claims.

The features of the embodiments and embodiments described herein above may be combined with each other without conflict.

The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims

1. An artificial intelligence picking method of seismic velocity spectrum based on deep learning is characterized in that: extracting the characteristics of energy clusters in the velocity spectrum through a convolutional neural network based on the characteristics of the velocity spectrum with images, and calculating the prediction result of a time-velocity pair sequence corresponding to the energy clusters in the velocity spectrum from the extracted characteristics; based on the time sequence characteristics of the velocity spectrum, extracting the relation characteristics of time and velocity of the prediction result by using a recurrent neural network, and correcting the prediction result according to the relation characteristics to finally obtain an accurate time-velocity pair sequence;

the method comprises the following refining steps:

the step of refining step a1 includes:

a1.4: inputting the convolution characteristic graph into a subsequent pooling layer adjacent to the convolution characteristic graph, wherein the pooling layer adopts a maximum pooling algorithm to obtain a pooling characteristic graph which enters another subsequent convolution layer adjacent to the pooling characteristic graph;

a1.9: and returning to the step 1.2.

2. The seismic velocity spectrum artificial intelligence picking method based on deep learning of claim 1, characterized in that: further comprising the following refining steps:

3. The seismic velocity spectrum artificial intelligence picking method based on deep learning of claim 2, characterized in that: the convolutional neural network comprises a plurality of convolutional layers and a plurality of pooling layers which are arranged in a crossed mode with the convolutional layers, each pooling layer is located behind the convolutional layer corresponding to the pooling layer, the convolutional neural network further comprises a plurality of full-connection layers located behind the last pooling layer, the full-connection layers are arranged in a front-back mode, and the last full-connection layer corresponds to the output of the neural network.

4. The seismic velocity spectrum artificial intelligence picking method based on deep learning of claim 3, characterized in that: the recurrent neural network comprises an input layer, an output layer and a hidden layer positioned between the input layer and the output layer, wherein the input layer corresponds to the output of the convolutional neural network.

5. The seismic velocity spectrum artificial intelligence picking method based on deep learning of claim 4, characterized in that: the step of refining step a2 includes:

a2.5: repeat step A2.2.

6. The seismic velocity spectrum artificial intelligence picking method based on deep learning of claim 5, characterized in that: the training sample set of the recurrent neural network is formed on an input layer of the recurrent neural network, a hidden layer of the recurrent neural network is used for correcting the prediction result, an output layer of the recurrent neural network is used for outputting a more accurate time-speed pair sequence, and the recurrent neural network is a long-short term memory network structure which comprises a forgetting gate, an input gate, an output gate and a cell state.

7. The seismic velocity spectrum artificial intelligence picking method based on deep learning of claim 4, characterized in that: the step of refining step a3 includes:

a3.1: carrying out whitening pretreatment on the original velocity spectrum;