CN108596327B - Seismic velocity spectrum artificial intelligence picking method based on deep learning - Google Patents

Seismic velocity spectrum artificial intelligence picking method based on deep learning Download PDF

Info

Publication number
CN108596327B
CN108596327B CN201810260667.3A CN201810260667A CN108596327B CN 108596327 B CN108596327 B CN 108596327B CN 201810260667 A CN201810260667 A CN 201810260667A CN 108596327 B CN108596327 B CN 108596327B
Authority
CN
China
Prior art keywords
neural network
layer
velocity spectrum
velocity
convolutional
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810260667.3A
Other languages
Chinese (zh)
Other versions
CN108596327A (en
Inventor
张昊
朱培民
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China University of Geosciences
Original Assignee
China University of Geosciences
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China University of Geosciences filed Critical China University of Geosciences
Priority to CN201810260667.3A priority Critical patent/CN108596327B/en
Publication of CN108596327A publication Critical patent/CN108596327A/en
Application granted granted Critical
Publication of CN108596327B publication Critical patent/CN108596327B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Abstract

The invention provides an artificial intelligence picking method of an earthquake velocity spectrum based on deep learning, which is characterized in that a velocity spectrum has images, the characteristics of energy clusters in the velocity spectrum are extracted through a convolutional neural network, and the prediction result of a time-velocity pair sequence corresponding to the energy clusters in the velocity spectrum is calculated from the extracted characteristics; and extracting the relation characteristics of the time and the speed of the prediction result by using a recurrent neural network based on the time sequence characteristics of the speed spectrum, and correcting the prediction result according to the relation characteristics to finally obtain an accurate time-speed pair sequence. The invention has the beneficial effects that: the degree of automation is high, the time of seismic data processing is reduced, and the intelligence and physical labor of seismic data processing personnel can be basically and completely liberated.

Description

Seismic velocity spectrum artificial intelligence picking method based on deep learning
Technical Field
The invention relates to the technical field of seismic data processing of seismic exploration, in particular to a seismic velocity spectrum artificial intelligence picking method based on deep learning.
Background
Velocity analysis is the basis for conventional seismic data processing. In the conventional velocity analysis method, a series of velocities at the same interval are given to scan, and a velocity spectrum (fig. 1) is produced with the superposition energy or the similarity coefficient or the like as a criterion for velocity analysis. Energy boluses of the velocity spectrum correspond to strong reflection information and can provide correct kinetic correction velocity. The accuracy of the speed pick-up can directly influence the results of dynamic correction, offset, AVO and time-depth conversion in oil and gas exploration.
The current velocity spectrum picking mode is completed manually, the working efficiency is low, the time consumption is high, and human errors are easy to occur. Therefore, with the continuous development of seismic exploration and the continuous expansion of exploration fields, the number of seismic data to be processed is increased, and higher requirements on the picking efficiency and precision of velocity spectrum in velocity analysis are also provided.
Through the search of the prior art, the Chinese patent with the publication number of CN105445788A provides an automatic velocity spectrum interpretation method based on a model and global optimization, the method has low picking efficiency, the result is often random, and the method is easy to make mistakes.
Another chinese patent, publication No. CN105572733A, provides an automatic seismic velocity spectrum picking method. The method comprises the steps of firstly converting seismic data into two-dimensional grid data, then setting a calculation time window, determining the position of a maximum value in the area of the calculation time window, then defining a speed search threshold value, recording the position of the maximum value of a speed spectrum, connecting a speed curve, and finally carrying out interpolation on the speed curve. The method needs to manually set reasonable calculation time window and speed search threshold according to prior knowledge, and cannot realize intellectualization and automation.
Disclosure of Invention
In view of this, the embodiment of the present invention provides an artificial intelligence picking method for seismic velocity spectrum based on deep learning.
The embodiment of the invention provides an artificial intelligence picking method of a seismic velocity spectrum based on deep learning, which is characterized in that based on the characteristic that the velocity spectrum has images, the characteristics of energy clusters in the velocity spectrum are extracted through a convolutional neural network, and the prediction result of a time-velocity pair sequence corresponding to the energy clusters in the velocity spectrum is calculated from the extracted characteristics; and extracting the relation characteristics of the time and the speed of the prediction result by using a recurrent neural network based on the time sequence characteristics of the speed spectrum, and correcting the prediction result according to the relation characteristics to finally obtain an accurate time-speed pair sequence.
Further, the method comprises the following steps of:
a1: taking the preprocessed original velocity spectrum and the artificial extraction value of the time-velocity pair sequence corresponding to the original velocity spectrum as training samples, training the convolutional neural network, and obtaining a convolutional neural network training model capable of predicting the prediction result;
a2: taking the prediction result and the manually extracted value of the time-speed pair sequence of the original speed spectrum as a new training sample set, training the recurrent neural network, and obtaining a recurrent neural network training model capable of obtaining an accurate time-speed pair sequence;
a3: and sequentially inputting the original velocity spectrum into the convolutional neural network training model and the cyclic neural network training model to obtain an accurate time-velocity pair sequence.
Furthermore, the convolutional neural network comprises a plurality of convolutional layers and a plurality of pooling layers arranged in a cross way with the convolutional layers, each pooling layer is positioned behind the convolutional layer corresponding to the pooling layer, the cyclic neural network further comprises a plurality of full-connection layers positioned behind the last pooling layer, the full-connection layers are arranged in front and at the back, and the last full-connection layer corresponds to the output of the neural network.
Further, the recurrent neural network includes an input layer, an output layer, and a hidden layer between the input layer and the output layer, the input layer corresponding to an output of the convolutional neural network.
Further, the step of refining of step a1 includes:
a1.1: carrying out whitening pretreatment on an original velocity spectrum to obtain a velocity spectrum, wherein the velocity spectrum and a corresponding time-velocity pair sequence thereof form a training data set of a convolutional neural network;
a1.2: randomly shuffling a training data set of the convolutional neural network, grouping the data in the training set, wherein the number of the data in each group is consistent, and sequentially inputting each group into the convolutional neural network, wherein each group of data sequentially passes through a plurality of convolutional layers;
a1.3: extracting and fusing the features on one convolution layer by using a sliding window to obtain convolution feature maps on the convolution layer, wherein each convolution feature map is related to the weight of the convolution layer corresponding to the convolution feature map;
a1.4: inputting the convolution characteristic map into a subsequent pooling layer adjacent to the convolution characteristic map, wherein the pooling layer adopts a maximum pooling algorithm to obtain a pooling characteristic map, and the pooling characteristic map enters another subsequent convolution layer adjacent to the pooling characteristic map;
a1.5: repeating the steps A1.3 and A1.4 until data enters the last convolutional layer, wherein a Dropout layer is inserted in the last convolutional layer, the Dropout layer is used for randomly leading the value of the pooling feature map passing through the Dropout layer to be zero so as to reduce the phenomenon of network overfitting, then the convolutional layer generates a last convolutional feature map, and the convolutional feature map is finally pooled by the last pooling layer to generate a pooling feature map;
a1.6: performing multiple classification identification on the region position of the energy mass of the velocity spectrum based on the pooling characteristic maps of different pooling layers, screening out the real position of the energy mass, and performing multiple detection and inference on the peak value of the energy mass by using a full-connected layer to obtain the actual position of the peak value of the energy mass, namely the prediction result of the time-velocity pair sequence of the velocity spectrum;
a1.7: calculating a cost function of the convolutional neural network, judging whether the cost function of the last full-connection layer is converged, finishing the training of the convolutional neural network if the cost function of the last full-connection layer is converged, and outputting a prediction result of the time-speed pair sequence, otherwise, performing the next step;
a1.8: calculating gradient values of cost functions of all layers in the convolutional neural network by using a back conduction algorithm, calculating error sensitivity of all layers of the convolutional neural network according to the obtained gradient of each layer by using a momentum-based random gradient descent algorithm, and updating the weight of each layer by using the error sensitivity of each layer;
a1.8: and returning to the step 1.2.
Further, the step of refining of step a2 includes:
a2.1: the prediction result and the artificially extracted value of the time-speed pair sequence of the original speed spectrum form a training sample set of the recurrent neural network, and the training sample set of the recurrent neural network is input into the recurrent neural network;
a2.2: the recurrent neural network extracts the relation characteristic of time and speed in the prediction result through the neurons of the recurrent neural network, the relation characteristic is related to the weight of the corresponding layer, and a more accurate time-speed pair sequence is obtained by utilizing the relation characteristic;
a2.3: calculating a cost function of the convolutional neural network, judging whether the cost function of the output layer is converged, if so, finishing the training of the convolutional neural network, and if not, performing the next step;
a2.4: calculating gradient values of cost functions of all layers in the cyclic neural network by using a reverse conduction algorithm, calculating error sensitivity of all layers in the cyclic neural network according to the gradient values of all layers by using an Adam learning rate self-adaptive algorithm, and updating the weight of each layer by using the error sensitivity of each layer;
a2.5: repeat step A2.2.
Further, the training sample set of the recurrent neural network is formed in an input layer of the recurrent neural network, a hidden layer of the recurrent neural network is used for correcting the prediction result, an output layer of the recurrent neural network is used for outputting a more accurate time-speed pair sequence, and the recurrent neural network is a long-short term memory network structure which comprises a forgetting gate, an input gate, an output gate and a cell state.
Further, the step of refining of step a3 includes:
a3.1: carrying out whitening pretreatment on the original velocity spectrum;
a3.2: inputting the preprocessed velocity spectrum into the convolutional neural network training model, wherein the position corresponding to the energy mass in the processed velocity spectrum is the prediction result of the time-speed pair sequence of the processed velocity spectrum;
a3.3: inputting the prediction result of the processed time-speed pair sequence of the velocity spectrum into the recurrent neural network training model, correcting the prediction result by the recurrent neural network training model, and finally outputting an accurate time-speed pair sequence;
a3.4: and combining a plurality of velocity spectrum intelligent pickup results of the same measuring line at different positions, and forming a superimposed velocity profile through interpolation.
The technical scheme provided by the embodiment of the invention has the following beneficial effects: the seismic velocity spectrum artificial intelligence picking method based on the deep learning can (1) carry out artificial intelligence picking on a seismic velocity spectrum by adopting the seismic velocity spectrum artificial intelligence picking method based on the deep learning, and corresponding original seismic velocity spectrum data are input into a convolutional neural network training model and a cyclic neural network training model obtained after training, so that an accurate time-velocity pair sequence can be obtained without any artificial intervention in the period, and the degree of automation is high; (2) the invention needs to spend a lot of time when training the model; after training is finished, the time-velocity curve can be automatically and quickly obtained by inputting the original velocity spectrum data into the training model, the time for processing the seismic data is favorably shortened, and the intelligence and physical labor of seismic data processing personnel can be basically and completely liberated.
Drawings
FIG. 1 is a prior art seismic velocity spectrum;
FIG. 2 is a block diagram of a convolutional neural network and a cyclic neural network of the present invention;
FIG. 3 is a comparison schematic of the velocity spectrum pickup of the present invention;
FIG. 4 is a frame diagram of the seismic velocity spectrum artificial intelligence picking method based on deep learning of the invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, embodiments of the present invention will be further described with reference to the accompanying drawings.
Referring to fig. 2 and 4, an embodiment of the present invention provides an artificial intelligence picking method for seismic velocity spectrum based on deep learning, which mainly includes two stages of learning and intelligent picking, where the learning stage mainly trains a convolutional neural network to obtain a convolutional neural network training model and trains a cyclic neural network to obtain a cyclic neural network training model; in the intelligent picking stage, the accurate time-velocity pair sequence in the original velocity spectrum is obtained by using the convolutional neural network training model and the cyclic neural network training model.
The seismic velocity spectrum artificial intelligence picking method based on deep learning is characterized in that a velocity spectrum has the characteristics of an image, the characteristics of energy clusters in the velocity spectrum are extracted through the convolutional neural network training model, and the prediction result of a time-velocity pair sequence corresponding to the energy clusters in the velocity spectrum is calculated from the extracted characteristics; and extracting the relation characteristics of the time and the speed of the prediction result by utilizing the recurrent neural network training model based on the time sequence characteristics of the speed spectrum, and correcting the prediction result according to the relation characteristics to finally obtain an accurate time-speed pair sequence.
Specifically, the learning phase includes:
a1: and taking the preprocessed original velocity spectrum and the artificial extraction value of the time-velocity pair sequence corresponding to the original velocity spectrum as training samples, training the convolutional neural network, and obtaining a convolutional neural network training model capable of predicting the prediction result.
Referring to fig. 2, the convolutional neural network includes a plurality of convolutional layers and a plurality of pooling layers arranged to intersect with the convolutional layers, each pooling layer is located behind the convolutional layer corresponding to the convolutional layer, the convolutional neural network further includes a plurality of fully-connected layers located behind the last pooling layer, the plurality of fully-connected layers are arranged in front of and behind each other, and the last fully-connected layer corresponds to an output of the neural network.
In this embodiment, the convolutional neural network at least includes 4 convolutional layers, 4 pooling layers, and 3 fully-connected layers.
The convolutional neural network has the specific structure as follows:
the first layer is a convolution layer, 16 convolution kernels are used in the convolution layer, the window size of the convolution kernels is 3 x 3 grids, the moving step length is set to be 2, the activation function is set to be a ReLU function, and 16 convolution characteristic graphs are output;
the second layer is a pooling layer, the kernel window size of the second layer is 2 multiplied by 2 grids, the moving step length is set to be 2, the size of the obtained pooling characteristic graph is changed to be one fourth of the original size after the convolution characteristic graph passes through the pooling layer, the number of the characteristic graphs is not changed, and 16 pooling characteristic graphs are output;
the third layer is a convolution layer, the layer uses 32 convolution kernels, the window size of the convolution kernels is 3 x 3 grids, the moving step length is set to be 2, the activation function is set to be a ReLU function, and 32 convolution characteristic graphs are output;
the fourth layer is a pooling layer, the size of a kernel window of the pooling layer is 2 multiplied by 2 grids, the moving step length is set to be 2, and 32 pooling feature maps are output;
the fifth layer is a convolution layer, the convolution layer uses 64 convolution kernels, the window size of the convolution kernels is 3 x 3 grids, the moving step length is set to be 2, the activation function is set to be a ReLU function, and 64 convolution characteristic graphs are output;
the sixth layer is a pooling layer, the size of a kernel window of the pooling layer is 2 multiplied by 2 grids, the moving step length is set to be 2, and 64 pooling feature maps are output;
the seventh layer is a convolution layer, the layer uses 256 convolution kernels, the window size of the convolution kernels is 3 x 3 grids, the moving step size is set to be 2, the activation function is set to be a ReLU function, meanwhile, a Dropout layer is added, and 256 convolution characteristic graphs are output;
the eighth layer is a pooling layer, the size of a kernel window of the layer is 2 multiplied by 2 grids, the moving step length is set to be 2, and 256 pooling characteristic graphs are output;
the ninth layer is a full connection layer, and 256 neurons are arranged in the layer;
the tenth layer is a full connection layer, and 1024 neurons are arranged in the layer;
the eleventh layer is a fully connected layer, and the number of neurons in the layer corresponds to the number of time-velocity pairs and energy boluses in the velocity spectrum.
The process of training the convolutional neural network is as follows:
a1.1: and performing whitening pretreatment on the original velocity spectrum to obtain a velocity spectrum, wherein the velocity spectrum and a corresponding time-velocity pair sequence form a training data set of the convolutional neural network.
The process of whitening preprocessing the raw velocity spectrum is: an amplitude threshold value of the original velocity spectrum grid points is set, and when the amplitude value of one of the grid points is smaller than the threshold value, the amplitude value of the grid point is set to zero, thereby obtaining a preprocessed velocity spectrum. And manually extracting a time-speed pair sequence of the processed velocity spectrum, wherein the time-speed pair sequence is an estimated value and has relatively low accuracy.
A1.2: randomly shuffling the training data set of the convolutional neural network, grouping the data in the training set, wherein the number of the data in each group is consistent, and sequentially inputting each group into the convolutional neural network, wherein each group of data sequentially passes through a plurality of convolutional layers.
A1.3: and extracting and fusing the features on one convolution layer by using a sliding window to obtain convolution feature maps on the convolution layer, wherein each convolution feature map is related to the weight of the convolution layer corresponding to the convolution feature map.
The forward implementation process of the convolutional layer is as follows:
ai,k=f(xi*wk+bk) (I)
wherein, ai,kIs the ith tensor velocity spectral data x of the training set input via the kth convolution kernel in the convolutional layeriConvolution characteristic graph obtained by convolution processing, representing convolution operation, WkWeight representing the kth convolution, bkFor the threshold corresponding to the kth convolution kernel, f (-) is the ReLU-type activation function:
f(x)=max(0,x) (2)
as can be seen from formula (2), when x is less than or equal to zero, the function value is zero; when x is greater than zero, the function value is still x.
A1.4: and inputting the convolution characteristic map into the subsequent pooling layer adjacent to the convolution characteristic map, wherein the pooling layer adopts a maximum pooling algorithm to obtain a pooling characteristic map, and the pooling characteristic map enters another subsequent convolution layer adjacent to the pooling characteristic map.
And inputting the convolution characteristic diagram generated by the convolution layer into the pooling layer, wherein the pooling layer adopts maximum pooling, the pooling dimension is set to be 2, the moving step length is 2, the size of the pooling characteristic diagram obtained after the convolution characteristic diagram passes through the pooling layer becomes one fourth of the original size, and the number of the characteristic diagrams is unchanged. The maximum pooling uses the following formula:
cj=maxpool(ai,k) (3)
wherein, cjA jth pooling profile generated for the pooling layer.
A1.5: repeating the steps A1.3 and A1.4 until the data enters the last convolution layer, wherein a Dropout layer is inserted in the last convolution layer and used for randomly enabling the value of the pooling feature map passing through the Dropout layer to be zero so as to reduce the phenomenon of network overfitting, and then the convolution layer generates a last convolution feature map which is finally pooled by the last pooling layer to generate a pooling feature map.
The calculation process for randomly letting the pooling profile across the Dropout layer be set to zero is:
Dropout(x)=RandomZero(p)×x (4)
randomzero (p) shows that the value in the data matrix x of the layer is set to be 0 according to the set probability p, and Dropout (x) shows the data matrix obtained after the training stage passes through the Dropout layer.
A1.6: and performing multiple classification and identification on the region position of the energy mass of the velocity spectrum based on the pooling characteristic maps of different pooling layers, screening out the real position of the energy mass, and performing multiple detection and inference on the peak value of the energy mass by using the full-connected layers to obtain the actual position of the peak value of the energy mass, namely the prediction result of the time-velocity pair sequence of the velocity spectrum.
Wherein the forward calculation process of the full connection layer is as follows:
ai,l=f(Wlat,t-1+bi) (5)
in the formula, ai,lAnd the characteristic vector represents the ith tensile velocity spectrum data after passing through the full connecting layer, and the characteristic vector contains the actual position of the energy mass peak.
A1.7: and calculating a cost function of the convolutional neural network, judging whether the cost function of the last full-connection layer is converged, finishing the training of the convolutional neural network if the cost function of the last full-connection layer is converged, and outputting a prediction result of the time-speed pair sequence, otherwise, performing the next step.
The calculation formula of the cost function of the convolutional neural network is as follows:
Figure GDA0003067190590000101
wherein, yiDenotes xiThe corresponding objective, f (-) represents the function of the forward computation of all convolutional layers, L (-) represents the loss function, m is the number of input vectors, and θ represents the parameter of the convolutional neural network.
A1.8: calculating gradient values of cost functions of all layers in the convolutional neural network by using a back conduction algorithm, calculating error sensitivity of all layers of the convolutional neural network according to the obtained gradient of each layer by using a momentum-based random gradient descent algorithm, and updating the weight of each layer by using the error sensitivity of each layer.
If the current l layer is a fully connected layer, the residual calculation formula of the current l layer is as follows:
δl=(Wi)7δl+1fl(at) (7)
parameter WlThe gradient of (a) is calculated as:
Figure GDA0003067190590000102
the gradient calculation formula of the parameter b is as follows:
Figure GDA0003067190590000111
wherein, deltal+1Is the residual error of the (l + 1) th layer in the network, J (W, b; x, y) is the cost function, W, b, x, y respectively represent the weight, the threshold, the training input and the corresponding target, and T is the matrix transposition.
If the current layer is a convolutional layer, the residual calculation formula of the current layer is as follows:
δi=δl+1*rot180(Wl)fl(al) (10)
wherein, rot180 (W)l) Represents WlA rotation of 180 degrees is required.
If the current l layer is a pooling layer, the residual calculation formula of the current l layer is as follows:
δl=upsample((Wl)7δl+1)fl(al) (11)
where the upsamplle (-) function represents a function of the amplification and error redistribution of the pooled error matrix.
The update formula of the momentum random gradient descent algorithm is as follows:
Figure GDA0003067190590000112
wherein the content of the first and second substances,
Figure GDA0003067190590000113
representing the gradient of a parameter theta calculated from x, y in the training set, alpha being the learning rate, gamma being the momentum parameter, vtIs the current velocity vector, vt-1Is the velocity vector in the previous iteration. Alpha is initially set to 0.1, vtThe initial setting is 0, gamma is between the intervals (0, 1), gamma is set to 0.5 in the initial training stage, and becomes larger as the number of iterations increases.
A1.8: and returning to the step 1.2.
A2: and taking the prediction result and the manually extracted value of the time-speed pair sequence of the original speed spectrum as a new training sample set, training the recurrent neural network, and obtaining a recurrent neural network training model capable of obtaining an accurate time-speed pair sequence.
The recurrent neural network comprises an input layer, an output layer and a hidden layer positioned between the input layer and the output layer, wherein the input layer corresponds to the output of the convolutional neural network. The input layer vector dimensions correspond to the output of a convolutional neural network; the hidden layer uses at least 20 neurons; the output layer is a time-speed pair and has the same number as the neurons of the eleventh layer of the convolutional neural network.
The process of training the recurrent neural network is as follows:
a2.1: and (3) forming a training sample set of the recurrent neural network by the prediction result and the artificially extracted value of the time-velocity pair sequence of the original velocity spectrum, and inputting the training sample set of the recurrent neural network into the recurrent neural network.
A2.2: and the recurrent neural network extracts the relation characteristic of time and speed in the prediction result through the neurons of the recurrent neural network, the relation characteristic is related to the weight of the corresponding layer, and a more accurate time-speed pair sequence is obtained by utilizing the relation characteristic.
The recurrent neural network is a long-short term memory (LSTM) network structure, which comprises a forgetting gate, an input gate, an output gate and a cell state, and the forward calculation formula is as follows:
f(t)=σ(Wfh(t-1)+Ufx(z)+bf) (13)
i(t)=σ(Wih(t-1)+Uix(t)+bi) (14)
a(t)=tanh(Wah(t-1)+Uax(t)+ba) (15)
C(t)=C(t-1)⊙f(t)+i(t)⊙a(t) (16)
o(t)=σ(Woh(t-1)+Uox(t)+bo) (17)
h(t)=o(t)⊙tanh(C(t)) (18)
Figure GDA0003067190590000121
wherein, the formula (13) is a mathematical expression of the LSTM forgetting door, f(t)Indicates the probability of forgetting the state of the previous layer of hidden cells, h(t-1)Representing the hidden state of the previous sequence, x(t)Representing the input of the current sequence, Wf,Uf,bfRespectively, a weight matrix and an offset vector, and sigma is a sigmoid activation function.
The LSTM input gate is composed of two parts, and the expressions (14) and (15) are mathematical expressions of the two parts respectively. i.e. i(t),a(t)Respectively representing the outputs of the two parts. WiAnd UiWeight matrix representing input gates, biRepresenting the offset vector of the input gate, WaAnd UaWeight matrix representing input gates, baRepresenting the offset vector of the input gate.
Equation (16) is a mathematical expression for updating the state of cells, where C(t)Indicating the current cell state, c(t-1)Indicates the cell state of the previous sequence, which indicates the Hadamard product.
Equations (17), (18) represent two-part mathematical expressions for the output gate, the first part being o(t)I.e. a hidden neuron which is hidden by the previous sequence of hidden states h(t-1)And the input vector of the current sequence and the activation function sigmoid, and the second part h(t)Indicating the current hidden state, which is represented by the current cell state C(t)Tan h activation function and o(t)And (4) forming. Hidden neurons and hidden states hide velocity, time information.
Equation (19) is a mathematical expression for updating the prediction output of the current sequence.
Figure GDA0003067190590000131
For the prediction output of the current sequence, V and c represent the weight matrix and the offset vector of the prediction output, respectively.
A2.3: and calculating a cost function of the convolutional neural network, judging whether the cost function of the output layer is converged, finishing the training of the convolutional neural network if the cost function of the output layer is converged, and otherwise, performing the next step.
A2.4: and calculating gradient values of cost functions of all layers in the cyclic neural network by using a reverse conduction algorithm, calculating error sensitivity of all layers in the cyclic neural network according to the gradient values of all layers by using an Adam learning rate self-adaptive algorithm, and updating the weight of each layer by using the error sensitivity of each layer.
The gradient value update formula is as follows:
Figure GDA0003067190590000132
Figure GDA0003067190590000133
equation (20) represents the hidden state h(t)The expression of the gradient calculation of (a),
Figure GDA0003067190590000134
in a hidden state h(t)L represents a cost function,formula (21) represents cell State C(t)The expression of the gradient calculation of (a),
Figure GDA0003067190590000135
is in a cellular state C(t)The gradient value of (a). Equation (22) represents the weight matrix W in the forgetting gatefThe gradient update expression of (1):
Figure GDA0003067190590000136
where τ denotes an index value of the current sequence, and the gradient update expression of other parameters is similar to expression (22).
The updating formula of the Adam learning rate self-adaptive algorithm is as follows:
Figure GDA0003067190590000141
where η represents the step size, set to 0.001,. epsilon.a constant for numerical stability, set to 10-8,
Figure GDA0003067190590000142
denotes vtThe correction value for the first moment estimate,
Figure GDA0003067190590000143
represents mtCorrection values of second-order moment estimates, vtAnd mtThe following were used:
vt=ρ1 vt-1+(1-ρ1)L
mt=ρ2mt-1+(1-ρ2)L⊙L
where ρ is1Exponential decay Rate, ρ, representing an estimate of the first moment2Exponential decay Rate, ρ, representing an estimate of the second moment1And ρ2In the interval [0,1), L represents a cost function, and L represents a Hadamard product.
A2.5: repeat step A2.2.
The intelligent pick-up phase comprises:
a3: and sequentially inputting the original velocity spectrum into the convolutional neural network training model and the cyclic neural network training model to obtain an accurate time-velocity pair sequence.
The method specifically comprises the following steps:
a3.1: the raw velocity spectrum is subjected to a whitening pre-processing.
A3.2: and inputting the preprocessed velocity spectrum into the convolutional neural network training model, wherein the position corresponding to the energy mass in the processed velocity spectrum is the prediction result of the time-speed pair sequence of the processed velocity spectrum.
A3.3: and inputting the prediction result of the processed time-speed pair sequence of the velocity spectrum into the recurrent neural network training model, correcting the prediction result by the recurrent neural network training model, and finally outputting an accurate time-speed pair sequence.
A3.4: and combining a plurality of velocity spectrum intelligent pickup results of the same measuring line at different positions, and forming a superimposed velocity profile through interpolation.
FIG. 1 is a prior art seismic velocity spectrum and FIG. 3 is a comparison schematic of the velocity spectrum pickup of the present invention.
In this document, the terms front, back, upper and lower are used to define the components in the drawings and the positions of the components relative to each other, and are used for clarity and convenience of the technical solution. It is to be understood that the use of the directional terms should not be taken to limit the scope of the claims.
The features of the embodiments and embodiments described herein above may be combined with each other without conflict.
The above description is only for the purpose of illustrating the preferred embodiments of the present invention and is not to be construed as limiting the invention, and any modifications, equivalents, improvements and the like that fall within the spirit and principle of the present invention are intended to be included therein.

Claims (7)

1. An artificial intelligence picking method of seismic velocity spectrum based on deep learning is characterized in that: extracting the characteristics of energy clusters in the velocity spectrum through a convolutional neural network based on the characteristics of the velocity spectrum with images, and calculating the prediction result of a time-velocity pair sequence corresponding to the energy clusters in the velocity spectrum from the extracted characteristics; based on the time sequence characteristics of the velocity spectrum, extracting the relation characteristics of time and velocity of the prediction result by using a recurrent neural network, and correcting the prediction result according to the relation characteristics to finally obtain an accurate time-velocity pair sequence;
the method comprises the following refining steps:
a1: taking the preprocessed original velocity spectrum and the artificial extraction value of the time-velocity pair sequence corresponding to the original velocity spectrum as training samples, training the convolutional neural network, and obtaining a convolutional neural network training model capable of predicting the prediction result;
the step of refining step a1 includes:
a1.1: carrying out whitening pretreatment on an original velocity spectrum to obtain a velocity spectrum, wherein the velocity spectrum and a corresponding time-velocity pair sequence thereof form a training data set of a convolutional neural network;
a1.2: randomly shuffling a training data set of the convolutional neural network, grouping the data in the training set, wherein the number of the data in each group is consistent, and sequentially inputting each group into the convolutional neural network, wherein each group of data sequentially passes through a plurality of convolutional layers;
a1.3: extracting and fusing the features on one convolution layer by using a sliding window to obtain convolution feature maps on the convolution layer, wherein each convolution feature map is related to the weight of the convolution layer corresponding to the convolution feature map;
a1.4: inputting the convolution characteristic graph into a subsequent pooling layer adjacent to the convolution characteristic graph, wherein the pooling layer adopts a maximum pooling algorithm to obtain a pooling characteristic graph which enters another subsequent convolution layer adjacent to the pooling characteristic graph;
a1.5: repeating the steps A1.3 and A1.4 until data enters the last convolutional layer, wherein a Dropout layer is inserted in the last convolutional layer, the Dropout layer is used for randomly leading the value of the pooling feature map passing through the Dropout layer to be zero so as to reduce the phenomenon of network overfitting, then the convolutional layer generates a last convolutional feature map, and the convolutional feature map is finally pooled by the last pooling layer to generate a pooling feature map;
a1.6: performing multiple classification identification on the region position of the energy mass of the velocity spectrum based on the pooling characteristic maps of different pooling layers, screening out the real position of the energy mass, and performing multiple detection and inference on the peak value of the energy mass by using a full-connected layer to obtain the actual position of the peak value of the energy mass, namely the prediction result of the time-velocity pair sequence of the velocity spectrum;
a1.7: calculating a cost function of the convolutional neural network, judging whether the cost function of the last full-connection layer is converged, finishing the training of the convolutional neural network if the cost function of the last full-connection layer is converged, and outputting a prediction result of the time-speed pair sequence, otherwise, performing the next step;
a1.8: calculating gradient values of cost functions of all layers in the convolutional neural network by using a back conduction algorithm, calculating error sensitivity of all layers of the convolutional neural network according to the obtained gradient of each layer by using a momentum-based random gradient descent algorithm, and updating the weight of each layer by using the error sensitivity of each layer;
a1.9: and returning to the step 1.2.
2. The seismic velocity spectrum artificial intelligence picking method based on deep learning of claim 1, characterized in that: further comprising the following refining steps:
a2: taking the prediction result and the manually extracted value of the time-speed pair sequence of the original speed spectrum as a new training sample set, training the recurrent neural network, and obtaining a recurrent neural network training model capable of obtaining an accurate time-speed pair sequence;
a3: and sequentially inputting the original velocity spectrum into the convolutional neural network training model and the cyclic neural network training model to obtain an accurate time-velocity pair sequence.
3. The seismic velocity spectrum artificial intelligence picking method based on deep learning of claim 2, characterized in that: the convolutional neural network comprises a plurality of convolutional layers and a plurality of pooling layers which are arranged in a crossed mode with the convolutional layers, each pooling layer is located behind the convolutional layer corresponding to the pooling layer, the convolutional neural network further comprises a plurality of full-connection layers located behind the last pooling layer, the full-connection layers are arranged in a front-back mode, and the last full-connection layer corresponds to the output of the neural network.
4. The seismic velocity spectrum artificial intelligence picking method based on deep learning of claim 3, characterized in that: the recurrent neural network comprises an input layer, an output layer and a hidden layer positioned between the input layer and the output layer, wherein the input layer corresponds to the output of the convolutional neural network.
5. The seismic velocity spectrum artificial intelligence picking method based on deep learning of claim 4, characterized in that: the step of refining step a2 includes:
a2.1: the prediction result and the artificially extracted value of the time-speed pair sequence of the original speed spectrum form a training sample set of the recurrent neural network, and the training sample set of the recurrent neural network is input into the recurrent neural network;
a2.2: the recurrent neural network extracts the relation characteristic of time and speed in the prediction result through the neurons of the recurrent neural network, the relation characteristic is related to the weight of the corresponding layer, and a more accurate time-speed pair sequence is obtained by utilizing the relation characteristic;
a2.3: calculating a cost function of the convolutional neural network, judging whether the cost function of the output layer is converged, if so, finishing the training of the convolutional neural network, and if not, performing the next step;
a2.4: calculating gradient values of cost functions of all layers in the cyclic neural network by using a reverse conduction algorithm, calculating error sensitivity of all layers in the cyclic neural network according to the gradient values of all layers by using an Adam learning rate self-adaptive algorithm, and updating the weight of each layer by using the error sensitivity of each layer;
a2.5: repeat step A2.2.
6. The seismic velocity spectrum artificial intelligence picking method based on deep learning of claim 5, characterized in that: the training sample set of the recurrent neural network is formed on an input layer of the recurrent neural network, a hidden layer of the recurrent neural network is used for correcting the prediction result, an output layer of the recurrent neural network is used for outputting a more accurate time-speed pair sequence, and the recurrent neural network is a long-short term memory network structure which comprises a forgetting gate, an input gate, an output gate and a cell state.
7. The seismic velocity spectrum artificial intelligence picking method based on deep learning of claim 4, characterized in that: the step of refining step a3 includes:
a3.1: carrying out whitening pretreatment on the original velocity spectrum;
a3.2: inputting the preprocessed velocity spectrum into the convolutional neural network training model, wherein the position corresponding to the energy mass in the processed velocity spectrum is the prediction result of the time-speed pair sequence of the processed velocity spectrum;
a3.3: inputting the prediction result of the processed time-speed pair sequence of the velocity spectrum into the recurrent neural network training model, correcting the prediction result by the recurrent neural network training model, and finally outputting an accurate time-speed pair sequence;
a3.4: and combining a plurality of velocity spectrum intelligent pickup results of the same measuring line at different positions, and forming a superimposed velocity profile through interpolation.
CN201810260667.3A 2018-03-27 2018-03-27 Seismic velocity spectrum artificial intelligence picking method based on deep learning Active CN108596327B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810260667.3A CN108596327B (en) 2018-03-27 2018-03-27 Seismic velocity spectrum artificial intelligence picking method based on deep learning

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810260667.3A CN108596327B (en) 2018-03-27 2018-03-27 Seismic velocity spectrum artificial intelligence picking method based on deep learning

Publications (2)

Publication Number Publication Date
CN108596327A CN108596327A (en) 2018-09-28
CN108596327B true CN108596327B (en) 2021-08-03

Family

ID=63624703

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810260667.3A Active CN108596327B (en) 2018-03-27 2018-03-27 Seismic velocity spectrum artificial intelligence picking method based on deep learning

Country Status (1)

Country Link
CN (1) CN108596327B (en)

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110007366B (en) * 2019-03-04 2020-08-25 中国科学院深圳先进技术研究院 Life searching method and system based on multi-sensor fusion
CN110471111B (en) * 2019-09-06 2020-04-14 中国海洋大学 Velocity spectrum automatic picking method based on convolutional neural network
CN112540404B (en) * 2019-09-20 2024-04-12 中国石油化工股份有限公司 Automatic speed analysis method and system based on deep learning
CN110826691A (en) * 2019-10-14 2020-02-21 中国地质大学(武汉) Intelligent seismic velocity spectrum pickup method based on YOLO and LSTM
CN110688722B (en) * 2019-10-17 2023-08-08 深制科技(苏州)有限公司 Automatic generation method of part attribute matrix based on deep learning
CN110889494A (en) * 2019-11-29 2020-03-17 北京龙田华远科技有限公司 Coal mining machine positioning method and device
CN111257934B (en) * 2020-01-17 2022-03-11 哈尔滨工业大学 Seismic oscillation peak acceleration prediction method based on second-order neuron deep neural network
CN111461172B (en) * 2020-03-04 2023-05-30 哈尔滨工业大学 Lightweight characteristic fusion method of hyperspectral remote sensing data based on two-dimensional point group convolution
CN112285776B (en) * 2020-10-23 2021-07-13 中国矿业大学(北京) Seismic velocity automatic picking method based on deep learning
CN113705084A (en) * 2021-08-02 2021-11-26 北京工业大学 Seismic simulation vibration table closed-loop control method based on deep learning
CN116047604B (en) * 2023-02-23 2023-10-17 成都理工大学 Deep seismic phase rapid pickup method based on amplitude statistics and time-frequency analysis

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102073064A (en) * 2009-11-25 2011-05-25 中国石油天然气集团公司 Method for improving velocity spectrum resolution by using phase information
CN102606151A (en) * 2012-04-01 2012-07-25 中国石油大学(北京) Method and device for predicting rock drillability of wildcat well before drilling
CN103675900A (en) * 2012-09-04 2014-03-26 中国石油天然气集团公司 Method for determining optimum velocity profile of converted-wave pre-stack time migration during seismic data processing process

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7447115B2 (en) * 2006-12-05 2008-11-04 Westerngeco L.L.C. Processing seismic data using interferometry techniques

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102073064A (en) * 2009-11-25 2011-05-25 中国石油天然气集团公司 Method for improving velocity spectrum resolution by using phase information
CN102606151A (en) * 2012-04-01 2012-07-25 中国石油大学(北京) Method and device for predicting rock drillability of wildcat well before drilling
CN103675900A (en) * 2012-09-04 2014-03-26 中国石油天然气集团公司 Method for determining optimum velocity profile of converted-wave pre-stack time migration during seismic data processing process

Also Published As

Publication number Publication date
CN108596327A (en) 2018-09-28

Similar Documents

Publication Publication Date Title
CN108596327B (en) Seismic velocity spectrum artificial intelligence picking method based on deep learning
CN109508655B (en) SAR target recognition method based on incomplete training set of twin network
CN109492822B (en) Air pollutant concentration time-space domain correlation prediction method
CN110321603B (en) Depth calculation model for gas path fault diagnosis of aircraft engine
CN114092832B (en) High-resolution remote sensing image classification method based on parallel hybrid convolutional network
CN106022954B (en) Multiple BP neural network load prediction method based on grey correlation degree
CN112949828A (en) Graph convolution neural network traffic prediction method and system based on graph learning
CN110728698A (en) Multi-target tracking model based on composite cyclic neural network system
CN107832789B (en) Feature weighting K nearest neighbor fault diagnosis method based on average influence value data transformation
CN110766060B (en) Time series similarity calculation method, system and medium based on deep learning
CN115758212A (en) Mechanical equipment fault diagnosis method based on parallel network and transfer learning
CN110455512B (en) Rotary mechanical multi-integration fault diagnosis method based on depth self-encoder DAE
CN113869563A (en) Method for predicting remaining life of aviation turbofan engine based on fault feature migration
CN114548591A (en) Time sequence data prediction method and system based on hybrid deep learning model and Stacking
CN115689008A (en) CNN-BilSTM short-term photovoltaic power prediction method and system based on ensemble empirical mode decomposition
CN111723523B (en) Estuary surplus water level prediction method based on cascade neural network
CN113887342A (en) Equipment fault diagnosis method based on multi-source signals and deep learning
CN110223342B (en) Space target size estimation method based on deep neural network
CN114166509A (en) Motor bearing fault prediction method
CN115290326A (en) Rolling bearing fault intelligent diagnosis method
CN114357372A (en) Aircraft fault diagnosis model generation method based on multi-sensor data driving
CN114596726A (en) Parking position prediction method based on interpretable space-time attention mechanism
CN115761654B (en) Vehicle re-identification method
CN115206455B (en) Deep neural network-based rare earth element component content prediction method and system
CN115083511A (en) Peripheral gene regulation and control feature extraction method based on graph representation learning and attention

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant