CN115660221A - Oil and gas reservoir economic recoverable reserve assessment method and system based on hybrid neural network - Google Patents

Oil and gas reservoir economic recoverable reserve assessment method and system based on hybrid neural network Download PDF

Info

Publication number
CN115660221A
CN115660221A CN202211560212.6A CN202211560212A CN115660221A CN 115660221 A CN115660221 A CN 115660221A CN 202211560212 A CN202211560212 A CN 202211560212A CN 115660221 A CN115660221 A CN 115660221A
Authority
CN
China
Prior art keywords
characteristic
data
neural network
feature
variables
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202211560212.6A
Other languages
Chinese (zh)
Other versions
CN115660221B (en
Inventor
张剑
郝翱枭
杨云
李坤
盛行
李梓涵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Southwest Petroleum University
Original Assignee
Southwest Petroleum University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Southwest Petroleum University filed Critical Southwest Petroleum University
Priority to CN202211560212.6A priority Critical patent/CN115660221B/en
Publication of CN115660221A publication Critical patent/CN115660221A/en
Application granted granted Critical
Publication of CN115660221B publication Critical patent/CN115660221B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A10/00TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE at coastal zones; at river basins
    • Y02A10/40Controlling or monitoring, e.g. of flood or hurricane; Forecasting, e.g. risk assessment or mapping

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses an oil and gas reservoir economic recoverable reserve assessment method and system based on a hybrid neural network, wherein the method comprises the steps of obtaining original data in the development process of an oil and gas reservoir; respectively preprocessing the acquired original data according to the data types; analyzing the importance of each type of feature data set by adopting a neural network model; constructing a hybrid neural network model, and performing model training on the hybrid neural network model by taking main characteristic variables and composite characteristic variables of various characteristic data sets as input characteristic variables; and predicting the economic recoverable reserves of the oil and gas reservoir according to the trained mixed neural network model. According to the method, various types of characteristic parameters are taken as influence factors, characteristic selection and characteristic extraction are carried out in a mode suitable for different types of data, and various types of data are obtained through characteristic fusion, so that the accuracy of the reserve evaluation result can be effectively improved.

Description

Oil-gas reservoir economic recoverable reserve evaluation method and system based on hybrid neural network
Technical Field
The invention relates to the technical field of economic recoverable reserve assessment of oil and gas reservoirs, in particular to an oil and gas reservoir economic recoverable reserve assessment method and system based on a hybrid neural network.
Background
Reserve assessment is a very important task in the development and management of oil and gas reservoirs, and particularly, economic recoverable reserves directly affect the economic benefit and profitability of oil field companies. The method is particularly important for scientifically and accurately evaluating the economical and collectable reserves, and has important practical significance for making medium-long term development plans, improving development benefits and ensuring sustainable development.
Economic reserves refer to reserves recognized by economic evaluation as having commercial benefits over a period of time. The method is generally characterized in that the evaluation is carried out according to the international oil and gas prices issued by famous oil and gas fields with similar oil and gas properties and the current market conditions in the evaluation period, the recoverable reserves are confirmed to be feasible in mining technology, reasonable in economy, allowed by other conditions such as environment and the like, and the reserves and the gains can meet the requirement of investment return in the evaluation period.
Reserve assessment is similar to yield prediction, and conventional methods mainly include statistical analysis methods such as yield decreasing curve method and water drive curve method, including conventional Arps decreasing prediction model and decreasing analysis model improved on the basis of the conventional Arps decreasing prediction model. However, the curve fitting method is performed on the basis of many assumptions and experiences, has more subjective factors, does not consider the influence of various objective conditions, and the obtained result lacks stability and generalization along with the change of experience or human factors, and the result may have deviation when applied to different oil wells. In addition, the oil reservoir numerical simulation method is a main method for predicting oil reservoir development indexes at home and abroad at present, is a typical physical driving data analysis method, can consider more factors more carefully, and has a more objective prediction result than an oil reservoir engineering. In addition, another type of method for economically recoverable reserves is a formula method that considers only economic influencing factors, such as the cash flow method, the economic limit method, the well pattern density method, the marginal cost method, the analog method, and the like. Although these methods have some feasibility, they are subjectively influenced by human subjects and have differences in the applicability of different reservoirs, which limits their applicability to problems involving mass data sets.
Machine learning and neural network technology are the mainstream development direction in the field of artificial intelligence, and the analysis of oil and gas data by using related algorithms becomes a research hotspot. Although some algorithms have been successfully applied to the fields of lithology identification, well log interpretation, oil price prediction, etc., the application of such algorithms to economic recoverable reserve assessment has been inadequate and typically only a single type of data is considered as input. The economic recoverable reserve evaluation is a complex multivariable nonlinear system, has multiple data types and large data volume, relates to parameters in multiple aspects such as oil reservoir geological characteristics, mining processes, historical yield, economic factors and the like, and a single prediction system may have poor effect.
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides an oil and gas reservoir economic recoverable reserve evaluation method and system based on a hybrid neural network.
In order to achieve the purpose of the invention, the invention adopts the technical scheme that:
in a first aspect, the invention provides a method for evaluating the economic recoverable reserves of an oil and gas reservoir based on a hybrid neural network, which comprises the following steps:
s1, acquiring original data in an oil and gas reservoir development process;
s2, respectively preprocessing the acquired original data according to data types to obtain various feature data sets;
s3, analyzing the importance of each type of feature data set by adopting a neural network model, dividing feature variables in each type of feature data set into a main feature variable, a secondary feature variable and an invalid feature variable according to the importance, and selecting the secondary feature variable to construct a composite feature variable;
s4, constructing a hybrid neural network model, and performing model training on the hybrid neural network model by using main characteristic variables and composite characteristic variables of various characteristic data sets as input characteristic variables;
and S5, predicting the economic recoverable reserves of the oil and gas reservoirs according to the trained mixed neural network model.
Optionally, the raw data comprises:
geological parameter data, process parameter data, yield history data, economic parameter data and logging curve data.
Optionally, step S2 specifically includes the following sub-steps:
s2-1, performing data cleaning, data filling and standardization processing on numerical data of geological parameter data and process parameter data, coding classification labels of the geological parameter data and the process parameter data by adopting one-hot coding, and combining the classification labels with an economic and collectable reserve value to obtain a first-class characteristic data set;
s2-2, performing data cleaning, data filling and standardization processing on the yield historical data and the economic parameter data, and combining the data with the economic recoverable reserve value to obtain a second type characteristic data set;
and S2-3, carrying out data cleaning, data filling and standardization processing on the logging curve data, and combining the data with the economic recoverable reserve value to obtain a third type characteristic data set.
Optionally, step S3 specifically includes the following sub-steps:
s3-1, constructing a full-connection neural network model, setting hyper-parameters, and training and testing the full-connection neural network model by using a first class characteristic data set to obtain model parameters when the iteration of the full-connection neural network model is finished;
s3-2, calculating the importance characteristic value of each characteristic variable according to the model parameters obtained in the step S3-1;
s3-3, dividing all the characteristic variables into a main characteristic variable, a secondary characteristic variable and an invalid characteristic variable according to the importance characteristic value calculated in the step S3-2, and constructing a composite characteristic variable according to the secondary characteristic variable;
and S3-4, processing the economic parameters in the second type characteristic data set by adopting the methods from the steps S3-1 to S3-3 to obtain main characteristic variables and composite characteristic variables corresponding to the economic parameters, taking the yield historical data in the second type characteristic data set and the main characteristic variables corresponding to the economic parameters as the main characteristic variables of the second type characteristic data set, and taking the composite characteristic variables corresponding to the economic parameters as the composite characteristic variables of the second type characteristic data set.
Optionally, the model parameters include:
an input layer-hidden layer connection weight, a hidden layer-output layer connection weight, a value of an input layer neuron, an output value of a hidden layer neuron, an output value of an output layer neuron.
Optionally, step S3-2 specifically includes the following sub-steps:
s3-2-1, selecting a first number of fully-connected neural network models with optimal performance from test results of the fully-connected neural network models;
s3-2-2, calculating a total connection weight value corresponding to each characteristic variable according to the input layer-hidden layer connection weight and the hidden layer-output layer connection weight corresponding to each characteristic variable
Figure 612791DEST_PATH_IMAGE001
S3-2-3, calculating the connection weight product corresponding to each characteristic variable according to the input layer-hidden layer connection weight and the hidden layer-output layer connection weight corresponding to each characteristic variable
Figure 248172DEST_PATH_IMAGE002
S3-2-4, connecting weights of the input layer and the hidden layer according to the input layer and the hidden layer corresponding to each characteristic variable
Figure 669926DEST_PATH_IMAGE003
Input layer neuron values
Figure 732560DEST_PATH_IMAGE004
And output values of hidden layer neurons
Figure 872554DEST_PATH_IMAGE005
Calculate input layer numberiThe first neuron to the hidden layerjInfluence value of individual neuron
Figure 311625DEST_PATH_IMAGE006
S3-2-5, connecting weights of hidden layers and output layers according to the hidden layer-output layer corresponding to each characteristic variable
Figure 587886DEST_PATH_IMAGE007
Hidden layer neuron output values
Figure 588465DEST_PATH_IMAGE005
And output values of output layer neurons
Figure 950177DEST_PATH_IMAGE008
Computing hidden layer numberjInfluence value of individual neurons on output layer neurons
Figure 927360DEST_PATH_IMAGE009
S3-2-6, calculating the input layer number corresponding to each characteristic variable by adopting the following formulaiInfluence value of each neuron on the output value:
Figure 323706DEST_PATH_IMAGE010
wherein the content of the first and second substances,Mthe number of neurons in the hidden layer;
s3-2-7, connecting the absolute value of the weight according to the input layer-hidden layer corresponding to each characteristic variable
Figure 728143DEST_PATH_IMAGE011
Absolute value of hidden layer-output layer connection weight
Figure 842729DEST_PATH_IMAGE012
Calculating the product thereof
Figure 358024DEST_PATH_IMAGE013
S3-2-8, calculating the importance of each characteristic variable to each neuron of the hidden layer based on the weight absolute value by adopting the following formula
Figure 608877DEST_PATH_IMAGE014
Wherein, in the step (A),Nthe number of neurons in the input layer;
s3-2-9, calculating the importance characteristic value of each characteristic variable to the output layer based on the weight absolute value by adopting the following formula:
Figure 449794DEST_PATH_IMAGE015
wherein the content of the first and second substances,Mthe number of hidden layer neurons.
Optionally, step S3-3 specifically includes the following sub-steps:
s3-3-1, respectively calculating according to the calculation results of the step S3-2
Figure 786097DEST_PATH_IMAGE016
Figure 370662DEST_PATH_IMAGE017
Figure 974557DEST_PATH_IMAGE018
Figure 720796DEST_PATH_IMAGE019
Average value of (2)
Figure 544396DEST_PATH_IMAGE020
Figure 667072DEST_PATH_IMAGE021
Figure 626938DEST_PATH_IMAGE022
Figure 809658DEST_PATH_IMAGE023
S3-3-2, respectively calculating the relative importance characteristic value of each characteristic variable by adopting the following formula:
Figure 120553DEST_PATH_IMAGE024
Figure 46921DEST_PATH_IMAGE025
Figure 861293DEST_PATH_IMAGE026
Figure 480493DEST_PATH_IMAGE027
wherein the content of the first and second substances,nis a characteristic variable serial number;
s3-3-3, respectively pairing all characteristic variables
Figure 13106DEST_PATH_IMAGE028
Figure 477585DEST_PATH_IMAGE029
Figure 913508DEST_PATH_IMAGE030
Figure 438030DEST_PATH_IMAGE031
Four times of sorting is carried out according to the sizes to generate four relative importance sorting tables;
s3-3-4, dividing the four relative importance ranking tables into a main characteristic variable, a secondary characteristic variable and an invalid characteristic variable according to a threshold value;
and S3-3-5, calculating the mean value of the four relative importance characteristic values of each secondary characteristic variable and the weight mean value proportion of the four relative importance characteristic values in all the secondary characteristic variables, and performing weighted summation on all the secondary characteristic variables to construct a composite characteristic variable.
Optionally, the hybrid neural network model specifically includes:
the device comprises a parallel feature extraction channel and a full-connection neural network, wherein the parallel feature extraction channel consists of a first feature extraction channel, a second feature extraction channel and a third feature extraction channel;
the first feature extraction channel is used for inputting main feature variables and composite feature variables of the first-class feature data set, and extracting numerical values of geological parameters and process parameters and feature vectors of classification labels;
the second feature extraction channel is used for inputting main feature variables and composite feature variables of a second type of feature data set and extracting feature vectors of economic parameters and yield historical data;
the third feature extraction channel is used for inputting a third type of feature data set and extracting feature vectors of logging curve data;
and the fully-connected neural network is used for performing characteristic cascade on the characteristic vectors extracted by the parallel characteristic extraction channel connection to serve as input characteristic vectors and predicting the economic recoverable reserves of the oil and gas reservoirs.
Optionally, the hybrid neural network model is iteratively trained by using an optimized adaptive momentum method, where an iterative update formula is:
Figure 723518DEST_PATH_IMAGE032
Figure 991688DEST_PATH_IMAGE033
wherein the content of the first and second substances,
Figure 780653DEST_PATH_IMAGE034
is the network weight at the next iteration,
Figure 741656DEST_PATH_IMAGE035
for the network weight of the current iteration,
Figure 983281DEST_PATH_IMAGE036
Figure 789563DEST_PATH_IMAGE037
in order to adapt the hyper-parameters adaptively,
Figure 698613DEST_PATH_IMAGE038
is a matrix of partial derivatives of the network error against the weights,
Figure 564938DEST_PATH_IMAGE040
is that
Figure 559439DEST_PATH_IMAGE042
The transpose matrix of (a) is,
Figure 667947DEST_PATH_IMAGE043
is a scale factor, m is a penalty factor,Iis an identity matrix, E is a network error vector,
Figure 431504DEST_PATH_IMAGE044
is the network weight at the last iteration,kis the current iteration number.
In a second aspect, the present invention provides a system for evaluating economic recoverable reserves of hydrocarbon reservoirs based on a hybrid neural network, which applies the method described above, and comprises:
the data acquisition module is used for acquiring original data in the development process of the oil and gas reservoir;
the data preprocessing module is used for respectively preprocessing the acquired original data according to data types to obtain various feature data sets;
the characteristic extraction module is used for analyzing the importance of each type of characteristic data set by adopting a neural network model, dividing characteristic variables in each type of characteristic data set into a main characteristic variable, a secondary characteristic variable and an invalid characteristic variable according to the importance, and selecting the secondary characteristic variable to construct a composite characteristic variable;
the model training module is used for constructing a hybrid neural network model and performing model training on the hybrid neural network model by taking the main characteristic variables and the composite characteristic variables of various characteristic data sets as input characteristic variables;
and the data prediction module is used for predicting the economic recoverable reserves of the oil and gas reservoir according to the trained hybrid neural network model.
The invention has the following beneficial effects:
according to the method, various types of characteristic parameters are taken as influence factors, characteristic selection and characteristic extraction are carried out in a mode suitable for different types of data, and various types of data are obtained through characteristic fusion, so that the accuracy of the reserve evaluation result can be effectively improved.
Drawings
FIG. 1 is a schematic flow chart of a method for evaluating the economic recoverable reserves of a hydrocarbon reservoir based on a hybrid neural network in embodiment 1 of the present invention;
FIG. 2 is a schematic structural diagram of a hybrid neural network model in embodiment 1 of the present invention;
fig. 3 is a schematic structural diagram of a system for evaluating an economic recoverable reserve of a hydrocarbon reservoir based on a hybrid neural network in embodiment 2 of the present invention.
Detailed Description
The following description of the embodiments of the present invention is provided to facilitate the understanding of the present invention by those skilled in the art, but it should be understood that the present invention is not limited to the scope of the embodiments, and it will be apparent to those skilled in the art that various changes may be made without departing from the spirit and scope of the invention as defined and defined in the appended claims, and all matters produced by the invention using the inventive concept are protected.
Example 1
As shown in fig. 1, an embodiment of the present invention provides a method for evaluating an economically recoverable reserve of a hydrocarbon reservoir based on a hybrid neural network, including the following steps S1 to S5:
s1, acquiring original data in an oil and gas reservoir development process;
in an alternative embodiment of the present invention, in considering the problem of economic recoverable reserve evaluation, it is desirable to determine the characteristics of the data relevant to the evaluation, and how to select and use the data. New sensor technologies are able to stream large amounts, multi-scale and high dimensional reservoir data into databases in real time. The data types are extremely diverse, the data are learned from multi-source and multi-type data, more details are included, the corresponding relation among various data can be obtained, in order to embody all '7V' characteristics of the big data, namely quantity, speed, diversity, variability, authenticity, visualization and value, how to combine the various types of data, how to process different levels of noise and how to process lost data, the capacity of representing the data in a meaningful mode is important for the problem of economic and recoverable reserve assessment, and different data processing modes are required for different types of data.
The invention uses various sources and various types of mass data sets in the economic recoverable reserves evaluation problem, relates to parameters in multiple aspects such as oil deposit geological characteristics, mining process, historical yield, economic factors and the like, is comprehensive and objective, considers various types of characteristic parameters as input, extracts characteristics in a mode suitable for different types of data, and then performs characteristic fusion, and various types of data contribute to the accuracy of reserves evaluation results.
The initial characteristic selection is based on the oil well production theory and factors which most possibly influence the oil well production, the real reserve related data of a certain oil field is obtained, the reserve related data comprises reserve data of a plurality of evaluation units, and the characteristic variables collected by each oil well unit are divided into the following three types of corresponding input characteristic data.
The first type is: values and classification labels for geological and process parameters. The method comprises information such as well blocks, horizons, porosity, permeability, original formation pressure, crude oil viscosity, oil-containing area, oil-containing saturation, crude oil geological storage, the number of oil production wells put into production in the current year, production days, reservoir lithology, physical property classification, production mode and the like. Such data is discrete and represents the overall effect of the reservoir.
The second type: production history data and economic parameters. In the actual production process of an oil field, a lot of time series data are generated, and the data record one or more variable quantities at the past moment. The historical data of the yield is counted according to the month and comprises the information of oil yield, gas yield, water injection amount, well opening number and the like of each month; the economic parameters comprise information such as operation cost, fixed cost ratio, oil price, gas price, exchange rate, special income, tax, well construction cost, ton oil cost, input-output ratio, net cash flow, internal income rate, financial net present value, recovery period, million tons of capacity investment and the like.
In the third category: and (3) a logging curve needs to be extracted according to the depth range of each reservoir, and according to the limitation of actual measurement, only a part of the measurement is accepted as input. In the normal section of the borehole, a good mapping relation exists between each logging curve and the formation characteristics, the detection depth is deep, the more curves which are not influenced by the collapse of the borehole are, and the better the modeling effect is. The normal section of the well diameter is selected as sample data, and the well logging data quality is high to carry out experiments. The correspondence between these log data is correct. And the logging curve comprises information such as acoustic time difference, natural gamma, resistivity, density, neutrons, natural potential and the like. The structured data of the well logging curve contains more reservoir details and has dual information of time and space.
S2, respectively preprocessing the acquired original data according to data types to obtain various feature data sets;
in an optional embodiment of the present invention, step S2 specifically includes the following sub-steps:
s2-1, carrying out data cleaning, data filling and standardization processing on numerical data of geological parameter data and technological parameter data, coding classification labels of the geological parameter data and the technological parameter data by adopting one-hot coding, and combining the classification labels with an economic recoverable reserve value to obtain a first-class characteristic data set;
s2-2, performing data cleaning, data filling and standardization processing on the yield historical data and the economic parameter data, and combining the data with the economic recoverable reserve value to obtain a second type characteristic data set;
and S2-3, carrying out data cleaning, data filling and standardization processing on the logging curve data, and combining the data with the economic recoverable reserve value to obtain a third type characteristic data set.
Specifically, due to different data collection modes or different emergency situations of collection personnel, certain redundant data, default values and abnormal values of the data of the oil field can occur. For the repeated data, on the basis of the known content of the repeated data, one record is taken out from each repeated data and is reserved, and other repeated data is deleted. Abnormal values such as zero values and bad values are processed by a data cleaning technology and filled by a local mean value or a mode, so that the instability of a prediction result caused by the abnormal values is reduced to the maximum extent.
Data preprocessing is crucial to deep learning, and requires conversion of a prepared sampled data set into a data format that can be input into a hybrid neural network model. Different types of data are measurements of different properties, with different ranges of values. In order to eliminate the influence caused by different dimensions of different variables, the variables need to be converted into the same dimension, and the data is standardized in consideration of certain outliers of the sampled data. The normalization process formula is as follows:
Figure 468730DEST_PATH_IMAGE045
wherein the content of the first and second substances,
Figure 950527DEST_PATH_IMAGE046
is the average of some kind of input data,
Figure 364191DEST_PATH_IMAGE047
is the standard deviation of some input data. To increase the convergence rate of the model, the present document deals with the output data (i.e., the economic recoverable reserve)The numerical value of (b) is also subjected to standardization, the reserve predicted by the model belongs to the standardized range, and the predicted value of the model also needs to be subjected to anti-standardization to obtain the real economical recoverable reserve.
It should be noted that the normalization process is used for data of numerical type, so that the numerical values of geological parameters and process parameters, yield history data and economic parameters, well logs can and need to be normalized, and the classification labels of the geological parameters and process parameters cannot be digitized, so that the normalization process cannot be adopted, and the classification labels of the geological parameters and process parameters are converted into a data format in which the model can perform learning and feature extraction by adopting a one-hot coding process. The advantage of the one-hot coding is that the classification value is mapped to the integer value, the non-continuous numerical characteristic can be processed, the one-hot coding is used for the discrete characteristic, and the distance calculation between the characteristics is more reasonable.
One-hot coding mode:
for example, physical property classification: [ "low permeability", "medium permeability", "high permeability" ], "low permeability" is mapped to 100, "medium permeability" is mapped to 010, "high permeability" is mapped to 001;
such as land-sea classification: [ "land", "beach" ], "land" is mapped to 10, and "beach" is mapped to 01;
therefore, when a sample is "medium penetration", "beach" ], the complete feature digitization results as: [0,1,0,0,1].
The invention combines the preprocessed characteristic values with the economically collectable and storable values to form a data set required by an experiment. When the hybrid neural network model is constructed and trained, a training set, a verification set and a test set are divided according to the proportion of 8. The training set is used for training the model, the verification set is used for calculating a loss function in the training process, and the test set is used for evaluating the prediction effect of the model.
S3, analyzing the importance of each type of feature data set by adopting a neural network model, dividing feature variables in each type of feature data set into a main feature variable, a secondary feature variable and an invalid feature variable according to the importance, and selecting the secondary feature variables to construct a composite feature variable;
in an optional embodiment of the present invention, step S3 specifically includes the following sub-steps:
s3-1, constructing a full-connection neural network model, setting hyper-parameters, and training and testing the full-connection neural network model by using a first class of feature data set to obtain model parameters when the iteration of the full-connection neural network model is finished; wherein the model parameters include: the input layer-hidden layer connection weight, the hidden layer-output layer connection weight, the value of the input layer neuron, the output value of the hidden layer neuron and the output value of the output layer neuron;
s3-2, calculating the importance characteristic value of each characteristic variable according to the model parameters obtained in the step S3-1; the method specifically comprises the following steps:
s3-2-1, selecting a first number of fully-connected neural network models with optimal performance from test results of the fully-connected neural network models;
s3-2-2, calculating a connection weight total value corresponding to each characteristic variable according to the input layer-hidden layer connection weight and the hidden layer-output layer connection weight corresponding to each characteristic variable
Figure 982254DEST_PATH_IMAGE001
S3-2-3, calculating the connection weight product corresponding to each characteristic variable according to the input layer-hidden layer connection weight and the hidden layer-output layer connection weight corresponding to each characteristic variable
Figure 190381DEST_PATH_IMAGE002
S3-2-4, connecting weights according to the input layer-hidden layer corresponding to each characteristic variable
Figure 159474DEST_PATH_IMAGE003
Input layer neuron values
Figure 845670DEST_PATH_IMAGE004
And output values of hidden layer neurons
Figure 849399DEST_PATH_IMAGE005
Calculate input layer numberiThe first neuron pair of hidden layerjInfluence value of individual neuron
Figure 962848DEST_PATH_IMAGE006
S3-2-5, connecting weights according to hidden layer-output layer corresponding to each characteristic variable
Figure 655123DEST_PATH_IMAGE007
Hidden layer neuron output values
Figure 410589DEST_PATH_IMAGE005
And output values of output layer neurons
Figure 3245DEST_PATH_IMAGE008
Computing hidden layer numberjInfluence value of individual neurons on output layer neurons
Figure 553175DEST_PATH_IMAGE009
S3-2-6, calculating the input layer number corresponding to each characteristic variable by adopting the following formulaiInfluence value of each neuron on the output value:
Figure 496860DEST_PATH_IMAGE010
wherein, the first and the second end of the pipe are connected with each other,Mthe number of hidden layer neurons;
s3-2-7, connecting the absolute value of the weight according to the input layer-hidden layer corresponding to each characteristic variable
Figure 524859DEST_PATH_IMAGE011
Absolute value of hidden layer-output layer connection weight
Figure 237600DEST_PATH_IMAGE012
Calculating the product thereof
Figure 958431DEST_PATH_IMAGE013
S3-2-8, calculating each characteristic variable by adopting the following formulaImportance of each neuron of hidden layer based on weight absolute value
Figure 389412DEST_PATH_IMAGE048
Wherein, in the process,Nthe number of neurons in the input layer;
s3-2-9, calculating the importance characteristic value of each characteristic variable to the output layer based on the weight absolute value by adopting the following formula:
Figure 221102DEST_PATH_IMAGE015
wherein the content of the first and second substances,Mthe number of hidden layer neurons;
s3-3, dividing all the characteristic variables into a main characteristic variable, a secondary characteristic variable and an invalid characteristic variable according to the importance characteristic value calculated in the step S3-2, and constructing a composite characteristic variable according to the secondary characteristic variable; the method specifically comprises the following steps:
s3-3-1, respectively calculating according to the calculation results of the step S3-2
Figure 522771DEST_PATH_IMAGE016
Figure 178618DEST_PATH_IMAGE017
Figure 831316DEST_PATH_IMAGE018
Figure 201117DEST_PATH_IMAGE019
Average value of (2)
Figure 888451DEST_PATH_IMAGE020
Figure 951084DEST_PATH_IMAGE021
Figure 825500DEST_PATH_IMAGE022
Figure 264571DEST_PATH_IMAGE023
S3-3-2, respectively calculating the relative importance characteristic value of each characteristic variable by adopting the following formula:
Figure 806411DEST_PATH_IMAGE024
Figure 39946DEST_PATH_IMAGE025
Figure 667237DEST_PATH_IMAGE026
Figure 378841DEST_PATH_IMAGE027
wherein the content of the first and second substances,nserial number of the characteristic variable;
s3-3-3, respectively pairing all characteristic variables
Figure 775187DEST_PATH_IMAGE028
Figure 946668DEST_PATH_IMAGE029
Figure 795675DEST_PATH_IMAGE030
Figure 576549DEST_PATH_IMAGE031
Four times of sorting is carried out according to the sizes to generate four relative importance sorting tables;
s3-3-4, dividing the four relative importance ranking tables into a main characteristic variable, a secondary characteristic variable and an invalid characteristic variable according to a threshold value;
s3-3-5, calculating the mean value of the four relative importance characteristic values of each secondary characteristic variable and the weight mean value proportion of the mean value in all the secondary characteristic variables, and performing weighted summation on all the secondary characteristic variables to construct a composite characteristic variable;
and S3-4, processing the economic parameters in the second type characteristic data set by adopting the methods from the steps S3-1 to S3-3 to obtain main characteristic variables and composite characteristic variables corresponding to the economic parameters, taking the yield historical data in the second type characteristic data set and the main characteristic variables corresponding to the economic parameters as the main characteristic variables of the second type characteristic data set, and taking the composite characteristic variables corresponding to the economic parameters as the composite characteristic variables of the second type characteristic data set.
Specifically, since the hybrid neural network has a complex structure and many parameters, multi-source and multi-type data are not only required to be classified and respectively processed, importance analysis and feature selection are also required to be performed in each type of feature variable, a fully-connected neural network is used for predicting a target value to obtain a weight under an optimal result, the total value of the connection weight of each variable, the product of the connection weights, an influence value on an output value, and the importance comprehensive analysis based on the weight of an output layer are performed, so that variables with high relative importance are retained, variables with low relative importance are weakened and integrated, and variables with extremely low relative importance are removed, thereby improving the prediction accuracy.
When the model hyper-parameters are set, the value intervals of the model learning rate are set to be {0.001,0.003,0.005,0.01,0.03,0.05 and 0.1}, the value intervals of the hidden layer unit number are {5,6,7, \43, 35}, the data set randomly selects the ratio value intervals of {50%,55%,60%,65%,70%,75%,80%,85%,90%,95% and 100% }, the loss function is a Root Mean Square Error (RMSE) loss function, the weight initialization method is a Uniform distribution initialization method (Random uniformity), the maximum epoch number is 100, and the iteration is ended in advance when the continuous 5-time Error is less than 0.001.
In neural networks, the connection weights between neurons are the connection between input and output, and thus the connection between problem and solution. The relative importance of the independent variable characteristics to the prediction output of the neural network mainly depends on the magnitude and direction of the connection weight. However, in deep learning, processing high-dimensional vectors can cause a great consumption of computing resources and even cause a dimensional disaster problem. Therefore, there is a need to find ways to convert high-dimensional vectors to low-dimensional vectors without losing data features. In the conventional oil reservoir engineering, the characteristics are preliminarily analyzed, and the characteristics are reduced by using intuition and experience of the petroleum production engineering, but the method has strong subjectivity. The invention provides a feature importance analysis method which is used for counting and evaluating the contribution of input variables in a neural network, improving the resolving power of mass feature parameters, identifying variables which have important contribution to network prediction and further selecting the feature variables.
Firstly, importance analysis is carried out on numerical values and classification labels of first-class characteristic variables, geological parameters and process parameters, only the first-class characteristic variables are used as input, economic and recoverable reserves are used as output, a three-layer full-connection neural network is constructed, and training and testing are carried out. Since the network weight matrix initialization is random, the model training has some randomness, and if a particular variable in the dataset performs poorly, it is possible that, although this variable does not appear to be important in the experiment, it may become more important as the experiment hyper-parameter set or dataset selection changes, thus multiple random experiments are used. The invention sets value ranges of a plurality of hyper-parameters, constructs a plurality of three-layer fully-connected neural networks, and randomly selects and pairs the learning rate, the number of hidden layer units and the random selection proportion of data sets. This is repeated a plurality of times, for example 100 times. Because each connection weight in the network has a magnitude and a direction, in order to more comprehensively evaluate the importance variable, when the fully-connected neural network of each experiment reaches a critical condition or an optimal iteration state, the following values are recorded: (1) Input layer-hidden layer connection weights
Figure 827402DEST_PATH_IMAGE049
(ii) a (2) Hidden layer-to-output layer connection weights
Figure 668319DEST_PATH_IMAGE050
(ii) a (3) Input layer neuron i values
Figure 4622DEST_PATH_IMAGE051
(ii) a (4) Output value for hidden layer neuron j
Figure 323608DEST_PATH_IMAGE005
(ii) a Wherein
Figure 694547DEST_PATH_IMAGE052
The value of 1,2 is selected,
Figure 440786DEST_PATH_IMAGE053
n, the number of input layer neurons; value of j
Figure 998806DEST_PATH_IMAGE054
I.e. the number of hidden layer neurons; (5) Output values for output layer neurons
Figure 387062DEST_PATH_IMAGE008
After multiple randomization experiments are finished, 10 fully-connected neural networks with the best performance are selected, and the learning rate (1) and the hidden layer unit number (2) of the fully-connected neural networks are recorded. Their following values were calculated: (1) The total value of the connection weights, which includes the sum of all the connection weights of each variable from the input layer to the hidden layer and then to the output layer, is
Figure 346928DEST_PATH_IMAGE001
(ii) a (2) Connecting the weight product including the connection weight of each variable input layer and hidden layer
Figure 51620DEST_PATH_IMAGE055
Connecting weight value with hidden layer and output layer of each hidden layer neuron
Figure 362516DEST_PATH_IMAGE007
The sum of the products of (a) is made to be
Figure 23304DEST_PATH_IMAGE056
(ii) a (3) Influence value of ith neuron of input layer on jth neuron of hidden layer
Figure 103256DEST_PATH_IMAGE057
Influence value of j-th neuron of hidden layer on neuron of output layer
Figure 456877DEST_PATH_IMAGE009
(ii) a Calculating the output value of the ith neuron pair of the input layer
Figure 989489DEST_PATH_IMAGE008
Influence value of
Figure 453969DEST_PATH_IMAGE058
(ii) a (4) Absolute value of connection weight of each variable input layer and hidden layer
Figure 388426DEST_PATH_IMAGE011
Absolute value of connected weights of hidden layer and output layer for each hidden layer neuron
Figure 178528DEST_PATH_IMAGE012
Calculating the product thereof
Figure 198437DEST_PATH_IMAGE059
Then calculating the importance of each variable to each neuron of the hidden layer based on the weight absolute value
Figure 466607DEST_PATH_IMAGE014
Importance characteristic value of each variable to output layer based on weight absolute value
Figure 757036DEST_PATH_IMAGE060
. Wherein i takes on a value
Figure 452460DEST_PATH_IMAGE061
I.e. the number of input layer neurons; value of j
Figure 959664DEST_PATH_IMAGE054
I.e., the number of hidden layer neurons. This in turn allows the statistical significance of the importance of each variable to be derived.
Screening out 10 full-connection gods with optimal performance in model experimentVia a network model. Find the 10
Figure 765946DEST_PATH_IMAGE016
Figure 674997DEST_PATH_IMAGE017
Figure 541321DEST_PATH_IMAGE018
Figure 535822DEST_PATH_IMAGE019
Average value of (2)
Figure 145795DEST_PATH_IMAGE020
Figure 909352DEST_PATH_IMAGE021
Figure 946578DEST_PATH_IMAGE022
Figure 428375DEST_PATH_IMAGE023
Calculating four relative importance characteristic values of each variable
Figure 576459DEST_PATH_IMAGE062
Figure 958637DEST_PATH_IMAGE063
Figure 901185DEST_PATH_IMAGE064
Figure 870278DEST_PATH_IMAGE027
Comprehensively analyzing the influence of the connection weight on the variables and the output from four angles, and respectively making all the variables pair
Figure 822054DEST_PATH_IMAGE028
Figure 294623DEST_PATH_IMAGE029
Figure 673652DEST_PATH_IMAGE030
Figure 130041DEST_PATH_IMAGE031
Four sorts by size and generates four sort tables of relative importance.
All variables are classified into primary characteristic variables (i.e., variables of higher importance), secondary characteristic variables (i.e., variables of lower importance), and null characteristic variables (i.e., variables of lowest importance) according to a certain criterion.
Since the output variable value is the result of the common influence of all the characteristic variables, the secondary characteristic variables contribute less, but not completely useless, with respect to the primary variable. Therefore, the invention does not abandon all the secondary characteristic variables, but weakens the connection weight of the secondary characteristic variables, and more clearly analyzes the influence of each input characteristic variable on the output variable.
Two percentage thresholds are set, one larger and one smaller. In four sorting processes, all the characteristic variables with relative importance exceeding a larger threshold are taken out and four sets are formed
Figure 619928DEST_PATH_IMAGE065
The intersection of the four sets
Figure 478163DEST_PATH_IMAGE066
All the variables in (2) are defined as main characteristic variables, all the characteristic variables with relative importance lower than a smaller threshold value are taken out, and four sets are formed
Figure 28093DEST_PATH_IMAGE067
The intersection of the four sets
Figure 440620DEST_PATH_IMAGE068
All the feature variables in (1) are defined as invalid feature variables, and the rest of feature variables are all defined as secondary feature variables. Or set a percentage thresholdThe feature variable reaching this threshold 2 to 4 times is defined as a primary feature variable, the feature variable reaching this threshold 1 time is defined as a secondary feature variable, and the feature variable reaching this threshold 0 time is defined as an invalid feature variable.
Averaging the four relative importance of each sub-feature variable
Figure 501242DEST_PATH_IMAGE069
And their weight-mean ratios in all sub-feature variables
Figure 948404DEST_PATH_IMAGE070
And L is the number of the secondary characteristic variables. And according to the weight proportion, carrying out weighted summation on the secondary characteristic variables and integrating the secondary characteristic variables into a new composite characteristic variable. The weight proportion of each secondary characteristic variable in the composite characteristic variables is determined, and the weights are not trained in the subsequent mixed neural network model training process. And taking the composite characteristic variable and the main characteristic variable together as input characteristic variables of the hybrid neural network model. Too many feature variable inputs complicate the model and increase the parameters of the neural network. In order to simplify the model, more data types are effectively utilized while the prediction precision is ensured, so that not only can all collected characteristic data be more fully considered to exert the value of each characteristic variable, but also the connection between low-importance characteristic variables and a neural network is reduced, the number of parameters during model training is reduced, and the model structure is optimized.
According to the invention, the main characteristic variables and the new characteristic variables obtained by analyzing the importance of the first-class characteristic data set are defined as the first-class main characteristic variables and the first-class composite characteristic variables.
And (4) performing importance analysis on the second type characteristic data set, the yield historical data and the economic parameters, wherein the yield historical data and the reserves have the most direct relationship and are all reserved, so that only the second type characteristic variables are taken as economic parameter input and the economically obtainable reserves are output, the method is similar to the steps S3-1 to S3-3, and finally the second type main characteristic variables (the yield historical data and the economic parameters with higher relative importance) and the second type composite characteristic variables (the composite variables of the economic parameters with lower relative importance) are obtained.
For the third type of characteristic data set, because the logging curve sequence is easy to have more missing values in practice, several curves with high logging data quality are screened out in the data preprocessing stage, and the curves comprise information such as acoustic wave time difference, natural gamma, resistivity, density, neutron, natural potential and the like, so that the importance analysis is not performed on the data, and all the curves are defined as third type main characteristic variables.
S4, constructing a hybrid neural network model, and performing model training on the hybrid neural network model by using main characteristic variables and composite characteristic variables of various characteristic data sets as input characteristic variables;
in an optional embodiment of the invention, a hybrid neural network (HDNN) model is provided for economic recoverable reserve evaluation in consideration of various types of data such as geological parameters, technological parameters, economic parameters, yield historical data and the like of a single well in the development process of a hydrocarbon reservoir.
The hybrid neural network model constructed by the invention specifically comprises the following steps:
the system comprises a first feature extraction channel, a second feature extraction channel, a third feature extraction channel, a full-connection neural network and a second feature extraction channel, wherein the first feature extraction channel, the second feature extraction channel and the third feature extraction channel form a parallel feature extraction channel;
the first feature extraction channel is used for inputting main feature variables and composite feature variables of the first-class feature data set, and extracting numerical values of geological parameters and process parameters and feature vectors of classification labels;
the second feature extraction channel is used for inputting main feature variables and composite feature variables of a second type of feature data set and extracting feature vectors of economic parameters and yield historical data;
the third feature extraction channel is used for inputting a third type of feature data set and extracting feature vectors of logging curve data;
and the fully-connected neural network is used for performing characteristic cascade on the characteristic vectors extracted by the parallel characteristic extraction channel connection to serve as input characteristic vectors and predicting the economic recoverable reserves of the oil and gas reservoirs.
The hybrid neural network model of the present invention processes each input type separately with a suitable network for feature learning. All learned features are then concatenated and integrated into a feature set. This integration contains valid information from different inputs, which are then input to subsequent neural networks for overall target learning.
The input data can be continuous or discrete, it is inconvenient to input all the data into one network, and in order to process mixed input, the invention applies various types of neural networks to different data types and integrates the data types.
Aiming at some geological parameters which are numerical values such as porosity, permeability, original formation pressure, crude oil viscosity, oil saturation and the like, and some geological parameters which are classification labels such as reservoir lithology, physical property classification and the like; some technological parameters also include the number of oil wells put into production in the current year, the number of days in production and other numerical values, and the classification labels such as the production mode and the like. The values and classification labels are discrete and represent the overall effect of the reservoir. Therefore, the invention uses a Fully Connected Neural Network (FCNN) to perform feature processing on the numerical values of the geological parameters and the technological parameters and the input of the classification labels.
The historical data of production and economic parameters are important data of the oil field for a long time, the data contain time series information, and the data record one or more variable quantities at the past moment, so the invention uses a Gated Current Unit (GRU) to extract the characteristics and mine the relationship of the historical data of economic parameters and production.
For some structural measurements, such as well logs, that contain more detail, a variety of sequence curves can be provided, reflecting cumulative effects in the reservoir, including both temporal and spatial information. Therefore, the invention uses a one-dimensional Convolutional Neural Network (CNN) which comprises a Conv1D Convolutional layer and a pooling layer, and a cyclic Convolutional Neural network combined with a Gated Recurrent Unit (GRU) to process the logging curve and perform feature extraction and relationship mining.
The method respectively performs characteristic learning on data with different formats, and then performs characteristic cascade on the output of the FCNN, the GRU and the cyclic convolution neural network to realize final evaluation on the economical and recoverable reserves.
Converting numerical values and classification labels of geological parameters and process parameters, economic parameters and yield historical data and logging curves into a characteristic vector A, a characteristic vector B and a characteristic vector C, inputting the characteristic vectors into a nonlinear mapping equation of learning characteristics, and respectively expressing the three data as follows:
Figure 669235DEST_PATH_IMAGE071
Figure 100216DEST_PATH_IMAGE072
Figure 931906DEST_PATH_IMAGE073
wherein, the first and the second end of the pipe are connected with each other,
Figure 499154DEST_PATH_IMAGE074
in order to be the FCNN,
Figure 390886DEST_PATH_IMAGE075
in order to gate the circulation network, the control circuit is provided with a control circuit,
Figure 778005DEST_PATH_IMAGE076
in order to be a circular convolutional neural network,
Figure 678965DEST_PATH_IMAGE077
Figure 100719DEST_PATH_IMAGE078
Figure 163353DEST_PATH_IMAGE079
numerical and classification labels for geological and technological parameters, respectivelyThe features learned in the FCNN, the economic parameters and the production history data are input into the GRU, and the logging curves are input into the cyclic convolution neural network.
Performing feature cascade on all learned features to obtain an integrated feature, combining several features according to specific data types, and then representing the integrated feature as the input of a final economic recoverable reserve assessment model
Figure 37768DEST_PATH_IMAGE080
Wherein Y represents the value of the economically recoverable reserve obtained by the final model,
Figure 975375DEST_PATH_IMAGE081
is a layer of fully connected network.
On the basis of the characteristic extraction, the hybrid neural network (HDNN) model provided by the invention realizes an end-to-end deep learning model expressed as
Figure 251636DEST_PATH_IMAGE082
Wherein F represents the nonlinear mapping from multi-source multi-class input to target value, that is, the HDNN model provided by the invention is formed by connecting FCNN, GRU and cyclic convolution neural network in parallel and then connecting with the target value
Figure 750750DEST_PATH_IMAGE081
And (4) combining models after series connection. When the HDNN model is integrally trained, the weight W, the weight V and the threshold of the model are continuously optimized through a neural network training algorithm, and the model is trained to be the model with the optimal effect.
When the model training is carried out, the learning rate is set as the average value of the learning rate values of 10 fully-connected neural networks with optimal performance in the random result of the value range of the set model learning rate
Figure 112461DEST_PATH_IMAGE083
And the average value of learning rate values of 10 fully-connected neural networks with optimal performance when the importance analysis is carried out on the economic parameters in the second class of characteristic variables
Figure 89645DEST_PATH_IMAGE084
Average value of
Figure DEST_PATH_IMAGE085
(ii) a The number of the hidden layer units at the FCNN end is the average value (rounded down if decimal) of the number of the hidden layer units of 10 fully-connected neural networks with optimal performance in the random result of the value interval of the number of the hidden layer units, and the number of the hidden layer units at the GRU end is the average value (rounded down if decimal) of the number of the hidden layer units of 10 fully-connected neural networks with optimal performance when the importance analysis is carried out on the economic parameters in the second type of characteristic variables; the loss function is Root Mean Square Error (RMSE) loss function, the weight initialization method is a Uniform distribution initialization method (Random uniformity), the maximum epoch number is 100, and iteration is ended in advance when the Error is less than 0.001 for 5 consecutive times.
In order to avoid the over-fitting condition and increase the probability that the network model converges to the global optimal value, the invention adopts an optimized adaptive momentum method and carries out model training by introducing simple weight attenuation regularization. The variation of the weights is confined to the hyperellipse. The conjugacy is also maintained between all subsequent descent directions, where only the descent direction that sufficiently reduces the objective function is accepted, and the iterative update formula is expressed as:
Figure 17149DEST_PATH_IMAGE032
Figure 421586DEST_PATH_IMAGE033
wherein the content of the first and second substances,
Figure 5014DEST_PATH_IMAGE034
is the network weight at the next iteration,
Figure 51467DEST_PATH_IMAGE035
for the network weight of the current iteration,
Figure 302320DEST_PATH_IMAGE036
Figure 143237DEST_PATH_IMAGE037
in order to adapt the hyper-parameters adaptively,
Figure 715426DEST_PATH_IMAGE038
is a matrix of partial derivatives of the network error against the weights,
Figure 34412DEST_PATH_IMAGE040
is that
Figure 405351DEST_PATH_IMAGE042
The transpose matrix of (a) is,
Figure 151590DEST_PATH_IMAGE043
is a scale factor, m is a penalty factor,Iis an identity matrix, E is a network error vector,
Figure 240768DEST_PATH_IMAGE044
is the network weight at the last iteration,kis the current iteration number.
When model training is carried out, a first class main characteristic variable and a first class composite characteristic variable are input to an FCNN end in the proposed HDNN, a second class main characteristic variable and a second class composite characteristic variable are input to a GRU end in the proposed HDNN, a third class main characteristic variable is input to a circular convolution neural network end in the proposed HDNN, and a model architecture is shown in figure 2. Training by adopting an optimized adaptive momentum method (OLMAM) until the maximum iteration number is reached or the error requirement is reached; economic recoverable reserve assessment is a regression problem, and therefore Root Mean Square Error (RMSE) loss functions are used in the training process as criteria for evaluating the predicted effect during network training. The root mean square error is an index used to describe the "systematic error", and the smaller the value, the closer to 0, the better the performance of the model. The root mean square error loss function is calculated as follows:
Figure 97866DEST_PATH_IMAGE086
wherein, the total number of samples is T,
Figure 323311DEST_PATH_IMAGE087
is the first onetThe actual value of the economically exploitable reserves of an individual sample,
Figure DEST_PATH_IMAGE088
is the firsttAnd predicting the economic recoverable reserves of the samples.
When the model test is carried out, the test data set is input into the trained model, and the performance of the model is evaluated. Because all data is normalized, the root mean square error is calculated during the training process using the normalized data. Therefore, when the performance of the model is evaluated, the actual prediction effect of the model is reflected, firstly, the output of the model is subjected to anti-standardization processing to obtain a predicted economic recoverable reserve value, and then, the root mean square error is calculated by using the predicted value and the actual value.
And S5, predicting the economic recoverable reserves of the oil and gas reservoirs according to the trained mixed neural network model.
Example 2
As shown in fig. 3, the present invention provides a hybrid neural network-based economic recoverable reservoir assessment system applying the above method based on the hybrid neural network-based economic recoverable reservoir assessment method described in embodiment 1, including:
the data acquisition module is used for acquiring original data in the development process of the oil and gas reservoir;
the data preprocessing module is used for respectively preprocessing the acquired original data according to data types to obtain various feature data sets;
the characteristic extraction module is used for analyzing the importance of each type of characteristic data set by adopting a neural network model, dividing characteristic variables in each type of characteristic data set into a main characteristic variable, a secondary characteristic variable and an invalid characteristic variable according to the importance, and selecting the secondary characteristic variable to construct a composite characteristic variable;
the model training module is used for constructing a hybrid neural network model and performing model training on the hybrid neural network model by taking the main characteristic variables and the composite characteristic variables of various characteristic data sets as input characteristic variables;
and the data prediction module is used for predicting the economic recoverable reserves of the oil and gas reservoir according to the trained mixed neural network model.
The present invention has been described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
The principle and the implementation mode of the invention are explained by applying specific embodiments in the invention, and the description of the embodiments is only used for helping to understand the method and the core idea of the invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.
It will be appreciated by those of ordinary skill in the art that the embodiments described herein are intended to assist the reader in understanding the principles of the invention and are to be construed as being without limitation to such specifically recited embodiments and examples. Those skilled in the art can make various other specific changes and combinations based on the teachings of the present invention without departing from the spirit of the invention, and these changes and combinations are within the scope of the invention.

Claims (10)

1. A method for evaluating the economic recoverable reserves of oil and gas reservoirs based on a hybrid neural network is characterized by comprising the following steps:
s1, acquiring original data in an oil and gas reservoir development process;
s2, respectively preprocessing the acquired original data according to data types to obtain various feature data sets;
s3, analyzing the importance of each type of feature data set by adopting a neural network model, dividing feature variables in each type of feature data set into a main feature variable, a secondary feature variable and an invalid feature variable according to the importance, and selecting the secondary feature variable to construct a composite feature variable;
s4, constructing a hybrid neural network model, and performing model training on the hybrid neural network model by using the main characteristic variables and the composite characteristic variables of various characteristic data sets as input characteristic variables;
and S5, predicting the economic recoverable reserves of the oil and gas reservoirs according to the trained mixed neural network model.
2. The method of claim 1, wherein the raw data comprises:
geological parameter data, process parameter data, yield history data, economic parameter data and logging curve data.
3. The method for evaluating the economic recoverable reserves of the oil and gas reservoirs based on the hybrid neural network as claimed in claim 2, wherein the step S2 comprises the following sub-steps:
s2-1, performing data cleaning, data filling and standardization processing on numerical data of geological parameter data and process parameter data, coding classification labels of the geological parameter data and the process parameter data by adopting one-hot coding, and combining the classification labels with an economic and collectable reserve value to obtain a first-class characteristic data set;
s2-2, performing data cleaning, data filling and standardization processing on the yield historical data and the economic parameter data, and combining the data with the economic recoverable reserve value to obtain a second type characteristic data set;
and S2-3, carrying out data cleaning, data filling and standardization processing on the logging curve data, and combining the data with the economic recoverable reserve value to obtain a third type characteristic data set.
4. The method for evaluating the economic recoverable reserves of the oil and gas reservoirs based on the hybrid neural network as claimed in claim 3, wherein the step S3 comprises the following sub-steps:
s3-1, constructing a full-connection neural network model, setting hyper-parameters, and training and testing the full-connection neural network model by using a first class of feature data set to obtain model parameters when the iteration of the full-connection neural network model is finished;
s3-2, calculating the importance characteristic value of each characteristic variable according to the model parameters obtained in the step S3-1;
s3-3, dividing all the characteristic variables into a main characteristic variable, a secondary characteristic variable and an invalid characteristic variable according to the importance characteristic value calculated in the step S3-2, and constructing a composite characteristic variable according to the secondary characteristic variable;
and S3-4, processing the economic parameters in the second type characteristic data set by adopting the methods from the steps S3-1 to S3-3 to obtain main characteristic variables and composite characteristic variables corresponding to the economic parameters, taking the yield historical data in the second type characteristic data set and the main characteristic variables corresponding to the economic parameters as the main characteristic variables of the second type characteristic data set, and taking the composite characteristic variables corresponding to the economic parameters as the composite characteristic variables of the second type characteristic data set.
5. The method of claim 4, wherein the model parameters comprise:
an input layer-hidden layer connection weight, a hidden layer-output layer connection weight, a value of an input layer neuron, an output value of a hidden layer neuron, an output value of an output layer neuron.
6. The method for evaluating the economic recoverable reserves of the oil and gas reservoirs based on the hybrid neural network as claimed in claim 5, wherein the step S3-2 comprises the following sub-steps:
s3-2-1, selecting a first number of fully-connected neural network models with optimal performance from test results of the fully-connected neural network models;
s3-2-2, calculating a total connection weight value corresponding to each characteristic variable according to the input layer-hidden layer connection weight and the hidden layer-output layer connection weight corresponding to each characteristic variable
Figure 731686DEST_PATH_IMAGE001
S3-2-3, calculating the connection weight product corresponding to each characteristic variable according to the input layer-hidden layer connection weight and the hidden layer-output layer connection weight corresponding to each characteristic variable
Figure 117668DEST_PATH_IMAGE002
S3-2-4, connecting weights of the input layer and the hidden layer according to the input layer and the hidden layer corresponding to each characteristic variable
Figure 631826DEST_PATH_IMAGE003
Input layer neuron values
Figure 495877DEST_PATH_IMAGE004
And output values of hidden layer neurons
Figure 543205DEST_PATH_IMAGE005
Calculate input layer numberiThe first neuron to the hidden layerjInfluence value of individual neuron
Figure 834509DEST_PATH_IMAGE006
S3-2-5, connecting weights according to hidden layer-output layer corresponding to each characteristic variable
Figure 835963DEST_PATH_IMAGE007
Hidden layer neuron output values
Figure 769284DEST_PATH_IMAGE008
And output values of output layer neurons
Figure 907004DEST_PATH_IMAGE009
Computing hidden layer numberjInfluence value of individual neurons on output layer neurons
Figure 634789DEST_PATH_IMAGE010
S3-2-6, calculating the input layer number corresponding to each characteristic variable by adopting the following formulaiInfluence value of each neuron on the output value:
Figure 857960DEST_PATH_IMAGE011
wherein the content of the first and second substances,Mthe number of neurons in the hidden layer;
s3-2-7, connecting the absolute value of the weight according to the input layer-hidden layer corresponding to each characteristic variable
Figure 329392DEST_PATH_IMAGE012
Absolute value of hidden layer-output layer connection weight
Figure 587198DEST_PATH_IMAGE013
Calculating the product thereof
Figure 485884DEST_PATH_IMAGE014
S3-2-8, calculating the importance of each characteristic variable to each neuron of the hidden layer based on the weight absolute value by adopting the following formula
Figure 461930DEST_PATH_IMAGE015
Wherein, in the step (A),Nthe number of neurons in the input layer;
s3-2-9, calculating the importance characteristic value of each characteristic variable to the output layer based on the weight absolute value by adopting the following formula:
Figure 471475DEST_PATH_IMAGE016
wherein the content of the first and second substances,Mthe number of hidden layer neurons.
7. The method for evaluating the economic recoverable reserves of the oil and gas reservoirs based on the hybrid neural network as claimed in claim 6, wherein the step S3-3 comprises the following sub-steps:
s3-3-1, respectively calculating according to the calculation results of the step S3-2
Figure 82322DEST_PATH_IMAGE017
Figure 151909DEST_PATH_IMAGE018
Figure 615252DEST_PATH_IMAGE019
Figure 162908DEST_PATH_IMAGE020
Average value of (2)
Figure 395306DEST_PATH_IMAGE021
Figure 635795DEST_PATH_IMAGE022
Figure 320854DEST_PATH_IMAGE023
Figure 937780DEST_PATH_IMAGE024
S3-3-2, respectively calculating the relative importance characteristic value of each characteristic variable by adopting the following formula:
Figure 759105DEST_PATH_IMAGE025
Figure 436074DEST_PATH_IMAGE026
Figure 608430DEST_PATH_IMAGE027
Figure 996423DEST_PATH_IMAGE028
wherein the content of the first and second substances,nserial number of the characteristic variable;
s3-3-3, respectively pairing all characteristic variables
Figure 203414DEST_PATH_IMAGE029
Figure 51284DEST_PATH_IMAGE030
Figure 445356DEST_PATH_IMAGE031
Figure 404085DEST_PATH_IMAGE032
Four times of sorting is carried out according to the sizes to generate four relative importance sorting tables;
s3-3-4, dividing the four relative importance ranking tables into a main characteristic variable, a secondary characteristic variable and an invalid characteristic variable according to a threshold value;
and 3-3-5, calculating the mean value of the four relative importance characteristic values of each secondary characteristic variable and the weight-mean ratio of the mean value in all the secondary characteristic variables, and performing weighted summation on all the secondary characteristic variables to construct and obtain a composite characteristic variable.
8. The method for evaluating the economic recoverable reserves of the oil and gas reservoirs based on the hybrid neural network as claimed in claim 7, wherein the model of the hybrid neural network specifically comprises:
the device comprises a parallel feature extraction channel and a full-connection neural network, wherein the parallel feature extraction channel consists of a first feature extraction channel, a second feature extraction channel and a third feature extraction channel;
the first feature extraction channel is used for inputting main feature variables and composite feature variables of the first class of feature data sets, and extracting numerical values of geological parameters and technological parameters and feature vectors of classification labels;
the second feature extraction channel is used for inputting main feature variables and composite feature variables of a second type of feature data set and extracting feature vectors of economic parameters and yield historical data;
the third feature extraction channel is used for inputting a third type of feature data set and extracting feature vectors of logging curve data;
and the fully-connected neural network is used for performing characteristic cascade on the characteristic vectors extracted by the parallel characteristic extraction channel connection to serve as input characteristic vectors and predicting the economic recoverable reserves of the oil and gas reservoirs.
9. The method for evaluating the economic recoverable reserves of the oil and gas reservoirs based on the hybrid neural network as claimed in claim 8, wherein the hybrid neural network model adopts an optimized adaptive momentum method for iterative training, wherein an iterative updating formula is as follows:
Figure 200003DEST_PATH_IMAGE033
Figure 953195DEST_PATH_IMAGE034
wherein, the first and the second end of the pipe are connected with each other,
Figure 100143DEST_PATH_IMAGE035
is the network weight at the next iteration,
Figure 862562DEST_PATH_IMAGE036
is the network weight for the current iteration,
Figure 512987DEST_PATH_IMAGE037
Figure 437080DEST_PATH_IMAGE038
in order to adapt the hyper-parameters adaptively,
Figure 71324DEST_PATH_IMAGE039
is a matrix of partial derivatives of the network error against the weights,
Figure 628249DEST_PATH_IMAGE041
is that
Figure 664338DEST_PATH_IMAGE043
The transpose matrix of (a) is,
Figure DEST_PATH_IMAGE044
is a scale factor, m is a penalty factor,Iis an identity matrix, E is a network error vector,
Figure 228174DEST_PATH_IMAGE045
is the network weight at the last iteration,kis the current iteration number.
10. A hybrid neural network-based reservoir economic recoverable reserve evaluation system employing the method of any one of claims 1 to 9, comprising:
the data acquisition module is used for acquiring original data in the development process of the oil and gas reservoir;
the data preprocessing module is used for respectively preprocessing the acquired original data according to data types to obtain various feature data sets;
the characteristic extraction module is used for analyzing the importance of each type of characteristic data set by adopting a neural network model, dividing characteristic variables in each type of characteristic data set into a main characteristic variable, a secondary characteristic variable and an invalid characteristic variable according to the importance, and selecting the secondary characteristic variable to construct a composite characteristic variable;
the model training module is used for constructing a hybrid neural network model and performing model training on the hybrid neural network model by taking the main characteristic variables and the composite characteristic variables of various characteristic data sets as input characteristic variables;
and the data prediction module is used for predicting the economic recoverable reserves of the oil and gas reservoir according to the trained mixed neural network model.
CN202211560212.6A 2022-12-07 2022-12-07 Oil and gas reservoir economic recoverable reserve assessment method and system based on hybrid neural network Active CN115660221B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211560212.6A CN115660221B (en) 2022-12-07 2022-12-07 Oil and gas reservoir economic recoverable reserve assessment method and system based on hybrid neural network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211560212.6A CN115660221B (en) 2022-12-07 2022-12-07 Oil and gas reservoir economic recoverable reserve assessment method and system based on hybrid neural network

Publications (2)

Publication Number Publication Date
CN115660221A true CN115660221A (en) 2023-01-31
CN115660221B CN115660221B (en) 2023-03-21

Family

ID=85017351

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211560212.6A Active CN115660221B (en) 2022-12-07 2022-12-07 Oil and gas reservoir economic recoverable reserve assessment method and system based on hybrid neural network

Country Status (1)

Country Link
CN (1) CN115660221B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117910538A (en) * 2024-03-19 2024-04-19 成都三一能源环保技术有限公司 Downhole drilling instrument running state monitoring system based on machine learning

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108573320A (en) * 2018-03-08 2018-09-25 中国石油大学(北京) The computational methods and system of shale gas reservoir ultimate recoverable reserves
CN109057786A (en) * 2018-07-27 2018-12-21 中国石油大学(华东) A kind of adaptive model for the estimation of untraditional reservoir recoverable reserves
US20200364593A1 (en) * 2019-05-13 2020-11-19 Saudi Arabian Oil Company Calcium carbonate scale prediction and inhibition in hydrocarbon wells using machine learning
CN112989708A (en) * 2021-04-08 2021-06-18 重庆科技学院 Well logging lithology identification method and system based on LSTM neural network
CN113947198A (en) * 2021-12-21 2022-01-18 西南石油大学 Logging curve reconstruction method based on nonlinear autoregressive neural network model
CN114662414A (en) * 2022-05-25 2022-06-24 中国石油大学(华东) Oil reservoir production prediction method based on graph wavelet neural network model
CN114925623A (en) * 2022-07-22 2022-08-19 中国地质大学(北京) Oil and gas reservoir yield prediction method and system
CN115186729A (en) * 2022-05-30 2022-10-14 西安石油大学 Stratum identification and classification method based on cuckoo algorithm optimized elastic BP neural network

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108573320A (en) * 2018-03-08 2018-09-25 中国石油大学(北京) The computational methods and system of shale gas reservoir ultimate recoverable reserves
CN109057786A (en) * 2018-07-27 2018-12-21 中国石油大学(华东) A kind of adaptive model for the estimation of untraditional reservoir recoverable reserves
US20200364593A1 (en) * 2019-05-13 2020-11-19 Saudi Arabian Oil Company Calcium carbonate scale prediction and inhibition in hydrocarbon wells using machine learning
CN112989708A (en) * 2021-04-08 2021-06-18 重庆科技学院 Well logging lithology identification method and system based on LSTM neural network
CN113947198A (en) * 2021-12-21 2022-01-18 西南石油大学 Logging curve reconstruction method based on nonlinear autoregressive neural network model
CN114662414A (en) * 2022-05-25 2022-06-24 中国石油大学(华东) Oil reservoir production prediction method based on graph wavelet neural network model
CN115186729A (en) * 2022-05-30 2022-10-14 西安石油大学 Stratum identification and classification method based on cuckoo algorithm optimized elastic BP neural network
CN114925623A (en) * 2022-07-22 2022-08-19 中国地质大学(北京) Oil and gas reservoir yield prediction method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
刘巍;刘威;谷建伟;: "基于机器学习方法的油井日产油量预测" *
王化增;迟国泰;程砚秋;: "基于BP神经网络的油气储量价值等级划分" *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117910538A (en) * 2024-03-19 2024-04-19 成都三一能源环保技术有限公司 Downhole drilling instrument running state monitoring system based on machine learning

Also Published As

Publication number Publication date
CN115660221B (en) 2023-03-21

Similar Documents

Publication Publication Date Title
CN109657947A (en) A kind of method for detecting abnormality towards enterprises ' industry classification
CN109829543A (en) A kind of online method for detecting abnormality of data flow based on integrated study
CN110968069B (en) Fault prediction method of wind generating set, corresponding device and electronic equipment
CN116108758B (en) Landslide susceptibility evaluation method
Fang et al. Fuzzy modelling and the prediction of porosity and permeability from the compositional and textural attributes of sandstone
CN106202489A (en) A kind of agricultural pest intelligent diagnosis system based on big data
CN104809230A (en) Cigarette sensory quality evaluation method based on multi-classifier integration
CN114186641B (en) Landslide susceptibility evaluation method based on deep learning
CN115660221B (en) Oil and gas reservoir economic recoverable reserve assessment method and system based on hybrid neural network
CN116316599A (en) Intelligent electricity load prediction method
CN115470962A (en) LightGBM-based enterprise confidence loss risk prediction model construction method
CN117272841B (en) Shale gas dessert prediction method based on hybrid neural network
CN117314266B (en) Novel intelligent scientific and technological talent evaluation method based on hypergraph attention mechanism
CN117540303A (en) Landslide susceptibility assessment method and system based on cross semi-supervised machine learning algorithm
CN115481841A (en) Material demand prediction method based on feature extraction and improved random forest
CN112329804A (en) Naive Bayes lithofacies classification integrated learning method and device based on feature randomness
Elwakil Knowledge discovery based simulation system in construction
CN116720743A (en) Carbon emission measuring and calculating method based on data clustering and machine learning
CN113505850A (en) Boiler fault prediction method based on deep learning
Goncalves et al. Macroscopic rock texture image classification using an hierarchical neuro-fuzzy system
Feder Machine-learning approach determines spatial variation in shale decline curves
CN116467572B (en) Coal-rock gas composite dynamic disaster prediction method based on deep learning
Chengwei Research on Performance Evaluation System of Manufacturing Listed Companies Based on CNN
Dong et al. Application and Research of One-dimensional convolutional neural network in Intelligent Stratification in Oilfield
Guo et al. Simulation Research on Artificial Intelligence Design Decision Model Based on Deep Convolution Neural Network

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant