CN115828130A - Clustering algorithm-based multi-parameter dominant water flow channel automatic identification method and system - Google Patents

Clustering algorithm-based multi-parameter dominant water flow channel automatic identification method and system Download PDF

Info

Publication number
CN115828130A
CN115828130A CN202310107756.5A CN202310107756A CN115828130A CN 115828130 A CN115828130 A CN 115828130A CN 202310107756 A CN202310107756 A CN 202310107756A CN 115828130 A CN115828130 A CN 115828130A
Authority
CN
China
Prior art keywords
sample
center point
water flow
flow channel
principal component
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310107756.5A
Other languages
Chinese (zh)
Inventor
张吉群
李欣
贾德利
王利明
常军华
李夏宁
吴丽
崔丽宁
闫林
张洋
王全宾
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Petrochina Co Ltd
Original Assignee
Petrochina Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Petrochina Co Ltd filed Critical Petrochina Co Ltd
Priority to CN202310107756.5A priority Critical patent/CN115828130A/en
Publication of CN115828130A publication Critical patent/CN115828130A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02ATECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE
    • Y02A10/00TECHNOLOGIES FOR ADAPTATION TO CLIMATE CHANGE at coastal zones; at river basins
    • Y02A10/40Controlling or monitoring, e.g. of flood or hurricane; Forecasting, e.g. risk assessment or mapping

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a method and a system for automatically identifying a multi-parameter dominant water flow channel based on a clustering algorithm, which relate to the technical field of oil exploitation, and comprise the following steps: step S1: collecting parameters of an oil-water well layer of an oil field by taking the layer as a unit; step S2: obtaining a principal component by using a principal component analysis method based on the interlayer parameter of the oil-water well; and step S3: and clustering the main components by using a K-Means clustering algorithm to identify a dominant water flow channel. The invention overcomes the limitations that other methods need to depend on the experience of engineers, the time consumption and the like. A more effective dominant water flow channel identification method is established, the research period is shortened, the identification precision is improved, judgment which is more and more consistent with the actual production is obtained, and the method has wide trial.

Description

Clustering algorithm-based multi-parameter dominant water flow channel automatic identification method and system
Technical Field
The invention relates to the technical field of oil exploitation, in particular to a multi-parameter dominant water flow channel automatic identification method and system based on a clustering algorithm.
Background
Most of oil fields in China are heterogeneous and multi-reservoir sandstone oil fields deposited on continental facies, and the reservoir heterogeneity is relatively serious. Water injection is an important means for maintaining pressure, increasing oil recovery rate and recovery ratio in oil field development. However, in the later period of high water content, interlayer contradiction, plane contradiction and in-layer contradiction are increasingly prominent, so that a serious water injection invalid circulation phenomenon appears, great difficulty is caused to oil stabilization and water control of the oil field, and the water injection development effect and the economic benefit are directly influenced. And only when the position of the dominant water flow channel is found, the feasible measures can be taken for the secondary development deep profile control and flooding work of the old oil field.
The existing dominant water flow channel identification method mainly comprises a well logging method, a well testing method and other mathematical methods, and each method at the present stage has certain limitations, wherein the well testing method needs field operation, is high in cost, only part of wells have data, and influences analysis results; the well logging method and other mathematical methods mostly focus on partial parameters for analysis, and the accuracy is relatively low and the method is not suitable for wide trial use.
Disclosure of Invention
The invention aims to provide a clustering algorithm-based multi-parameter dominant water flow channel automatic identification method and system, and overcomes the limitations that other methods need to depend on the experience of engineers, the time consumption and the like. A more effective dominant water flow channel identification method is established, the research period is shortened, the identification precision is improved, judgment which is more and more consistent with the actual production is obtained, and the method has wide trial. In order to achieve the purpose, the invention provides the following technical scheme:
according to one aspect of the disclosure, a method for automatically identifying a multi-parameter dominant water flow channel based on a clustering algorithm is provided, the method comprising the following steps:
step S1: collecting parameters of an oil-water well stratum with stratum as a unit in an oil field;
step S2: obtaining a principal component by using a principal component analysis method based on the interlayer parameter of the oil-water well;
and step S3: and clustering the main components by using a K-Means clustering algorithm to identify a dominant water flow channel.
In one possible embodiment, the parameters include: the accumulated water injection amount, the scouring time, the instantaneous water injection amount, the water injection speed, the water consumption rate, the water-flooding oil amount, the water-flooding liquid amount and the water injection strength among wells.
In a possible implementation, the step S2: based on the oil-water well interlayer parameters, obtaining principal components by using a principal component analysis method, wherein the principal components comprise:
step S21: constructing a sample data matrix X of n X m according to the sample data and the parameters of each oil-water well layer, wherein n represents the number of samples, and m represents the number of the parameters of the oil-water well layer;
step S22: carrying out standardization processing on the sample data matrix X;
step S23: calculating a covariance matrix of the sample data matrix X after the standardization processing;
step S24: calculating an eigenvalue and an eigenvector of the covariance matrix;
step S25: calculating a principal component contribution rate and an accumulated contribution rate based on the eigenvalues and the eigenvectors;
step S26: based on the principal component contribution rate and the cumulative contribution rate, p principal components are obtained.
In a possible implementation, the step S3: clustering the principal components by using a K-Means clustering algorithm to identify a dominant water flow channel, comprising:
step S31: randomly selecting 3 data from samples taking a layer as a unit as a central point;
step S32: respectively calculating the distance from each sample point to the selected 3 central points, and classifying the sample points according to the distances;
step S33: calculating each sample point in the classified sample points, and recalculating the average value as a new central point;
step S34: if the new center point calculated again is the same as the original center point, finishing clustering; if the new center point calculated again is different from the original center point, the new center point is assigned to the original center point, and the step S32 is continuously repeated until the calculation is finished;
step S35: according to the finally obtained clusters, the clusters can be divided into three categories: the strong water flow is used for identifying each dominant water flow channel.
In a possible implementation, the step S32: respectively calculating the distance from each sample point to the selected 3 central points, and classifying the sample points according to the distances, wherein the method comprises the following steps:
step 321: calculating the distance from each sample point to 3 central points according to the following formula:
Figure SMS_1
in the formula (I), the compound is shown in the specification,
Figure SMS_2
represents the distance from sample i to the center point 1 at time t;
Figure SMS_3
represents the distance of sample i to the center point 2 at time t;
Figure SMS_4
represents the distance of sample i to the center point 3 at time t;
Figure SMS_5
a value of a principal component k representing the center point 1;
Figure SMS_6
a value of principal component k representing the center point 2;
Figure SMS_7
a value of a principal component k representing the center point 3;
Figure SMS_8
a value representing a principal component k of a sample i; i denotes the sample number and k denotes the principal component number.
Step 322: sequentially comparing the distance from each sample point to each center, dividing the sample object into the clusters of the centers closest to each other, and obtaining 3 new cluster types
Figure SMS_9
Figure SMS_10
Figure SMS_11
In one possible embodiment, the new center point is calculated as follows:
Figure SMS_12
in the formula (I), the compound is shown in the specification,
Figure SMS_13
represents the center point 1 at time t + 1;
Figure SMS_17
represents the center point 2 at time t + 1;
Figure SMS_19
represents the center point 3 at time t + 1;
Figure SMS_15
represents the distance from sample i to the center point 1 at time t;
Figure SMS_18
represents the distance of sample i to the center point 2 at time t;
Figure SMS_20
represents the distance of sample i to the center point 3 at time t;
Figure SMS_21
is a first cluster;
Figure SMS_14
is a second type cluster;
Figure SMS_16
is a third type cluster.
According to one aspect of the present disclosure, there is provided a system for automatically identifying a multi-parameter dominant water flow channel based on a clustering algorithm, the system comprising: the device comprises a collecting unit, a principal component analyzing unit and a clustering algorithm unit; wherein the content of the first and second substances,
the acquisition unit is used for acquiring the parameters of the oil-water well layer by layer in the oil field;
the principal component analysis unit is used for obtaining principal components by using a principal component analysis method based on the parameters of the oil-water well interlayer;
and the clustering algorithm unit is used for clustering the main components by using a K-Means clustering algorithm and identifying a dominant water flow channel.
In a possible embodiment, the parameters in the acquisition unit include: the accumulated water injection amount, the scouring time, the instantaneous water injection amount, the water injection speed, the water consumption rate, the water-flooding oil amount, the water-flooding liquid amount and the water injection strength among wells.
In one possible embodiment, the principal component analysis unit includes: the device comprises a construction module, a standardization processing module, a first calculation module, a second calculation module, a third calculation module and an acquisition module; wherein the content of the first and second substances,
the construction module is used for constructing a sample data matrix X of n X m according to the sample data and the parameters of each oil-water well interlayer, wherein n is the number of samples, and m is the number of the parameters of the oil-water well interlayer;
the standardization processing module is used for carrying out standardization processing on the sample data matrix X;
the first calculation module is used for calculating a covariance matrix of the sample data matrix X after the standardization processing;
the second calculation module is used for calculating an eigenvalue and an eigenvector of the covariance matrix;
the third calculation module is used for calculating the principal component contribution rate and the accumulated contribution rate based on the characteristic value and the characteristic vector;
and the acquisition module is used for acquiring p principal components based on the principal component contribution rate and the accumulated contribution rate.
In a possible implementation, the clustering algorithm unit includes: the device comprises a random selection module, a classification module, a fourth calculation module, a judgment module and an identification module; wherein the content of the first and second substances,
the random selection module is used for randomly selecting 3 data in a sample with a layer as a unit as a central point;
the classification module is used for respectively calculating the distance from each sample point to the selected 3 central points and classifying the sample points according to the distances;
the fourth calculation module is used for calculating each sample point in the classified data, and recalculating the average value as a new central point;
the judging module is used for finishing clustering if the newly calculated central point is the same as the original central point; if the new center point calculated again is different from the original center point, the new center point is assigned to the original center point, and the step S32 is continuously repeated until the calculation is finished;
the identification module is used for dividing the clusters into three categories according to the finally obtained clusters: the strong water flow is the channel for identifying each dominant water flow.
The invention has the technical effects and advantages that:
through the collected parameters of each oil-water well interlayer, principal components are preferably selected by using a principal component analysis method, and then the dominant water flow channel of each layer is automatically identified by using a clustering algorithm. The main influence factors can be selected from the multiple parameters, the influence of human factors is reduced while the fast recognition is carried out, the recognition precision and the prediction speed are improved, and the water injection development effect of the oil field can be effectively guided.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by the practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
Drawings
FIG. 1 is a flow chart of a method for automatically identifying a multi-parameter dominant water flow channel based on a clustering algorithm according to an exemplary embodiment of the invention;
FIG. 2 is a flow chart of a principal component analysis method according to an exemplary embodiment of the present invention;
FIG. 3 is a flowchart of a K-means clustering algorithm according to an exemplary embodiment of the present invention;
fig. 4 is a schematic diagram of an automatic identification system for a multi-parameter dominant water flow channel based on a clustering algorithm according to an exemplary embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Fig. 1 is a flowchart of an automatic identification method for a multi-parameter dominant water flow channel based on a clustering algorithm according to an exemplary embodiment of the present invention, and as shown in fig. 1, an automatic identification method for a multi-parameter dominant water flow channel based on a clustering algorithm according to an exemplary embodiment of the present invention includes the following steps:
step S1: collecting parameters of an oil-water well layer of an oil field by taking the layer as a unit;
step S2: obtaining a principal component by using a principal component analysis method based on the interlayer parameter of the oil-water well;
and step S3: and clustering the main components by using a K-Means clustering algorithm to identify a dominant water flow channel.
In step S1 of the present invention, parameters of the oil-water well interval in each interval of the oil field are collected, including data of the inter-well accumulated water injection amount, the flushing time, the instantaneous water injection amount, the water injection speed, the water consumption rate, the water flooding oil amount, the water flooding liquid amount, the water injection strength, etc., and the parameter data of each small interval and all time points are sorted out, and a data table is shown in table 1.
TABLE 1 Collection of parameters between oil and water well layers in oil field
Figure SMS_22
In step S2 of the present invention, a principal component analysis method is used to determine a main influence parameter, and fig. 2 is a flowchart of a principal component analysis method according to an exemplary embodiment of the present invention, as shown in fig. 2, including the following steps:
step S21: firstly, combining the number of samples (n) and the interlayer parameter (m) of the oil-water well, constructing a sample data matrix X of n X m, wherein the expression is as follows:
Figure SMS_23
in the formula (I), the compound is shown in the specification,
Figure SMS_24
represents the value of parameter 1 for sample 1;
Figure SMS_28
represents the value of parameter 2 for sample 1;
Figure SMS_31
represents the value of the parameter m for sample 1;
Figure SMS_27
a value representing parameter 1 of sample 2;
Figure SMS_30
represents the value of parameter 2 of sample 2;
Figure SMS_33
represents the value of the parameter m for sample 2;
Figure SMS_34
a value representing the sample n parameter 1;
Figure SMS_25
a value representing a sample n parameter 2;
Figure SMS_29
represents the value of the parameter m for the sample n;
Figure SMS_32
a matrix of n sample values representing parameter 1;
Figure SMS_35
a matrix of n sample values representing parameter 2;
Figure SMS_26
a matrix of n sample values representing a parameter m; n represents the number of samples; m represents the number of parameters between oil-water well layers.
Step S22: calculate mean by column
Figure SMS_36
Difference of sum and sample
Figure SMS_37
Normalizing data
Figure SMS_38
And carrying out standardization processing on the sample data matrix X to obtain:
Figure SMS_39
in the formula (I), the compound is shown in the specification,
Figure SMS_42
represents the normalized value of parameter 1 for sample 1;
Figure SMS_44
represents the normalized value of parameter 2 for sample 1;
Figure SMS_48
represents the normalized value of the parameter m of sample 1;
Figure SMS_43
represents the normalized value of parameter 1 for sample 2;
Figure SMS_45
represents the normalized value of parameter 2 for sample 2;
Figure SMS_49
represents the normalized value of the parameter m of sample 2;
Figure SMS_52
represents the normalized value of the parameter 1 of the sample n;
Figure SMS_40
represents the normalized value of the sample n parameter 2;
Figure SMS_46
represents the normalized value of the parameter m of the sample n;
Figure SMS_51
a matrix of normalized values of n samples representing parameter 1;
Figure SMS_53
a matrix of normalized values of n samples representing parameter 2;
Figure SMS_41
a matrix of normalized values of n samples representing the parameter m; n represents the number of samples; m represents the number of parameters between oil-water well layers;
Figure SMS_47
represents the value of the parameter j of the sample i;
Figure SMS_50
the normalized value of the parameter j of the sample i is shown.
Step S23: and (3) calculating a covariance matrix according to the sample data matrix after the standardization treatment:
Figure SMS_54
wherein
Figure SMS_55
Step S24: and (3) calculating an eigenvalue and an eigenvector of the covariance matrix:
characteristic value:
Figure SMS_56
feature vector:
Figure SMS_57
step S25: calculating the contribution rate and the cumulative contribution rate of each component:
contribution rate:
Figure SMS_58
cumulative contribution rate:
Figure SMS_59
step S26: and acquiring p principal components according to the accumulated contribution rate.
Step S261: taking the first, second, and pth (p is less than or equal to m) principal components corresponding to the characteristic values with the cumulative contribution rate of more than 80%;
Figure SMS_60
ith principal component:
Figure SMS_61
step S262: so far, m influence indexes are converted into p main components by using a main component analysis method
Figure SMS_62
In step S3 of the present invention, a K-Means clustering algorithm is used to cluster the principal components and identify the dominant water flow channel, fig. 3 is a flow chart of the K-Means clustering algorithm according to the exemplary embodiment of the present invention, as shown in fig. 3, including the following steps:
step S31: randomly selecting 3 data from samples taking a layer as a unit as a central point Z;
Figure SMS_63
in the formula (I), the compound is shown in the specification,
Figure SMS_65
a center point 1 representing time t;
Figure SMS_70
a center point 2 representing time t;
Figure SMS_71
a center point 3 representing time t;
Figure SMS_67
a value of principal component 1 representing the center point 1;
Figure SMS_68
a value representing the principal component 2 of the center point 1;
Figure SMS_72
a value representing the principal component p of the center point 1;
Figure SMS_74
a value representing principal component 1 of center point 2;
Figure SMS_64
a value of principal component 2 representing the center point 2;
Figure SMS_69
a value representing the principal component p of the center point 2;
Figure SMS_73
a value of principal component 1 representing the center point 3;
Figure SMS_75
a value of the principal component 2 representing the center point 3;
Figure SMS_66
representing the value of the principal component p of the center point 3.
Step S32: respectively calculating the distance from each sample point D to the selected 3 central points, and classifying the sample points according to the distances;
step S321: calculating the distance from each sample point to 3 central points;
Figure SMS_76
in the formula (I), the compound is shown in the specification,
Figure SMS_77
represents the distance from sample i to the center point 1 at time t;
Figure SMS_78
represents the distance of sample i to the center point 2 at time t;
Figure SMS_79
represents the distance of sample i to the center point 3 at time t;
Figure SMS_80
a value of a principal component k representing the center point 1;
Figure SMS_81
a value of principal component k representing the center point 2;
Figure SMS_82
a value of a principal component k representing the center point 3;
Figure SMS_83
a value representing a principal component k of a sample i; i denotes the sample number and k denotes the principal component number.
Step S322: sequentially comparing the distance from each sample point to each center, dividing the sample objects into clusters with the nearest centers to obtain 3 new cluster types
Figure SMS_84
Figure SMS_85
Figure SMS_86
Step S33: calculating each sample point in the classified sample points, and recalculating the average value as a new central point;
Figure SMS_87
in the formula (I), the compound is shown in the specification,
Figure SMS_90
represents the center point 1 at time t + 1;
Figure SMS_93
represents the center point 2 at time t + 1;
Figure SMS_95
represents the center point 3 at time t + 1;
Figure SMS_89
represents the distance from sample i to the center point 1 at time t;
Figure SMS_91
to representThe distance from sample i to the center point 2 at time t;
Figure SMS_94
represents the distance of sample i to the center point 3 at time t;
Figure SMS_96
is a first cluster;
Figure SMS_88
is a second type cluster;
Figure SMS_92
is a third type cluster.
Step S34: if the central point of the step t +1 is recalculated to be the same as the central point of the step t, finishing clustering; if the center point of the t +1 step is different from the t step, the new center point is assigned to the original center point
Figure SMS_97
Continuing to repeat the step S32 until the calculation is finished;
step S35: according to the finally determined clusters, the clusters can be divided into three categories: strong water flow, normal water flow and weak water flow, wherein the strong water flow is the dominant water flow channels identified, and a table 2 is obtained.
TABLE 2 Final determined clustering Table
Figure SMS_98
Fig. 4 is a schematic diagram of an automatic identification system for a multi-parameter dominant water flow channel based on a clustering algorithm according to an exemplary embodiment of the present invention, and as shown in fig. 4, an automatic identification system for a multi-parameter dominant water flow channel based on a clustering algorithm according to an exemplary embodiment of the present invention is provided, and the system includes: the device comprises a collecting unit, a principal component analyzing unit and a clustering algorithm unit; the device comprises a collecting unit, a control unit and a control unit, wherein the collecting unit is used for collecting the parameters of an oil-water well layer by layer of an oil field; the principal component analysis unit is used for obtaining principal components by using a principal component analysis method based on the parameters of the oil-water well interlayer; and the clustering algorithm unit is used for clustering the main components by using a K-Means clustering algorithm and identifying a dominant water flow channel.
Further, the parameters in the acquisition unit include: the accumulated water injection amount, the scouring time, the instantaneous water injection amount, the water injection speed, the water consumption rate, the water-flooding oil amount, the water-flooding liquid amount and the water injection strength among wells. The principal component analysis unit includes: the device comprises a construction module, a standardization processing module, a first calculation module, a second calculation module, a third calculation module and an acquisition module; the construction module is used for constructing a sample data matrix X of n X m according to the sample data and the parameters of each oil-water well interlayer, wherein n represents the number of samples, and m represents the number of the parameters of the oil-water well interlayer; the standardization processing module is used for carrying out standardization processing on the sample data matrix X; the first calculation module is used for calculating a covariance matrix of the sample data matrix X after the standardization processing; the second calculation module is used for calculating an eigenvalue and an eigenvector of the covariance matrix; the third calculation module is used for calculating the principal component contribution rate and the accumulated contribution rate based on the characteristic value and the characteristic vector; and the acquisition module is used for acquiring p principal components based on the principal component contribution rate and the accumulated contribution rate. The clustering algorithm unit comprises: the device comprises a random selection module, a classification module, a fourth calculation module, a judgment module and an identification module; the random selection module is used for randomly selecting 3 data in a sample with a layer as a unit as a central point; the classification module is used for respectively calculating the distance from each sample point to the selected 3 central points and classifying the sample points according to the distances; the fourth calculation module is used for calculating each sample point in the classified data, and recalculating the average value as a new central point; the judging module is used for finishing clustering if the newly calculated central point is the same as the original central point; if the new center point calculated again is different from the original center point, the new center point is assigned to the original center point, and the step S32 is continuously repeated until the calculation is finished; the identification module is used for dividing the obtained clusters into three categories: the strong water flow is used for identifying each dominant water flow channel.
Finally, it should be noted that: although the present invention has been described in detail with reference to the foregoing embodiments, it will be apparent to those skilled in the art that modifications may be made to the embodiments or portions thereof without departing from the spirit and scope of the invention.

Claims (10)

1. A multi-parameter dominant water flow channel automatic identification method based on a clustering algorithm is characterized by comprising the following steps:
step S1: collecting parameters of an oil-water well layer of an oil field by taking the layer as a unit;
step S2: obtaining a principal component by using a principal component analysis method based on the interlayer parameter of the oil-water well;
and step S3: and clustering the main components by using a K-Means clustering algorithm to identify a dominant water flow channel.
2. The clustering algorithm-based multi-parameter dominant water flow channel automatic identification method according to claim 1, wherein the parameters comprise: the accumulated water injection amount, the scouring time, the instantaneous water injection amount, the water injection speed, the water consumption rate, the water-flooding oil amount, the water-flooding liquid amount and the water injection strength among wells.
3. The method for automatically identifying the multiparameter dominant water flow channel based on the clustering algorithm as claimed in claim 1 or 2, wherein the step S2: based on the oil-water interlayer parameters, obtaining principal components by using a principal component analysis method, wherein the method comprises the following steps:
step S21: constructing a sample data matrix X of n X m according to the sample data and the parameters of each oil-water well layer, wherein n represents the number of samples, and m represents the number of the parameters of the oil-water well layer;
step S22: carrying out standardization processing on the sample data matrix X;
step S23: calculating a covariance matrix of the sample data matrix X after the standardization processing;
step S24: calculating an eigenvalue and an eigenvector of the covariance matrix;
step S25: calculating a principal component contribution rate and an accumulated contribution rate based on the eigenvalues and the eigenvectors;
step S26: based on the principal component contribution rate and the cumulative contribution rate, p principal components are obtained.
4. The method for automatically identifying the multiparameter dominant water flow channel based on the clustering algorithm as claimed in claim 1, wherein the step S3: clustering the principal components by using a K-Means clustering algorithm to identify a dominant water flow channel, comprising:
step S31: randomly selecting 3 data from samples taking a layer as a unit as a central point;
step S32: respectively calculating the distance from each sample point to the selected 3 central points, and classifying the sample points according to the distances;
step S33: calculating each sample point in the classified sample points, and recalculating the average value as a new central point;
step S34: if the new center point recalculated is the same as the original center point, finishing clustering; if the new center point calculated again is different from the original center point, the new center point is assigned to the original center point, and the step S32 is continuously repeated until the calculation is finished;
step S35: according to the finally obtained clusters, the clusters can be divided into three categories: the strong water flow is used for identifying each dominant water flow channel.
5. The method for automatically identifying the multiparameter dominant water flow channel based on the clustering algorithm as claimed in claim 4, wherein the step S32: respectively calculating the distance from each sample point to the selected 3 central points, and classifying the sample points according to the distances, wherein the method comprises the following steps:
step 321: calculating the distance from each sample point to 3 central points according to the following formula:
Figure QLYQS_1
in the formula (I), the compound is shown in the specification,
Figure QLYQS_2
represents the distance from sample i to the center point 1 at time t;
Figure QLYQS_3
represents the distance of sample i to the center point 2 at time t;
Figure QLYQS_4
represents the distance of sample i to the center point 3 at time t;
Figure QLYQS_5
a value of a principal component k representing the center point 1;
Figure QLYQS_6
a value of principal component k representing the center point 2;
Figure QLYQS_7
a value of a principal component k representing the center point 3;
Figure QLYQS_8
a value representing a principal component k of a sample i; i represents a sample number, k represents a principal component number;
step 322: sequentially comparing the distance from each sample point to each center, dividing the sample object into the clusters of the centers closest to each other, and obtaining 3 new cluster types
Figure QLYQS_9
Figure QLYQS_10
Figure QLYQS_11
6. The method for automatically identifying the multiparameter dominant water flow channel based on the clustering algorithm as claimed in claim 4, wherein the new central point is calculated as follows:
Figure QLYQS_12
in the formula (I), the compound is shown in the specification,
Figure QLYQS_14
represents the center point 1 at time t + 1;
Figure QLYQS_16
represents the center point 2 at time t + 1;
Figure QLYQS_20
center point 3 representing time t + 1;
Figure QLYQS_15
represents the distance from sample i to the center point 1 at time t;
Figure QLYQS_18
represents the distance of sample i to the center point 2 at time t;
Figure QLYQS_19
represents the distance of sample i to the center point 3 at time t;
Figure QLYQS_21
is a first cluster;
Figure QLYQS_13
is a second type cluster;
Figure QLYQS_17
is a third type cluster.
7. A multi-parameter dominant water flow channel automatic identification system based on a clustering algorithm is characterized by comprising: the device comprises a collecting unit, a principal component analyzing unit and a clustering algorithm unit; wherein the content of the first and second substances,
the acquisition unit is used for acquiring the parameters of the oil-water well layer by layer in the oil field;
the principal component analysis unit is used for obtaining principal components by using a principal component analysis method based on the parameters of the oil-water well interlayer;
and the clustering algorithm unit is used for clustering the main components by using a K-Means clustering algorithm and identifying a dominant water flow channel.
8. The system for automatically identifying the multi-parameter dominant water flow channel based on the clustering algorithm as claimed in claim 7, wherein the parameters in the acquisition unit comprise: the accumulated water injection amount, the scouring time, the instantaneous water injection amount, the water injection speed, the water consumption rate, the water-flooding oil amount, the water-flooding liquid amount and the water injection strength among wells.
9. The system for automatically identifying the multiparameter dominant water flow channel based on the clustering algorithm according to claim 7 or 8, wherein the principal component analysis unit comprises: the device comprises a construction module, a standardization processing module, a first calculation module, a second calculation module, a third calculation module and an acquisition module; wherein the content of the first and second substances,
the construction module is used for constructing a sample data matrix X of n X m according to the sample data and the parameters of each oil-water well interlayer, wherein n is the number of samples, and m is the number of the parameters of the oil-water well interlayer;
the standardization processing module is used for carrying out standardization processing on the sample data matrix X;
the first calculation module is used for calculating a covariance matrix of the sample data matrix X after the standardization processing;
the second calculation module is used for calculating an eigenvalue and an eigenvector of the covariance matrix;
the third calculation module is used for calculating the principal component contribution rate and the accumulated contribution rate based on the characteristic value and the characteristic vector;
and the acquisition module is used for acquiring p principal components based on the principal component contribution rate and the accumulated contribution rate.
10. The system for automatically identifying the multiparameter dominant water flow channel based on the clustering algorithm as claimed in claim 7, wherein the clustering algorithm unit comprises: the device comprises a random selection module, a classification module, a fourth calculation module, a judgment module and an identification module; wherein the content of the first and second substances,
the random selection module is used for randomly selecting 3 data in a sample with a layer as a unit as a central point;
the classification module is used for respectively calculating the distance from each sample point to the selected 3 central points and classifying the sample points according to the distances;
the fourth calculation module is used for calculating each sample point in the classified data, and recalculating the average value as a new central point;
the judging module is used for finishing clustering if the newly calculated central point is the same as the original central point; if the new center point calculated again is different from the original center point, the new center point is assigned to the original center point, and the step S32 is continuously repeated until the calculation is finished;
the identification module is used for dividing the clusters into three categories according to the finally obtained clusters: the strong water flow is used for identifying each dominant water flow channel.
CN202310107756.5A 2023-02-14 2023-02-14 Clustering algorithm-based multi-parameter dominant water flow channel automatic identification method and system Pending CN115828130A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310107756.5A CN115828130A (en) 2023-02-14 2023-02-14 Clustering algorithm-based multi-parameter dominant water flow channel automatic identification method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310107756.5A CN115828130A (en) 2023-02-14 2023-02-14 Clustering algorithm-based multi-parameter dominant water flow channel automatic identification method and system

Publications (1)

Publication Number Publication Date
CN115828130A true CN115828130A (en) 2023-03-21

Family

ID=85521177

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310107756.5A Pending CN115828130A (en) 2023-02-14 2023-02-14 Clustering algorithm-based multi-parameter dominant water flow channel automatic identification method and system

Country Status (1)

Country Link
CN (1) CN115828130A (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102419787A (en) * 2011-10-19 2012-04-18 中国石油天然气股份有限公司 Method and system for identifying water flow dominant channels of high-water-content oil field
US20130013255A1 (en) * 2011-07-06 2013-01-10 Honeywell International Inc. Automatic identification of operating parameters for power plants
WO2015099766A1 (en) * 2013-12-27 2015-07-02 Halliburton Energy Services, Inc. Synthetic gas-oil-ratio determination for gas dominant fluids
CN108678728A (en) * 2018-05-15 2018-10-19 北京理工大学 A kind of oilwell parameter analysis combinational algorithm based on k-means
CN109740641A (en) * 2018-12-18 2019-05-10 清华四川能源互联网研究院 A kind of area user identification zone method clustered based on principal component analysis and k-means
CN113283180A (en) * 2021-06-28 2021-08-20 中国石油大学(华东) K-means and SVR combination-based tight reservoir horizontal well fracturing productivity prediction method and application

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130013255A1 (en) * 2011-07-06 2013-01-10 Honeywell International Inc. Automatic identification of operating parameters for power plants
CN102419787A (en) * 2011-10-19 2012-04-18 中国石油天然气股份有限公司 Method and system for identifying water flow dominant channels of high-water-content oil field
WO2015099766A1 (en) * 2013-12-27 2015-07-02 Halliburton Energy Services, Inc. Synthetic gas-oil-ratio determination for gas dominant fluids
CN108678728A (en) * 2018-05-15 2018-10-19 北京理工大学 A kind of oilwell parameter analysis combinational algorithm based on k-means
CN109740641A (en) * 2018-12-18 2019-05-10 清华四川能源互联网研究院 A kind of area user identification zone method clustered based on principal component analysis and k-means
CN113283180A (en) * 2021-06-28 2021-08-20 中国石油大学(华东) K-means and SVR combination-based tight reservoir horizontal well fracturing productivity prediction method and application

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
梅永贵;骆裕明;王景悦;薛占新;石延霞;张斌;: "基于聚类分析的煤层气抽油机井能耗特征研究" *

Similar Documents

Publication Publication Date Title
CN108009716B (en) Multi-level evaluation method for influence factors of horizontal well volume fracturing effect
CN104318109B (en) Shale gas reservoir stratum identification method based on support vector machines
CN105760673B (en) A kind of fluvial depositional reservoir seismic-sensitive parameterized template analysis method
CN107387051B (en) Repeated fracturing well selection method for multi-stage fractured horizontal well with low-permeability heterogeneous oil reservoir
CN112529341B (en) Drilling well leakage probability prediction method based on naive Bayesian algorithm
CN111046341A (en) Unconventional natural gas fracturing effect evaluation and capacity prediction method based on principal component analysis
CN109184677A (en) Reservoir evaluation methods for heterogeneous alternating layers sand body
CN105986811A (en) Five-dimension lithologic character judgment and recognition method based on conventional logging data
CN115358285B (en) Method, device and equipment for selecting key geological parameters of block to be surveyed
CN109594967A (en) A kind of lock of tool drilling detection method for early warning based on well logging big data
CN116384618A (en) Method and system for profile control and flooding well selection of large well spacing oil reservoir
CN109670729A (en) A kind of top plate aquifer water well evaluation method
CN116562174A (en) Method and device for determining water distribution and injection quantity
CN110866551B (en) Drilling data dimension reduction method based on high correlation filtering algorithm and PCA algorithm
CN115828130A (en) Clustering algorithm-based multi-parameter dominant water flow channel automatic identification method and system
CN116562428A (en) Fracturing construction parameter optimization method based on machine learning
CN110956388A (en) Method for generating yield increasing scheme of offshore oil and gas reservoir
CN114562236B (en) Geological engineering integrated lost circulation real-time early warning method based on integrated learning model
CN117332668A (en) Shale gas well fracturing effect analysis method and system based on data driving
CN114444964B (en) Drilling process operation performance evaluation method
CN103049463A (en) Seismic attribute optimal selection method based on rough set theory
CN113627640A (en) Productivity well testing prediction method and system for fracture-cavity type oil reservoir oil and gas well
CN110656924A (en) Ultra-low permeability oil reservoir classification method
CN112132369A (en) Petroleum geology evaluation index assignment method
CN104462792A (en) Log data lithological stratum numerical value reduction method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination