CN116662847A - Data distortion identification method based on association density clustering and application thereof - Google Patents

Data distortion identification method based on association density clustering and application thereof Download PDF

Info

Publication number
CN116662847A
CN116662847A CN202310279317.2A CN202310279317A CN116662847A CN 116662847 A CN116662847 A CN 116662847A CN 202310279317 A CN202310279317 A CN 202310279317A CN 116662847 A CN116662847 A CN 116662847A
Authority
CN
China
Prior art keywords
clustering
data
association
window
strain
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202310279317.2A
Other languages
Chinese (zh)
Inventor
王娟
杜晓莹
申祖晨
祁鑫
杨娜
白凡
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Jiaotong University
Original Assignee
Beijing Jiaotong University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Jiaotong University filed Critical Beijing Jiaotong University
Priority to CN202310279317.2A priority Critical patent/CN116662847A/en
Publication of CN116662847A publication Critical patent/CN116662847A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F17/00Digital computing or data processing equipment or methods, specially adapted for specific functions
    • G06F17/10Complex mathematical operations
    • G06F17/16Matrix or vector computation, e.g. matrix-matrix or matrix-vector multiplication, matrix factorization
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Mathematical Physics (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Computational Mathematics (AREA)
  • Pure & Applied Mathematics (AREA)
  • Mathematical Analysis (AREA)
  • Mathematical Optimization (AREA)
  • Evolutionary Computation (AREA)
  • Computing Systems (AREA)
  • Algebra (AREA)
  • Databases & Information Systems (AREA)
  • Software Systems (AREA)
  • Testing Or Calibration Of Command Recording Devices (AREA)

Abstract

The application discloses a data distortion identification method based on association density clustering, which comprises the following steps: and for the historical time series of n strain measuring points of the strain sensor, respectively measuring the association degree between the measuring point sequences through the angle of the overall displacement difference representing the similarity of the n strain measuring points and the overall first-order slope difference representing the similarity of the development trend and the overall second-order slope difference of the similarity of the development speed based on gray type B association, constructing a classification model, and identifying real-time monitoring data based on the classification model. The application can complete the data distortion identification caused by hardware faults, can realize the fault diagnosis of multi-measuring point equipment by utilizing the relevance of multi-source data, and improves the reliability and the robustness of clustering.

Description

Data distortion identification method based on association density clustering and application thereof
Technical Field
The application relates to the technical field of data processing, in particular to a data distortion identification method based on association density clustering and application thereof.
Background
When the structural safety state is evaluated based on the monitoring data, real and reliable actually measured data is a precondition for searching the actual state of the structure, so that the data obtained by the monitoring sensor cannot be simply and accurately defaulted. For most health monitoring systems, the service lives of the sensors and other equipment are only decades or even more than ten years, compared with the service lives of ancient wood structures for hundreds of years, faults caused by external environment, aging, electromagnetic interference and other reasons are not rare in the service process, so that the real-time identification of data distortion caused by equipment faults is realized, the faults can be processed timely, the reliability of the monitoring system is ensured, reliable and accurate data are provided for structural safety evaluation, and structural abnormality misjudgment of the monitoring system can be reduced.
The data distortion caused by the system hardware fault is different from the abnormality caused by the structural damage, so that the correct distinction is ensured, the data distortion caused by the equipment abnormality is often single or few measuring points, the structural strain has obvious correlation among the strain measuring points under the influence of the environmental temperature, and the data information of a single sensor is considered to be single and the unilateral performance is strong, and meanwhile, the structural strain is utilized to have a relatively stable time sequence characteristic.
Disclosure of Invention
The application aims to provide a data distortion identification method based on association density clustering, which aims to solve the technical problems in the prior art.
In order to solve the technical problems, the application specifically provides the following technical scheme:
in a first aspect of the present application, there is provided a data distortion identification method based on association density clustering, including:
historical time series x= { X for n strain measurement points of strain sensor 1 (t),x 2 (t),...,x n (t) } wherein 0.ltoreq.t.ltoreq.m, based on grey type B correlations, respectively by an overall displacement difference representing the closeness of the n strain pointsAnd measuring the correlation degree among the measuring point sequences at the angle of the overall first-order slope difference representing the similarity of the development trend and the overall second-order slope difference representing the similarity of the development speed, constructing a classification model, and identifying real-time monitoring data based on the classification model.
Preferably, the method further comprises a normalization process for the historical time series:
dividing a step-by-step clustering window for n measuring point history time sequences of a certain period of time, wherein the length w determines the time section of abnormal positioning, and the shorter the window is, the stronger the instantaneity of judgment is;
performing standardization processing on each sequence of the window by utilizing the z-score to eliminate the dimension or balance the measurement dimension of each strain sensor;
the time sequence of a certain window after normalization is shown as a formula (6),
wherein n is the measuring point sequence number, and m is the time sequence number.
Preferably, the method further comprises smoothing and denoising the normalized historical time series:
the Savitzky-Golay method is selected for smooth denoising, the convolution and polynomial regression are combined to realize smooth filtering, and different smooth effects are realized by adjusting the sliding window and the fitting order, which are expressed as follows:
wherein 2l+1 is the sliding window length, x k Represents the center of the sliding window, h i The smoothing coefficient is obtained by a least square fitting polynomial;
wherein, the smaller the polynomial fitting order, the longer the window length, the more remarkable the smoothing effect.
Preferably, constructing the classification model includes:
respectively carrying out differential processing on the smoothed and denoised window sequence as an overall displacement difference, an overall first-order slope difference and an overall second-order slope difference according to columns, merging the window sequence with a matrix X to form a new matrix, and adding a weight coefficient into the new matrix to obtain a clustering feature matrix Y;
performing cluster analysis, and determining Manhattan distance as a calculation mode of the inter-sequence distance according to a calculation method of the inter-sequence distance in a cluster calculation process;
and determining parameters (E, minPts) according to actual engineering requirements, obtaining a final clustering result, and constructing a classification model.
Preferably, the new matrix comprises:
overall displacement difference matrix:
first order differential matrix:
second order differential matrix:
clustering feature moment
Wherein dif ij =x i(j+1) -x ijw 1 ,w 2 ,w 3 As a weight, if more dependence on proximity features in cluster analysis increases w 1 Reduce w 2 And w 3 The method comprises the steps of carrying out a first treatment on the surface of the Increasing w if more dependent on similarity features 2 And w 3 Reduce w 1
Preferably, the manhattan distance is as in formula (11):
preferably, identifying real-time monitoring data based on the classification model includes:
the method comprises the steps of carrying out clustering analysis on monitoring data collected in real time step by adopting the same window length w;
when the clustering results of all the measuring points are consistent with the clustering model, judging that the time data is normal;
when the clustering results are inconsistent and the outliers appear, the clustering results are regarded as distortion;
and when the clustering results are inconsistent and the outliers are not available, further carrying out data distortion identification.
On the other hand, the application provides application of the data distortion identification method, which is applied to health monitoring of wood structures.
Preferably, the measurement points include strain measurement points of the homogeneous component and strain measurement points of the heterogeneous component.
Compared with the prior art, the application has the following beneficial effects:
the application completes the data distortion recognition caused by hardware faults based on the gray B-type association density clustering method, can realize the fault diagnosis of multi-measuring point equipment by utilizing the association of multi-source data, and the improved gray B-type association degree realizes the determination and calculation of characteristic attributes, thereby improving the reliability and robustness of clustering and better recognizing the distorted data.
Drawings
In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below. It will be apparent to those of ordinary skill in the art that the drawings in the following description are exemplary only and that other implementations can be obtained from the extensions of the drawings provided without inventive effort.
FIG. 1 is a strain sensor layout of the present application;
FIG. 2 is a schematic representation of a strain and temperature and humidity sensor arrangement according to the present application;
FIG. 3 is a schematic diagram of a similar component strain measurement point cluster mode I according to the present application;
FIG. 4 is a schematic diagram of a similar component strain measurement point cluster mode II according to the present application;
FIG. 5 is a schematic diagram of a similar component strain measurement point cluster mode III of the present application.
FIG. 6 is a schematic diagram of a cluster mode one of strain measurement points of a heterogeneous component according to the present application;
FIG. 7 is a schematic diagram of a cluster pattern two of strain measurement points of a heterogeneous component according to the present application;
FIG. 8 is a schematic diagram of a cluster mode III of strain measurement points of a heterogeneous component according to the present application;
FIG. 9 is a flow chart of data distortion identification based on density clustering in accordance with the present application;
FIG. 10 is a schematic diagram of C2-4 outlier identification of the present application;
FIG. 11 is a schematic diagram of C2-4 constant offset identification of the present application;
FIG. 12 is a schematic diagram of C2-3 data stuck identification of the present application;
fig. 13 is a flow chart of the overall method of the present application.
Detailed Description
The following description of the embodiments of the present application will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present application, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the application without making any inventive effort, are intended to be within the scope of the application.
As shown in fig. 1, the application provides a data distortion identification method based on association density clustering, which is used for historical time series x= { X of n strain measuring points of a strain sensor 1 (t),x 2 (t),...,x n (t) } wherein 0.ltoreq.t.ltoreq.m, based on gray type B correlations, by an overall displacement difference representing the similarity of n strain measurement points, and an overall first-order slope difference and a sum of the differences representing the similarity of the development trends, respectivelyThe degree of association between measuring point sequences is measured by the angle of the overall second-order slope difference of the speed-expanding similarity, a classification model is constructed, and real-time monitoring data are identified based on the classification model.
Wherein, in theory, closeness and similarity can be expressed as:
proximity:
similarity:
wherein the method comprises the steps ofFor the total displacement difference>The overall first-order slope difference represents the similarity of the development trend; />The overall second order slope difference represents the similarity of development speed. And further defining the association degree as:
the core idea of measuring the correlation degree between sequences through two angles of similarity and similarity in the original theory is utilized.
When the association degree is calculated, according to the characteristics of the monitoring data, balancing the data scale of two measurement indexes of similarity and similarity, simultaneously considering the actual application condition of engineering, and referring to the specific gravity of the two measurement indexes, obtaining final association characteristics by multiplying the total displacement difference, the total first-order slope difference and the total second-order slope difference by the weight coefficient respectively, wherein the final association characteristics are shown as a formula (5):
if more dependence on proximity features in cluster analysis increases the weight w 1 Reduce the weight w 2 And weight w 3 . If the similarity feature is more dependent, the weight w is increased 2 And weight w 3 Reduce the weight w 1
The normalization of the historical time series is as follows:
dividing a step-by-step clustering window for n measuring point history time sequences of a certain period of time, wherein the length w determines the time section of abnormal positioning, and the shorter the window is, the stronger the instantaneity of judgment is;
performing standardization processing on each sequence of the window by utilizing the z-score to eliminate the dimension or balance the measurement dimension of each strain sensor;
the time sequence of a certain window after normalization is shown as a formula (6),
wherein n is the measuring point sequence number, and m is the time sequence number.
Smoothing and denoising the standardized historical time sequence as follows:
the Savitzky-Golay method is selected for smooth denoising, the convolution and polynomial regression are combined to realize smooth filtering, and different smooth effects are realized by adjusting the sliding window and the fitting order, which are expressed as follows:
wherein 2l+1 is the sliding window length, x k Represents the center of the sliding window, h i Fitting multiple by least squares for smoothing coefficientsObtaining a term;
wherein, the smaller the polynomial fitting order, the longer the window length, the more remarkable the smoothing effect.
The construction of the classification model comprises the following steps:
respectively carrying out differential processing on the smoothed and denoised window sequence as an overall displacement difference, an overall first-order slope difference and an overall second-order slope difference according to columns, merging the window sequence with a matrix X to form a new matrix, and adding a weight coefficient into the new matrix to obtain a clustering feature matrix Y;
the new matrix comprises:
overall displacement difference matrix:
first order differential matrix:
second order differential matrix:
clustering feature moment
Wherein dif ij =x i(j+1) -x ijw 1 ,w 2 ,w 3 As a weight, if more dependence on proximity features in cluster analysis increases w 1 Reduce w 2 And w 3 The method comprises the steps of carrying out a first treatment on the surface of the Increasing w if more dependent on similarity features 2 And w 3 Reduce w 1
Performing cluster analysis, and determining Manhattan distance as a calculation mode of the inter-sequence distance according to a calculation method of the inter-sequence distance in a cluster calculation process;
the manhattan distance is as in formula (11):
parameters (epsilon, minPts) are determined according to actual engineering requirements, a final clustering result is obtained, a classification model is constructed, minpts is the number of clusters generated by clustering, and epsilon refers to w1, w2 and w3 weight parameters.
Identifying real-time monitoring data based on the classification model as follows:
the method comprises the steps of carrying out clustering analysis on monitoring data collected in real time step by adopting the same window length w;
when the clustering results of all the measuring points are consistent with the clustering model, judging that the time data is normal;
when the clustering results are inconsistent and the outliers appear, the clustering results are regarded as distortion;
and when the clustering results are inconsistent and the outliers are not available, further carrying out data distortion identification.
The application also provides application of the data distortion identification method to health monitoring of the wood structure. The measuring points comprise strain measuring points of the same type of component and strain measuring points of different type of component.
Specific examples are provided below for illustration:
as shown in fig. 1 and fig. 2, a wood structure ancient architecture of a health monitoring system with a certain layout structure is taken as an object, a related diagram of a strain sensor is shown in fig. 1 and fig. 2, sampling time intervals are 10 minutes, the length of a distributed clustering window is 24 hours, and 144 sampling points are arranged in each window.
And carrying out standardization and smooth denoising pretreatment according to the method. The characteristics of the acquired data determine that the smoothing order is 3 and the length of the moving window is 25, so that noise can be filtered, and the influence of a smoothing process on distorted data is reduced as much as possible.
When a new characteristic attribute matrix is constructed after smooth noise reduction, the strain data characteristics applied at the time and the requirements on similarity characteristics and similarity characteristics are considered, and the selection weight coefficients are respectively as followsw 1 =0.1,w 2 =0.45,w 3 =0.45。
Cluster analysis of strain measuring points of similar components:
taking 12 through column strain measuring points as an example, training is carried out by utilizing historical strain data, and parameters (epsilon, minPts) in the clustering process are (5.4,1) which basically meet the requirement of obtaining a stable clustering result, so that the robustness is good. The final clustering result presents three modes:
cluster mode one: through cluster analysis, the 12 measuring points in the mode are all classified into two presentation modes, namely a timing diagram of the model presents a nearly sinusoidal form, and a non-fixed form is presented, as shown in figure 3.
Cluster mode two: in the mode, other measuring points except for three C1-2, C1-3 and C1-4 are classified into one type except that the three measuring points are easily classified into wild values, the expression forms of the measuring points classified into one type also show two states, namely, the time sequence diagram is similar to a sine curve form, and the other measuring points are expressed in a non-fixed form, as shown in figure 4.
Clustering mode three: in the mode, the time sequence diagram of each measuring point is disordered, the clustering condition is unobvious and regular, most measuring points cannot be classified, and a consistent clustering result is not provided, as shown in figure 5.
Cluster analysis of strain measurement points of heterogeneous components:
and adding frame beam strain on the basis of the through column strain analysis, and carrying out cluster analysis on 24 measuring points in total. The clustering result presented by the method under the two conditions of higher and lower environmental humidity is basically consistent with the simple column-through strain result. Except for the three post strain measuring points C1-2, C1-3 and C1-4, the measuring points B1-4, B3-2 and B3-4 are measuring points which are easy to be marked as wild values in the beam strain. Therefore, the strain clustering results of the heterogeneous components have consistent regularity, reflect that the clustering analysis method based on the gray theory is good in robustness, and can provide a reliable clustering model for the subsequent real-time identification of data distortion, as shown in fig. 6, 7 and 8.
And (3) verifying a distortion data identification method:
constructing a clustering model: a classification model is built by using 9 measuring points such as C1-1, C2-2, C2-3 and the like. The working condition during training can be adjusted to 2 conditions, and two clustering models are correspondingly constructed. Respectively "gather into a class" and "irregular". At this time, the "group" corresponding pattern one in training is shown in table 2, and the total duty ratio reaches 86%, which indicates that the clustering method can be used to process the data distortion condition of not less than 86% in the 9 measurement point time series at the same time. Therefore, when the cluster analysis of the real-time multi-measuring-point time sequence is carried out, if the cluster analysis is consistent with the first model, the representative data is normal; if the wild value appears, the corresponding measuring point can be judged as data distortion; and if different classification conditions occur, carrying out subsequent structural abnormality recognition to further judge. The specific identification flow is shown in fig. 9.
Table 2 build a clustering model
The method uses 9 through-column strain measuring points of 6 months in 2020 to perform analog distortion data identification verification, adds outliers, constant offset and data locking three forms of distortion analog data into strain sequences of 4, 6, 8, 14 and 15 days, and can identify corresponding data distortion measuring points, which are respectively shown in figures 10, 11 and 12.
To further explore the sensitivity of the method to three data distortion type identifications, hierarchical simulations were performed on the outlier data.
(1) Outliers
By analyzing the historical strain time sequence, the strain fluctuation range of each measuring point is less than 400 [ mu ] epsilon on average in years. The single point outlier anomalies were therefore classified into three classes, as shown in table 3, with 10 outlier distortion data applied for each class, with the anomalies applied for each simulation being random values within the class range. The probability of having outlier points is identified by the method, analyzed, at each level, as set forth in table 3. The results show that this method can effectively identify absolute outliers above 300 (mu epsilon), but cannot identify outliers below 200 (mu epsilon).
TABLE 3 outlier identification sensitivity control Table
Table 3Comparison of outlier identification sensitivity
(2) Data card death
For the continuous data distortion type of data stuck, the sensitivity of the method to the continuous distorted data time span needs to be explored. Under the sampling frequency of 10min average, through calculation, the method can effectively identify the data locking condition of more than 40 continuous sampling points, namely, the method is suitable for the condition of more than 6.7 hours of time span. Because the method performs cluster analysis under the condition of taking 24 hours as the window length, the data distortion category can be timely identified.
(3) Constant offset
The constant offset is also of a continuous data distortion type, and comprises two variables of a continuous distortion time span and an offset, wherein the variable is a control variable, and the sensitivity of the method to the offset is explored under the condition that the continuous time span is 7 hours by referring to the result of data blocking. The offset is identified as three classes, each class range is shown in table 4, each class applies 10 times of outlier distortion data, and each simulation applies an offset that is a random value within the class range. The result shows that the method can effectively identify the distortion data for the case that the absolute value of the offset is larger than 50 (mu epsilon).
TABLE 4 constant offset identification sensitivity comparison Table
Table 4Comparison of outlier identification sensitivity
By analyzing the sensitivity of three abnormal categories, the recognition sensitivity of outliers is lower than that of other types of data distortion recognition due to the limitation of a smooth noise reduction method, and the recognition sensitivity of continuous data distortion types such as constant offset, data locking and the like is better.
The above embodiments are only exemplary embodiments of the present application and are not intended to limit the present application, the scope of which is defined by the claims. Various modifications and equivalent arrangements of this application will occur to those skilled in the art, and are intended to be within the spirit and scope of the application.

Claims (9)

1. A data distortion identification method based on association density clustering, comprising:
historical time series x= { X for n strain measurement points of strain sensor 1 (t),x 2 (t),...,x n And (t) measuring the degree of association between measuring point sequences based on gray type B association through the angle of total displacement difference representing the similarity of n strain measuring points and the total second-order slope difference representing the similarity of the development trend and the similarity of the development speed respectively, constructing a classification model, and identifying real-time monitoring data based on the classification model.
2. The data distortion identification method based on association density clustering as claimed in claim 1, further comprising a normalization process for the historical time series:
dividing a step-by-step clustering window for n measuring point history time sequences of a certain period of time, wherein the length w determines the time section of abnormal positioning, and the shorter the window is, the stronger the instantaneity of judgment is;
performing standardization processing on each sequence of the window by utilizing the z-score to eliminate the dimension or balance the measurement dimension of each strain sensor;
the time sequence of a certain window after normalization is shown as a formula (6),
wherein n is the measuring point sequence number, and m is the time sequence number.
3. The data distortion identification method based on association density clustering as claimed in claim 2, further comprising smoothing and denoising the normalized historical time series:
the Savitzky-Golay method is selected for smooth denoising, the convolution and polynomial regression are combined to realize smooth filtering, and different smooth effects are realized by adjusting the sliding window and the fitting order, which are expressed as follows:
wherein 2l+1 is the sliding window length, x k Represents the center of the sliding window, h i The smoothing coefficient is obtained by a least square fitting polynomial;
wherein, the smaller the polynomial fitting order, the longer the window length, the more remarkable the smoothing effect.
4. A method of data distortion identification based on association density clustering as claimed in claim 3, wherein constructing the classification model comprises:
respectively carrying out differential processing on the smoothed and denoised window sequence as an overall displacement difference, an overall first-order slope difference and an overall second-order slope difference according to columns, merging the window sequence with a matrix X to form a new matrix, and adding a weight coefficient into the new matrix to obtain a clustering feature matrix Y;
performing cluster analysis, and determining Manhattan distance as a calculation mode of the inter-sequence distance according to a calculation method of the inter-sequence distance in a cluster calculation process;
and determining parameters (E, minPts) according to actual engineering requirements, obtaining a final clustering result, and constructing a classification model.
5. The method for identifying data distortion based on association density clustering as claimed in claim 4, wherein the new matrix comprises:
overall displacement difference matrix:
first order differential matrix:
second order differential matrix:
clustering feature moment
Wherein dif ij (0) =x (i+1)j -x ij ,dif ij =x i(j+1) -x ijw 1 ,w 2 ,w 3 As a weight, if more dependence on proximity features in cluster analysis increases w 1 Reduce w 2 And w 3 The method comprises the steps of carrying out a first treatment on the surface of the Increasing w if more dependent on similarity features 2 And w 3 Reduce w 1
6. The method for recognizing data distortion based on association density clustering as claimed in claim 5, wherein the manhattan distance is as follows:
7. the method of claim 6, wherein identifying real-time monitoring data based on the classification model comprises:
the method comprises the steps of carrying out clustering analysis on monitoring data collected in real time step by adopting the same window length w;
when the clustering results of all the measuring points are consistent with the clustering model, judging that the time data is normal;
when the clustering results are inconsistent and the outliers appear, the clustering results are regarded as distortion;
and when the clustering results are inconsistent and the outliers are not available, further carrying out data distortion identification.
8. Use of a data distortion identification method as claimed in any of claims 1-7 for health monitoring of wood structures.
9. The use according to claim 8, wherein,
the measuring points comprise strain measuring points of the same type of component and strain measuring points of different type of component.
CN202310279317.2A 2023-03-21 2023-03-21 Data distortion identification method based on association density clustering and application thereof Pending CN116662847A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202310279317.2A CN116662847A (en) 2023-03-21 2023-03-21 Data distortion identification method based on association density clustering and application thereof

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202310279317.2A CN116662847A (en) 2023-03-21 2023-03-21 Data distortion identification method based on association density clustering and application thereof

Publications (1)

Publication Number Publication Date
CN116662847A true CN116662847A (en) 2023-08-29

Family

ID=87712416

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202310279317.2A Pending CN116662847A (en) 2023-03-21 2023-03-21 Data distortion identification method based on association density clustering and application thereof

Country Status (1)

Country Link
CN (1) CN116662847A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117057517A (en) * 2023-10-12 2023-11-14 国网吉林省电力有限公司长春供电公司 Efficient processing method and system for electric power data based on digital twin

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117057517A (en) * 2023-10-12 2023-11-14 国网吉林省电力有限公司长春供电公司 Efficient processing method and system for electric power data based on digital twin
CN117057517B (en) * 2023-10-12 2024-01-12 国网吉林省电力有限公司长春供电公司 Efficient processing method and system for electric power data based on digital twin

Similar Documents

Publication Publication Date Title
CN113838054B (en) Mechanical part surface damage detection method based on artificial intelligence
CN106909756A (en) A kind of rolling bearing method for predicting residual useful life
CN114509266B (en) Bearing health monitoring method based on fault feature fusion
CN108399434B (en) Analysis and prediction method of high-dimensional time series data based on feature extraction
CN116662847A (en) Data distortion identification method based on association density clustering and application thereof
CN110991471B (en) Fault diagnosis method for high-speed train traction system
CN115828140A (en) Neighborhood mutual information and random forest fusion fault detection method, system and application
CN112414694A (en) Equipment multistage abnormal state identification method and device based on multivariate state estimation technology
CA3186873A1 (en) Activity level measurement using deep learning and machine learning
CN113298162A (en) Bridge health monitoring method and system based on K-means algorithm
CN114266289A (en) Complex equipment health state assessment method
CN114755017B (en) Variable-speed bearing fault diagnosis method of cross-domain data driving unsupervised field shared network
CN112766301A (en) Similarity judgment method for indicator diagram of oil extraction machine
CN114676749A (en) Power distribution network operation data abnormity judgment method based on data mining
CN114330486A (en) Power system bad data identification method based on improved Wasserstein GAN
CN113837591A (en) Equipment health assessment method oriented to multi-working-condition operation conditions
CN117076869A (en) Time-frequency domain fusion fault diagnosis method and system for rotary machine
Lai Fitting power law distributions to data
CN116773961A (en) Transmission line corrosion detection method based on vibration signal high-frequency characteristic analysis
CN113673609B (en) Questionnaire data analysis method based on linear hidden variables
CN114757495A (en) Membership value quantitative evaluation method based on logistic regression
CN114648060A (en) Fault signal standardization processing and classification method based on machine learning
CN112632469A (en) Method and device for detecting abnormity of business transaction data and computer equipment
Duan et al. A Spatio-temporal Fusion Autoencoder-based Health Indicator Automatic Construction Method for Rotating Machinery Considering Vibration Signal Expression
CN111881502A (en) Bridge state discrimination method based on fuzzy clustering analysis

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination