CN105809203A - Hierarchical clustering-based system steady state detection algorithm - Google Patents

Hierarchical clustering-based system steady state detection algorithm Download PDF

Info

Publication number
CN105809203A
CN105809203A CN201610146318.XA CN201610146318A CN105809203A CN 105809203 A CN105809203 A CN 105809203A CN 201610146318 A CN201610146318 A CN 201610146318A CN 105809203 A CN105809203 A CN 105809203A
Authority
CN
China
Prior art keywords
class
matrix
sequence
cluster result
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610146318.XA
Other languages
Chinese (zh)
Other versions
CN105809203B (en
Inventor
赵均
季丁
季一丁
邵之江
徐祖华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Original Assignee
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU filed Critical Zhejiang University ZJU
Priority to CN201610146318.XA priority Critical patent/CN105809203B/en
Publication of CN105809203A publication Critical patent/CN105809203A/en
Application granted granted Critical
Publication of CN105809203B publication Critical patent/CN105809203B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Complex Calculations (AREA)
  • Image Analysis (AREA)

Abstract

The invention discloses a hierarchical clustering-based system steady state detection algorithm. The algorithm comprises the following steps: for industrial data in a period of continuous time interval, calculating distances between every two types of data to obtain a matrix, finding the minimum distance values in the matrix, combining the minimum distance values into a new type, deleting the new type, carrying out processing to obtain an updated matrix and carrying out iterative computation to obtain an expression matrix of a hierarchical clustering tree; for the expression matrix, carrying out computation from the last iterative numerical value to the first iterative numerical value in sequence so as to obtain clustering result reasonability values of the computations, carrying out solving and computation on the computed clustering result reasonability values to obtain final threshold values, carrying out clustering computation on the industrial data through the final threshold values so as to obtain a final clustering result sequence, and carrying out joint time sequence judgement to obtain system steady state condition. The algorithm disclosed in the invention is strong in applicability and has the effect of avoiding the sharp calculation amount increase phenomenon caused by data dimensionality during the data processing.

Description

A kind of systematic steady state detection algorithm based on hierarchical clustering
Technical field
The present invention relates to a kind of system detecting method, especially relate to a kind of systematic steady state detection algorithm based on hierarchical clustering.
Background technology
In the research to a procedures system, stable state is most important and modal hypothesis.Whether system is in stable state, is directly connected to the follow-up method to the modeling of system, control and optimization.The internal principle of procedures system and structure are complicated, show as physical quantity and there is stronger coupling, and system also exists extremely strong non-linear.When system is in unsteady state, the data characteristic variation of each variable of system acutely, numerically shows as instability or exception, there is relatively large deviation with the input/output relation of real system.Only system is under steady working condition, and parameters and variable just have stronger state consistency.Based on this kind of situation, the evaluation to equipment runnability, the analysis of plant characteristic and controller's effect, it is required for premised on the operation stable state to obtain system.
Development along with process industrial, in actual production process, system and the mode of production all tend to complicating, the procedures system object related to is often multivariate, high dimension and close coupling, and system general performance is non-linear, time variation, uncertainty and imperfection.Although study mechanism and modeling are caused great difficulty by complicated system, but simultaneously because DCS controls the application of system and intelligence instrument so that increasing process data is recorded.Strong and the complicated coupled relation existed between actual each variable in procedures system makes the stability being studied whole system by measurement data become possibility, along with the development of data mining and Statistical Learning Theory and perfect, process industrial field also progressively using related algorithm solving practical problems, creates such as fields such as statistical Process Control.In the problem processing stable state detection, the thought of big data is also different from traditional Study of Control Process method with method, the latter judges whether system is in stablize and carries out decision-making often by several key variables to formulate stable state standard, the former then requires from whole data, system to be analyzed, the result obtained based on total data theoretically will reflect the practical situation of system more comprehensively and truly, therefore can obtain result more accurately and reliably.
Stable state test problems is from last century since the eighties is suggested, domestic and international many scholars are proposed the method for different stable state detections, but due to the complexity of field data, the testing result that much the existing stable state detection means demonstrating effectiveness in l-G simulation test obtains in actual applications is not exactly accurate.And various method is subject to the real needs restriction in itself affect and practical application, the occasion of application is also not quite similar.
Cluster is technology critically important in unsupervised learning field, and for the individuality in data sample is divided into different classifications so that in class, individuality has similarity high as far as possible, individual between class have diversity high as far as possible.Hierarchical clustering thought is proposed in 1967 by JohnsonSC. the earliest, other clusters such as difference and EM and K-means, hierarchical clustering need not give classification number in advance, and does not need iterative optimization procedure, can obtain cluster result by given similarity function and threshold value.Owing to being not related to the problems such as Optimization Solution, therefore can avoid " dimension disaster ", but owing to hierarchical clustering needs to calculate distance between sample between two, complexity can increase by square multiple along with the increase of sample size.
The flow chart of Traditional calculating methods structure such as Fig. 1, it is assumed that having N number of individual of sample in sample data sets to be clustered, the algorithm of hierarchical clustering specifically has the steps:
1) definition similarity function and algorithm end condition threshold value (being generally maximum inter-object distance or infima species spacing);
2) being gathered by each individuality in data set is a class, altogether N class;
3) current data set is allocated as k class, calculates the similarity between every class, d (i, j) represents the similarity between i and j class, if d (i j) is minima in current similarity set between any two, then merges i-th and jth class.Now cluster number is become k-1 by k;
4) calculating end condition value now, if meeting threshold value, terminating algorithm;
5) 3 are returned to), as k=1, all of sampled point is all gathered for same class, and clustering algorithm cannot continue, then algorithm stops.
Thus, traditional method major downside is that the dependency to process object is strong, and universality is poor, and carries out stable state detection mainly for single argument, and therefore the suitability is not strong, and cannot avoid the problem processing the amount of calculation sharp increase produced in data because of data dimension.
Summary of the invention
In order to solve Problems existing in background technology, the present invention proposes a kind of stable state detection algorithm based on hierarchical clustering.
The technical solution used in the present invention is:
STEP1 generates clustering tree:
1.1) comprised to the industrial data { d of N number of sampled point one period of continuous time in intervali, i=1,2,3 ..., N, in set, sampled point is as class, diRepresent the industrial data of class;
1.2) calculating all distances between class between two, obtain the matrix A of N × N, in matrix A, the element of i row j row is designated as aij, aii=0, aijRepresent class diWith class djBetween distance, the matrix A obtained is as follows:
1.3) the lowest distance value a in matrix A is foundmn, amnFor its class spacing.amnRepresent that m class and the n-th class are closest class, m class and the n-th class are merged into a new class, m row in puncture table A, n row, m row and n row, adopt step 1.2 by class remaining in matrix A again with merging the new class obtained) identical mode carries out the matrix A after processing acquisition renewal;
1.4) step 1.2 is repeated)~1.3) be iterated, until matrix A becomes the matrix of 1 × 1, record m, n and a in each iterative processmn, constitute the expression matrix Z of N × 3 of hierarchical clustering tree;
Z = Z 11 Z 21 Z 31 . . . . . . . . . Z N 1 Z N 1 ... Z 3 N
STEP2 threshold value is chosen:
2.1) for expression matrix Z, last iterative numerical calculate successively to the order of first time iterative numerical and be calculated in the following ways:
With current lowest distance value amnFor threshold value to industrial data { diCarrying out cluster calculation, the cluster result sequence obtained is Tk, k represents iteration ordinal number, cluster result sequence TkIt is the integer sequence of 1 × N, wherein, TkI () represents cluster result sequence TkI-th cluster result, Tk(i)=p;Cluster result sequence T is asked in calculatingkDifference sequence D, be namely that each adjacent element in sequence is subtracted each other acquisition difference, calculate in difference sequence D the number of zero as cluster result reasonability value D_zero (k).
2.2) cluster result reasonability value sequence D_zero it is made up of each calculated cluster result reasonability value D_zero (k), calculate the difference sequence asking for cluster result reasonability value sequence D_zero, namely it is that each adjacent element in sequence is subtracted each other acquisition difference, therefrom find the sequence number k at difference value maximum in difference sequence and place thereof, and with Z3kAs final threshold value.
Described step STEP2 is calculated, to the order of first time iterative numerical, the centre time iterative numerical referring to first time iterative numerical or the k=N/2 calculating k=1 by last iterative numerical successively.
When k is too little, the result of cluster is nonsensical to final stable state identification, is typically chosen the k=N/2 stop condition as algorithm to reduce amount of calculation.
STEP3 combines sequential and judges stable state:
Element T in final cluster result sequence TiIf meeting the following conditions, then it is assumed that the system between m-th sampled point to the m+k-1 sampled point is in stable state:
Ti=c, i=m, m+1, m+2 ..., m+k-1
Wherein, TiFor the cluster result that the i-th sampled point in final cluster result sequence T is corresponding, c represents result constant, k=τ/Ts, τ is above-mentioned time span threshold value, TsSampling time interval for data.
The size of τ is determined according to the time response of institute's object of study, generally takes τ=3t*, t*Unit-step response regulating time for system.
Described step 1.3) the middle end merged in the matrix A after the new class obtained is placed in renewal, end adds the row/column of new can be denoted as [a1(N+1)a2(N+1)…]T
In described matrix A, the ranks digital number of remaining class remains unchanged, the ranks rank-numeral merging the new class obtained adopts new ranks digital number and the ranks digital number with all classes before all to differ so that each class of whole system process has unique ranks digital number.
Described step 1.3) in the distance that merges in the class that obtains and matrix A between each class remaining adopt below equation to calculate:
asi=α ami+(1-α)an*
Wherein, α is weight parameter, 0≤α≤1, asiRepresent the spacing between s class and i class, aniRepresent the spacing between n class and i class, amiRepresent the spacing between m class and i class.
Each sampled point of system is regarded as " characteristic vector " (or the characteristic point) of a state space, when system is in stable state, characteristic point fluctuation in certain interval, it is rendered as the higher-dimension Gauss distribution centered by a certain specified point.When system is in an interim state or when unstable state, state point will be disengaged from original Gauss distribution, and then presents other scattered distribution.This feature widely different of state between moment before and after when being in unstable state based on system, it is possible to carried out the detection of procedures system stable state by the difference degree between comparison system state characteristic vector.Here introducing clustering algorithm, in stable state detects, have significantly high similarity between steady state data, can be gathered in cluster is a class, there is very big-difference simultaneously, then can be assigned to different classifications between dynamic data point and steady state data.
Due to the characteristic of system, can only ensure that distributions time system is in stable state compares concentration, but if system is in fluctuation status, the fluctuation of its distribution is very big in state space.It is to say, while there is similarity between stable state sampled point, the similarity between dynamic sampling point is only small, in cluster result, the dynamic point in continuous time may be gathered at much different apoplexy due to endogenous wind.Therefore, it was the number that cannot determine the class that finally there are before cluster, adopts hierarchical clustering algorithm of the present invention based on these.
The invention has the beneficial effects as follows:
" big data " thought and technology are introduced in stable state detection by the present invention, and by comparing in data set the similarity degree between each sampled point, the temporal characteristics in combination with data carries out stable state detection, and proposes the method how determining cluster threshold value.
It is characteristic of the invention that the suitability is strong, and avoid and process the amount of calculation sharp increase phenomenon produced because of data dimension in data.
Accompanying drawing explanation
Fig. 1 is the flow chart of hierarchical clustering algorithm.
Fig. 2 is the flow chart of the inventive method threshold value selection course.
Fig. 3 is that the inventive method judges the flow chart of stable state in conjunction with sequential.
Fig. 4 is the input-output curve figure of embodiment 1 second order analogue system.
Fig. 5 is embodiment 1 second-order system state point cluster result figure.
Fig. 6 is that embodiment 1 clusters testing result figure.
Fig. 7 is the input-output curve figure of embodiment 2 second order analogue system.
Fig. 8 is the cluster result comparison diagram that embodiment 2 obtains when taking different threshold value.
Fig. 9 is the pending data and curves figure of embodiment 3 input.
Figure 10 is the cluster result in the 50th iteration of embodiment 3.
Figure 11 be in all 50 iteration of embodiment 3 in D 0 number and change profile figure.
Figure 12 is the part cluster result that embodiment 3 is representative.
Figure 13 is the stable state testing result figure of embodiment 3.
Detailed description of the invention
Below in conjunction with drawings and Examples, the invention will be further described.
The inventive method is applied particularly in the detection of procedures system stable state, as in figure 2 it is shown, there is following flow process: data point { d to be sortedi; first define the distance of point in its state space and between class distance metric form (due to the state point of procedures system in the state space that it is corresponding according to certain probability and aggregation extent random distribution, in the method for stable state detection, we select to use Euclidean distance to weigh).When weighing between class distance, our stable state of interest is a class in cluster result, and we will improve the aggregation extent of each class as far as possible, selects ultimate range more to meet above-mentioned requirements.
The input of algorithm: according to the system history data state point { d of time-sequencingi, wherein { diFor p dimension vector data, i=1,2,3 ..., N, N is sampled point number.
The output of algorithm: classification T, the T that each sampled point is corresponding is length is the one-dimensional sequence of N, i.e. sampled point class number sequence.Here class number only plays the other effect of region class, it does not have any physics and meaning numerically.
Step according to hierarchical clustering, following algorithm is iterated by calculating the distance between data set midpoint, and each step checks whether and reaches algorithm end condition, and algorithm terminates then output cluster result T.
Data object owing to analyzing is the time series according to Time alignment, when system is in stable state within a period of time, and its spatial distribution also Relatively centralized.Hence with the timing of data, meeting following two conditions, be namely regarded as stable state: 1) data acquisition sampling point is continuous in time;2) similarity of data acquisition sampling point significantly high (namely data are gathered an apoplexy due to endogenous wind by clustering algorithm).
Stable state is obtained by cluster result, it needs to be determined that the threshold value of cluster, and judge stable minimum time interval T, under selected threshold value δ, when in a period of time that length is not less than T, the state point of system is gathered in same class, namely illustrate that in this period, system is in stable state.The determination of minimum time interval T can according to the time response of system object, below main choosing and the threshold value impact on algorithm of cluster threshold value δ is discussed.
In the process of cluster calculation, data acquisition system to be clustered has N number of sample, data are aggregated to gradually from N class the clustering tree matrix Z of process record N*3 of 1 class, wherein Zi1The first kind sequence number of two apoplexy due to endogenous wind, Z is merged when recording the i-th step clusteri2The Equations of The Second Kind sequence number of two apoplexy due to endogenous wind, Z is merged when recording the i-th step clusteri3The distance between two classes is merged when recording the i-th step cluster.Therefore acquisition optimum cluster threshold value is and finds a line in Z, as the end line of algorithm output, wherein Zi3It is threshold value.Here we take the mode that the result of every a line is enumerated to obtain optimal threshold.
Additionally the innovation of the present invention is in that to adopt the rational quantizating index of appointment to find rational threshold value to be calculated, and concrete principle is: TkEnumerate, for kth time, the cluster result obtained, use its Difference Terms D=diff (Tk) weigh overall Clustering Effect, along with threshold value reduces, in D, the number of 0 element is gradually reduced, ordinary circumstance is such as (1)~(9), and in D, the number of 0 reduces in slower trend, but when a class stable state is split off out, result there will be system point in a period of time frequently switch between two classes, this will make the number of 0 in D sharply reduce, accordingly, it would be desirable to undertaken enumerating by cluster result until the threshold value enumerated too little to stable state identification meaningless time stop.Find out the maximum step of number change of in wherein D 0, be the suitable stable state threshold value that the termination step that cluster threshold value reduces, the threshold value of its correspondence namely current data cluster are required, it is determined that the flow chart of threshold value such as 3.
Embodiments of the invention are as follows:
Embodiment 1
Embodiment 1 is for a second order single-input single-output system, adopting the input after the inventive method simulation process and output as shown in Figure 4, simulation time length is 100s, and the sampling interval is 0.1s, Fig. 4 (a) represents input variable curve, and Fig. 4 (b) represents output variable curve.
In embodiment 1 variable no noise added, can see from the curve of Fig. 4, system is in stable state in the incipient stage (t=0~300), produce a slope in the input of t=300 place to adjust, system enters transitive state, when t=600 rear slopes signal ended, output also tends to stable final system and enters another stable state.Due to no noise added, the system mode extent of polymerization when stable state is very high, the method that wouldn't adopt above-mentioned algorithms selection threshold value here, and direct labor selects an only small threshold value t=0.01, and data are clustered, and the result obtained is as shown in Figure 5.
Curve from Fig. 4 can be seen that, in whole process, system has two different steady statues, and in Fig. 5 cluster result, the data point category label being in same stable state is consistent, it can be seen that Fig. 5 exists the part of two sections of levels, corresponding with the two of system equilibrium transport on the time.
The result of cluster illustrates state point distance in space, it is judged that whether system is in stable state needs the timing in conjunction with data.When system is in stable state within a period of time, then the data in this section of time interval will be gathered same class because similarity is high.In turn can as steady-state criterion, being all clustered that algorithm gathers when all data in one section of continuous time τ is a class, then the system that is considered as is in stable state.Wherein the length τ of time period determines according to the time response of institute's object of study, takes the unit-step response regulating time that τ=2t*, t* are system here.
Therefore the system stability result obtained is as shown in Figure 6, Fig. 6 (a) represents second-order system status number strong point cluster result, Fig. 6 (b) represents the final stable state testing result of system, in steady result, 1 represents stable, 0 represents instability, result can be seen that, system is approximately near t=300 and departs from from first stable state, enter transition state, it is again introduced into stable state when close to t=700, contrast input and output, although input just finishes ramp signal and enters stable when t=600, but output y experienced by the adjustment of a period of time later and just settles out.
Embodiment 2
Embodiment 2 adds noise and emulates, and the single-input single-output systematic procedure used based on embodiment 1 further illustrates threshold value and chooses process, add the analogue system input and output of noise as shown in Figure 7, Fig. 7 (a) represents input variable curve, and Fig. 7 (b) represents output variable curve:
Can be seen that in Fig. 7, system, in the interval of time t=0~300, is in a stable state, and then input u produces a slope change, and through transition state after a while, system enters another one stable state.In whole status of processes space, the state point aggregation extent of two stable states is higher, and transitive state point then ratio is relatively decentralized.Choosing of threshold value takes descending order to enumerate, and along with reducing of cluster threshold value, the intensity within each class is more and more higher.When proceed to jth step, threshold value narrows down to and to a certain degree then the class point originally belonging to same stable state can be separated into multiclass, it is believed that threshold value now is the too little expectation beyond us, continue to zoom out threshold value also it is not necessary that.Now, the threshold value that back and jth-1 step reach is currently suitable value.
The single-input single-output second-order system of embodiment 2 carries out enumerating front 12 steps of cluster as shown in Figure 8: as can be seen from Figure 8, in Fig. 8 (1), threshold value is bigger, data only be divide into two classes by algorithm, in Fig. 8 (2)~Fig. 8 (9), middle data in an interim state are constantly separated under the driving of algorithm.When cluster threshold value reduces, the extent of polymerization in each class is more and more higher, and the result of the present invention is more accurate.But in Fig. 8 (10), when t is 700~1000, the state point of system originally belongs to same class, but after this step threshold value reduces, many points are therefrom stripped out, and two classes being separated out are interlaced with each other in time sequencing.If this two class being all considered as the stable state of system, Fig. 8 (10) representing, during this period of time system switches continually between two stable states, and this is non-existent in natural procedures system to be absent from transitive state.
It is thus regarded that, the state in Fig. 8 (10) has reached the limit that threshold value reduces, and rational distance threshold size should take threshold value that in Fig. 8 (9), result is corresponding as final threshold value.
Embodiment 3:
Embodiment 3 be applied in real process produce historical data, employ certain power plant 60MW unit boiler data and carry out data experiments.Data specifically include that boiler load instruction, generated output, coal input quantity, intake, each some soda pop temperature of measuring, pressure etc., totally 180 tie up.Choosing wherein 10000 point data uses above-mentioned clustering algorithm to carry out stable state detection.Variable change situation main in this period is as shown in Figure 9:
As seen from Figure 9, in the time period studied, system loading has bigger twice adjustment, generally creates the data of 3 sections of stable states in interval.Main steam temperature and main steam pressure have bigger fluctuation, only even do not see the fluctuation situation of system from vapor (steam) temperature curve.Below the state point that all of for system variable forms is clustered, according to the method above selecting threshold value, selecting total iterations is 50 times, data experiments proves result such as Figure 10 of the 50th time, cannot obtain useful message from result, therefore here stop continuing iteration, in the D finally obtained 0 number and change such as Figure 11, Figure 11 (a) represents in 50 iteration the number distribution of in D 0, and Figure 11 (b) represents in 50 iteration the situation of change of 0 number in D.
Embodiment is when 16 step, and in D, the number of 0 element is undergone mutation, and in order to more intuitively arrive the change of cluster, the representative iteration result of selected part is as shown in Figure 12.Can be seen that in Figure 12, when iteration is from the 16th step to 17 step, the point in 0~2000 time period is split into two classes, and the picture on cluster result coincide with the sudden change of 0 element number in D, also illustrate that the reasonability of the inventive method.
According to the result that cluster obtains, suitable time span is selected to carry out stable state detection, the characteristic according to industrial object, it is believed that when system remains stable in 500 sampled points, then it is believed that system is in stable state.In the cluster result of embodiment, adopt in cluster result be not less than in 500 interval to cluster a little number all identical, then think that system is in steady statue, obtain the stable state detection output result of system accordingly as shown in figure 13, Figure 13 (c) wherein 1 expression is stable, and 0 represents instability.
The final visible present invention has its significant technique effect, the stable state detection suitability is strong, it is to avoid process the amount of calculation sharp increase phenomenon produced in data because of data dimension.
Above-described embodiment is not the restriction for the present invention, and the present invention is not limited only to above-described embodiment, as long as meeting application claims, belongs to protection scope of the present invention.

Claims (6)

1. the systematic steady state detection algorithm based on hierarchical clustering, it is characterised in that:
STEP1 generates clustering tree:
1.1) comprised to the industrial data { d of N number of sampled point one period of continuous time in intervali, i=1,2,3 ..., N, in set, sampled point is as class, diRepresent the industrial data of class;
1.2) calculate all distances between class between two, obtain the matrix of N × NIn matrix A, the element of p row q row is designated as apq, apqRepresent class dpWith class dqBetween distance, app=0;
1.3) the class spacing minima a in matrix A is foundmnNamely m class and the n-th class are two classes that current distance is nearest, m class and the n-th class are merged into a new class, m row in puncture table A, n row, m arrange and n row, the class of current residual is adopted step 1.2 again with merging the new class obtained) identical mode calculates class spacing, it is thus achieved that the matrix A after renewal;
1.4) step 1.2 is repeated)~1.3) be iterated, until matrix A becomes the matrix of 1 × 1, record m, n and a in each iterative processmn, constitute expression matrix Z, the m of N × 3 of hierarchical clustering tree, n and amnThe first, second, third columns value respectively as matrix Z;
Z = Z 11 Z 21 Z 31 . . . . . . . . . Z N 1 Z N 1 ... Z 3 N
STEP2 threshold value is chosen:
For expression matrix Z, calculate cluster result reasonability value D_zero (k) obtaining each time successively to the order of first time iterative numerical by last iterative numerical, asked for calculate by each calculated cluster result reasonability value D_zero (k) and obtain final threshold value, with z3kAs final threshold value to industrial data { diCarry out cluster calculation, it is thus achieved that final cluster result sequence T;
STEP3 combines sequential and judges stable state:
Element T in final cluster result sequence TiIf meeting the following conditions, then it is assumed that the system between m-th sampled point to the m+l-1 sampled point is in stable state:
Ti=c, i=m, m+1, m+2 ..., m+l-1
Wherein, TiFor the cluster result that the i-th sampled point in final cluster result sequence T is corresponding, c represents result constant, l=τ/Ts, τ is above-mentioned time span threshold value, TsSampling time interval for data.
2. a kind of systematic steady state detection algorithm based on hierarchical clustering according to claim 1, it is characterized in that: described step 1.3) the middle end merged in the matrix A after the new class obtained is placed in renewal, in described matrix A, the ranks digital number of remaining class remains unchanged, and the ranks rank-numeral merging the new class obtained adopts new ranks digital number and the ranks digital number with all classes before all to differ.
3. a kind of systematic steady state detection algorithm based on hierarchical clustering according to claim 1, it is characterized in that: described step 1.3) in merge in the class that obtains and matrix A between each class remaining distance calculation as follows, class after note m class and the merging of n class is numbered s, i class is any sort except s class, then the distance calculating s class and i class has:
asi=α ami+(1-α)αni
Wherein, α is weight parameter, 0≤α≤1, asiRepresent the spacing between s class and i class, aniRepresent the spacing between n class and i class, amiRepresent the spacing between m class and i class.
4. a kind of systematic steady state detection algorithm based on hierarchical clustering according to claim 1, it is characterised in that: described step STEP2 calculates every time and calculates all in the following ways: with current lowest distance value amnFor threshold value to industrial data { diCarrying out cluster calculation, the cluster result sequence obtained is Tk, k represents iteration ordinal number, cluster result sequence TkIt is the integer sequence of 1 × N, calculates and ask for cluster result sequence TkDifference sequence D, calculate in difference sequence D the number of zero as cluster result reasonability value D_zero (k).
5. a kind of systematic steady state detection algorithm based on hierarchical clustering according to claim 1, it is characterized in that: described step STEP2 is asked for by each calculated cluster result reasonability value D_zero (k) calculating and obtains final threshold value particularly as follows: be made up of cluster result reasonability value sequence D_zero each calculated cluster result reasonability value D_zero (k), calculate the difference sequence asking for cluster result reasonability value sequence D_zero, therefrom find the sequence number j at difference value maximum in difference sequence and place thereof, and with Z3jAs final threshold value.
6. a kind of systematic steady state detection algorithm based on hierarchical clustering according to claim 1, it is characterised in that: described step STEP2 is calculated, to the order of first time iterative numerical, the centre time iterative numerical referring to first time iterative numerical or the k=N/2 calculating k=1 by last iterative numerical successively.
CN201610146318.XA 2016-03-15 2016-03-15 A kind of systematic steady state detection algorithm based on hierarchical clustering Expired - Fee Related CN105809203B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610146318.XA CN105809203B (en) 2016-03-15 2016-03-15 A kind of systematic steady state detection algorithm based on hierarchical clustering

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610146318.XA CN105809203B (en) 2016-03-15 2016-03-15 A kind of systematic steady state detection algorithm based on hierarchical clustering

Publications (2)

Publication Number Publication Date
CN105809203A true CN105809203A (en) 2016-07-27
CN105809203B CN105809203B (en) 2019-01-18

Family

ID=56467412

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610146318.XA Expired - Fee Related CN105809203B (en) 2016-03-15 2016-03-15 A kind of systematic steady state detection algorithm based on hierarchical clustering

Country Status (1)

Country Link
CN (1) CN105809203B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108154173A (en) * 2017-12-21 2018-06-12 陕西科技大学 A kind of oil-water interface measuring device of crude oil storage tank and method
CN108596207A (en) * 2018-03-21 2018-09-28 上海海事大学 A kind of gantry crane motor data processing method based on hierarchical clustering
CN108664000A (en) * 2018-03-26 2018-10-16 中南大学 A kind of alumina producing evaporation process steady state detecting method for use and system
CN108805168A (en) * 2017-05-04 2018-11-13 唯亚威通讯技术有限公司 The end-point detection for the manufacturing process that near infrared spectroscopy and machine learning techniques carry out
CN112148942A (en) * 2019-06-27 2020-12-29 北京达佳互联信息技术有限公司 Business index data classification method and device based on data clustering
CN118277802A (en) * 2024-03-29 2024-07-02 欧亚高科数字技术有限公司 Data interaction method for virtual simulation of anesthesiology crisis skills

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101286702A (en) * 2008-05-06 2008-10-15 深圳航天科技创新研究院 Adaptive digital DC/DC control method and converter with fast dynamic response
US20120197560A1 (en) * 2011-01-28 2012-08-02 Hampden Kuhns Signal identification methods and systems
CN103105556A (en) * 2013-01-30 2013-05-15 西安交通大学 Intelligent power grid load testing and recognition method based on steady state and transient state characteristic joint matching

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101286702A (en) * 2008-05-06 2008-10-15 深圳航天科技创新研究院 Adaptive digital DC/DC control method and converter with fast dynamic response
US20120197560A1 (en) * 2011-01-28 2012-08-02 Hampden Kuhns Signal identification methods and systems
CN103105556A (en) * 2013-01-30 2013-05-15 西安交通大学 Intelligent power grid load testing and recognition method based on steady state and transient state characteristic joint matching

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
JEFFREY D. KELLY 等: "A steady-state detection (SSD) algorithm to detect non-stationary drifts in processes", 《JOURNAL OF PROCESS CONTROL》 *
季一丁: "多变量复杂系统的稳态检测和提取方法研究", 《中国优秀硕士学位论文全文数据库 工程科技Ⅰ辑》 *
林敬茂 等: "聚类算法在机电系统稳定性检测中的应用", 《机电工程》 *
饶宛 等: "基于Fisher有序聚类的汽轮机试验数据稳态检测方法", 《电站系统工程》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108805168A (en) * 2017-05-04 2018-11-13 唯亚威通讯技术有限公司 The end-point detection for the manufacturing process that near infrared spectroscopy and machine learning techniques carry out
CN108805168B (en) * 2017-05-04 2023-10-13 唯亚威通讯技术有限公司 Endpoint detection for manufacturing processes by near infrared spectroscopy and machine learning techniques
CN108154173A (en) * 2017-12-21 2018-06-12 陕西科技大学 A kind of oil-water interface measuring device of crude oil storage tank and method
CN108154173B (en) * 2017-12-21 2021-08-24 陕西科技大学 Device and method for measuring oil-water interface of crude oil storage tank
CN108596207A (en) * 2018-03-21 2018-09-28 上海海事大学 A kind of gantry crane motor data processing method based on hierarchical clustering
CN108664000A (en) * 2018-03-26 2018-10-16 中南大学 A kind of alumina producing evaporation process steady state detecting method for use and system
CN112148942A (en) * 2019-06-27 2020-12-29 北京达佳互联信息技术有限公司 Business index data classification method and device based on data clustering
CN112148942B (en) * 2019-06-27 2024-04-09 北京达佳互联信息技术有限公司 Business index data classification method and device based on data clustering
CN118277802A (en) * 2024-03-29 2024-07-02 欧亚高科数字技术有限公司 Data interaction method for virtual simulation of anesthesiology crisis skills

Also Published As

Publication number Publication date
CN105809203B (en) 2019-01-18

Similar Documents

Publication Publication Date Title
CN105809203A (en) Hierarchical clustering-based system steady state detection algorithm
Marcus et al. Plan-structured deep neural network models for query performance prediction
CN110488218B (en) Electric energy meter running state evaluation method and evaluation device
CN103649858A (en) Method and device for predicting the condition of a component or system, computer program product
CN112749840B (en) Method for acquiring energy efficiency characteristic index reference value of thermal power generating unit
CN102832617A (en) Large power grid transient state stabilization analyzing method based on precision pattern discrimination
CN109921426A (en) Wind-electricity integration system probability load flow calculation method based on CV-KDE
CN108038211A (en) A kind of unsupervised relation data method for detecting abnormality based on context
CN118170004B (en) Control method and system based on Internet of things
Bontempi et al. Recursive lazy learning for modeling and control
CN113780420A (en) Method for predicting concentration of dissolved gas in transformer oil based on GRU-GCN
CN113809365A (en) Method and system for determining voltage attenuation of hydrogen fuel cell system and electronic equipment
Romer et al. Data-driven inference of passivity properties via Gaussian process optimization
CN112052496B (en) Operation method of high arch dam valley amplitude deformation influence factor analysis system based on VAR model
CN109960146A (en) The method for improving soft measuring instrument model prediction accuracy
CN105787507B (en) LS SVMs on-line study methods based on budget supporting vector collection
Xu et al. Non-conservative design of robust tracking controllers based on input-output data
CN113111588B (en) NO of gas turbine X Emission concentration prediction method and device
CN115600102B (en) Abnormal point detection method and device based on ship data, electronic equipment and medium
CN114925605A (en) Method for selecting training data in integrated circuit design
CN114510257A (en) Software architecture recovery method based on hierarchical clustering combination
Jiang et al. A trend prediction method based on fusion model and its application
CN112650770B (en) MySQL parameter recommendation method based on query work load analysis
CN111651935B (en) Multi-dimensional expansion prediction method and device for non-stationary time series data
CN117854650B (en) Polymer material multi-target attribute prediction method based on Gaussian process regression

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20190118