CN110135012A

CN110135012A - A kind of regression coefficient of system linear regressive prediction model determines method

Info

Publication number: CN110135012A
Application number: CN201910334251.6A
Authority: CN
Inventors: 王新攀; 靳洋; 牛道恒; 陈灵奎; 王彦兵; 王俊
Original assignee: Being Tsingyun Solar Energy Tech Co Ltd
Current assignee: Being Tsingyun Solar Energy Tech Co Ltd
Priority date: 2019-04-24
Filing date: 2019-04-24
Publication date: 2019-08-16
Anticipated expiration: 2039-04-24
Also published as: CN110135012B

Abstract

The regression coefficient that the present invention discloses a kind of system linear regressive prediction model determines method and system, and method includes: the initial model coefficient based on system initial samples data sequence computing system Linear Regression Forecasting Model, and is recorded；New systematic sampling data sequence is obtained with each increase of systematic sampling point, determines the systematic sampling point data of increased systematic sampling point data and reduction；Based on increase/reduction systematic sampling point data and "current" model coefficient, model coefficient is updated；It is then based on reduction/increased systematic sampling point data and updated model coefficient, updates model coefficient again, increase corresponding model coefficient as each systematic sampling point and is recorded.Present invention computation model coefficient in the way of iteration can avoid a large amount of occupancy in calculating process to server resource, improve the efficiency that system linear regression model coefficient calculates, and ensure the real-time calculated linear regression model (LRM) coefficient.

Description

A kind of regression coefficient of system linear regressive prediction model determines method

Technical field

The present invention relates to computer datas to analyze processing technology field, especially a kind of to calculate in real time suitable for big data quantity Scene determines method based on the regression coefficient of the system linear regressive prediction model of iteration.

Background technique

Actual industrial system in the process of running, join in stateful Monitor And Control Subsystem record system by the operation of key equipment Number, these parameters are usually the data such as temperature, pressure, flow, electric current, voltage.And in the process for carrying out analysis optimization to system In, usually more concerned with data such as assembly property decaying, prospective earnings, O&M plans.These data can not be from Monitor And Control Subsystem It intuitively obtains, and is embodied in the Secular Variation Tendency of these data in the data of acquisition.

Based on the analysis to system principle, linearisation or local linearization can be carried out to system, obtain system parameter Simple linear representation form between data, the i.e. linear model of system.Coefficient and system component according to system principle, in model Performance, transfer efficiency, maintenance threshold value etc. are directly related.Therefore, it according to the Secular Variation Tendency of model coefficient, can be realized to being The work such as the assessment of system performance degradation, earnings forecast at a specified future date, O&M node optimization.

System model after linearisation can be denoted as Y^T=W^TX, wherein X=[X1 X2 ... Xn] is that several sampled points are System input, Y^T=[y1 y2 ...] is the system output of corresponding sampled point model prediction, and W is model coefficient.In real system, Output and input parameter values in the different available different real systems of sampled point, be brought into model constitute one it is super Determine equation group, the value of model coefficient W can be estimated by least square method.If therefore can be according to before prediction samples point A dry group system actual parameter, estimating system prediction samples point model coefficient, and then according to the variation tendency of model coefficient into Row system performance analysis.

Least square method is related to big when the data parameters group number that model dimension is higher or equation group calculates is larger Moment matrix operation, and need to expend a large amount of computing resources to intermediate matrix inversion.Therefore, on the server to system model into When row detailed analysis calculates, a large amount of system resources in computation can be occupied.On the one hand increase server stress, while can also reduce to being The feasibility that model detailed analysis of uniting solves.

In order to sufficiently eliminate the influence that random error and bursty interference estimate model coefficient, usually needed when calculating The system measured data introduced within the scope of the long period participates in operation.For example, practical calculate to obtain the mould of a sampled point Type coefficient, it may be necessary to even 1 year continuous one month operation data of system before this.It requires to obtain such as if calculated every time This mass data will all form immense pressure to server storage I/O bandwidth, memory size etc..

Summary of the invention

The object of the present invention is to provide a kind of regression coefficients of system linear regressive prediction model to determine method, utilizes iteration Mode computation model coefficient, avoid a large amount of occupancy in calculating process to server resource, improve system linear regression model The efficiency that coefficient calculates ensures the real-time calculated linear regression model (LRM) coefficient.

The technical scheme adopted by the invention is as follows: a kind of regression coefficient of system linear regressive prediction model determines method, wraps It includes:

S1 is based on preset data window length, obtains system initial samples data sequence, utilizes initial samples data sequence The model coefficient of column count system linear regressive prediction model, and record；

S2 obtains increased systematic sampling point data, is based on preset data window length, forms new systematic sampling number According to sequence；Determine new systematic sampling data sequence compared with the increased systematic sampling point data of initial samples data sequence institute and reduction Systematic sampling point data；

S3 updates model coefficient based on increase/reduction systematic sampling point data and "current" model coefficient；Then base In reduction/increased systematic sampling point data and updated model coefficient, model coefficient is updated again, and record；

S4 repeats step S2 to step S4 with the increase of systematic sampling point, obtains each sample point data increase, hits According to the model coefficient after sequence variation, and record.

The present invention also realizes the analysis to system performance variation, that is, further includes step S5, based on the model coefficient recorded The variation of system performance is analyzed.Carrying out analysis to system performance according to model coefficient variation can be used the prior art.

As a kind of specific embodiment, in step S1, for system model Y^T=W^TX, Y are corresponding sampling input quantity X System output quantity, W is model coefficient, then has intermediary matrix C_N=(XX^T)^-1, model coefficient W_N ^T=Y^TX^TC_N。

Being considered as Ridge Regression Modeling Method can effectively deal with the high scene of sample point data synteny, it is therefore preferred that step In S1, for system model Y^T=W^TX, Y are the system output quantity of corresponding sampling input quantity X, and W is model coefficient, then has intermediate square Battle array C_N=(XX^T+λI)^-1, λ is coefficients of ridge regression, and I is unit matrix, "current" model coefficient matrix W_N ^T=Y^TX^TC_N.It can be first Method is integrally changed to Ridge Regression Modeling Method when beginningization.

In step S2, whenever increasing a sample point data, that is, sampled data window, the sample point data of the new write-in is written I.e. described " increased systematic sampling point data ".Since sampled data window is based on preset data length, primary data sequence Earliest sample point data will go out sampled data window by " crowded " in the sampling time in column, the quilt it is " crowded " go out sample point data it is i.e. described " sample point data of reduction ".The length of window of sampled data window can be set as needed, be the prior art.

In step S3, model coefficient first can be carried out more according to increased systematic sampling point data and "current" model coefficient Newly, the update of model coefficient is then carried out further according to the systematic sampling point data of reduction and "current" model coefficient, it also can sequence It is on the contrary.

Preferably, carrying out model coefficient update based on increased systematic sampling point data and "current" model coefficient includes:

Define input quantity initial samples data sequence matrix are as follows: X=[x_N-m+1 ... x_N],

Output quantity sampled data matrix are as follows: Y^T=[y_N-m+1 ... y_N],

Increased input quantity/output quantity systematic sampling point data is x_N+1/y_N+1, constitute matrix x₊=x_N+1/y₊=y_N+1；

The then intermediary matrix after sample point data increase are as follows: C_N+=C_N-C_Nx₊(I+x₊ ^TC_Nx₊)x₊ ^TC_N,

Model coefficient matrix after sample point data increase are as follows:

W_N+ ^T=W_N ^T-W_N ^Tx₊(I+x₊ ^TC_Nx₊)x₊ ^TC_N+y₊ ^Tx₊ ^TC_N+；

By the intermediary matrix C after sample point data increase_N+With model coefficient matrix W_N+It is updated to current intermediary matrix C_NWith "current" model coefficient matrix W_N。

Preferably, the systematic sampling point data based on reduction and "current" model coefficient progress model coefficient update and include:

Defining reduced input quantity/output quantity systematic sampling point data is x_N-m+1/y_N-m+1, constitute matrix x_{_}=x_N-m+1/y_{_} =y_N-m+1,

Intermediary matrix after then sample point data is reduced are as follows: C_N-=C_N+C_Nx_{_}(I+x_{_} ^TC_Nx_{_})x_{_} ^TC_N,

Model coefficient matrix after sample point data reduction are as follows:

W_N- ^T=W_N ^T+W_N ^Tx_{_}(I+x_{_} ^TC_Nx_{_})x_{_} ^TC_N-y_^Tx_{_} ^TC_N-

Intermediary matrix C after sample point data is reduced_N-With model coefficient matrix W_N- ^TIt is updated to current intermediary matrix C_NWith "current" model coefficient matrix W_N。

In step S1, the length of window of initial data sequence is as needed and computer computation ability adjusts, if practical Obtained initial data sequence is longer, it is assumed that is M, can set lesser m as data window length, be based on using S1 The model coefficient of m sample data sequence is as initial model coefficient, then using step 2 to step 3, gradually increases sampling Each sample point data after point m, each sequence variation all obtain corresponding model coefficient and record as "current" model system It counts, in the calculating of iteration to model coefficient next time, until obtaining the model coefficient of corresponding sample data point M.

Preferably, step S1, in S3 and S4, the model coefficient result being calculated is recorded as correspondence respectively and is accordingly adopted The model coefficient of last samples point in sample data sequence.

Invention additionally discloses a kind of regression coefficients of system linear regressive prediction model to determine system, comprising:

Initial model coefficient determination module obtains system initial samples data for being based on preset data window length Sequence using the model coefficient of initial samples data sequence computing system Linear Regression Forecasting Model, and records；

Sampled data variation obtains module, for obtaining increased systematic sampling point data, is based on preset data window Length forms new systematic sampling data sequence；Determine that new systematic sampling data sequence is increased compared with initial samples data sequence The systematic sampling point data of the systematic sampling point data and reduction that add；

Model coefficient updates computing module, for based on increase/reduction systematic sampling point data and "current" model system Number updates model coefficient；It is then based on reduction/increased systematic sampling point data and updated model coefficient, again more New model coefficient, and record；

With the increase of each systematic sampling point, sampled data variation obtains module and obtains new systematic sampling data sequence Column, model coefficient update the model system that computing module is calculated after each sample point data increase, sample data sequence variation Number, and record.

Beneficial effect

Compared with prior art, the present invention has the following advantages that and improves:

1. the computation model coefficient in the way of iteration, each iteration need to only be obtained from monitoring system or existing database to be increased The constant history input sample value of the model sampled value and quantity added, it is unrelated with sampling window length.It is adopted even if iteration is corresponding Sample length of window is 1 year even longer, and practical each iterative step only takes less data, can reduce and store IO to server The requirement of bandwidth, memory size etc..

2. iterative process next step calculating only relies on previous step and acquires model coefficient and intermediary matrix, they are that method is adjacent Quantity of state between iteration.Hypothetical model coefficient dimension is k, then state numerical quantity only has k* (k+1) a.Therefore very little is only taken up Memory space can support iterative process.

Add, subtract, multiplying calculating 3. iterative process only uses matrix, not needing to carry out complicated decomposition operation of inverting, computer Operation pressure it is small, the real-time of data can be ensured；

4. method single iteration calculates, pressure is small, and acquisition data are few, therefore can be parallel under the support of limited service device resource It handles a large amount of models to calculate, real-time computation model coefficient can be updated with monitoring sampled data, to obtain the company of coefficient in real time Continuous variation tendency.

Detailed description of the invention

Fig. 1 show the method for the present invention flow diagram；

Fig. 2 show a kind of specific embodiment flow diagram of the method for the present invention；

Fig. 3 show the effect diagram of application examples of the present invention.

Specific embodiment

It is further described below in conjunction with the drawings and specific embodiments.

Symbol description:

X: system input；Y: system output；W: model coefficient；C: intermediary matrix；I unit matrix；A: matrix-vector for example without It illustrates, is column vector.

Subscript explanation:

T: transposition；- 1: inverting；H: transposition conjugation

Subscript explanation:

+: increased data；: the data of reduction

Embodiment 1

Refering to what is shown in Fig. 1, the regression coefficient of the system linear regressive prediction model of the present embodiment determines method, comprising:

The present invention is also used to realize the analysis to system performance variation, that is, further includes step S5, based on the model recorded Coefficient analyzes the variation of system performance.Carrying out analysis to system performance according to model coefficient variation can be used existing skill Art.

In step S1, for system model Y^T=W^TX, Y are the system output quantity of corresponding sampling input quantity X, and W is model system Number, then have intermediary matrix C_N=(XX^T)^-1, model coefficient W_N ^T=Y^TX^TC_N。

Being considered as Ridge Regression Modeling Method can effectively deal with the high scene of sample point data synteny, therefore in step S1, right In system model Y^T=W^TX, Y are the system output quantity of corresponding sampling input quantity X, and W is model coefficient, then has intermediary matrix C_N= (XX^T+λI)^-1, λ is coefficients of ridge regression, and I is unit matrix, "current" model coefficient matrix W_N ^T=Y^TX^TC_N.It can be in initialization Method is integrally changed to Ridge Regression Modeling Method.

In step S2, whenever increasing a sample point data, that is, sampled data window, the sample point data of the new write-in is written I.e. described " increased systematic sampling point data ".Since sampled data window is based on preset data length, primary data sequence Earliest sample point data will go out sampled data window by " crowded " in the sampling time in column, the quilt it is " crowded " go out sample point data it is i.e. described " sample point data of reduction ".The length of window of sampled data window can be set as needed, be the prior art, iterative process It is then unrelated with length of window.

Model coefficient update is carried out based on increased systematic sampling point data and "current" model coefficient specifically:

Define input quantity initial samples data sequence matrix are as follows: X=[x_N-m+1 … x_N],

Output quantity sampled data matrix are as follows: Y^T=[y_N-m+1 … y_N],

Model coefficient matrix after sample point data increase are as follows:

W_N+ ^T=W_N ^T-W_N ^Tx₊(I+x₊ ^TC_Nx₊)x₊ ^TC_N+y₊ ^Tx₊ ^TC_N+；

Systematic sampling point data and "current" model coefficient based on reduction carry out model coefficient update

Defining reduced input quantity/output quantity systematic sampling point data is x_N-m+1/y_N-m+1, constitute matrix x_{_}=x_N-m+1/y_- =y_N-m+1,

Intermediary matrix after then sample point data is reduced are as follows: C_N=C_N+C_Nx_-(I+x_- ^TC_Nx_-)x_- ^TC_N,

Model coefficient matrix after sample point data reduction are as follows:

W_N- ^T=W_N ^T+W_N ^Tx_-(I+x_- ^TC_Nx_{_})x_- ^TC_N-y_{_} ^Tx_{_} ^TC_N-

Step S1, it in S3 and S4, is recorded as the model coefficient result being calculated to correspond to corresponding sampled data respectively The model coefficient of last samples point in sequence.

The principle of the method for the present invention are as follows: overdetermined equation Y^T=W^TIn X, W is undetermined coefficient, each column of input quantity sampling matrix X For a sampled data.Then seek W formula are as follows:

W^T=Y^TX^T(XX^T)^-1。

There is matrix inversion lemma:

(A+XY^H)^-1=A^-1-A^-1X(I+Y^HA^-1X)^-1Y^HA^-1

Based on above formula, the feelings for increasing new sampled point in original sample point data collection X, reducing former sampled point are discussed Shape.Wherein, remember C=(XX^T)^-1For intermediary matrix.

Increase data iteration and seek coefficient:

Original is to sampled point window X_N、Y_NIt seeks obtaining W_N、C_N, newly increasing sampled point x₊、y₊Afterwards, new X is X_N+1 =[X_N x₊], new Y is

So there is C_N+1It is as follows to iterate to calculate formula:

C_N+1=(X_N+1X_N+1 ^T)^-1=(X_NX_N ^T+x₊x₊ ^T)^-1

=(X_NX_N ^T)^-1-(X_NX_N ^T)^-1X₊(I+x₊ ^T(X_NX_N ^T)^-1x₊)x₊ ^T(X_NX_N ^T)^-1

=C_N-C_Nx₊(I+x₊ ^TC_Nx₊)x₊ ^TC_N

Because of W_N ^T=Y_N ^TX_N ^TC_N, and then have W_N+1It is as follows to iterate to calculate formula:

It reduces data iteration and seeks coefficient:

Original is to sampled point window X_N、Y_NIt seeks obtaining W_N、C_N.Remember X_N=[x_- X_N+1],It is reducing Former sampled point x_-、y_-Afterwards, remember that new sampled point window data is X_N+1、Y_N+1。

So having:

X_NX_N ^T=[x_- X_N+1][x_- X_N+1]^T=X_N+1 X_N+1 ^T+x_-x_- ^T

So there is C_N+1It is as follows to iterate to calculate formula:

C_N+1=(X_N+1 X_N+1 ^T)^-1=(X_NX_N ^T-x_-x_- ^T)^-1

=(X_NX_n ^T)^-1+(X_N X_N ^T)^-1x_-(I+x_- ^T(X_N X_N ^T)^-1x_-)x_- ^T(X_N X_N ^T)^-1

=C_N+C_Nx_-(I+x_- ^TC_Nx_-)x_- ^TC_N

W_N+1 ^T=Y_N+1 ^TX_N+1 ^T(X_N+1 X_N+1 ^T)^-1=(Y_N ^TX_NT-y_- ^Tx_- ^T)C_N+1

=Y_N ^TX_N ^TC_N+Y_N ^TX_N ^TC_Nx_-(I+x_- ^TC_Nx_-)x_- ^TC_N-y_- ^Tx_- ^TC_N+1

=W_N ^T+W_N ^Tx_-(I+x_- ^TC_Nx_-)x_- ^TC_N-y_- ^Tx_- ^TC_N+1

Embodiment 1-1

Increase data the method for the present invention includes iteration and iteration reduces two self-contained process of data.In known one group of sampling In the case of point regression coefficient, data are increased by iteration, obtain this group of sampled point finally and increase the regression coefficient of sampled point entirety. In known one group of sampled point regression coefficient, data are reduced by iteration, obtains to reduce in this group of sampled point finally and partially adopt Regression coefficient after sampling point.

Refering to what is shown in Fig. 2, to guarantee numerical stability, the present embodiment first increases data in iterative process, reduces number afterwards According to reference to Fig. 2, specific step is as follows.

1. initialization:

Assuming that current sampling point N is directed toward sampled point i；

Obtain original input data x_N-m+1..., x_N, constitute matrix X=[x_N-m+1 … x_N]；

Obtain original output data y_N-m+1..., y_N, constitute matrix Y^T=[y_N-m+1 … y_N]；

According to formula C_N=(XX^T)^-1Calculate intermediary matrix C_N；

W is calculated according to formula_N ^T=Y^TX^TC_NComputation model coefficient W_N；

Recording status W_N、C_NFor "current" model coefficient and current intermediary matrix.

W at this time_N、C_NIt is calculated by the m data of sampled point i-m+1 to i.That is, W_N、C_NIt is the model of corresponding sampled point i Coefficient and intermediary matrix.

2. increasing data point:

Obtain increased original input data x_N+1, constitute matrix x₊=x_N+1, new input quantity sampling matrix X is X_N+1= [X_N x₊]；

Obtain increased original output data y_N+1, constitute matrix y₊=y_N+1, new output quantity sampling matrix Y is

According to formula C_N+=C_N-C_Nx₊(I+x₊ ^TC_Nx₊)x₊ ^TC_N, by current intermediary matrix C_NIterate to calculate C_N+；

According to formula W_N+ ^T=W_N ^T-W_N ^Tx₊(I+x₊ ^TC_Nx₊)x₊ ^TC_N+y₊ ^Tx₊ ^TC_N+By "current" model coefficient W_N、C_NWith step 2.3 As a result C_N+Iterate to calculate W_N+。

Recording status W_N=W_N+、C_N=C_N+For "current" model coefficient and current intermediary matrix.

W at this time_N、C_NIt is calculated by the m+1 data of sampled point N-m+1 to N+1.

3. reducing data point:

Obtain reduced original input data x_N-m+1, constitute matrix x_-=x_N-m+1；

Obtain the original output data y reduced_N-m+1, constitute matrix y_-=y_N-m+1；

According to formula C_N-=C_N+C_Nx_-(I+x_- ^TC_Nx_-)x_- ^TC_NBy current intermediary matrix C_NIterate to calculate C_N-；

According to formula W_N- ^T=W_N ^T+W_N ^Tx_{_}(I+x_{_} ^TC_Nx_{_})x_{_} ^TC_N-y_{_} ^Tx_^TC_N-By "current" model coefficient W_N, intermediary matrix C_N With step 3.3 result C_N-Iterate to calculate W_N-。

Recording status W_N=W_N-、C_N=C_N-For "current" model coefficient and current intermediary matrix.

W at this time_N、C_NIt is calculated by the m data of sampled point N-m+2 to N+1.That is, W_N、C_NIt is the mould of sampled point i+1 Type coefficient and intermediary matrix.

Preservation state W_NIt is calculated to database as the model coefficient of sampled point i+1 for subsequent analysis.

4. current sampling point N is directed toward i+1, step 2,3,4 model coefficients for successively seeking subsequent sampling point are repeated.

In the above process, it is equivalent in sampling point sequence and opens the window that a length is m.It is first in each iterative process First increase a new sampled point in window tail portion, reduce by a sampled point on window head later, is equivalent to window entirety Forward one.It calculates and preceding W is started according to iteration_N、C_NThe sample point data increased and decreased in numerical value and window can be calculated next to adopt W at sampling point_N+1、X_N+1, unrelated with length of window m.Therefore when needing successively to calculate model coefficient at all sampled points, pass through Computation complexity can be greatly reduced in iterative algorithm.

In the above process, each iterative process, which increases sampling number k in window tail portion, can be greater than 1.At this time in window head Portion reduces sampling number should be corresponding with points are increased, and is also k.It calculates and preceding W is started according to iteration_N、C_NIncrease and decrease in numerical value and window Sample point data can calculate to obtain W after k sampled point_N+k、C_N+k。

In the above process, initialization step can not disposably load all m sampled points in window, but repeatedly call step Rapid 2 successively increase sampled point, until all m sampled point increases finish.Initial phase can be reduced to account for the memory of server With.

Embodiment 2

As a kind of application examples, the present invention is used in photovoltaic plant O&M scenarios, log history data time interval Usually 15 minutes even it is shorter.It is calculated with this, in 25 years power station life cycles, all data of record will have about 1,000,000 Item, wherein 1 year data will have 40,000.In order to estimate the component aging influence to generated energy, the appraising model of foundation is with day 8 monitoring parameters such as gas bar part are as input, using actual power generation as output.In model, assembly property attenuation rate is at any time It gradually increases, is embodied on the consecutive variations of 8 inputs parameters and model constants item at any time.Meanwhile being blocked for reduction, The influence that aging is estimated in the chance events such as cleaning, maintenance, calculates the time window for selecting continuous 1 year data to calculate as model Mouthful.Therefore, after full 1 year of power station operation, in each monitoring sampling time point, system all can according to the data of the previous year calculate Model coefficient.Finally, by the model coefficient in 24 years life cycles after acquisition power station at each sampling time point.According to model The situation of change of coefficient can assess the performance degradation of different phase in the life cycle management of power station.

Simulation calculation has been carried out for above-mentioned application scenarios for the error condition of appraisal procedure iterative calculation.Emulation creation The power station operation datas of 1,000,000 Noises, and using 40,000 datas as analysis window size, successively iterate to calculate all numbers The absolute error of model coefficient at strong point.Abscissa is sampled point serial number, and ordinate is to return 8 input quantity moulds being calculated The absolute error of type coefficient and constant entry value and preset value.Visible all error entry value do not dissipate in figure, and wherein data9 is corresponding The absolute error of constant entry value, the highest in all errors, but still it is no more than 1e-10 magnitude.

It should be understood by those skilled in the art that, embodiments herein can provide as method, system or computer program Product.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the application Apply the form of example.Moreover, it wherein includes the computer of computer usable program code that the application, which can be used in one or more, The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) produces The form of product.

The application is referring to method, the process of equipment (system) and computer program product according to the embodiment of the present application Figure and/or block diagram describe.It should be understood that can be realized by computer program instructions each in flowchart and/or the block diagram The combination of process and/or box in process and/or box and flowchart and/or the block diagram.It can provide these computers Processor of the program instruction to general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices To generate a machine, so that generating use by the instruction that computer or the processor of other programmable data processing devices execute In the dress for realizing the function of specifying in one or more flows of the flowchart and/or one or more blocks of the block diagram It sets.

These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates, Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram or The function of being specified in multiple boxes.

These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that counting Series of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer or The instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram one The step of function of being specified in a box or multiple boxes.

The embodiment of the present invention is described in conjunction with attached drawing above, but the invention is not limited to above-mentioned specific Embodiment, the above mentioned embodiment is only schematical, rather than restrictive, those skilled in the art Under the inspiration of the present invention, without breaking away from the scope protected by the purposes and claims of the present invention, it can also make very much Form, all of these belong to the protection of the present invention.

Claims

1. a kind of regression coefficient of system linear regressive prediction model determines method, characterized in that include:

S1 is based on preset data window length, obtains system initial samples data sequence, utilizes initial samples data sequence meter The model coefficient of system linear regressive prediction model is calculated, and is recorded；

S2 obtains increased systematic sampling point data, is based on preset data window length, forms new systematic sampling data sequence Column；Determine new systematic sampling data sequence compared with initial samples data sequence increased systematic sampling point data and reduction be System sample point data；

S3 updates model coefficient based on increase/reduction systematic sampling point data and "current" model coefficient；It is then based on and subtracts Few/increased systematic sampling point data and updated model coefficient, update model coefficient, and record again；

S4 repeats step S2 to step S4 with the increase of systematic sampling point, obtains each sample point data increase, sampled data sequence Model coefficient after column variation, and record.

2. according to the method described in claim 1, it is characterized in that, further include step S5, based on the model coefficient recorded to being The variation of system performance is analyzed.

3. according to the method described in claim 1, it is characterized in that, in step S1, for system model Y^T=W^TX, Y are that correspondence is adopted The system output quantity of sample input quantity X, W are model coefficient, then have intermediary matrix C_N=(XX^T)^-1, model coefficient W_N ^T=Y^TX^TC_N。

4. according to the method described in claim 1, it is characterized in that, in step S1, for system model Y^T=W^TX, Y are that correspondence is adopted The system output quantity of sample input quantity X, W are model coefficient, then have intermediary matrix C_N=(XX^T+λI)^-1, λ is coefficients of ridge regression, and I is Unit matrix, "current" model coefficient matrix W_N ^T=Y^TX^TC_N。

5. the method according to claim 3 or 4, characterized in that be based on increased systematic sampling point data and current mould Type coefficient carries out model coefficient update

Define input quantity initial samples data sequence matrix are as follows: X=[x_N-m+1…x_N],

Output quantity sampled data matrix are as follows: Y^T=[y_N-m+1…y_N],

Model coefficient matrix after sample point data increase are as follows:

W_N+ ^T=W_N ^T-W_N ^Tx₊(I+x₊ ^TC_Nx₊)x₊ ^TC_N+y₊ ^Tx₊ ^TC_N+；

By the intermediary matrix C after sample point data increase_N+With model coefficient matrix W_N+It is updated to current intermediary matrix C_NWith it is current Model coefficient matrix W_N。

6. according to the method described in claim 5, it is characterized in that, systematic sampling point data and "current" model system based on reduction Number carries out model coefficient update

Defining reduced input quantity/output quantity systematic sampling point data is x_N-m+1/y_N-m+1, constitute matrix x_{_}=x_N-m+1/y_{_}= y_N-m+1,

Intermediary matrix after then sample point data is reduced are as follows: C_N-=C_N+C_Nx_-(I+x_- ^TC_Nx_-)x_- ^TC_N,

Model coefficient matrix after sample point data reduction are as follows:

W_N- ^T=W_N ^T+W_N ^Tx_-(I+x_- ^TC_Nx_-)x_- ^TC_N-y_- ^Tx_-TC_N-

Intermediary matrix C after sample point data is reduced_N-With model coefficient matrix W_N- ^TIt is updated to current intermediary matrix C_NWith it is current Model coefficient matrix W_N。

7. according to the method described in claim 1, it is characterized in that, step S1, in S3 and S4, the model that will be calculated respectively Coefficient results are recorded as corresponding to the model coefficient of last samples point in corresponding sample data sequence.

8. a kind of regression coefficient of system linear regressive prediction model determines system, characterized in that include:

Initial model coefficient determination module, for obtaining system initial samples data sequence based on preset data window length, Using the model coefficient of initial samples data sequence computing system Linear Regression Forecasting Model, and record；

Sampled data variation obtains module, for obtaining increased systematic sampling point data, is based on preset data window length, Form new systematic sampling data sequence；Determine new systematic sampling data sequence compared with increased system, initial samples data sequence institute The systematic sampling point data for the sample point data and reduction of uniting；

Model coefficient updates computing module, for based on increase/reduction systematic sampling point data and "current" model coefficient, more New model coefficient；It is then based on reduction/increased systematic sampling point data and updated model coefficient, again more new model Coefficient, and record；

With the increase of each systematic sampling point, sampled data variation obtains module and obtains new systematic sampling data sequence, mould The model coefficient after each sample point data increase, sample data sequence variation is calculated in type coefficient update computing module, and Record.