CN107220483A

CN107220483A - A kind of mode prediction method of polynary time series data

Info

Publication number: CN107220483A
Application number: CN201710324105.6A
Authority: CN
Inventors: 肖云; 许震洲; 王欣; 王选宏; 高颢函; 陈晓江; 房鼎益
Original assignee: Northwest University
Current assignee: Shaanxi Dahang Wujiang Information Technology Co ltd
Priority date: 2017-05-09
Filing date: 2017-05-09
Publication date: 2017-09-29
Anticipated expiration: 2037-05-09
Also published as: CN107220483B

Abstract

The invention discloses a kind of mode prediction method of polynary time series data, including three phases, the time series formed first to each conditional-variable and decision variable finds candidate's interest mode collection, each candidate's interest mode collection is clustered respectively；Secondly, the prediction between Production conditions variable and decision variable；The interest mode that finally the conditional-variable execution stage one of testing data is obtained goes the prediction rule of matching stage two, if meeting the former piece of prediction rule, exports predicting the outcome for decision variable.The mode prediction method amount of calculation of the polynary time series data of the present invention is small, the effective time complexity reduced in model prediction, solves the problem of time complexity is too high in conventional method.

Description

A kind of mode prediction method of polynary time series data

Technical field

The invention belongs to computer realm, the more particularly to Data Mining in computer, and in particular to a kind of polynary The mode prediction method of time series data.

Background technology

Time series forecasting is in weather forecast, and the field such as stock is a very important research direction.One in time series forecasting Individual most important method is exactly the behavior that can remove to predict its dependent variable according to the trend of some variables, and this is just called polynary sequential Prediction.For example, if we consider that two variable correlations, we may wonder that temperature is added for example in weather forecast Whether 10% have impact on the trend of humidity.

In polynary prediction, main certain methods can be divided into mathematics and artificial method by us.In mathematical method Such as ARIMA, (Autoregressive integrated Moving Average Model, nonstationary time series is converted into Stationary time series, then only carries out recurrence institute by dependent variable to its lagged value and the present worth of stochastic error and lagged value The model of foundation) or exponential smoothing algorithm handle real world in non-linear irregular data when it is unreliable.Manually Neutral net, SVMs and k nearest neighbor are all the machine learning methods that some are applied to time series forecasting.Yet with very Many time variables can be translated and stretched over time, and these traditional methods will fail.In order to solve this problem, one Solution is exactly one variate-value of behavior rather than consideration for considering a sequence.For example certain methods are in time series analysis Middle carry out model prediction.These methods all assume it is a kind of data are represented then as possible look for most frequent pattern.However, this The subject matter of solutions presence is a bit：Data represent not reduce the especially high dimension of data dimension in these methods According to, and they must also go to cause time complexity to improve with the method processing data of such as cluster；Another question is Their research has no ability to explain output rule and relation, therefore reduction and explanation output rule and the relation of time complexity Need effectively to solve.

The content of the invention

For the defect and deficiency of prior art, it is an object of the invention to provide a kind of model prediction of polynary time series data Method, solves the problem of existing data processing method time complexity is high.

To achieve these goals, the present invention, which is adopted the following technical scheme that, is achieved：

A kind of mode prediction method of polynary time series data, comprises the following steps：

Stage one：The time series formed to each conditional-variable and decision variable finds candidate's interest mode collection, Each candidate's interest mode collection is clustered respectively；

Step 1：Find candidate's interest mode collection；

Step 1.1：Searching can use initial subsequence

For time series S={ s₁,…,s_l, from s₁Start to find slope m successively₁≠ 0 two adjacent time sequences Value, regard search out first two adjacent time sequential values as initial subsequence S_i={ s_i,s_i+1, wherein, i=1,2 ..., L-1, l are the length of time series, slope m₁Calculation formula be：

Step 1.2：Calculate the slope of adjacent time sequential value

Increase next s to available initial subsequence_i+2, calculate s_i+2And s_i+1Slope m₂；

Step 1.3：Obtain interest mode

If m₂It is not equal to m₁, obtain interest mode p_α={ s_i,s_i+1,s_i+2}；

If m₂Equal to m₁, continue step 1.2, until m_kIt is not equal to m₁Untill, obtain interest mode p_α={ s_i,s_i+1,…, s_i+k, wherein, m_kFor s_i+kAnd s_i+k-1Slope, k=1,2 ..., l-2；

Step 1.4 obtains candidate's interest mode collection

For time series S={ s₁,…,s_l, from interest mode p_αLast time sequential value start, repeat walk Rapid 1.1 to step 1.3, until finding whole time series S={ s₁,…,s_lIn all interest mode, form candidate's interest Set of patterns P_c={ p₁,p₂,…,p_α,…,p_β,…,p_n}；

Step 2：Candidate's interest mode clustering；

Step 2.1：1. 2. it is assigned to using following prune rule with prune rule to the pattern distance value for meeting rule condition It is infinitely great；

Prune rule is 1.：If candidate's interest mode collection P_cIn any two interest mode p_α,p_βDo not appear in simultaneously Peak width is w_sThe same area in, by the D in Distance matrix D_αβIt is assigned to infinity；Wherein, w_sIt is the parameter that user specifies, D is the distance matrix of interest mode,

D_αβ=d_αβ(p_α,p_β), d_αβFor p_αAnd p_βEuclidean distance；

Prune rule is 2.：If interest mode p_αSlope be negative, and interest mode p_βSlope for just, by Distance matrix D In D_αβIt is assigned to infinity；

Step 2.2：Calculate D in distance matrix_αβFor the distance of non-infinitely great element, and assignment is corresponding into distance matrix Position；

Step 2.3：Compare d_αβ(p_α,p_β) and the d that specifies of user_minBetween size, if d_αβ≤d_min, from P_cMiddle deletion p_α And p_βThe less interest mode of number of time sequential value in two interest modes, finally gives new interest mode collection P；

Wherein, d_minTake certain between Euclidean distance minimum value and the maximum between two neighboring time sequential value Individual value, is specifically specified by user；

Stage two：Produce prediction rule

Step 3：Correlation rule is calculated with Apriori algorithm

The interest mode collection P for merging each time variable obtains P_all, using Apriori algorithm to P_allIn interest mode Rule digging is associated, multiple correlation rules between different time variable are obtained；

Step 4：Generate prediction rule

1. m (p_vMMultiple correlation rules of)=m (p ')=1 merge to form following prediction rule：

A₁≤V(p_v1)≤B₁, and A₂≤V(p_v2)≤B₂..., and A_j≤V(p_v)_j≤B_j..., and A_λ≤V(p_vλ)≤B, then C₁≤V(p′)≤C₂, and postpone Δ T₁The individual unit interval；

Wherein, p_vjBe conditional-variable formation interest mode, j=1,2 ..., λ, λ >=1, λ be conditional-variable number, p ' It is the interest mode of decision variable formation；

m(p_vM) it is s_LAnd s₁Between slope, m (p_vM)=sgn (s_L-s₁), p_vMIt is to decision variable shadow in conditional-variable Ring maximum interest mode, s_LIt is p_vMIn last time sequential value, s₁It is p_vMIn first time sequential value；M (p ') is s′_LAnd s₁' between slope, s '_LRepresent the time sequential value of last in p ', s₁' represent first time sequential value in p '；

A_jAnd B_jIt is m (p respectively_vMV (p in the interest mode correlation rule of)=m (p ')=1_vj) minimum value and maximum, C₁And C₂It is m (p respectively_vMV (p ') minimum value and maximum, A in the interest mode correlation rule of)=m (p ')=1_j、B_j、C₁ And C₂It is positive number；

V(p_vj) it is interest mode p in conditional-variable_vjVariable quantity, V (p ') is the change of interest mode p ' in decision variable Change amount,

V(p_vj)=(max (p_vj)-min(p_vj))×m(p_vj)

V (p ')=(max (p ')-min (p ')) × m (p ')

max(p_vj) and min (p_vj) interest mode p is represented respectively_vjMaximum time sequential value and minimum time sequential value；

Time delay Δ T₁=max (Δ (r_g)), Δ (r_g)=I_pvM-I_p′, I_pvMIt is p_vMStart Time value, I_p′It is p ' starting Time value；

2. m (p_vM)=m (p ')=- 1 multiple correlation rules merge to form following prediction rule：

E₁≤V(p_v1)≤F₁, and E₂≤V(p_v2)≤F₂..., and E_j≤V(p_v)_j≤ F,_j..., and E_η≤V(p_vη)≤F_η, then G₁≤V(p′)≤G₂, and postpone Δ T₂The individual unit interval；

Wherein, E_jAnd F_jIt is m (p respectively_vMV (p in)=m (p ')=- 1 interest mode correlation rule_j) minimum value and most Big value, G₁And G₂It is m (p respectively_vMV (p ') minimum value and maximum, j in)=m (p ')=- 1 interest mode correlation rule =1,2 ..., η, η >=1, η are the number of conditional-variable；E_j、F_j、G₁And G₂It is negative, j is natural number；ΔT₂=max (Δs (r_g))；

Stage three：The interest mode that the conditional-variable execution stage one of testing data is obtained goes the prediction of matching stage two Rule, if meeting the former piece of prediction rule, exports predicting the outcome for decision variable.

Further, candidate's interest mode clustering method can be replaced with the following method in described step 2：

Step 2.1：The MBR for each pattern concentrated using R-tree structure candidate patterns, the data structure of rock mechanism, Obtain the index of pattern；

Step 2.2：To each child nodes i and j in R-tree data structures, following prune rule 1 and rule 2 are utilized Infinity is assigned to the pattern distance value for meeting rule condition.

Prune rule is 1.：If two interest mode p_α,p_βPeak width is not appeared in simultaneously for w_sThe same area In, by the D in distance matrix_αβIt is assigned to infinity；Wherein, w_sIt is the parameter that user specifies, D is the distance matrix of interest mode,

D_αβ=d_αβ(p_α,p_β), d_αβFor p_αAnd p_βEuclidean distance；

Step 2.3：D in Distance matrix D_αβCalculated for non-infinitely great element according to Euclidean distance, and assignment The corresponding position into distance matrix；

Step 2.4：Compare d_αβ(p_α,p_β) and the d that specifies of user_minBetween size, if d_αβ≤d_min, from P_cMiddle deletion p_α And p_βThe less interest mode of number of time series in two interest modes, finally gives new interest mode collection P；Wherein, d_minSome value between Euclidean distance minimum value and the maximum between two neighboring time series is taken, specifically by user Specify.

Compared with prior art, the beneficial effects of the invention are as follows：The mode prediction method of the polynary time series data of the present invention Amount of calculation is small, the effective time complexity reduced in model prediction, solves that time complexity in conventional method is too high to ask Topic.

Brief description of the drawings

Fig. 1 is the timing diagram of air themperature and rammed earth temperature.

Fig. 2 is the variable relation figure of air themperature and rammed earth temperature.

Fig. 3 is the MBR between pattern.

Fig. 4 is the data structure diagram based on R-tree for being used to retrieve candidate pattern proposed by the invention.

Fig. 5 is that the use beta pruning of Euclidean distance measurement carries out the performance of distance matrix calculating with unused Pruning strategy With sequence number figure.

Fig. 6 is to enter row distance square using the Euclidean distance calculating of the prune rule time used and using beta pruning and R-tree The time performance that battle array is calculated compares figure.

Fig. 7 is the six regular performance evaluations generated in embodiment.

Explanation is further explained in detail to the particular content of the present invention with reference to embodiments.

Embodiment

Conditional-variable in the present invention is the variable that can be used for predicting its dependent variable, and decision variable is exactly can be by other The variable of variable prediction.

The procedure of the present invention is divided into three phases：

Stage one：The time series formed to each conditional-variable and decision variable finds candidate's interest mode collection, Each candidate's interest mode collection is clustered respectively；Stage two：Produce prediction rule；Stage three：Generated according to the stage two Prediction rule is predicted to testing data.

The wherein stage one and stage two is the training stage, is predicted after existing data are performed into stage one and stage two Rule, the stage three is to be directed to data to be measured, and data to be measured are undergone to the interest mode obtained after the stage one and go to match rank Duan Erzhong prediction rule, if meeting the former piece of prediction rule, exports predicting the outcome for decision variable.

In the stage one：Interest mode, and the behavior of summary data are found, for the change of data, slope is found for just It is negative pattern with slope.Because sequence data may be clustered and divided to these patterns comprising the pattern repeated, the algorithm Group, be specially：

Step 1：Find candidate's interest mode collection

Step 1.1：Searching can use initial subsequence

For time series S={ s₁,…,s_l, from s₁Start to find slope m successively₁Be 0 two adjacent time sequences Train value, regard search out first two adjacent time sequential values as initial subsequence S_i={ s_i,s_i+1, wherein, i=1, 2 ..., l-1, l are the length of time series,Slope m₁Calculation formula be：

Step 1.2：Calculate the slope of adjacent time sequential value

Step 1.3：Obtain interest mode

If m₂It is not equal to m₁, obtain interest mode p_α={ s_i,s_i+1,s_i+2}；

Step 1.4 obtains candidate's interest mode collection

For time series S={ s₁,…,s_l, from interest mode p_αLast time sequential value (i.e. s_i+k) start, Repeat step 1.1 is to step 1.3, until finding whole time series S={ s₁,…,s_lN interest mode, formed candidate it is emerging Interesting set of patterns P_c={ p₁,p₂,…,p_α,…,p_β,…,p_n}；

Step 2：Candidate's interest mode clustering；

Concentrated in candidate pattern and be grouped parallel pattern, the first step for finding parallel pattern is to generate one between modes Distance matrix.For every a pair of patterns, the element of a distance matrix shows the distance of two patterns.But traditional algorithm Time loss it is too big, in order to solve this problem, two methods are proposed in the present invention：One kind is prune rule, another For R-tree combination prune rules.

It is specific as follows for prune rule：

Prune rule is 1.：If candidate's interest mode collection P_cIn any two interest mode p_α,p_βDo not appear in simultaneously Peak width is w_sThe same area in, then the distance of the two patterns for infinity, by the D in Distance matrix D_αβIt is assigned to nothing It is poor big；Wherein, w_sIt is the parameter that user specifies, D is the distance matrix of interest mode,

D_αβ=d_αβ(p_α,p_β), d_αβFor p_αAnd p_βEuclidean distance；

Prune rule is 2.：The slope of use pattern carries out beta pruning, if interest mode p_αSlope be negative, and interest mould Formula p_βSlope for just, then they be not construed as it is similar, by the D in Distance matrix D_αβIt is filled with infinity；

Step 2.2：Calculate D in distance matrix_αβFor the Euclidean distance between non-infinitely great element, and assignment to away from From corresponding position in matrix.

Step 2.3：Compare d_αβ(p_α,p_β) and the d that specifies of user_minBetween size, if d_αβ≤d_min, from P_cMiddle deletion p_α And p_βThe less interest mode of number of time sequential value in two interest modes, finally gives new interest mode collection P；Its In, d_minTake some value between Euclidean distance minimum value and the maximum between two neighboring time sequential value, time sequence Train value is specifically specified by user.

For R-tree combination prune rules, R-tree is used for index candidate set of patterns P_c, as shown in figure 4, P₁-P₉To wait The leaf node that each in lectotype, R tree constructions is set is the MBR of a candidate pattern.It is specific as follows：

Step 2.1：The MBR for each pattern concentrated using R-tree structure candidate patterns, the data structure of rock mechanism, Obtain the index of pattern；Each of which leaf node is the MBR of a pattern, R-Tree middle entry with neighbouring MBR come Indexing model.This data structure will be used to reduce the time complexity of algorithm by reducing the quantity of the pattern of processing.Fig. 3 Illustrate pattern p₁With pattern p₂Between MBR.

D_αβ=d_αβ(p_α,p_β), d_αβFor p_αAnd p_βEuclidean distance；

Step 2.3：Calculate D in distance matrix_αβFor the Euclidean distance between non-infinitely great element, and assignment to away from From corresponding position in matrix.

Step 2.4：Compare d_αβ(p_α,p_β) and the d that specifies of user_minBetween size, if d_αβ≤d_min, from P_cMiddle deletion p_α And p_βThe less interest mode of number of time sequential value in two interest modes, finally gives new interest mode collection P；Its In, d_minTake some value between Euclidean distance minimum value and the maximum between two neighboring time sequential value, specifically by User specifies.

Stage two：Produce prediction rule

Step 3：Correlation rule is calculated with Apriori algorithm

Merge the interest mode collection P of each time variable into P_all, using Apriori algorithm to P_allIn interest mode enter Row association rule mining, obtains the correlation rule of interest mode between different time variable：

Wherein, g=1,2 ..., R, p_vjIt is the interest mode of conditional-variable formation, p ' is the interest mould of decision variable formation Formula；

Correlation rule r is calculated according to formula below_gDirection, time delay and variable quantity：

The direction calculating formula of correlation rule：

Wherein, m (p_vM) it is s_LAnd s₁Between slope, m (p_vM)=sgn (s_L-s₁), p_vMIt is decision-making to be become in conditional-variable The maximum interest mode of amount influence, s_LRepresent p_vMIn last time sequential value, s₁Represent p_vMIn first time sequential value； M (p ') is s '_LSlope between s ', s '_LThe time sequential value of last in p ' is represented, s ' represents first time sequence in p ' Train value；

The calculation formula of time delay：

ΔT₁=max (Δ (r_g)), Δ (r_g)=I_pvM-I_p′ (4)

Wherein, I_pvMIt is p_vMStart Time value, I_p′It is p ' Start Time value；

The variable quantity of rule：

V(p_vj)=(max (p_vj)-min(p_vj))×m(p_vj)

V (p ')=(max (p ')-min (p ')) × m (p ') (5)

Wherein, V (p_vj) it is interest mode p in conditional-variable_vjVariable quantity, V (p ') is interest mode p ' in decision variable Variable quantity, max (p_vj) and min (p_vj) interest mode p is represented respectively_vjMaximum time sequential value and minimum time sequential value.

Step 4：Generate prediction rule

Wherein, A_jAnd B_jIt is m (p respectively_vMThe interest mode correlation rule r of)=m (p ')=1_gMiddle V (p_vj) minimum value and Maximum, C₁And C₂It is m (p respectively_vMThe interest mode correlation rule r of)=m (p ')=1_gThe minimum value and maximum of middle V (p '), J=1,2 ..., λ, λ >=1, λ are the number of conditional-variable, A_j、B_j、C₁And C₂It is positive number.

Wherein, E_jAnd F_jIt is m (p respectively_vMV (p in)=m (p ')=- 1 interest mode correlation rule_j) minimum value and most Big value, G₁And G₂It is m (p respectively_vMV (p ') minimum value and maximum, E in)=m (p ')=- 1 interest mode correlation rule_j、 F_j、G₁And G₂It is negative, j=1,2 ..., η, η >=1, η is the number of conditional-variable；ΔT₂=max (Δ (r_g)), Δ (r_g)= I_pvM-I_p′, I_pvMIt is p_vMStart Time value, I_p′It is p ' Start Time value；

The present invention is used for the performance for illustrating inventive algorithm using hit rate H, and H is defined as：

Wherein, N is the number of the interest mode of accurate match prediction rule in conditional-variable, and M is interest in conditional-variable The total number of pattern.

Specific embodiment of the invention given below is, it is necessary to which explanation is that the invention is not limited in implement in detail below Example, all equivalents done on the basis of technical scheme each fall within protection scope of the present invention.

Embodiment 1

The present embodiment provides the soil temperature for detecting Ruins of Great Wall in advance by inventive algorithm, wherein, Fig. 1 and Fig. 2 divide Not Wei bright Ruins of Great Wall air themperature and rammed earth temperature timing diagram and its variable relation figure, the time series data is passed through above-mentioned three The processing in individual stage, wherein, the present embodiment has only used prune rule candidate's interest mode clustering, its result in the stage one Such as Fig. 5, wherein, what naive curves were represented be both without using prune rule or without using the cluster result of R-tree data structures, Pruning-based curves represent the cluster result using prune rule, it can be seen that sequence quantity over time Increase, the Euclidean distance that prune rule is not used calculates time journey exponential increase used, and enters line-spacing using prune rule Time from calculating increasess slowly.

Embodiment 2

The present embodiment and the distinctive points of embodiment 1 are：The present embodiment has used R-tree data knot in the stage one Structure combination prune rule candidate's interest mode clustering, this data structure will be used to by reducing the quantity of the pattern of processing The time complexity of algorithm is reduced, as shown in Figure 6.Transverse axis is the quantity of time series, it can be seen that sequence quantity over time Increase, using only prune rule enter row distance square using the prune rule combination R-tree time ratios for carrying out distance matrix calculating The time that battle array is calculated increasess slowly.In the timing diagram of air themperature, p₁And p₂The candidate pattern collection recognized for inventive algorithm, The MBR between two patterns can be built, Fig. 3 show pattern p₁With pattern p₂Between MBR.

Table 1 is six specific prediction rules that the present embodiment is generated,

The prediction rule of table 1

Fig. 7 is the present embodiment predicting the outcome according to above-mentioned six prediction rules, and transverse axis is monitored area in figure, and the longitudinal axis is Hit rate.The contrast for the hit rate H that six prediction rules of generation are predicted in region 1,2,3,4,5 is illustrated in Fig. 7 As a result, wherein the mean hit rate highest of rule 3.

Claims

1. a kind of mode prediction method of polynary time series data, it is characterised in that：Comprise the following steps：

Stage one：The time series formed to each conditional-variable and decision variable finds candidate's interest mode collection, respectively Each candidate's interest mode collection is clustered；

Step 1：Find candidate's interest mode collection；

Step 1.1：Searching can use initial subsequence

For time series S={ s₁,…,s_l, from s₁Start to find slope m successively₁≠ 0 two adjacent time sequential values, will The two adjacent time sequential values searched out first are as initial subsequence S_i={ s_i,s_i+1, wherein, i=1,2 ..., l-1, l For the length of time series, slope m₁Calculation formula be：

Step 1.2：Calculate the slope of adjacent time sequential value

Step 1.3：Obtain interest mode

If m₂It is not equal to m₁, obtain interest mode p_α={ s_i,s_i+1,s_i+2}；

Step 1.4 obtains candidate's interest mode collection

For time series S={ s₁,…,s_l, from interest mode p_αLast time sequential value start, repeat step 1.1 To step 1.3, until finding whole time series S={ s₁,…,s_lIn all interest mode, form candidate's interest mode collection P_c={ p₁,p₂,…,p_α,…,p_β,…,p_n}；

Step 2：Candidate's interest mode clustering；

Step 2.1：1. 2. it is assigned to using following prune rule with prune rule to the pattern distance value for meeting rule condition infinite Greatly；

Prune rule is 1.：If candidate's interest mode collection P_cIn any two interest mode p_α,p_βRegion is not appeared in simultaneously Width is w_sThe same area in, by the D in Distance matrix D_αβIt is assigned to infinity；Wherein, w_sIt is the parameter that user specifies, D is The distance matrix of interest mode,

D_αβ=d_αβ(p_α,p_β), d_αβFor p_αAnd p_βEuclidean distance；

Prune rule is 2.：If interest mode p_αSlope be negative, and interest mode p_βSlope for just, by Distance matrix D D_αβIt is assigned to infinity；

Step 2.2：Calculate D in distance matrix_αβFor the distance of non-infinitely great element, and assignment corresponding position into distance matrix Put；

Step 2.3：Compare d_αβ(p_α,p_β) and the d that specifies of user_minBetween size, if d_αβ≤d_min, from P_cMiddle deletion p_αAnd p_β The less interest mode of number of time sequential value in two interest modes, finally gives new interest mode collection P；

Wherein, d_minSome value between Euclidean distance minimum value and the maximum between two neighboring time sequential value is taken, Specifically specified by user；

Stage two：Produce prediction rule

Step 3：Correlation rule is calculated with Apriori algorithm

The interest mode collection P for merging each time variable obtains P_all, using Apriori algorithm to P_allIn interest mode carry out Association rule mining, obtains multiple correlation rules between different time variable；

Step 4：Generate prediction rule

A₁≤V(p_v1)≤B₁, and A₂≤V(p_v2)≤B₂..., and A_j≤V(p_vj)≤B_j..., and A_λ≤V(p_vλ)≤B_λ, then C₁≤V (p′)≤C₂, and postpone Δ T₁The individual unit interval；

Wherein, p_vjBe conditional-variable formation interest mode, j=1,2 ..., λ, λ >=1, λ be conditional-variable number, p ' is to determine The interest mode of plan variable formation；

m(p_vM) it is s_LAnd s₁Between slope, m (p_vM)=sgn (s_L-s₁), p_vMIt is that maximum is influenceed on decision variable in conditional-variable Interest mode, s_LIt is p_vMIn last time sequential value, s₁It is p_vMIn first time sequential value；M (p ') is s '_LAnd s₁′ Between slope, s '_LRepresent the time sequential value of last in p ', s '₁Represent first time sequential value in p '；

A_jAnd B_jIt is m (p respectively_vMV (p in the interest mode correlation rule of)=m (p ')=1_vj) minimum value and maximum, C₁With C₂It is m (p respectively_vMV (p ') minimum value and maximum, A in the interest mode correlation rule of)=m (p ')=1_j、B_j、C₁And C₂ It is positive number；

V(p_vj) it is interest mode p in conditional-variable_vjVariable quantity, V (p ') is the variable quantity of interest mode p ' in decision variable,

V(p_vj)=(max (p_vj)-min(p_vj))×m(p_vj)

V (p ')=(max (p ')-min (p ')) × m (p ')

Time delay Δ T₁=max (Δ (r_g)), It is p_vMStart Time value, I_p′It is p ' initial time Value；

E₁≤V(p_v1)≤F₁, and E₂≤V(p_v2)≤F₂..., and E_j≤V(p_vj)≤F_j..., and E_η≤V(p_vη)≤F_η, then G₁≤V (p′)≤G₂, and postpone Δ T₂The individual unit interval；

Wherein, E_jAnd F_jIt is m (p respectively_vMV (p in)=m (p ')=- 1 interest mode correlation rule_j) minimum value and maximum Value, j=1,2 ..., η, η >=1, η is the number of conditional-variable, G₁And G₂It is m (p respectively_vM)=m (p ')=- 1 interest mode V (p ') minimum value and maximum in correlation rule；E_j、F_j、G₁And G₂It is negative；ΔT₂=max (Δ (r_g))；

Stage three：The interest mode that the conditional-variable execution stage one of testing data is obtained removes the pre- gauge of matching stage two Then, if meeting the former piece of prediction rule, predicting the outcome for decision variable is exported.

2. the mode prediction method of polynary time series data as claimed in claim 1, it is characterised in that：Waited in described step 2 Interest mode clustering method is selected to replace with the following method：

Step 2.1：The MBR for each pattern that candidate pattern is concentrated is built using R-tree, the data structure of rock mechanism is obtained The index of pattern；

Step 2.2：To each child nodes i and j in R-tree data structures, using following prune rule 1 and rule 2 to full The pattern distance value of sufficient rule condition is assigned to infinity.

Prune rule is 1.：If two interest mode p_α,p_βPeak width is not appeared in simultaneously for w_sThe same area in, will D in distance matrix_αβIt is assigned to infinity；Wherein, w_sIt is the parameter that user specifies, D is the distance matrix of interest mode,

D_αβ=d_αβ(p_α,p_β), d_αβFor p_αAnd p_βEuclidean distance；

Step 2.3：D in Distance matrix D_αβCalculated for non-infinitely great element according to Euclidean distance, and assignment to away from From corresponding position in matrix；

Step 2.4：Compare d_αβ(p_α,p_β) and the d that specifies of user_minBetween size, if d_αβ≤d_min, from P_cMiddle deletion p_αAnd p_β The less interest mode of number of time series in two interest modes, finally gives new interest mode collection P；Wherein, d_minTake Some value between Euclidean distance minimum value and maximum between two neighboring time series, is specifically specified by user.