CN107516114A - A kind of time Series Processing method and device - Google Patents

A kind of time Series Processing method and device Download PDF

Info

Publication number
CN107516114A
CN107516114A CN201710753494.4A CN201710753494A CN107516114A CN 107516114 A CN107516114 A CN 107516114A CN 201710753494 A CN201710753494 A CN 201710753494A CN 107516114 A CN107516114 A CN 107516114A
Authority
CN
China
Prior art keywords
target component
dynamic similarity
similarity degree
time series
parameter
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710753494.4A
Other languages
Chinese (zh)
Other versions
CN107516114B (en
Inventor
黎灿兵
章江
曹家
曹一家
周斌
刘新东
张卉钰
周珑
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Hunan University
Original Assignee
Hunan University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Hunan University filed Critical Hunan University
Priority to CN201710753494.4A priority Critical patent/CN107516114B/en
Publication of CN107516114A publication Critical patent/CN107516114A/en
Application granted granted Critical
Publication of CN107516114B publication Critical patent/CN107516114B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Economics (AREA)
  • Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • General Physics & Mathematics (AREA)
  • Marketing (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Quality & Reliability (AREA)
  • General Engineering & Computer Science (AREA)
  • Game Theory and Decision Science (AREA)
  • Evolutionary Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Operations Research (AREA)
  • Development Economics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A kind of time Series Processing method and device provided in an embodiment of the present invention, data mining technology field.This method by first obtain in predetermined time period comprising parameter to be measured target component target component time series, the influence factor time series of the target component in the predetermined time period is obtained again, the target component time series and the influence factor time series are finally based on, the dynamic similarity sequence of the parameter to be measured is chosen from the M target component subsequence.So as to which the change procedure of target component and the change procedure of each influence factor of target component synthesis are included in the selection rule of Similar Time Series Based on Markov Chain, so that in the time series forecasting by accumulation effects, selection accuracy for Similar Time Series Based on Markov Chain is higher and also of a relatively high to the accuracy of time series forecasting, and then significantly reduces the prediction error to time series in the prior art.

Description

A kind of time Series Processing method and device
Technical field
The present invention relates to data mining technology field, in particular to a kind of time Series Processing method and device.
Background technology
In the existing research that time series forecasting is carried out using Similar Time Series Based on Markov Chain method, more by the change of target component Rule is deduced and studied as the changing rule of time series, and thinks that this rule is changeless.It is but actual On, this rule is time-varying.Thus, the still larger technical problem of prediction error be present in prior art.
The content of the invention
The present invention provides a kind of time Series Processing method and device, it is intended to improves above-mentioned technical problem.
A kind of time Series Processing method provided by the invention, including:Obtain in predetermined time period and include parameter to be measured Target component target component time series, wherein, the target component time series includes M target component subsequence, Wherein, M is the integer more than or equal to 2;Obtain the influence factor time series of the target component in the predetermined time period; Based on the target component time series and the influence factor time series, chosen from the M target component subsequence The dynamic similarity sequence of the parameter to be measured.
Preferably, it is described to be based on the target component time series and the influence factor time series, from the M mesh The dynamic similarity sequence of parameter to be measured described in being chosen in parameter subsequence is marked, including:Obtain the M target component subsequence Length of time series;The parameter subsequence to be measured of the parameter to be measured is obtained based on the length of time series;Obtain each Multiple first dynamic similarity degree of the target component subsequence and the parameter subsequence to be measured;When obtaining the influence factor Between sequence and the parameter subsequence to be measured multiple second dynamic similarity degree;Based on the multiple first dynamic similarity degree and institute State the dynamic similarity sequence of parameter to be measured described in multiple second dynamic similarity retrievals.
Preferably, it is described based on described in the multiple first dynamic similarity degree and the acquisition of the multiple second dynamic similarity degree The dynamic similarity sequence of parameter to be measured, including:Obtain the target component dynamic similarity degree corresponding to the first dynamic similarity degree Influence factor dynamic similarity degree weight corresponding to weight and the second dynamic similarity degree, wherein, the target component dynamic Similarity weight is 1 with the influence factor dynamic similarity degree weight sum;Obtain each first dynamic similarity degree and institute State the first product of target component dynamic similarity degree weight;Obtain each two dynamic similarities degree and influence factor dynamic Second product of similarity weight;The dynamic similarity of the parameter to be measured is obtained based on first product and second product Sequence.
Preferably, the dynamic similarity sequence that the parameter to be measured is obtained based on first product and second product Row, including:Obtain first product and second sum of products;First product and second sum of products are made For comprehensive dynamic similarity degree;The sequence number of corresponding subsequence during numerical value maximum based on the comprehensive dynamic similarity degree;Will Dynamic similarity sequence of the target component subsequence corresponding to the sequence number as the parameter to be measured.
Preferably, the comprehensive dynamic similarity degree meets:
Wherein, i=1,2 ..., r-1, β0For the target component dynamic similarity degree weight, βc(c=1,2 ..., C) is The influence factor dynamic similarity degree weight.
A kind of time Series Processing device provided by the invention, including:First data capture unit, during for obtaining default Between in length the target component comprising parameter to be measured target component time series, wherein, the target component time series bag Containing M target component subsequence, wherein, M is the integer more than or equal to 2;Second data capture unit, it is described default for obtaining The influence factor time series of the target component in time span;Data processing unit, during for based on the target component Between sequence and the influence factor time series, the dynamic of the parameter to be measured is chosen from the M target component subsequence Similar sequences.
Preferably, the data processing unit includes:First data acquisition subelement, for obtaining the M target ginseng The length of time series of number subsequence;Second data acquisition subelement, for based on the length of time series obtain described in treat Survey the parameter subsequence to be measured of parameter;3rd data acquisition subelement, for obtaining each target component subsequence and institute State multiple first dynamic similarity degree of parameter subsequence to be measured;4th data acquisition subelement, for obtaining the influence factor Time series and multiple second dynamic similarity degree of the parameter subsequence to be measured;5th data acquisition subelement, for based on The multiple first dynamic similarity degree and the multiple second dynamic similarity degree obtain the dynamic similarity sequence of the parameter to be measured.
Preferably, the 5th data acquisition subelement includes:First data acquisition module, it is dynamic for obtaining described first The influence factor corresponding to target component dynamic similarity degree weight and the second dynamic similarity degree corresponding to state similarity is moved State similarity weight, wherein, the target component dynamic similarity degree weight and the influence factor dynamic similarity degree weight sum For 1;Second data acquisition module, weighed for obtaining each first dynamic similarity degree with the target component dynamic similarity degree First product of weight;3rd data acquisition module, moved for obtaining each second dynamic similarity degree with the influence factor Second product of state similarity weight;Data processing module, for obtaining institute based on first product and second product State the dynamic similarity sequence of parameter to be measured.
Preferably, the data processing module includes:First data acquisition submodule, for obtain first product with Second sum of products;Data processing submodule, for using first product and second sum of products as synthesis Dynamic similarity degree;Second data acquisition submodule, it is corresponding during for numerical value maximum based on the comprehensive dynamic similarity degree The sequence number of subsequence;3rd data acquisition submodule, for using target component subsequence corresponding to the sequence number as institute State the dynamic similarity sequence of parameter to be measured.
Preferably, the comprehensive dynamic similarity degree meets:
Wherein, i=1,2 ..., r-1, β0For the target component dynamic similarity degree weight, βc(c=1,2 ..., C) is The influence factor dynamic similarity degree weight.
A kind of time Series Processing method and device that the invention described above provides, this method are grown by first obtaining preset time The target component time series of target component comprising parameter to be measured in degree, then obtain the target in the predetermined time period The influence factor time series of parameter, the target component time series and the influence factor time series are finally based on, from The dynamic similarity sequence of the parameter to be measured is chosen in the M target component subsequence.So as to by the change of target component The change procedure synthesis of each influence factor of journey and target component is included in the selection rule of Similar Time Series Based on Markov Chain so that is being involved In the time series forecasting of product effects, the selection accuracy for Similar Time Series Based on Markov Chain is higher and to time series forecasting Accuracy it is also of a relatively high, and then significantly reduce the prediction error to time series in the prior art.
Brief description of the drawings
In order to illustrate the technical solution of the embodiments of the present invention more clearly, below by embodiment it is required use it is attached Figure is briefly described, it will be appreciated that the following drawings illustrate only certain embodiments of the present invention, therefore be not construed as pair The restriction of scope, for those of ordinary skill in the art, on the premise of not paying creative work, can also be according to this A little accompanying drawings obtain other related accompanying drawings.
Fig. 1 is the structured flowchart of electronic equipment provided in an embodiment of the present invention;
Fig. 2 is a kind of flow chart for time Series Processing method that first embodiment of the invention provides;
Fig. 3 is a kind of flow chart for time Series Processing method that second embodiment of the invention provides;
Fig. 4 is a kind of high-level schematic functional block diagram for time Series Processing device that third embodiment of the invention provides.
Embodiment
To make the purpose, technical scheme and advantage of the embodiment of the present invention clearer, below in conjunction with the embodiment of the present invention In accompanying drawing, the technical scheme in the embodiment of the present invention is clearly and completely described, it is clear that described embodiment is Part of the embodiment of the present invention, rather than whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art The every other embodiment obtained under the premise of creative work is not made, belongs to the scope of protection of the invention.Therefore, The detailed description of the embodiments of the invention to providing in the accompanying drawings is not intended to limit the model of claimed invention below Enclose, but be merely representative of the selected embodiment of the present invention.Based on the embodiment in the present invention, those of ordinary skill in the art are not having There is the every other embodiment made and obtained under the premise of creative work, belong to the scope of protection of the invention.
As shown in figure 1, the structured flowchart for a kind of electronic equipment provided in an embodiment of the present invention.The electronic equipment 300 Including time Series Processing device 400, memory 302, storage control 303, processor 304 and Peripheral Interface 305.
The memory 302, storage control 303, processor 304 and 305 each element of Peripheral Interface are direct between each other Or be electrically connected with indirectly, to realize the transmission of data or interaction.For example, these elements can pass through one or more between each other Communication bus or signal wire, which are realized, to be electrically connected with.The time Series Processing device 400 include it is at least one can be with software or solid The form of part (firmware) is stored in the memory 302 or is solidificated in the operating system of the electronic equipment 300 Software function module in (operating system, OS).The processor 304 is used to perform what is stored in memory 302 Executable module, such as the software function module or computer program that the time Series Processing device 400 includes.
Wherein, memory 302 may be, but not limited to, random access memory (Random Access Memory, RAM), read-only storage (Read Only Memory, ROM), programmable read only memory (Programmable Read- Only Memory, PROM), erasable read-only memory (Erasable Programmable Read-Only Memory, EPROM), electricallyerasable ROM (EEROM) (Electric Erasable Programmable Read-Only Memory, EEPROM) etc..Wherein, memory 302 is used for storage program, and the processor 304 is after execute instruction is received, described in execution Program, the method performed by server 100 that the stream process that foregoing any embodiment of the embodiment of the present invention discloses defines can answer Realized in processor 304, or by processor 304.
Processor 304 is probably a kind of IC chip, has the disposal ability of signal.Above-mentioned processor 304 can To be general processor, including central processing unit (Central Processing Unit, abbreviation CPU), network processing unit (Network Processor, abbreviation NP) etc.;Can also be digital signal processor (DSP), application specific integrated circuit (ASIC), Ready-made programmable gate array (FPGA) either other PLDs, discrete gate or transistor logic, discrete hard Part component.It can realize or perform disclosed each method, step and the logic diagram in the embodiment of the present invention.General processor Can be microprocessor or the processor can also be any conventional processor etc..
Various input/output devices are coupled to processor 304 and memory 302 by the Peripheral Interface 305.At some In embodiment, Peripheral Interface 305, processor 304 and storage control 303 can be realized in one single chip.Other one In a little examples, they can be realized by independent chip respectively.
Referring to Fig. 2, it is a kind of flow chart for time Series Processing method that first embodiment of the invention provides.Below will Idiographic flow shown in Fig. 2 is described in detail.
Step S101, the target component time series of the target component comprising parameter to be measured in predetermined time period is obtained, Wherein, the target component time series includes M target component subsequence, wherein, M is the integer more than or equal to 2.
Wherein, the selection of the predetermined time period can enter choose according to specific implementation process, for example, when described default Between length can be one week or 3 days, can also be one month.Here, it is not especially limited.
As a kind of embodiment, the target ginseng of the target component of period and its preceding some periods where obtaining parameter to be measured Number time series.
In the present embodiment, the target component subsequence can be obtained with passage time sliding window, according to set Sliding window size can from target component time series obtain M target component subsequence.
In the present embodiment, the parameter to be measured refers to the parameter for being ready for detection.
Step S102, obtain the influence factor time series of the target component in the predetermined time period.
In the present embodiment, each target component corresponds to one or more influence factor time serieses.
The influence factor refers to the factor for influenceing target component, i.e., the factor that can be had an impact to target component. For example, the target component is per day load, the influence factor can be max. daily temperature/mean daily temperature etc..
The influence factor time series refers to the influence factor in the predetermined time period.
Step S103, based on the target component time series and the influence factor time series, from the M target The dynamic similarity sequence of the parameter to be measured is chosen in parameter subsequence.
The dynamic similarity sequence refers to the son with the parameter to be measured in the change procedure of the subsequence of target component The maximum sequence of " comprehensive similarity " of sequence.It is rule searching pair by the change procedure of target component and each influence factor As, from parameter temporal node to be measured before target component and each influence factor time series in search for and choose and parameter to be measured Place target component and its all similar target component of each influence factor and the change of each influence factor period and its preceding some periods Change process.Parameter time series to be measured are predicted with the dynamic effects that take into account cumulative effect, and then is significantly reduced Error, so as to get the changing rule of target component time series more conform to actual conditions, when having existing for cumulative effect Between in sequence prediction, its precision of prediction obtains further improving relatively.
In the present embodiment, by first determining the length of target component and each influence factor time subsequence, recycle dynamic The similar thought of state determines target component dynamic similarity degree and each influence factor dynamic similarity degree description indexes respectively, then comprehensive again Close and consider target component dynamic similarity degree and each influence factor dynamic similarity degree, it is determined that comprehensive dynamic similarity degree description indexes, The maximum target component subsequence of comprehensive dynamic similarity degree is finally chosen, as target component subsequence dynamic where variable to be measured Similar sequences.The foundation chosen as Similar Time Series Based on Markov Chain, embedding data excavate prediction algorithm, further realize time sequence The prediction of row.
Referring to Fig. 3, it is a kind of flow chart for time Series Processing method that second embodiment of the invention provides.Below will Idiographic flow shown in Fig. 3 is described in detail.
Step S201, the target component time series of the target component comprising parameter to be measured in predetermined time period is obtained, Wherein, the target component time series includes M target component subsequence, wherein, M is the integer more than or equal to 2.
Step S202, obtain the influence factor time series of the target component in the predetermined time period.
Step S201 and step S202 embodiment refer to step corresponding in first embodiment, here, Repeat no more.
Step S203, obtain the length of time series of the M target component subsequence.
As a kind of embodiment, passage time sliding window produces M target in the target component sequence successively Parameter subsequence.Wherein, M value is the integer more than or equal to 2.And by set the size of the time slide window from And obtain the length of time series of each target component subsequence.I.e. length of time series is the size of time slide window.Example Such as, the size of the time slide window can be for the sliding window of t-1 (the general values 3,4,5 of t).For example, set target ginseng The length of number time series is n.If the length of sliding time window be t-1 (the general values 3,4,5 of t) sliding window successively Produce target component subsequence.R target component subsequence, r=n-t+1 can be produced altogether.Its i-th section of target component subsequence can To be expressed as Li={ Li(1), Li..., L (2)i(t-1)}.Wherein, i=1,2 ..., r.
Step S204, the parameter subsequence to be measured of the parameter to be measured is obtained based on the length of time series.
In the present embodiment, the time span of the parameter subsequence to be measured and the time of the target component subsequence are grown Spend equal.Therefore, by obtaining the time span of target component subsequence so as to getting the parameter subsequence to be measured.
Step S205, obtain multiple first dynamics of each target component subsequence and the parameter subsequence to be measured Similarity.
In the present embodiment, all it is t-1 target component subsequence L provided with time spani={ Li(1), Li(2) ..., Li} and parameter subsequence L to be measured (t-1)r={ Lr(1), Lr..., L (2)r(t-1) }, i=1,2 ..., r-1.Then target component Dynamic similarity degree describes the similarity degree of t-1 target component changing rule before the two subsequences.For example, use dL.i(i =1,2 ..., r-1) " relative change rate's square distance and " is represented, it can be described as LiAnd LrThe rate of change of middle target component Difference degree.Calculation formula is:
In formula, LiAnd L (u)r(u) it is respectively target component subsequence LiWith parameter subsequence L to be measuredrIn u (u=1, 2 ..., t-1) individual target component.The first dynamic similarity degree SL.iMeet:
In formula, max (dL.i) and min (dL.i) it is respectively dL.iMaximum and minimum value, i=1,2 ..., r-1.
Step S206, obtain the influence factor time series and multiple second dynamic phases of the parameter subsequence to be measured Like degree.
As a kind of embodiment, obtain in each influence factor subsequence and the parameter subsequence to be measured Multiple second dynamic similarity degree of the subsequence of C influence factors, for example, being provided with and LiAnd LrCorresponding two sections are long Degree is all the subsequence of c-th of influence factor of t target componentWith parameter to be measured Influence factor subsequence corresponding to subsequence Wherein, what influence factor dynamic similarity degree characterized is the similarity degree of influence factor change, and reaction is to target component size With the influence degree of changing rule.For example, use dc.i(c=1,2 ..., C, i=1,2 ..., r-1) represent " relative change rate it With ", it can be described asWithRate of change difference degree.Calculation formula is:
In formula, xc.iAnd x (v)c.r(v) it is respectively influence factor subsequenceWithIn v (v=1,2 ..., t) Individual influence factor value.The second dynamic similarity degree Sc.iMeet:
In formula, max (dc.i) and min (dc.i) it is respectively dc.iMaximum and minimum value, i=1,2 ..., r-1.
Step S207, based on described in the multiple first dynamic similarity degree and the multiple second dynamic similarity retrieval The dynamic similarity sequence of parameter to be measured.
As a kind of embodiment, according to target component dynamic similarity degree weight set in advance and influence factor dynamic phase Like degree weight, the first dynamic similarity degree and the product and described two dynamics of the target component dynamic similarity degree weight are obtained respectively The sum of products of similar sequences and the influence factor dynamic similarity degree weight, will be being obtained and as integrating dynamic similarity Degree, according to the sequence number of corresponding subsequence when finding out comprehensive dynamic similarity degree maximum to comprehensive dynamic similarity degree, by the sequence Similar Time Series Based on Markov Chain of the target component subsequence corresponding to row number as subsequence where parameter to be measured, i.e. dynamic similarity sequence. The target component dynamic similarity degree weight is 1 with the influence factor dynamic similarity degree weight sum.And the target component Dynamic similarity degree weight is all higher than zero with the influence factor dynamic similarity degree weight.
Wherein, the comprehensive dynamic similarity degree OiMeet:
Wherein, i=1,2 ..., r-1, β0For target component dynamic similarity degree weight, βc(c=1,2 ..., C) is target Each influence factor dynamic similarity degree weight of parameter.
In the present embodiment, for β0And βc(c=1,2 ..., C's) asks for, and can pass through known mesh in historical data Dynamic state of parameters similarity, influence factor dynamic similarity degree and comprehensive dynamic similarity degree are marked, is tried to achieve using least square method to optimize.
In the present embodiment, according to comprehensive dynamic similarity degree OiThe calculating of (i=1,2 ..., r-1), is sorted by size And the sequence number of corresponding subsequence during comprehensive dynamic similarity degree maximum is found out, be designated as I, i.e. I=i | max (Oi)}.Choose mesh Mark parameter subsequence LI={ LI(1), LI..., L (2)I(t-1) } the Similar Time Series Based on Markov Chain as subsequence where parameter to be measured, That is, dynamic similarity sequence.And LI(t) the then dynamic similarity parameter as parameter to be measured.
By accessed dynamic similarity sequence, the foundation chosen as Similar Time Series Based on Markov Chain, embedding data excavates pre- Method of determining and calculating, to carry out the prediction of time series.
For example, in August, the 2014 per day load of 3 to 8 is predicted in the per day load prediction of power system.Its In, target component is the per day load from January 1st, 2012 to prediction day proxima luce (prox. luc), and each influence factor data are from 2012 The meteorological data on January 1, in prediction same day day.Retrievable meteorological data includes:Daily maximum temperature, daily minimal tcmperature, day Temperature on average, wind speed, precipitation and relative humidity.It is similar with experience by the Relative Error that dynamic similarity method is calculated The Relative Error of method is shown in Table 1.By table 1 it can be found that dynamic similarity prediction error is integrally missed significantly lower than experience comparability prediction Difference.
Table 1
Referring to Fig. 4, it is a kind of time Series Processing apparatus function module diagram that third embodiment of the invention provides. The time Series Processing device 400 includes the first data capture unit 410, the second data capture unit 420 and data processing Unit 430.
First data capture unit 410, for obtaining the mesh of the target component comprising parameter to be measured in predetermined time period Mark parameter time series, wherein, the target component time series include M target component subsequence, wherein, M for more than etc. In 2 integer.
Second data capture unit 420, for obtaining the influence factor of the target component in the predetermined time period Time series.
Data processing unit 430, for based on the target component time series and the influence factor time series, from The dynamic similarity sequence of the parameter to be measured is chosen in the M target component subsequence
Wherein, the data processing unit 430 includes:First data acquisition subelement, the second data acquisition subelement, Three data acquisition subelements, the 4th data acquisition subelement and the 5th data acquisition subelement.
First data acquisition subelement, for obtaining the length of time series of the M target component subsequence.
Second data acquisition subelement, for obtaining the parameter to be measured of the parameter to be measured based on the length of time series Subsequence.
3rd data acquisition subelement, for obtaining each target component subsequence and the parameter subsequence to be measured Multiple first dynamic similarity degree.
4th data acquisition subelement, for obtaining the influence factor time series and the parameter subsequence to be measured Multiple second dynamic similarity degree.
5th data acquisition subelement, for based on the multiple first dynamic similarity degree and the multiple second dynamic phase The dynamic similarity sequence of the parameter to be measured is obtained like degree.
Wherein, the 5th data acquisition subelement includes:First data acquisition module, the second data acquisition module, Three data acquisition modules and data processing module.
First data acquisition module, for obtaining the target component dynamic similarity degree corresponding to the first dynamic similarity degree Influence factor dynamic similarity degree weight corresponding to weight and the second dynamic similarity degree, wherein, the target component dynamic Similarity weight is 1 with the influence factor dynamic similarity degree weight sum.
Second data acquisition module, for obtaining each first dynamic similarity degree and the target component dynamic similarity Spend the first product of weight.
3rd data acquisition module, for obtaining each second dynamic similarity degree and the influence factor dynamic similarity Spend the second product of weight.
Data processing module, for obtaining the dynamic of the parameter to be measured based on first product and second product Similar sequences.
Wherein, the data processing module includes:First data acquisition submodule, data processing submodule and the second data Acquisition submodule.
First data acquisition submodule, for obtaining first product and second sum of products;
Data processing submodule, for using first product and second sum of products as comprehensive dynamic similarity Degree.
Second data acquisition submodule, corresponding sub- sequence during for numerical value maximum based on the comprehensive dynamic similarity degree The sequence number of row.
3rd data acquisition submodule, for using target component subsequence corresponding to the sequence number as the ginseng to be measured Several dynamic similarity sequences.
Wherein, the data processing submodule is specifically used for:Based on the first dynamic similarity degree and second dynamic Similarity obtains multiple comprehensive dynamic similarity degree;The synthesis that numerical value is maximum in the multiple comprehensive dynamic similarity degree is obtained to move Subsequence corresponding to state similarity;The target subsequences corresponding to the subsequence are obtained, by the target subsequences Dynamic similarity sequence as parameter to be measured.Wherein, the comprehensive dynamic similarity degree meets:
Wherein, i=1,2 ..., r-1, β0For the target component dynamic similarity degree weight, βc(c=1,2 ..., C) is The influence factor dynamic similarity degree weight.
In summary, the present invention provides a kind of time Series Processing method and device, when this method is by first obtaining default Between in length the target component comprising parameter to be measured target component time series, then obtain described in the predetermined time period The influence factor time series of target component, it is finally based on the target component time series and the influence factor time sequence Row, the dynamic similarity sequence of the parameter to be measured is chosen from the M target component subsequence.So as to by the change of target component The change procedure synthesis of each influence factor of change process and target component is included in the selection rule of Similar Time Series Based on Markov Chain so that In time series forecasting by accumulation effects, the selection accuracy for Similar Time Series Based on Markov Chain is higher and to time series The accuracy of prediction is also of a relatively high, and then significantly reduces the prediction error to time series in the prior art.
In several embodiments provided herein, it should be understood that disclosed apparatus and method, can also pass through Other modes are realized.Device embodiment described above is only schematical, for example, flow chart and block diagram in accompanying drawing Show the device of multiple embodiments according to the present invention, method and computer program product architectural framework in the cards, Function and operation.At this point, each square frame in flow chart or block diagram can represent the one of a module, program segment or code Part, a part for the module, program segment or code include one or more and are used to realize holding for defined logic function Row instruction.It should also be noted that at some as in the implementation replaced, the function that is marked in square frame can also with different from The order marked in accompanying drawing occurs.For example, two continuous square frames can essentially perform substantially in parallel, they are sometimes It can perform in the opposite order, this is depending on involved function.It is it is also noted that every in block diagram and/or flow chart The combination of individual square frame and block diagram and/or the square frame in flow chart, function or the special base of action as defined in performing can be used Realize, or can be realized with the combination of specialized hardware and computer instruction in the system of hardware.
In addition, each functional module in each embodiment of the present invention can integrate to form an independent portion Point or modules individualism, can also two or more modules be integrated to form an independent part.
If the function is realized in the form of software function module and is used as independent production marketing or in use, can be with It is stored in a computer read/write memory medium.Based on such understanding, technical scheme is substantially in other words The part to be contributed to prior art or the part of the technical scheme can be embodied in the form of software product, the meter Calculation machine software product is stored in a storage medium, including some instructions are causing a computer equipment (can be People's computer, server, or network equipment etc.) perform all or part of step of each embodiment methods described of the present invention. And foregoing storage medium includes:USB flash disk, mobile hard disk, read-only storage (ROM, Read-Only Memory), arbitrary access Memory (RAM, Random Access Memory), magnetic disc or CD etc. are various can be with the medium of store program codes.Need It is noted that herein, such as first and second or the like relational terms are used merely to an entity or operation Made a distinction with another entity or operation, and not necessarily require or imply these entities or exist between operating any this Actual relation or order.Moreover, term " comprising ", "comprising" or its any other variant are intended to nonexcludability Comprising so that process, method, article or equipment including a series of elements not only include those key elements, but also wrapping Include the other element being not expressly set out, or also include for this process, method, article or equipment intrinsic want Element.In the absence of more restrictions, the key element limited by sentence "including a ...", it is not excluded that wanted including described Other identical element also be present in the process of element, method, article or equipment.
The preferred embodiments of the present invention are the foregoing is only, are not intended to limit the invention, for the skill of this area For art personnel, the present invention can have various modifications and variations.Within the spirit and principles of the invention, that is made any repaiies Change, equivalent substitution, improvement etc., should be included in the scope of the protection.It should be noted that:Similar label and letter exists Similar terms is represented in following accompanying drawing, therefore, once being defined in a certain Xiang Yi accompanying drawing, is then not required in subsequent accompanying drawing It is further defined and explained.

Claims (10)

  1. A kind of 1. time Series Processing method, it is characterised in that including:
    The target component time series of the target component comprising parameter to be measured in predetermined time period is obtained, wherein, the target Parameter time series include M target component subsequence, wherein, M is the integer more than or equal to 2;
    Obtain the influence factor time series of the target component in the predetermined time period;
    Based on the target component time series and the influence factor time series, from the M target component subsequence Choose the dynamic similarity sequence of the parameter to be measured.
  2. 2. according to the method for claim 1, it is characterised in that described to be based on the target component time series and the shadow Factors time sequence is rung, the dynamic similarity sequence of the parameter to be measured is chosen from the M target component subsequence, including:
    Obtain the length of time series of the M target component subsequence;
    The parameter subsequence to be measured of the parameter to be measured is obtained based on the length of time series;
    Obtain multiple first dynamic similarity degree of each target component subsequence and the parameter subsequence to be measured;
    Obtain multiple second dynamic similarity degree of the influence factor time series and the parameter subsequence to be measured;
    The dynamic of the parameter to be measured is obtained based on the multiple first dynamic similarity degree and the multiple second dynamic similarity degree Similar sequences.
  3. 3. according to the method for claim 2, it is characterised in that described based on the multiple first dynamic similarity degree and described Multiple second dynamic similarity degree obtain the dynamic similarity sequence of the parameter to be measured, including:
    Obtain the target component dynamic similarity degree weight corresponding to the first dynamic similarity degree and the second dynamic similarity degree Corresponding influence factor dynamic similarity degree weight, wherein, the target component dynamic similarity degree weight and the influence factor Dynamic similarity degree weight sum is 1;
    Obtain each first dynamic similarity degree and the first product of the target component dynamic similarity degree weight;
    Obtain each second dynamic similarity degree and the second product of the influence factor dynamic similarity degree weight;
    The dynamic similarity sequence of the parameter to be measured is obtained based on first product and second product.
  4. 4. according to the method for claim 3, it is characterised in that described to be obtained based on first product with second product The dynamic similarity sequence of the parameter to be measured is taken, including:
    Obtain first product and second sum of products;
    Using first product and second sum of products as comprehensive dynamic similarity degree;
    The sequence number of corresponding subsequence during numerical value maximum based on the comprehensive dynamic similarity degree;
    Dynamic similarity sequence using target component subsequence corresponding to the sequence number as the parameter to be measured.
  5. 5. according to the method for claim 4, it is characterised in that the comprehensive dynamic similarity degree meets:
    Wherein, i=1,2 ..., r-1, β0For the target component dynamic similarity degree weight, βc(c=1,2 ..., C) is described Influence factor dynamic similarity degree weight.
  6. A kind of 6. time Series Processing device, it is characterised in that including:
    First data capture unit, for obtain in predetermined time period comprising parameter to be measured target component target component when Between sequence, wherein, the target component time series includes M target component subsequence, wherein, M is whole more than or equal to 2 Number;
    Second data capture unit, for obtaining the influence factor time sequence of the target component in the predetermined time period Row;
    Data processing unit, for based on the target component time series and the influence factor time series, from the M The dynamic similarity sequence of the parameter to be measured is chosen in target component subsequence.
  7. 7. device according to claim 6, it is characterised in that the data processing unit includes:
    First data acquisition subelement, for obtaining the length of time series of the M target component subsequence;
    Second data acquisition subelement, for obtaining the sub- sequence of parameter to be measured of the parameter to be measured based on the length of time series Row;
    3rd data acquisition subelement, for obtaining the more of each target component subsequence and the parameter subsequence to be measured Individual first dynamic similarity degree;
    4th data acquisition subelement, for obtaining the multiple of the influence factor time series and the parameter subsequence to be measured Second dynamic similarity degree;
    5th data acquisition subelement, for based on the multiple first dynamic similarity degree and the multiple second dynamic similarity degree Obtain the dynamic similarity sequence of the parameter to be measured.
  8. 8. device according to claim 7, it is characterised in that the 5th data acquisition subelement includes:
    First data acquisition module, for obtaining the target component dynamic similarity degree weight corresponding to the first dynamic similarity degree With the influence factor dynamic similarity degree weight corresponding to the second dynamic similarity degree, wherein, the target component dynamic similarity It is 1 to spend weight with the influence factor dynamic similarity degree weight sum;
    Second data acquisition module, weighed for obtaining each first dynamic similarity degree with the target component dynamic similarity degree First product of weight;
    3rd data acquisition module, weighed for obtaining each second dynamic similarity degree with the influence factor dynamic similarity degree Second product of weight;
    Data processing module, for obtaining the dynamic similarity of the parameter to be measured based on first product and second product Sequence.
  9. 9. device according to claim 8, it is characterised in that the data processing module includes:
    First data acquisition submodule, for obtaining first product and second sum of products;
    Data processing submodule, for using first product and second sum of products as comprehensive dynamic similarity degree;
    Second data acquisition submodule, corresponding subsequence during for numerical value maximum based on the comprehensive dynamic similarity degree Sequence number;
    3rd data acquisition submodule, for using target component subsequence corresponding to the sequence number as the parameter to be measured Dynamic similarity sequence.
  10. 10. device according to claim 9, it is characterised in that the comprehensive dynamic similarity degree meets:
    Wherein, i=1,2 ..., r-1, β0For the target component dynamic similarity degree weight, βc(c=1,2 ..., C) is described Influence factor dynamic similarity degree weight.
CN201710753494.4A 2017-08-28 2017-08-28 Time sequence processing method and device Active CN107516114B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710753494.4A CN107516114B (en) 2017-08-28 2017-08-28 Time sequence processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710753494.4A CN107516114B (en) 2017-08-28 2017-08-28 Time sequence processing method and device

Publications (2)

Publication Number Publication Date
CN107516114A true CN107516114A (en) 2017-12-26
CN107516114B CN107516114B (en) 2020-06-19

Family

ID=60724328

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710753494.4A Active CN107516114B (en) 2017-08-28 2017-08-28 Time sequence processing method and device

Country Status (1)

Country Link
CN (1) CN107516114B (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108537466A (en) * 2018-04-25 2018-09-14 北京腾云天下科技有限公司 A kind of method and computing device of statistics application operation indicator
CN110674375A (en) * 2019-09-25 2020-01-10 联想(北京)有限公司 Data processing method and electronic equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101082972A (en) * 2007-05-30 2007-12-05 华为技术有限公司 Method and device for forecasting user's interest to commercial product and method for publishing advertisement thereof
CN101739814A (en) * 2009-11-06 2010-06-16 吉林大学 SCATS coil data-based traffic state online quantitative evaluation and prediction method
CN101877077A (en) * 2009-11-25 2010-11-03 天津工业大学 Time series predicting model
CN103577562A (en) * 2013-10-24 2014-02-12 河海大学 Multi-measurement time series similarity analysis method
CN105608096A (en) * 2014-11-21 2016-05-25 财团法人资讯工业策进会 Time series data processing method and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101082972A (en) * 2007-05-30 2007-12-05 华为技术有限公司 Method and device for forecasting user's interest to commercial product and method for publishing advertisement thereof
CN101739814A (en) * 2009-11-06 2010-06-16 吉林大学 SCATS coil data-based traffic state online quantitative evaluation and prediction method
CN101877077A (en) * 2009-11-25 2010-11-03 天津工业大学 Time series predicting model
CN103577562A (en) * 2013-10-24 2014-02-12 河海大学 Multi-measurement time series similarity analysis method
CN105608096A (en) * 2014-11-21 2016-05-25 财团法人资讯工业策进会 Time series data processing method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
MARTINEZ V,ET AL.: "CONVEX:similarity-based algorithms for forecasting group behavior", 《IEEE INTELLIGENT SYSTEMS》 *
莫维仁等: "短期负荷预测中选择相似日的探讨", 《清华大学学报(自然科学版)》 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108537466A (en) * 2018-04-25 2018-09-14 北京腾云天下科技有限公司 A kind of method and computing device of statistics application operation indicator
CN108537466B (en) * 2018-04-25 2020-09-15 北京腾云天下科技有限公司 Method for counting application operation indexes and computing equipment
CN110674375A (en) * 2019-09-25 2020-01-10 联想(北京)有限公司 Data processing method and electronic equipment

Also Published As

Publication number Publication date
CN107516114B (en) 2020-06-19

Similar Documents

Publication Publication Date Title
Azar et al. Dynamic policy programming
CN107526667A (en) A kind of Indexes Abnormality detection method, device and electronic equipment
CN105224543A (en) For the treatment of seasonal effect in time series method and apparatus
CN107516114A (en) A kind of time Series Processing method and device
CN103036231A (en) Forecasting method, device, and upper computer of power load
CN111831704A (en) Abnormal data determination method and device, storage medium and electronic equipment
JP2020501232A (en) Risk control event automatic processing method and apparatus
CN107370766A (en) A kind of network flow abnormal detecting method and system
CN104517020A (en) Characteristic extraction method and device used for cause and effect analysis
Vogel et al. A novel method to optimize electricity generation from wind energy
Wang A big data framework for stock price forecasting using fuzzy time series
Cormier et al. Simple models and insights for warehouse sizing
CN102023993B (en) Cluster page ranking equipment and method based on clustering/classification and time
CN116384574A (en) W-LSTM-based non-parameter load interval prediction method
Haug Closed form valuation of American barrier options
CN111175810B (en) Microseismic signal arrival time picking method, device, equipment and storage medium
CN117092582A (en) Electric energy meter abnormality detection method and device based on contrast self-encoder
Xylogiannopoulos et al. Periodicity data mining in time series using suffix arrays
CN115238779B (en) Cloud disk abnormality detection method, device, equipment and medium
Prodinger Approximate counting with m counters: a detailed analysis
US20190205804A1 (en) Information processing device, information processing method and computer readable medium
CN113850427A (en) Time series prediction method, electronic device, and computer storage medium
CN114358382A (en) Power system equivalent inertia probability prediction method, device, product and storage medium
CN107392220A (en) The clustering method and device of data flow
CN107605472B (en) Method and device for determining oil reservoir recovery ratio

Legal Events

Date Code Title Description
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant