CN101794345A - Data processing method and device - Google Patents

Data processing method and device Download PDF

Info

Publication number
CN101794345A
CN101794345A CN200910244151A CN200910244151A CN101794345A CN 101794345 A CN101794345 A CN 101794345A CN 200910244151 A CN200910244151 A CN 200910244151A CN 200910244151 A CN200910244151 A CN 200910244151A CN 101794345 A CN101794345 A CN 101794345A
Authority
CN
China
Prior art keywords
data
smoothing
historical data
data information
smoothing factor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN200910244151A
Other languages
Chinese (zh)
Other versions
CN101794345B (en
Inventor
申小次
贾学力
李建军
庄明亮
付新刚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Cennavi Technologies Co Ltd
Original Assignee
Beijing Cennavi Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Cennavi Technologies Co Ltd filed Critical Beijing Cennavi Technologies Co Ltd
Priority to CN 200910244151 priority Critical patent/CN101794345B/en
Publication of CN101794345A publication Critical patent/CN101794345A/en
Priority to PCT/CN2010/079716 priority patent/WO2011079705A1/en
Application granted granted Critical
Publication of CN101794345B publication Critical patent/CN101794345B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions

Abstract

The invention discloses a data processing method and a device, and relates to the technical field of intelligent traffic systems. The invention aims to solve the problem that the medium filtering smoothing technology in the prior art has the detect of random prediction process, so that the prediction precision is low, thereby failing to satisfy the requirements for actual prediction. The embodiment of the invention provides a data processing method which comprises the following steps: acquiring historical data information; carrying out data preprocessing on the historical data information; carrying out data combination and filling on the preprocessed historical data information; and carrying out dynamic exponential smoothing process on the historical data information. The embodiment of the invention can enhance the prediction precision and satisfy the requirements for actual prediction.

Description

A kind of data processing method and device
Technical field
The present invention relates to the intelligent transportation system technical field, relate in particular to a kind of data processing method and device.
Background technology
Advanced transportation information service systems (Advanced Traffic Information System, ATIS) be based upon on the perfect information network basis, this system can obtain all kinds of transport information by sensor or the data transmission set that is equipped in road, car, transfer stop, parking lot and forecast center, carries out overall treatment according to the described data that get access to.This system can provide Real-time Road traffic congestion information comprehensively and accurately to society in real time.But the data accessed by described equipment can not cover all roads fully, fill up thereby need carry out real time data by the similar inquiry of historical data, and available historical data are predicted after by analysis.
In order to improve the availability of dynamic information, the function that needs the information prediction of increase system, need carry out independent analysis to the historical road condition data of past certain hour in the cycle, obtain the variation tendency of every road in the traffic of historical data in the cycle, the mode by interface offers transportation information service systems and uses.Yet in the prior art, adopt median filter smoothness of image to handle usually historical data is handled, thereby the purpose of realization data prediction.
In realizing process of the present invention, the inventor finds that there are the following problems at least in the prior art: because the median filter smoothness of image treatment technology forecasting process that prior art adopted comparatively at random, make precision of prediction lower, can't satisfy the needs of actual prediction.
Summary of the invention
Embodiments of the invention provide a kind of data processing method and device.
For achieving the above object, embodiments of the invention adopt following technical scheme:
A kind of data processing method comprises:
Obtain historical data information;
Described historical data information is carried out the data pre-service;
Described pretreated historical data information is carried out data to be merged and fills up;
Described data are merged and fill up after historical data information carry out the dynamic index smoothing processing.
A kind of data processing equipment comprises:
Information acquisition unit is used to obtain historical data information;
Pretreatment unit is used for described historical data information is carried out the data pre-service;
Data merge shim, are used for that described pretreated historical data information is carried out data and merge and fill up;
The data smoothing unit is used for described data merging and the historical data information after filling up is carried out the dynamic index smoothing processing.
Data processing method that the embodiment of the invention provides and device are by obtaining historical data information; Described historical data information is carried out the data pre-service; Described pretreated historical data information is carried out data to be merged and fills up; Described data are merged and fill up after historical data information carry out the dynamic index smoothing processing.Compared with prior art, the dynamic index smoothing processing of employing of the present invention can be so that precision of prediction be higher and can satisfy the needs of actual prediction.
Description of drawings
A kind of data processing method process flow diagram that Fig. 1 provides for the embodiment of the invention;
In a kind of data processing method that Fig. 2 provides for the embodiment of the invention described historical data information is carried out the implementation procedure process flow diagram of the pretreated step of data;
In a kind of data processing method that Fig. 3 provides for the embodiment of the invention described pretreated historical data information carried out that data merge and the implementation procedure process flow diagram flow chart of the step filled up;
In a kind of data processing method that Fig. 4 provides for the embodiment of the invention described data are merged and fill up after historical data information carry out the implementation procedure process flow diagram of the step of dynamic index smoothing processing;
A kind of data processing equipment structural representation that Fig. 5 provides for the embodiment of the invention.
Embodiment
Below in conjunction with accompanying drawing embodiment of the invention data processing method and device are described in detail.
As shown in Figure 1, be a kind of data processing method that the embodiment of the invention provides, this method comprises:
101: obtain historical data information;
102: described historical data information is carried out the data pre-service; This step mainly is described historical data to be carried out the rejecting of abnormal data, improves the quality of historical data.
103: described pretreated historical data information is carried out data merge and fill up;
104: described data are merged and fill up after historical data information carry out the dynamic index smoothing processing.
Because described historical data exists various interference can produce a collection of abnormal data in collection and processing output procedure, if being carried out the data analysis meeting, described abnormal data has influence on the accuracy that predicts the outcome at last, so need carry out pre-service to described historical data, eliminate described abnormal data.Can adopt a kind of statistical method to come the rejecting abnormalities vehicle speed value in the embodiment of the invention based on the time band.
The implementation procedure of the pretreated step of data is provided described historical data information in a kind of data processing method that provides for the embodiment of the invention as shown in Figure 2; If the value of the time dimension of described historical data is 00:00-23:59; Per 5 minutes is a time period; The time band is meant one day vehicle speed value of certain road; Wherein, to be a time band half an hour.Described historical data pre-service is rejected some abnormal datas based on continuous urban history road condition data more than month, and its concrete implementation procedure comprises:
201: described historical data information is carried out the time band divide; Wherein, described historical data can be the historical vehicle speed value of reading from database; Described time band divide be to one day the speed of a motor vehicle of particular link dividing half an hour, thereby tentatively obtain 48 times bands.
202: the time band according to dividing merges verification; The specific implementation process of this step is:
Being with the T that successively uses F check and two samples to check the described ready-portioned time judges whether and can be with the merging time; Wherein, described F check is used to judge whether the variance of two time bands to be tested equates; The T check of described pair of sample is used to judge whether the average of two time bands to be tested equates; If the T check of described F check and described pair of sample all by merge two adjacent times bands.
The formula of structure F test statistics is: F = S 1 2 S 2 2 ~ F ( n 1 - 1 , n 2 - 1 ) ;
Wherein, S 1 2 = 1 n 1 - 1 Σ i = 1 n 1 ( X i - X ‾ ) 2 , S 2 2 = 1 n 2 - 1 Σ i = 1 n 2 ( Y i - Y ‾ ) 2 ;
X i, Y iBe respectively each vehicle speed value of two normal populations;
X, Y are sample average.
The region of rejection of F check is:
W = { F < F &alpha; 2 ( n 1 - 1 , n 2 - 1 ) Or F > F 1 - &alpha; 2 ( n 1 - 1 , n 2 - 1 ) }
α is an insolation level.
The formula of structure T test statistics is: T = X &OverBar; - Y &OverBar; S W &CenterDot; 1 n 1 + 1 n 2 ~ t ( n 1 + n 2 - 2 ) ;
Wherein, n 1, n 2Be respectively the effective value number of two normal populations;
s W 2 = ( n 1 - 1 ) S 1 2 + ( n 2 - 1 ) S 2 2 n 1 + n 2 - 2 (S 1 2, S 2 2Be respectively the variance of two normal populations)
The region of rejection of T check is: W = { | T | > t 1 - &alpha; 2 ( n 1 + n 2 - 2 ) } .
It should be noted that if band of described time satisfies the merging condition, then it merged that the time band after will merging is then handled as a time band; If described time band does not satisfy the merging condition, then it is still carried out follow-up processing according to the time band of dividing after its pre-service.
203: carry out unusual verification with described through the time band that merges verification, provide unusual check results; The implementation procedure of this step is: traversal is through merging the time band of handling, described effective time band is carried out the T check of U check or single sample respectively, if described effective time band is not by described check, think that then these data are abnormal data, record described abnormal data in the unusual check results.Wherein, described U check is applicable to fully big situation of sample size.
It should be noted that the U check is used under the very big situation of sample number, condition is that speed of a motor vehicle effective value number is greater than 30 herein.The formula of structure U test statistics is: U = x i - &mu; &sigma; / n
Wherein, x iRepresent the speed of a motor vehicle variable in this time band, the speed of a motor vehicle average of μ representative time band, the speed of a motor vehicle variance of σ representative time band, the effective value number of n representative time band;
The region of rejection of U check is: W={|U|>μ α/2;
Single sample T check is used under the situation of small sample amount, and condition is that speed of a motor vehicle effective value number is less than or equal to 30 herein.The formula of structure T test statistics is: T = x i - X &OverBar; s / n ;
Wherein, x iRepresent the speed of a motor vehicle variable in this time band, the speed of a motor vehicle average of μ representative time band, the speed of a motor vehicle variance of s representative time band, the effective value number of n representative time band;
The region of rejection of T check is: W={|T|>t α(n-1) }.
204:, abnormal data is rejected according to described unusual check results.
As shown in Figure 3, in a kind of data processing method that provides for the embodiment of the invention described pretreated historical data information carried out that data merge and the implementation procedure of the step filled up, this process comprises:
301: receive the described historical data information of rejecting through abnormal data;
302: the data in the identical moment in the described historical data information are merged processing; Concretely, be exactly that the data on the identical time point merge to identical week characteristic day, simply use the method for arithmetic mean to merge processing after, obtain one group of data.
303: whether the data that detect after described merging is handled exist no datat on the moment point;
304:, then carry out data filling and handle if there is no datat on the moment point.Wherein, the described data filling that carries out is handled and can be adopted the method for least square method to fill up, and for example: each two temporal data is done sample point before and after can getting the time point that needs fill data; Its specific implementation process is as follows:
Setting up regression model is Y ^ = &beta; ^ 0 + &beta; ^ 1 x ;
Wherein: x need to represent the time point of padding data;
Figure G2009102441510D00052
Represent the data that to fill up on this time point;
&beta; ^ 0 = y &OverBar; ^ - x &OverBar; &beta; ^ 1 &beta; ^ 1 = L xy / L xx ;
Wherein, L xy = def &Sigma; i = 1 n ( x i - x &OverBar; ) ( y i - y &OverBar; ) = &Sigma; i = 1 n x i y i - n x &OverBar; y &OverBar; , L xx = def &Sigma; i = 1 n ( x i - x &OverBar; ) 2 = &Sigma; i = 1 n x i 2 - n x &OverBar; 2 ;
(x i, y i) be each two temporal data value before and after the x time point of choosing.
As shown in Figure 4, in a kind of data processing method that provides for the embodiment of the invention described data are merged and fill up after historical data information carry out the implementation procedure of the step of dynamic index smoothing processing, the initial value that this process is established smoothing factor α is α 0, the initial value of exponential smoothing and control accuracy ε.Wherein, described control accuracy is used for judging whether being approximate optimal solution.This process specifically comprises:
401: receive the described historical data information of filling up processing through number;
402: the initial value α that obtains smoothing factor α 0, exponential smoothing initial value and control accuracy ε;
403:, obtain the exponential smoothing numerical value of described next moment point of exponential smoothing initial value according to the initial value and the described exponential smoothing initial value of described smoothing factor; Wherein, described exponential smoothing numerical value is that computing formula by exponential smoothing obtains.Described exponential smoothing is a kind of information processing method that is come by the development of moving average method.This method does not need to store the time series data of n phase, and gives recent real data with bigger flexible strategy, gives data at a specified future date with less flexible strategy, and the flexible strategy index of coincidence rule of each issue certificate.The exponential smoothing computing formula is:
y ^ t + 1 = &alpha; y t + ( 1 - &alpha; ) y ^ t - - - ( 1 - 1 )
In the formula, because the data in the embodiment of the invention are per 5 minutes among a 0:00-23:59 data, t value 1-288, expression begins per 5 minutes time points from 0:00, t=1, expression 0:00; T=288, expression 23:55;
Figure G2009102441510D00062
--smooth value of 5 minutes behind the-period t;
y t---period t actual value;
---period t smooth value;
α----smoothing factor claims weighting factor again, and span is 0≤α≤1.
Following formula shows: the exponential smoothing prediction is constantly the predicted value of last issue to be revised with predicated error, and obtains the predicted value of current period.
Figure G2009102441510D00064
Be
Figure G2009102441510D00065
And y tThe weighted arithmetic mean number, along with the size variation of α value, the decision
Figure G2009102441510D00066
Right with yt
Figure G2009102441510D00067
Influence degree.
Figure G2009102441510D00068
Have by the phase and review character, include the influence of full issue certificate.
404:, judge whether optimum smoothing factor of described smoothing factor according to described exponential smoothing data and the control accuracy that gets access to; Concrete implementation procedure is:
Calculate &dtri; SSE ( &alpha; 0 ) = ( dSSE / d&alpha; ) | &alpha; = &alpha; 0 ; If | | &dtri; SSE ( &alpha; 0 ) | | &le; &epsiv; , α then 0It is exactly approximate optimal solution; Otherwise not optimum solution, then begin the smoothing factor search.
405: if optimum smoothing factor is then proceeded the exponential smoothing of next moment point and handled;
406:, then carry out the smoothing factor search if not optimum smoothing factor.The specific implementation process of this step is as follows:
Step 1: for described smoothing factor is set up squared prediction error and minimum Optimization Model;
min SSE = &Sigma; t = 1 n e t 2 = &Sigma; t = 1 n ( y t - y ^ t ) 2 - - - ( 1 - 2 )
Described exponential smoothing computing formula (1-1) is given
Figure G2009102441510D000612
After the assignment, following formula can be expressed as
min SSE = &Sigma; t = 1 n [ y t - ( &alpha; y t - 1 + ( 1 - &alpha; ) y t - 1 ^ ) ] 2 - - - ( 1 - 3 )
Successively will
Figure G2009102441510D00071
Value bring following formula into, obtain after the arrangement:
min SSE = &Sigma; t = 1 n [ y t - &alpha; &Sigma; j = 1 t - 1 ( 1 - &alpha; ) j - 1 y t - j - ( 1 - &alpha; ) t - 1 y ^ 1 ] 2 - - - ( 1 - 4 )
Because when the t value is bigger,
Figure G2009102441510D00073
In fact very little, can be approximated to be 0.Then target equation (1-4) can be reduced to:
min SSE = &Sigma; t = 1 n [ y t - &alpha; &Sigma; j = 1 t - 1 ( 1 - &alpha; ) j - 1 y t - j ] 2 - - - ( 1 - 5 )
Find the solution above-mentioned nonlinear optimization model (1-5) and can obtain best α value.Ask the minimum value of following formula that a lot of methods are arranged, the steepest decline process of iteration of designed following fast convergence rate in embodiments of the present invention, easily carrying out: i.e. optimum gradient method.
Described optimum gradient method is the direction of search of the negative gradient direction of application target function as each step iteration.Because each step is all got the optimal step size of negative gradient direction, so be called optimum gradient method.Use optimum gradient method, it is the fastest that its target function value was descended in former steps, so be called method of steepest descent.
For example: the gradient of a n dimension nonlinear function f (X) is defined as:
&dtri; f ( x ) = &PartialD; f &PartialD; X = [ &PartialD; f &PartialD; x 1 , &PartialD; f &PartialD; x 2 , . . . , &PartialD; f &PartialD; x n ] T
This is a n dimension local derviation vector.
The vector of unit length of gradient is: S = &dtri; f ( X ) | | &dtri; f ( X ) | |
As can be seen, gradient direction is the normal direction of function just, the negative gradient direction with-S in the same way.Wherein
Figure G2009102441510D00077
Be called gradient Mould, also be norm.The computing formula of mould is
| | &dtri; f ( X ) | | = ( &PartialD; f &PartialD; x 1 ) 2 + ( &PartialD; f &PartialD; x 2 ) 2 + . . . + ( &PartialD; f &PartialD; x n ) 2
The character of gradient direction has: it is the fastest to descend along negative gradient directivity function value, is direction of steepest descent.The iterative formula of optimum gradient method:
Schilling X K+1=X k+ λ kS k
Wherein, vector of unit length S k = - &dtri; f ( X k ) | | &dtri; f ( X k ) | | ;
λ kBe optimal step size.Therefore, the concrete definition to optimum gradient method is: the optimization numerical method of getting the optimal step size search all the time along the negative gradient direction.
By the iterative formula of optimum gradient method, the formula of function minimization can be arranged
min &lambda; ( X k + &lambda; S k ) = f ( X k + &lambda; k S k ) = f ( X k + 1 )
With f (X) at X kBe launched into Taylor series near the point f ( X ) = f ( X k ) + &dtri; T f ( X k ) &Delta;X + 1 2 &Delta; X T A&Delta;X ;
Wherein, A = &PartialD; 2 f &PartialD; x 1 2 , &PartialD; 2 f &PartialD; x 1 &PartialD; x 2 , . . . . &PartialD; 2 f &PartialD; x 1 &PartialD; x n &PartialD; 2 f &PartialD; x 2 &PartialD; x 1 , &PartialD; 2 f &PartialD; x 2 2 , . . . . &PartialD; 2 f &PartialD; x 2 &PartialD; x n &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &PartialD; 2 f &PartialD; x n &PartialD; x 1 , &PartialD; 2 f &PartialD; x n &PartialD; x 2 , . . . . &PartialD; 2 f &PartialD; x n 2 ;
With Δ X=λ S kThe substitution following formula has
f ( X ) = f ( X k ) + &dtri; T f ( X k ) &lambda; S k + 1 2 ( &lambda; S k ) T A ( &lambda; S k )
And
&PartialD; f &PartialD; &lambda; | &lambda; = &lambda; k = &dtri; T f ( X k ) S k + &lambda; S k T A S k = 0
So optimal step size can be expressed as
&lambda; k = - &dtri; T f ( X ) AS k S k T A S k
Since in the embodiment of the invention for asking min SSE = &Sigma; t = 1 n [ y t - &alpha; &Sigma; j = 1 t - 1 ( 1 - &alpha; ) j - 1 y t - j ] 2 Minimum value, this function is the one dimension function that contains variable α.So α of derivation one dimension kValue, derivation is as follows:
α k+1=α k+λS k
Wherein s k = - &dtri; f ( &alpha; k ) | | &dtri; f ( &alpha; k ) | | , λ is a step-length.
min &lambda; f ( &alpha; k + &lambda; s k ) = f ( &alpha; k + &lambda; k s k ) = f ( &alpha; k + 1 )
With f (x) at α kBe launched into Taylor series near the point f ( x ) = f ( &alpha; k ) + &dtri; f ( &alpha; k ) &Delta;x + 1 2 f &prime; &prime; ( &alpha; k ) &Delta; x 2 ;
Because of Δ x=λ s k, bring following formula into, have
f ( x ) = f ( &alpha; k ) + &dtri; f ( &alpha; k ) &lambda; s k + 1 2 f &prime; &prime; ( x ) ( &lambda; s k ) 2
And &PartialD; f &PartialD; &lambda; | &lambda; = &lambda; k = &dtri; f ( &alpha; k ) s k + &lambda; f &prime; &prime; ( &alpha; k ) s k 2 = 0
So optimal step size can be expressed as
&lambda; k = - &dtri; f ( &alpha; k ) f &prime; &prime; ( &alpha; k ) s k
Like this, utilize the mode of optimum gradient method, from α 0The edge of setting out
Figure G2009102441510D00095
Direction is carried out linear search, asks optimal step size λ K-1Thereby with the formula of optimum gradient method,
Get access to &alpha; k = &alpha; k - 1 - &lambda; k - 1 &dtri; SSE ( &alpha; k - 1 ) , k &GreaterEqual; 1 ;
If | | &dtri; SSE ( &alpha; k ) | | &le; &epsiv; , α then kBe exactly approximate optimal solution, output α kAnd turn to step 2, otherwise turn to step 1.
Step 2:, obtain described optimum smoothing factor according to the Optimization Model of described foundation.To be similar to optimum solution α exactly concretely kBring the exponential smoothing model into, and be used for prediction.
As shown in Figure 5, be a kind of data processing equipment that the embodiment of the invention provides, this device comprises:
Information acquisition unit 501 is used to obtain historical data information;
Pretreatment unit 502 is used for described historical data information is carried out the data pre-service;
Data merge shim 503, are used for that described pretreated historical data information is carried out data and merge and fill up;
Data smoothing unit 504 is used for described data merging and the historical data information after filling up is carried out the dynamic index smoothing processing.
Wherein, described pretreatment unit comprises:
The time band is divided subelement, is used for that described historical data information is carried out the time band and divides;
Merge the syndrome unit, be used for merging verification according to the time band of dividing;
Unusual syndrome unit is used for carrying out unusual verification with described through the time band that merges verification, provides unusual check results;
The abnormality value removing subelement is used for according to described unusual check results abnormal data being rejected.
Described data merge shim, comprising:
Data message receives subelement, is used to receive the described historical data information of rejecting through abnormal data;
Data merge subelement, are used for the data in the described identical moment of historical data information are merged processing;
The data check subelement, whether the data that are used to detect after described merging is handled exist no datat on the moment point;
The data filling subelement if be used for existing no datat on the moment point, then carry out data filling and handles.
Described data merge shim, comprising:
Data message receives subelement, is used to receive the described historical data information of rejecting through abnormal data;
Data merge subelement, are used for the data in the described identical moment of historical data information are merged processing;
The data check subelement, whether the data that are used to detect after described merging is handled exist no datat on the moment point;
The data filling subelement if be used for existing no datat on the moment point, then carry out data filling and handles.
Described data smoothing unit comprises:
Data message receives subelement, is used to receive the described historical data information of filling up processing through number;
The parameter acquiring subelement is used to obtain the initial value of smoothing factor, exponential smoothing initial value and control accuracy;
The exponential smoothing value is obtained subelement, is used for initial value and described exponential smoothing initial value according to described smoothing factor, obtains the exponential smoothing numerical value of described next moment point of exponential smoothing initial value;
Optimum smoothing factor judgment sub-unit is used for the exponential smoothing data and the control accuracy that get access to according to described, judges whether optimum smoothing factor of described smoothing factor;
The smoothing processing subelement is used for if optimum smoothing factor is then proceeded the exponential smoothing of next moment point and handled;
Smoothing factor search subelement is used for if not optimum smoothing factor, then carries out the smoothing factor search.
It should be noted that described smoothing factor search subelement, may further include:
Optimization Model is set up subdivision, is used to described smoothing factor to set up squared prediction error and minimum Optimization Model;
Optimum smoothing factor obtains subdivision, is used for the Optimization Model according to described foundation, obtains described optimum smoothing factor.
Data processing method that the embodiment of the invention provides and device are by obtaining historical data information; Described historical data information is carried out the data pre-service; Described pretreated historical data information is carried out data to be merged and fills up; Described data are merged and fill up after historical data information carry out the dynamic index smoothing processing.Compared with prior art, the dynamic index smoothing processing of employing of the present invention can be so that precision of prediction be higher and can satisfy the needs of actual prediction.
Through the above description of the embodiments, one of ordinary skill in the art will appreciate that: realize that all or part of step in the foregoing description method is to instruct relevant hardware to finish by program, described program can be stored in the computer read/write memory medium, this program is when carrying out, comprise step as above-mentioned method embodiment, described storage medium, as: FLASH, ROM/RAM, magnetic disc, CD etc.
The above; only be the specific embodiment of the present invention, but protection scope of the present invention is not limited thereto, anyly is familiar with those skilled in the art in the technical scope that the present invention discloses; can expect easily changing or replacing, all should be encompassed within protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion by described protection domain with claim.

Claims (10)

1. a data processing method is characterized in that, comprising:
Obtain historical data information;
Described historical data information is carried out the data pre-service;
Described pretreated historical data information is carried out data to be merged and fills up;
Described data are merged and fill up after historical data information carry out the dynamic index smoothing processing.
2. data processing method according to claim 1 is characterized in that, described historical data information is carried out the pretreated step of data, comprising:
Described historical data information is carried out the time band divides;
Time band according to dividing merges verification;
Carry out unusual verification with described through the time band that merges verification, provide unusual check results;
According to described unusual check results, abnormal data is rejected.
3. data processing method according to claim 2 is characterized in that, described pretreated historical data information is carried out the step that data merge and fill up, and comprising:
Receive the described historical data information of rejecting through abnormal data;
The data in the identical moment in the described historical data information are merged processing;
Whether the data that detect after described merging is handled exist no datat on the moment point;
If there is no datat on the moment point, then carries out data filling and handle.
4. data processing method according to claim 3 is characterized in that, described data are merged and fill up after historical data information carry out the step of dynamic index smoothing processing, comprising:
Receive the described historical data information of filling up processing through number;
Obtain the initial value of smoothing factor, exponential smoothing initial value and control accuracy;
According to the initial value and the described exponential smoothing initial value of described smoothing factor, obtain the exponential smoothing numerical value of described next moment point of exponential smoothing initial value;
According to described exponential smoothing data and the control accuracy that gets access to, judge whether optimum smoothing factor of described smoothing factor;
If optimum smoothing factor is then proceeded the exponential smoothing of next moment point and is handled;
If not optimum smoothing factor, then carry out the smoothing factor search.
5. data processing method according to claim 4 is characterized in that, the step of described smoothing factor search comprises:
For described smoothing factor is set up squared prediction error and minimum Optimization Model;
According to the Optimization Model of described foundation, obtain described optimum smoothing factor.
6. a data processing equipment is characterized in that,
Information acquisition unit is used to obtain historical data information;
Pretreatment unit is used for described historical data information is carried out the data pre-service;
Data merge shim, are used for that described pretreated historical data information is carried out data and merge and fill up;
The data smoothing unit is used for described data merging and the historical data information after filling up is carried out the dynamic index smoothing processing.
7. data processing equipment according to claim 6 is characterized in that, described pretreatment unit comprises:
The time band is divided subelement, is used for that described historical data information is carried out the time band and divides;
Merge the syndrome unit, be used for merging verification according to the time band of dividing;
Unusual syndrome unit is used for carrying out unusual verification with described through the time band that merges verification, provides unusual check results;
The abnormality value removing subelement is used for according to described unusual check results abnormal data being rejected.
8. data processing equipment according to claim 7 is characterized in that, described data merge shim, comprising:
Data message receives subelement, is used to receive the described historical data information of rejecting through abnormal data;
Data merge subelement, are used for the data in the described identical moment of historical data information are merged processing;
The data check subelement, whether the data that are used to detect after described merging is handled exist no datat on the moment point;
The data filling subelement if be used for existing no datat on the moment point, then carry out data filling and handles.
9. data processing equipment according to claim 8 is characterized in that, described data smoothing unit comprises:
Data message receives subelement, is used to receive the described historical data information of filling up processing through number;
The parameter acquiring subelement is used to obtain the initial value of smoothing factor, exponential smoothing initial value and control accuracy;
The exponential smoothing value is obtained subelement, is used for initial value and described exponential smoothing initial value according to described smoothing factor, obtains the exponential smoothing numerical value of described next moment point of exponential smoothing initial value;
Optimum smoothing factor judgment sub-unit is used for the exponential smoothing data and the control accuracy that get access to according to described, judges whether optimum smoothing factor of described smoothing factor;
The smoothing processing subelement is used for if optimum smoothing factor is then proceeded the exponential smoothing of next moment point and handled;
Smoothing factor search subelement is used for if not optimum smoothing factor, then carries out the smoothing factor search.
10. data processing equipment according to claim 9 is characterized in that, described smoothing factor search subelement comprises:
Optimization Model is set up subdivision, is used to described smoothing factor to set up squared prediction error and minimum Optimization Model;
Optimum smoothing factor obtains subdivision, is used for the Optimization Model according to described foundation, obtains described optimum smoothing factor.
CN 200910244151 2009-12-30 2009-12-30 Data processing method and device Active CN101794345B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN 200910244151 CN101794345B (en) 2009-12-30 2009-12-30 Data processing method and device
PCT/CN2010/079716 WO2011079705A1 (en) 2009-12-30 2010-12-13 Data processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200910244151 CN101794345B (en) 2009-12-30 2009-12-30 Data processing method and device

Publications (2)

Publication Number Publication Date
CN101794345A true CN101794345A (en) 2010-08-04
CN101794345B CN101794345B (en) 2013-05-15

Family

ID=42587035

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200910244151 Active CN101794345B (en) 2009-12-30 2009-12-30 Data processing method and device

Country Status (2)

Country Link
CN (1) CN101794345B (en)
WO (1) WO2011079705A1 (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101950483A (en) * 2010-09-15 2011-01-19 青岛海信网络科技股份有限公司 Repairing method and device for traffic data fault
WO2011079705A1 (en) * 2009-12-30 2011-07-07 北京世纪高通科技有限公司 Data processing method and device
CN102332011A (en) * 2011-09-09 2012-01-25 北京空间飞行器总体设计部 Method for selecting effective data of in-orbit spacecraft
CN102588210A (en) * 2011-12-21 2012-07-18 中能电力科技开发有限公司 Filtering method for preprocessing fitting data of power curve
CN103065041A (en) * 2012-12-18 2013-04-24 湖南大唐先一科技有限公司 Test method of redundant data
CN104679970A (en) * 2013-11-29 2015-06-03 高德软件有限公司 Data detection method and device
CN104865916A (en) * 2015-03-19 2015-08-26 上海航天能源股份有限公司 Natural gas supply data processing method
CN106290729A (en) * 2016-08-09 2017-01-04 成都润泰茂成科技有限公司 A kind of Monitoring Data processing means
CN107085943A (en) * 2015-12-23 2017-08-22 青岛海信网络科技股份有限公司 A kind of road travel time short term prediction method and system
CN112380310A (en) * 2020-11-26 2021-02-19 成都新橙北斗智联有限公司 GNSS high-precision anti-aliasing calculation result smoothing method

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1725208A (en) * 2004-07-19 2006-01-25 上海市市政工程管理处 Taffic information processing system for urban through street
CN100492434C (en) * 2006-11-30 2009-05-27 上海交通大学 Traffic flow state analysis required detection vehicle sampling quantity obtaining method
CN101325004B (en) * 2008-08-01 2011-10-05 北京航空航天大学 Method for compensating real time traffic information data
CN101794345B (en) * 2009-12-30 2013-05-15 北京世纪高通科技有限公司 Data processing method and device

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2011079705A1 (en) * 2009-12-30 2011-07-07 北京世纪高通科技有限公司 Data processing method and device
CN101950483B (en) * 2010-09-15 2013-03-20 青岛海信网络科技股份有限公司 Repairing method and device for traffic data fault
CN101950483A (en) * 2010-09-15 2011-01-19 青岛海信网络科技股份有限公司 Repairing method and device for traffic data fault
CN102332011A (en) * 2011-09-09 2012-01-25 北京空间飞行器总体设计部 Method for selecting effective data of in-orbit spacecraft
CN102332011B (en) * 2011-09-09 2012-12-26 北京空间飞行器总体设计部 Method for selecting effective data of in-orbit spacecraft
CN102588210A (en) * 2011-12-21 2012-07-18 中能电力科技开发有限公司 Filtering method for preprocessing fitting data of power curve
CN102588210B (en) * 2011-12-21 2014-02-12 中能电力科技开发有限公司 Filtering method for preprocessing fitting data of power curve
CN103065041B (en) * 2012-12-18 2016-08-03 湖南大唐先一科技有限公司 A kind of test method of redundant data
CN103065041A (en) * 2012-12-18 2013-04-24 湖南大唐先一科技有限公司 Test method of redundant data
CN104679970A (en) * 2013-11-29 2015-06-03 高德软件有限公司 Data detection method and device
CN104679970B (en) * 2013-11-29 2018-11-09 高德软件有限公司 A kind of data detection method and device
CN104865916A (en) * 2015-03-19 2015-08-26 上海航天能源股份有限公司 Natural gas supply data processing method
CN107085943A (en) * 2015-12-23 2017-08-22 青岛海信网络科技股份有限公司 A kind of road travel time short term prediction method and system
CN107085943B (en) * 2015-12-23 2020-06-30 青岛海信网络科技股份有限公司 Short-term prediction method and system for road travel time
CN106290729A (en) * 2016-08-09 2017-01-04 成都润泰茂成科技有限公司 A kind of Monitoring Data processing means
CN112380310A (en) * 2020-11-26 2021-02-19 成都新橙北斗智联有限公司 GNSS high-precision anti-aliasing calculation result smoothing method
CN112380310B (en) * 2020-11-26 2023-12-01 成都新橙北斗智联有限公司 GNSS high-precision anti-aliasing resolving result smoothing method

Also Published As

Publication number Publication date
CN101794345B (en) 2013-05-15
WO2011079705A1 (en) 2011-07-07

Similar Documents

Publication Publication Date Title
CN101794345B (en) Data processing method and device
CN101950477B (en) Method and device for processing traffic information
CN102087788B (en) Method for estimating traffic state parameter based on confidence of speed of float car
CN108665093B (en) Deep learning-based expressway traffic accident severity prediction method
CN104318757B (en) Bus bus or train route section Forecasting Methodology working time on a kind of public transportation lane
CN103280110B (en) The Forecasting Methodology and device of expressway travel time
CN104992244A (en) Airport freight traffic prediction analysis method based on SARIMA and RBF neural network integration combination model
CN111126868B (en) Road traffic accident occurrence risk determination method and system
CN111950603B (en) Prediction method and device for road section traffic accident rate and computer storage medium
CN111047078B (en) Traffic characteristic prediction method, system and storage medium
CN101964061B (en) Binary kernel function support vector machine-based vehicle type recognition method
CN109859477A (en) A kind of determination method and apparatus of congestion data
CN113378458A (en) Congestion early warning method, device, medium and equipment based on big data
CN105206040A (en) Bus bunching predication method based on IC card data
CN113297795A (en) Method for constructing running condition of pure electric vehicle
CN110837979B (en) Safe driving risk prediction method and device based on random forest
CN115204755B (en) Service area access rate measuring method and device, electronic equipment and readable storage medium
CN114463978B (en) Data monitoring method based on track traffic information processing terminal
CN115271565B (en) DEA-based method, device and equipment for evaluating highway pavement maintenance measures
CN101694747B (en) Method and device for indentifying abnormal vehicle speed
CN114281808A (en) Traffic big data cleaning method, device, equipment and readable storage medium
CN115169630A (en) Electric vehicle charging load prediction method and device
CN113222208A (en) Ada-XGboost-based traffic accident prediction system
CN113298309A (en) Method, device and terminal for predicting traffic congestion state
CN116579677B (en) Full life cycle management method and system for high-speed railway electric service vehicle-mounted equipment

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant