CN101794345B - Data processing method and device - Google Patents

Data processing method and device Download PDF

Info

Publication number
CN101794345B
CN101794345B CN 200910244151 CN200910244151A CN101794345B CN 101794345 B CN101794345 B CN 101794345B CN 200910244151 CN200910244151 CN 200910244151 CN 200910244151 A CN200910244151 A CN 200910244151A CN 101794345 B CN101794345 B CN 101794345B
Authority
CN
China
Prior art keywords
data
smoothing
historical data
data information
subelement
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN 200910244151
Other languages
Chinese (zh)
Other versions
CN101794345A (en
Inventor
申小次
贾学力
李建军
庄明亮
付新刚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Cennavi Technologies Co Ltd
Original Assignee
Beijing Cennavi Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Cennavi Technologies Co Ltd filed Critical Beijing Cennavi Technologies Co Ltd
Priority to CN 200910244151 priority Critical patent/CN101794345B/en
Publication of CN101794345A publication Critical patent/CN101794345A/en
Priority to PCT/CN2010/079716 priority patent/WO2011079705A1/en
Application granted granted Critical
Publication of CN101794345B publication Critical patent/CN101794345B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G08SIGNALLING
    • G08GTRAFFIC CONTROL SYSTEMS
    • G08G1/00Traffic control systems for road vehicles
    • G08G1/01Detecting movement of traffic to be counted or controlled
    • G08G1/0104Measuring and analyzing of parameters relative to traffic conditions

Abstract

The invention discloses a data processing method and a device, and relates to the technical field of intelligent traffic systems. The invention aims to solve the problem that the medium filtering smoothing technology in the prior art has the detect of random prediction process, so that the prediction precision is low, thereby failing to satisfy the requirements for actual prediction. The embodiment of the invention provides a data processing method which comprises the following steps: acquiring historical data information; carrying out data preprocessing on the historical data information; carrying out data combination and filling on the preprocessed historical data information; and carrying out dynamic exponential smoothing process on the historical data information. The embodiment of the invention can enhance the prediction precision and satisfy the requirements for actual prediction.

Description

A kind of data processing method and device
Technical field
The present invention relates to the intelligent transportation system technical field, relate in particular to a kind of data processing method and device.
Background technology
Advanced transportation information service systems (Advanced Traffic Information System, ATIS) be based upon on perfect information network basis, this system can obtain all kinds of transport information by sensor or the data transmission set that is equipped in road, car, transfer stop, parking lot and forecast center, carries out overall treatment according to the described data that get.This system can provide Real-time Road traffic congestion information comprehensively and accurately to society in real time.But the data accessed by described equipment can not cover all roads fully, fill up thereby need to carry out real time data by the similar inquiry of historical data, and available historical data are predicted by analysis afterwards.
In order to improve the availability of dynamic information, the function that needs the information prediction of increase system, need to carry out independent analysis to the historical road condition data of past certain hour in the cycle, obtain every road in the variation tendency of the traffic of historical data in the cycle, the mode by interface offers transportation information service systems and uses.Yet in the prior art, usually adopt median filter smoothness of image to process historical data is processed, thereby realize the purpose of data prediction.
In realizing process of the present invention, the inventor finds that in prior art, there are the following problems at least: because the median filter smoothness of image treatment technology forecasting process that prior art adopts is comparatively random, make precision of prediction lower, can't satisfy the needs of actual prediction.
Summary of the invention
Embodiments of the invention provide a kind of data processing method and device.
For achieving the above object, embodiments of the invention adopt following technical scheme:
A kind of data processing method comprises:
Obtain historical data information;
Described historical data information is carried out the data pre-service;
Described pretreated historical data information is carried out data to be merged and fills up;
Described data are merged and fill up after historical data information carry out based Dynamic Exponential Smoothing and process.
A kind of data processing equipment comprises:
Information acquisition unit is used for obtaining historical data information;
Pretreatment unit is used for described historical data information is carried out the data pre-service;
Data merge shim, are used for that described pretreated historical data information is carried out data and merge and fill up;
The data smoothing unit is used for described data merging and the historical data information after filling up is carried out the based Dynamic Exponential Smoothing processing.
The data processing method that the embodiment of the present invention provides and device are by obtaining historical data information; Described historical data information is carried out the data pre-service; Described pretreated historical data information is carried out data to be merged and fills up; Described data are merged and fill up after historical data information carry out based Dynamic Exponential Smoothing and process.Compared with prior art, the based Dynamic Exponential Smoothing of employing of the present invention is processed, can be so that precision of prediction is higher and can satisfy the needs of actual prediction.
Description of drawings
A kind of data processing method process flow diagram that Fig. 1 provides for the embodiment of the present invention;
In a kind of data processing method that Fig. 2 provides for the embodiment of the present invention, described historical data information is carried out the implementation procedure process flow diagram of the pretreated step of data;
In a kind of data processing method that Fig. 3 provides for the embodiment of the present invention, described pretreated historical data information is carried out that data merge and the implementation procedure process flow diagram flow chart of the step filled up;
In a kind of data processing method that Fig. 4 provides for the embodiment of the present invention, described data are merged and fill up after historical data information carry out the implementation procedure process flow diagram of the step that based Dynamic Exponential Smoothing processes;
A kind of data processing equipment structural representation that Fig. 5 provides for the embodiment of the present invention.
Embodiment
Below in conjunction with accompanying drawing, embodiment of the present invention data processing method and device are described in detail.
As shown in Figure 1, be a kind of data processing method that the embodiment of the present invention provides, the method comprises:
101: obtain historical data information;
102: described historical data information is carried out the data pre-service; This step is mainly described historical data to be carried out the rejecting of abnormal data, improves the quality of historical data.
103: described pretreated historical data information is carried out data merge and fill up;
104: described data are merged and fill up after historical data information carry out based Dynamic Exponential Smoothing and process.
Exist various interference can produce a collection of abnormal data in the acquisition and processing output procedure due to described historical data, if being carried out the data analysis meeting, described abnormal data has influence on the accuracy that predicts the outcome at last, so need to carry out pre-service to described historical data, eliminate described abnormal data.Can adopt a kind of statistical method of time-based band to come the rejecting abnormalities vehicle speed value in the embodiment of the present invention.
The implementation procedure of the pretreated step of data is provided described historical data information in a kind of data processing method that provides for the embodiment of the present invention as shown in Figure 2; If the value of the time dimension of described historical data is 00:00-23:59; Every 5 minutes is a time period; The time band refers to the vehicle speed value of a day of certain road; Wherein, be with take half an hour as a time.Described historical data pre-service is rejected some abnormal datas take continuous urban history road condition data more than month as the basis, and its concrete implementation procedure comprises:
201: described historical data information is carried out the time band divide; Wherein, described historical data can be the historical vehicle speed value of reading from database; Described time band divide be to the speed of a motor vehicle of a day of particular link dividing half an hour, thereby tentatively obtain 48 times bands.
202: the time band according to dividing merges verification; The specific implementation process of this step is:
Described ready-portioned time band is successively checked to judge whether with the T of F check and two samples can merging time band; Wherein, described F check is used for judging whether the variance of two time bands to be tested equates; The T check of described pair of sample is used for judging whether the average of two time bands to be tested equates; If the T check of described F check and described pair of sample all by merge two adjacent times bands.
The formula of structure F test statistics is: F = S 1 2 S 2 2 ~ F ( n 1 - 1 , n 2 - 1 ) ;
Wherein, S 1 2 = 1 n 1 - 1 Σ i = 1 n 1 ( X i - X ‾ ) 2 , S 2 2 = 1 n 2 - 1 Σ i = 1 n 2 ( Y i - Y ‾ ) 2 ;
X i, Y iBe respectively each vehicle speed value of two normal populations;
X, Y are sample average.
The region of rejection of F check is:
W = { F < F &alpha; 2 ( n 1 - 1 , n 2 - 1 ) Or F > F 1 - &alpha; 2 ( n 1 - 1 , n 2 - 1 ) }
α is insolation level.
The formula of structure T test statistics is: T = X &OverBar; - Y &OverBar; S W &CenterDot; 1 n 1 + 1 n 2 ~ t ( n 1 + n 2 - 2 ) ;
Wherein, n 1, n 2Be respectively the effective value number of two normal populations;
s W 2 = ( n 1 - 1 ) S 1 2 + ( n 2 - 1 ) S 2 2 n 1 + n 2 - 2 (S 1 2, S 2 2Be respectively the variance of two normal populations)
The region of rejection of T check is: W = { | T | > t 1 - &alpha; 2 ( n 1 + n 2 - 2 ) } .
It should be noted that if band of described time satisfies the merging condition, it is merged, the time band after then merging is processed as a time band; If described time band does not satisfy the merging condition, it is still carried out follow-up processing according to the time band of dividing after its pre-service.
203: carry out abnormal verification with described through the time band that merges verification, provide abnormal check results; The implementation procedure of this step is: traversal is through merging the time band of processing, described effective time band is carried out respectively the T check of U check or single sample, if described effective time band is not by described check, think that these data are abnormal data, are recorded to described abnormal data in abnormal check results.Wherein, described U check is applicable to fully large situation of sample size.
It should be noted that the U check is used in the very large situation of sample number, condition is that speed of a motor vehicle effective value number is greater than 30 herein.The formula of structure U test statistics is: U = x i - &mu; &sigma; / n
Wherein, x iRepresent the speed of a motor vehicle variable in this time band, μ represents the speed of a motor vehicle average of time band, and σ represents the speed of a motor vehicle variance of time band, and n represents the effective value number of time band;
The region of rejection of U check is: W={|U|>μ α/2;
Single sample T check is used in the situation of small sample amount, and condition is that speed of a motor vehicle effective value number is less than or equal to 30 herein.The formula of structure T test statistics is: T = x i - X &OverBar; s / n ;
Wherein, x iRepresent the speed of a motor vehicle variable in this time band, μ represents the speed of a motor vehicle average of time band, and s represents the speed of a motor vehicle variance of time band, and n represents the effective value number of time band;
The region of rejection of T check is: W={|T|>t α(n-1) }.
204: according to described abnormal check results, with rejecting abnormal data.
As shown in Figure 3, in a kind of data processing method that provides for the embodiment of the present invention, described pretreated historical data information is carried out that data merge and the implementation procedure of the step filled up, this process comprises:
301: receive described historical data information through rejecting abnormal data;
302: phase data in the same time in described historical data information are merged processing; Concretely, be exactly that the data on same time point merge to identical week characteristic day, after simply using the method for arithmetic mean to merge processing, obtain one group of data.
303: whether the data that detect after described merging is processed exist countless certificates on moment point;
304: if there are countless certificates on moment point, carry out data filling and process.Wherein, the described data filling that carries out is processed and can be adopted the method for least square method to fill up, and for example: before and after can getting the time point that needs fill data, each two temporal data are done sample point; Its specific implementation process is as follows:
Setting up regression model is Y ^ = &beta; ^ 0 + &beta; ^ 1 x ;
Wherein: x need to represent the time point of padding data;
Figure G2009102441510D00052
Represent the data that to fill up on this time point;
&beta; ^ 0 = y &OverBar; ^ - x &OverBar; &beta; ^ 1 &beta; ^ 1 = L xy / L xx ;
Wherein, L xy = def &Sigma; i = 1 n ( x i - x &OverBar; ) ( y i - y &OverBar; ) = &Sigma; i = 1 n x i y i - n x &OverBar; y &OverBar; , L xx = def &Sigma; i = 1 n ( x i - x &OverBar; ) 2 = &Sigma; i = 1 n x i 2 - n x &OverBar; 2 ;
(x i, y i) be each two temporal data values before and after the x time point of choosing.
As shown in Figure 4, in a kind of data processing method that provides for the embodiment of the present invention, described data are merged and fill up after historical data information carry out the implementation procedure of the step that based Dynamic Exponential Smoothing processes, the initial value that this process is established smoothing factor α is α 0, the initial value of exponential smoothing and control accuracy ε.Wherein, described control accuracy is used for determining whether approximate optimal solution.This process specifically comprises:
401: receive the described historical data information of filling up processing through number;
402: the initial value α that obtains smoothing factor α 0, exponential smoothing initial value and control accuracy ε;
403: according to initial value and the described exponential smoothing initial value of described smoothing factor, obtain the exponential smoothing numerical value of described next moment point of exponential smoothing initial value; Wherein, described exponential smoothing numerical value is that computing formula by exponential smoothing obtains.Described exponential smoothing is a kind of information processing method that is developed by the moving average method.The method does not need to store the time series data of n phase, and gives recent real data with larger flexible strategy, gives data at a specified future date with less flexible strategy, and the flexible strategy index of coincidence rule of each issue certificate.The exponential smoothing computing formula is:
y ^ t + 1 = &alpha; y t + ( 1 - &alpha; ) y ^ t - - - ( 1 - 1 )
In formula, because the data in the embodiment of the present invention are the data of every 5 minutes in 0:00-23:59, t value 1-288, expression begins every 5 minutes time points from 0:00, t=1, expression 0:00; T=288, expression 23:55;
Figure G2009102441510D00062
---smooth value of 5 minutes after period t;
y t---period t actual value;
Figure G2009102441510D00063
---period t smooth value;
α----smoothing factor claims again weighting factor, and span is 0≤α≤1.
Following formula shows: Smoothing Prediction is constantly the predicted value of last issue to be revised with predicated error, and obtains the predicted value of current period. Be
Figure G2009102441510D00065
And y tThe weighted arithmetic mean number, along with the size variation of α value, determine
Figure G2009102441510D00066
With yt pair
Figure G2009102441510D00067
Influence degree.
Figure G2009102441510D00068
Have and review character by the phase, include the impact of full issue certificate.
404: according to described exponential smoothing data and the control accuracy that gets, judge whether optimal smoothing coefficient of described smoothing factor; Concrete implementation procedure is:
Calculate &dtri; SSE ( &alpha; 0 ) = ( dSSE / d&alpha; ) | &alpha; = &alpha; 0 ; If | | &dtri; SSE ( &alpha; 0 ) | | &le; &epsiv; , α 0It is exactly approximate optimal solution; Otherwise not optimum solution, begin the smoothing factor search.
405: if the optimal smoothing coefficient is proceeded the exponential smoothing of next moment point and processed;
406: if not the optimal smoothing coefficient, carry out the smoothing factor search.The specific implementation process of this step is as follows:
Step 1: the Optimized model of setting up the Prediction sum squares minimum for described smoothing factor;
min SSE = &Sigma; t = 1 n e t 2 = &Sigma; t = 1 n ( y t - y ^ t ) 2 - - - ( 1 - 2 )
Described exponential smoothing computing formula (1-1) is given
Figure G2009102441510D000612
After assignment, following formula can be expressed as
min SSE = &Sigma; t = 1 n [ y t - ( &alpha; y t - 1 + ( 1 - &alpha; ) y t - 1 ^ ) ] 2 - - - ( 1 - 3 )
Successively will
Figure G2009102441510D00071
Value bring following formula into, obtain after arrangement:
min SSE = &Sigma; t = 1 n [ y t - &alpha; &Sigma; j = 1 t - 1 ( 1 - &alpha; ) j - 1 y t - j - ( 1 - &alpha; ) t - 1 y ^ 1 ] 2 - - - ( 1 - 4 )
Due to the t value when larger,
Figure G2009102441510D00073
In fact very little, can be approximated to be 0.Target equation (1-4) can be reduced to:
min SSE = &Sigma; t = 1 n [ y t - &alpha; &Sigma; j = 1 t - 1 ( 1 - &alpha; ) j - 1 y t - j ] 2 - - - ( 1 - 5 )
Find the solution above-mentioned Non-linear Optimal Model (1-5) and can obtain best α value.Ask the minimum value of following formula that a lot of methods are arranged, the steepest Descent iteration method of designed in embodiments of the present invention following fast convergence rate, easily carrying out: i.e. optimum gradient method.
Described optimum gradient method is that the negative gradient direction of application target function is as the direction of search of each step iteration.Because each step is all got the optimal step size of negative gradient direction, so be called optimum gradient method.Use optimum gradient method, its target function value is descended the fastest in former steps, therefore be called method of steepest descent.
For example: the gradient of a n dimension nonlinear function f (X) is defined as:
&dtri; f ( x ) = &PartialD; f &PartialD; X = [ &PartialD; f &PartialD; x 1 , &PartialD; f &PartialD; x 2 , . . . , &PartialD; f &PartialD; x n ] T
This is a n dimension local derviation vector.
The vector of unit length of gradient is: S = &dtri; f ( X ) | | &dtri; f ( X ) | |
Can find out, gradient direction is the normal direction of function namely, the negative gradient direction with-S in the same way.Wherein Be called gradient
Figure G2009102441510D00078
Mould, also be norm.The computing formula of mould is
| | &dtri; f ( X ) | | = ( &PartialD; f &PartialD; x 1 ) 2 + ( &PartialD; f &PartialD; x 2 ) 2 + . . . + ( &PartialD; f &PartialD; x n ) 2
The character of gradient direction has: descend along negative gradient directivity function value the fastest, be direction of steepest descent.The iterative formula of optimum gradient method:
Schilling X k+1=X k+ λ kS k
Wherein, vector of unit length S k = - &dtri; f ( X k ) | | &dtri; f ( X k ) | | ;
λ kBe optimal step size.Therefore, to the specific definition of optimum gradient method be: the optimization numerical method of getting all the time the optimal step size search along the negative gradient direction.
By the iterative formula of optimum gradient method, the formula of Function Minimization can be arranged
min &lambda; ( X k + &lambda; S k ) = f ( X k + &lambda; k S k ) = f ( X k + 1 )
With f (X) at X kBe launched into Taylor series near point f ( X ) = f ( X k ) + &dtri; T f ( X k ) &Delta;X + 1 2 &Delta; X T A&Delta;X ;
Wherein, A = &PartialD; 2 f &PartialD; x 1 2 , &PartialD; 2 f &PartialD; x 1 &PartialD; x 2 , . . . . &PartialD; 2 f &PartialD; x 1 &PartialD; x n &PartialD; 2 f &PartialD; x 2 &PartialD; x 1 , &PartialD; 2 f &PartialD; x 2 2 , . . . . &PartialD; 2 f &PartialD; x 2 &PartialD; x n &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &CenterDot; &PartialD; 2 f &PartialD; x n &PartialD; x 1 , &PartialD; 2 f &PartialD; x n &PartialD; x 2 , . . . . &PartialD; 2 f &PartialD; x n 2 ;
With Δ X=λ S kThe substitution following formula has
f ( X ) = f ( X k ) + &dtri; T f ( X k ) &lambda; S k + 1 2 ( &lambda; S k ) T A ( &lambda; S k )
And
&PartialD; f &PartialD; &lambda; | &lambda; = &lambda; k = &dtri; T f ( X k ) S k + &lambda; S k T A S k = 0
So optimal step size can be expressed as
&lambda; k = - &dtri; T f ( X ) AS k S k T A S k
Due in the embodiment of the present invention for asking min SSE = &Sigma; t = 1 n [ y t - &alpha; &Sigma; j = 1 t - 1 ( 1 - &alpha; ) j - 1 y t - j ] 2 Minimum value, this function is the one dimension function that contains variable α.So α of derivation one dimension kValue, derivation is as follows:
α k+1=α k+λS k
Wherein s k = - &dtri; f ( &alpha; k ) | | &dtri; f ( &alpha; k ) | | , λ is step-length.
min &lambda; f ( &alpha; k + &lambda; s k ) = f ( &alpha; k + &lambda; k s k ) = f ( &alpha; k + 1 )
With f (x) at α kBe launched into Taylor series near point f ( x ) = f ( &alpha; k ) + &dtri; f ( &alpha; k ) &Delta;x + 1 2 f &prime; &prime; ( &alpha; k ) &Delta; x 2 ;
Because of Δ x=λ s k, bring following formula into, have
f ( x ) = f ( &alpha; k ) + &dtri; f ( &alpha; k ) &lambda; s k + 1 2 f &prime; &prime; ( x ) ( &lambda; s k ) 2
And &PartialD; f &PartialD; &lambda; | &lambda; = &lambda; k = &dtri; f ( &alpha; k ) s k + &lambda; f &prime; &prime; ( &alpha; k ) s k 2 = 0
So optimal step size can be expressed as
&lambda; k = - &dtri; f ( &alpha; k ) f &prime; &prime; ( &alpha; k ) s k
Like this, utilize the mode of optimum gradient method, from α 0The edge of setting out
Figure G2009102441510D00095
Direction is carried out linear search, asks optimal step size λ k-1Thereby with the formula of optimum gradient method,
Get &alpha; k = &alpha; k - 1 - &lambda; k - 1 &dtri; SSE ( &alpha; k - 1 ) , k &GreaterEqual; 1 ;
If | | &dtri; SSE ( &alpha; k ) | | &le; &epsiv; , α kBe exactly approximate optimal solution, output α kAnd turn to step 2, otherwise turn to step 1.
Step 2: according to the Optimized model of described foundation, obtain described optimal smoothing coefficient.To be similar to optimum solution α exactly concretely kBring the exponential smoothing model into, and be used for prediction.
As shown in Figure 5, be a kind of data processing equipment that the embodiment of the present invention provides, this device comprises:
Information acquisition unit 501 is used for obtaining historical data information;
Pretreatment unit 502 is used for described historical data information is carried out the data pre-service;
Data merge shim 503, are used for that described pretreated historical data information is carried out data and merge and fill up;
Data smoothing unit 504 is used for described data merging and the historical data information after filling up is carried out the based Dynamic Exponential Smoothing processing.
Wherein, described pretreatment unit comprises:
The time band is divided subelement, is used for that described historical data information is carried out the time band and divides;
Merging syndrome unit is used for merging verification according to the time band of dividing;
Abnormal syndrome unit is used for carrying out abnormal verification with described through the time band that merges verification, provides abnormal check results;
The abnormality value removing subelement is used for according to described abnormal check results, with rejecting abnormal data.
Described data merge shim, comprising:
Data message receives subelement, is used for receiving described historical data information through rejecting abnormal data;
Data merge subelement, are used for described historical data information phase data are in the same time merged processing;
The data check subelement, whether the data after processing for detection of described merging exist countless certificates on moment point;
The data filling subelement if be used for having countless certificates on moment point, carry out data filling and processes.
Described data merge shim, comprising:
Data message receives subelement, is used for receiving described historical data information through rejecting abnormal data;
Data merge subelement, are used for described historical data information phase data are in the same time merged processing;
The data check subelement, whether the data after processing for detection of described merging exist countless certificates on moment point;
The data filling subelement if be used for having countless certificates on moment point, carry out data filling and processes.
Described data smoothing unit comprises:
Data message receives subelement, is used for receiving the described historical data information of filling up processing through number;
The parameter acquiring subelement is for the initial value that obtains smoothing factor, exponential smoothing initial value and control accuracy;
The exponential smoothing value is obtained subelement, is used for initial value and described exponential smoothing initial value according to described smoothing factor, obtains the exponential smoothing numerical value of described next moment point of exponential smoothing initial value;
Optimal smoothing coefficient judgment sub-unit is used for according to described exponential smoothing data and the control accuracy that gets, and judges whether optimal smoothing coefficient of described smoothing factor;
The smoothing processing subelement is used for if the optimal smoothing coefficient is proceeded the exponential smoothing of next moment point and processed;
Smoothing factor search subelement is used for if not the optimal smoothing coefficient, carries out the smoothing factor search.
It should be noted that described smoothing factor search subelement, may further include:
Optimized model is set up subdivision, is used to described smoothing factor to set up the Optimized model of Prediction sum squares minimum;
The optimal smoothing coefficient obtains subdivision, is used for the Optimized model according to described foundation, obtains described optimal smoothing coefficient.
The data processing method that the embodiment of the present invention provides and device are by obtaining historical data information; Described historical data information is carried out the data pre-service; Described pretreated historical data information is carried out data to be merged and fills up; Described data are merged and fill up after historical data information carry out based Dynamic Exponential Smoothing and process.Compared with prior art, the based Dynamic Exponential Smoothing of employing of the present invention is processed, can be so that precision of prediction is higher and can satisfy the needs of actual prediction.
Through the above description of the embodiments, one of ordinary skill in the art will appreciate that: realize that all or part of step in above-described embodiment method is to come the relevant hardware of instruction to complete by program, described program can be stored in a computer read/write memory medium, this program is when carrying out, comprise the step as above-mentioned embodiment of the method, described storage medium, as: FLASH, ROM/RAM, magnetic disc, CD etc.
The above; be only the specific embodiment of the present invention, but protection scope of the present invention is not limited to this, anyly is familiar with those skilled in the art in the technical scope that the present invention discloses; can expect easily changing or replacing, within all should being encompassed in protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion by described protection domain with claim.

Claims (8)

1. a data processing method, is characterized in that, comprising:
Obtain historical data information; Described historical data is the historical vehicle speed value of reading from database;
Described historical data information is carried out the data pre-service, rejecting abnormalities data from described historical data;
Described pretreated historical data information is carried out data to be merged and fills up;
Described data are merged and fill up after historical data information carry out based Dynamic Exponential Smoothing and process;
Wherein, described described data are merged and fill up after historical data information carry out the step that based Dynamic Exponential Smoothing is processed, comprising:
Receive the described historical data information of filling up processing through number;
Obtain the initial value of smoothing factor, exponential smoothing initial value and control accuracy;
According to initial value and the described exponential smoothing initial value of described smoothing factor, obtain the exponential smoothing numerical value of described next moment point of exponential smoothing initial value;
According to described exponential smoothing data and the control accuracy that gets, judge whether optimal smoothing coefficient of described smoothing factor;
If the optimal smoothing coefficient is proceeded the exponential smoothing of next moment point and is processed;
If not the optimal smoothing coefficient, carry out the smoothing factor search.
2. data processing method according to claim 1, is characterized in that, described historical data information is carried out the pretreated step of data, comprising:
Described historical data information is carried out the time band divides;
Time band according to dividing merges verification;
Carry out abnormal verification with described through the time band that merges verification, provide abnormal check results;
According to described abnormal check results, with rejecting abnormal data.
3. data processing method according to claim 2, is characterized in that, described pretreated historical data information is carried out the step that data merge and fill up, and comprising:
Receive described historical data information through rejecting abnormal data;
Phase data in the same time in described historical data information are merged processing;
Whether the data that detect after described merging is processed exist countless certificates on moment point;
If there are countless certificates on moment point, carry out data filling and process.
4. data processing method according to claim 1, is characterized in that, the step of described smoothing factor search comprises:
Set up the Optimized model of Prediction sum squares minimum for described smoothing factor;
According to the Optimized model of described foundation, obtain described optimal smoothing coefficient.
5. a data processing equipment, is characterized in that,
Information acquisition unit is used for obtaining historical data information; Described historical data is the historical vehicle speed value of reading from database;
Pretreatment unit is used for described historical data information being carried out the data pre-service, rejecting abnormalities data from described historical data;
Data merge shim, are used for that described pretreated historical data information is carried out data and merge and fill up;
The data smoothing unit is used for described data merging and the historical data information after filling up is carried out the based Dynamic Exponential Smoothing processing;
Wherein, described data smoothing unit comprises:
Data message receives subelement, is used for receiving the described historical data information of filling up processing through number;
The parameter acquiring subelement is for the initial value that obtains smoothing factor, exponential smoothing initial value and control accuracy;
The exponential smoothing value is obtained subelement, is used for initial value and described exponential smoothing initial value according to described smoothing factor, obtains the exponential smoothing numerical value of described next moment point of exponential smoothing initial value;
Optimal smoothing coefficient judgment sub-unit is used for according to described exponential smoothing data and the control accuracy that gets, and judges whether optimal smoothing coefficient of described smoothing factor;
The smoothing processing subelement is used for if the optimal smoothing coefficient is proceeded the exponential smoothing of next moment point and processed;
Smoothing factor search subelement is used for if not the optimal smoothing coefficient, carries out the smoothing factor search.
6. data processing equipment according to claim 5, is characterized in that, described pretreatment unit comprises:
The time band is divided subelement, is used for that described historical data information is carried out the time band and divides;
Merging syndrome unit is used for merging verification according to the time band of dividing;
Abnormal syndrome unit is used for carrying out abnormal verification with described through the time band that merges verification, provides abnormal check results;
The abnormality value removing subelement is used for according to described abnormal check results, with rejecting abnormal data.
7. data processing equipment according to claim 6, is characterized in that, described data merge shim, comprising:
Data message receives subelement, is used for receiving described historical data information through rejecting abnormal data;
Data merge subelement, are used for described historical data information phase data are in the same time merged processing;
The data check subelement, whether the data after processing for detection of described merging exist countless certificates on moment point;
The data filling subelement if be used for having countless certificates on moment point, carry out data filling and processes.
8. data processing equipment according to claim 5, is characterized in that, described smoothing factor search subelement comprises:
Optimized model is set up subdivision, is used to described smoothing factor to set up the Optimized model of Prediction sum squares minimum;
The optimal smoothing coefficient obtains subdivision, is used for the Optimized model according to described foundation, obtains described optimal smoothing coefficient.
CN 200910244151 2009-12-30 2009-12-30 Data processing method and device Active CN101794345B (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN 200910244151 CN101794345B (en) 2009-12-30 2009-12-30 Data processing method and device
PCT/CN2010/079716 WO2011079705A1 (en) 2009-12-30 2010-12-13 Data processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 200910244151 CN101794345B (en) 2009-12-30 2009-12-30 Data processing method and device

Publications (2)

Publication Number Publication Date
CN101794345A CN101794345A (en) 2010-08-04
CN101794345B true CN101794345B (en) 2013-05-15

Family

ID=42587035

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 200910244151 Active CN101794345B (en) 2009-12-30 2009-12-30 Data processing method and device

Country Status (2)

Country Link
CN (1) CN101794345B (en)
WO (1) WO2011079705A1 (en)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101794345B (en) * 2009-12-30 2013-05-15 北京世纪高通科技有限公司 Data processing method and device
CN101950483B (en) * 2010-09-15 2013-03-20 青岛海信网络科技股份有限公司 Repairing method and device for traffic data fault
CN102332011B (en) * 2011-09-09 2012-12-26 北京空间飞行器总体设计部 Method for selecting effective data of in-orbit spacecraft
CN102588210B (en) * 2011-12-21 2014-02-12 中能电力科技开发有限公司 Filtering method for preprocessing fitting data of power curve
CN103065041B (en) * 2012-12-18 2016-08-03 湖南大唐先一科技有限公司 A kind of test method of redundant data
CN104679970B (en) * 2013-11-29 2018-11-09 高德软件有限公司 A kind of data detection method and device
CN104865916A (en) * 2015-03-19 2015-08-26 上海航天能源股份有限公司 Natural gas supply data processing method
CN107085943B (en) * 2015-12-23 2020-06-30 青岛海信网络科技股份有限公司 Short-term prediction method and system for road travel time
CN106290729A (en) * 2016-08-09 2017-01-04 成都润泰茂成科技有限公司 A kind of Monitoring Data processing means
CN112380310B (en) * 2020-11-26 2023-12-01 成都新橙北斗智联有限公司 GNSS high-precision anti-aliasing resolving result smoothing method

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1725208A (en) * 2004-07-19 2006-01-25 上海市市政工程管理处 Taffic information processing system for urban through street
CN101325004A (en) * 2008-08-01 2008-12-17 北京航空航天大学 Method for compensating real time traffic information data

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100492434C (en) * 2006-11-30 2009-05-27 上海交通大学 Traffic flow state analysis required detection vehicle sampling quantity obtaining method
CN101794345B (en) * 2009-12-30 2013-05-15 北京世纪高通科技有限公司 Data processing method and device

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1725208A (en) * 2004-07-19 2006-01-25 上海市市政工程管理处 Taffic information processing system for urban through street
CN101325004A (en) * 2008-08-01 2008-12-17 北京航空航天大学 Method for compensating real time traffic information data

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
高慧.短时交通流预测方法综述.《济南大学学报(自然科学版)》.2008,第22卷(第1期),第88-93页. *

Also Published As

Publication number Publication date
CN101794345A (en) 2010-08-04
WO2011079705A1 (en) 2011-07-07

Similar Documents

Publication Publication Date Title
CN101794345B (en) Data processing method and device
CN109117987B (en) Personalized traffic accident risk prediction recommendation method based on deep learning
Myung et al. Travel time prediction using k nearest neighbor method with combined data from vehicle detector system and automatic toll collection system
CN101950477B (en) Method and device for processing traffic information
CN102087788B (en) Method for estimating traffic state parameter based on confidence of speed of float car
CN103293014A (en) Bridge fatigue damage state and residual life evaluating method
CN104318757B (en) Bus bus or train route section Forecasting Methodology working time on a kind of public transportation lane
CN110310474A (en) A kind of vehicle flowrate prediction technique and device based on space-time residual error network
CN103280110B (en) The Forecasting Methodology and device of expressway travel time
CN111047078B (en) Traffic characteristic prediction method, system and storage medium
CN112345956A (en) Battery pack charge state detection method and device
CN109859477A (en) A kind of determination method and apparatus of congestion data
CN105206040A (en) Bus bunching predication method based on IC card data
CN103344395B (en) A kind of confirmation method of bridge strengthening target bearing capacity and device
CN110243598B (en) Train bearing temperature processing method and device and storage medium
CN110837979B (en) Safe driving risk prediction method and device based on random forest
Wang et al. Road surface state recognition based on semantic segmentation
CN115204755B (en) Service area access rate measuring method and device, electronic equipment and readable storage medium
CN101694747B (en) Method and device for indentifying abnormal vehicle speed
CN114463978B (en) Data monitoring method based on track traffic information processing terminal
CN114281808A (en) Traffic big data cleaning method, device, equipment and readable storage medium
CN114997748A (en) New energy automobile operation safety risk prediction method and system based on model fusion
CN102201166A (en) Preprocessing apparatus for floating car data and method thereof
Zhao et al. Travel time prediction based on pattern matching method
CN113222208A (en) Ada-XGboost-based traffic accident prediction system

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant