Summary of the invention
In order to overcome the defect of prior art, the object of the invention is to propose a kind of high ferro electric energy quality monitoring data classification method based on K average, first the method is isolated the power quality data that only has a high ferro operation on supply arm, then working time, power quality index and the statistical value thereof that utilizes high ferro as sample data normalization after, carry out K mean cluster, finally obtain the classification of the electric energy quality monitoring data of different automobile types.This invention is the quality of power supply characteristic after further studying certain vehicle and putting into operation, and assesses the impact that it brings to electrical network, and then takes positive counter-measure for electrical network and provide the foundation.
The present invention is achieved through the following technical solutions:
A high ferro electric energy quality monitoring data classification method based on K average, is characterized in that, the method comprises the steps:
(1) obtain the electric energy quality monitoring data of high ferro Traction Station by online electric energy quality monitor;
(2) electric energy quality monitoring data when foundation has car operation and high ferro, in the working time of supply arm, filter out the power quality data that only has a car operation on supply arm;
(3) for the power quality data that only has a car operation on supply arm, calculate the working time of a high ferro and the statistical value of power quality index;
(4) using the statistical value of the working time in step (3) and power quality index as sample data, and be normalized;
(5) sample data after being normalized is carried out to K mean cluster, to obtain the electric energy quality monitoring Data classification of different automobile types.
Further, in step (1), the electric energy quality monitoring data of described high ferro Traction Station comprise voltage deviation, electric current, negative-sequence current, frequency departure, active power, reactive power, electric weight, harmonic wave, a harmonic wave, phase angle, voltage fluctuation and flicker, tri-phase unbalance factor, voltage swell, voltage dip and short time voltage interruption etc.
Further, in step (2), described in have the electric energy quality monitoring data in car when operation to comprise active power, negative-sequence current and phase angle etc.
Further, in step (2), described in the power quality data that filters out while only having a car operation on supply arm comprise the steps:
(a), when the electric energy quality monitoring data in the time having car operation are greater than pre-set threshold value simultaneously, being considered as supply arm has the moment of car, has the moment of car to sort from front to back supply arm and obtains the time series of car period;
(b) according to there being car time series, calculate each time span that has continuously the car period;
(c) judge each time span that has continuously a car period and the size of short transit time, to there being continuously the car period to divide;
(d) judge each time span that has continuously a car period and the size of long transit time, by have continuously the car period be divided into the operation of many cars have the car period and only have a car operation have a car period, and then draw the power quality data while only having a car to move on supply arm.
Further, in step (3), described power quality index comprises content and the number of times etc. of active power, reactive power, electric current, negative-sequence current and harmonic wave; The statistical value of described power quality index comprises maximal value, minimum value, average and the variance etc. of power quality index.
Further, in step (3), the working time of a described high ferro is according to described corresponding of power quality data while only having the operation of car, only the absolute difference in moment obtains;
Suppose that power quality index p is P by the vector of Time alignment, P=[p (1), p (2), ..., p (i) ..., p (n)], wherein p (i) and p (n) are respectively the power quality index of i and n sampling instant, and the maximal value of described power quality index index p is p
max=max (P), minimum value are p
min=min (P), mean value are
variance is
Further, in step (4), the statistical value of the working time of a high ferro and power quality index is mapped to [0 ,+1] interval, sets X '
i=(x '
i, l, x '
i, 2..., x '
i, j... x '
i, M) be i sample data, tool is M eigenwert altogether, by following formula, sample data is normalized to calculating:
In formula, x '
i, jj eigenwert of i sample data, x
i, jit is the normalized value of j feature of i sample data.
Further, in step (5), K-mean cluster process is as follows:
(a) from sample data, choose 4 samples, as the initial cluster center of 4 bunches;
(b) sample data is distributed to the most contiguous cluster centre according to minimal distance principle;
(c), according to cluster result, recalculate sample average in each cluster as new cluster centre;
(d) repeating step b and c until cluster centre no longer change.
(e) cluster finishes, and obtains 4 bunches.
Compared with prior art, beneficial effect of the present invention is:
1) the present invention is according to high ferro Traction Station electric energy quality monitoring data, provides a kind of simple and efficient by the method that only has the power quality data of a high ferro operation to separate on certain supply arm; Utilize K means clustering algorithm, completed the power quality data classification for high ferro vehicle.
2) the present invention is that the power quality problem bringing to electrical network in certain type high ferro operational process of research provides the foundation.In addition, institute of the present invention extracting method also can carry out electric energy quality monitoring Data classification for operating mode (comprising load braking, startup, stable operation etc.), for the quality of power supply specificity analysis of the further high iron load of refinement provides basis.
Embodiment
Below in conjunction with accompanying drawing, the specific embodiment of the present invention is described in further detail.
In this example, set forth a kind of high ferro electric energy quality monitoring data classification method based on K average, its process flow diagram as shown in Figure 1, comprises the steps:
(1) obtain the electric energy quality monitoring data of high ferro Traction Station by online electric energy quality monitor.
High ferro traction power supply station electric energy quality monitoring packet is containing active power (P), reactive power (Q), electric current (I), harmonic wave (I
h, I
1represent fundamental current), a harmonic wave (I
ih), voltage (merit, flickering (P
st), negative phase-sequence (μ
2) etc. index.
(2) electric energy quality monitoring data when foundation has car operation and high ferro, at the transit time of supply arm, filter out the power quality data that only has a car operation on supply arm, and its specific implementation process as shown in Figure 2.
First I the present invention has the car period and carries out preliminary classification without the power quality data of car period supply arm for indexs such as active power, negative-sequence current and the phase angles of the quality of power supply.
According to there being high ferro to enter after supply arm, using some index of Monitoring Data such as power, negative-sequence current and phase angle are greater than predetermined threshold value as criterion simultaneously, obtain supply arm moment t1 (the i) (i=1 that takes up an official post, 2, n, n represents sampled point number) have a car situation: have car y=1 and without car y=0.The time series t (j) of car period will there be is moment corresponding to car y=1 situation to be arranged in order to be formed as.
Then II according to there being car time series, calculates each time span that has continuously the car period.
After utilizing, be a bit whether t (j+1)-t (j) is greater than 10s as criterion with the more front mistiming, find and on supply arm, have continuously the initial time t_b of car period and stop moment t_e.Detailed process is as follows: if t (j+1)-t (j) is <10s, j=j+1, enters next point and judge; If t (j+1)-t (j) >=10s, think that t (j) is n and has continuously the termination moment t_e (n) of car period, t (j+1) is n+1 the initial time t_b (n+1) of car period continuously.
Therefore, there is continuously the difference between the initial time t_b (n+1) of car period can try to achieve each time span that has continuously the car period according to a certain termination moment t_e (n) that has continuously a car period and this.
III has the division of car period continuously.
High ferro the normal transit time of supply arm be probably 4min between 6min, the shortest transit time is 4min, the longest run time is 6min.In order to obtain the more accurate complete car period that has continuously, the present invention utilizes the shortest transit time 4min of high ferro on supply arm further to judge.Concrete operations are as follows: calculating n has the time span t_interal=t_e-t_b of car period, if t_interal >=4min, representing this n, to have the car period be complete; If t_interal<4min, connect thereafter have a car period, form the new car period that has, until this new time span that has the car period is greater than the shortest transit time 4min, what this was new like this have, and the car period is also complete has a car period.
IV finally by have continuously the car period be divided into the operation of many cars have the car period and only have that a car moves have a car period.
In order to obtain only having the power quality data of a high ferro operation, the present invention utilizes the maximum duration of single high ferro operation as criterion, if there is the time span > 6min of car period, that thinks many cars operations has a car period; If there is time span≤6min of car period, that thinks the operation of car has a car period, and the power quality data that this period is corresponding, is exactly the power quality data that only has a high ferro operation that the present invention will obtain.
(3) power quality data when only having the operation of high ferro on supply arm, calculates working time, power quality index and the statistical value thereof of a high ferro.
Wherein can subtract each other acquisition according to the obtained corresponding start-stop of the power quality data moment that only has a high ferro operation working time of a car.Power quality index mainly refers to the indexs such as power, electric current, harmonic content, overtone order, negative-sequence current.The statistical value of power quality index comprises maximal value, minimum value, average, variance of power quality index etc.What suppose power index p is P by the vector of Time alignment, P=[p (1), p (2), ..., p (i) ..., p (n)], wherein p (i) and p (n) are respectively the power quality index value of i and n sampling instant, and the maximal value of this index p is p
max=max (P), minimum value are p
min=min (P), mean value are
Variance is
(4) using working time, power quality index and statistical value thereof as sample data, be normalized.
For improving the generalization ability of model, the time of minimizing procedural training, sample data is carried out to normalized.In this example, the working time of all high ferros, power quality index and statistical value thereof are mapped to [0 ,+1] interval, suppose X '
i=(x '
i, 1x '
i, 2' ..., x '
i, m) be i sample data, there is x '
i, 1, x '
i, 2..., x '
i, mm dimension variable, i.e. m eigenwert altogether.
Normalization formula is:
In above formula, x '
i,jj eigenwert of i sample data, x
i, jbe the normalized value of j feature of i sample data, i sample data after normalization is X
i=x
i, 1, x
i, 2..., x
i, m).
(5) carry out K-mean cluster
(5a), according to vehicle: bullet train 8 is organized into groups, bullet train 16 is organized into groups, medium trains 8 is organized into groups and medium trains 16 is organized into groups, the power quality data during by car operation is only divided into four classes;
(5b) choose 4 initial cluster center V
i(1), i=1,2 ..., the sequence number of 4, i representation class, parenthetic 1 represents primary iteration operation times;
For example: 4 initial cluster centers can be chosen to obtain comparatively according to actual conditions and disperse, such as 16 marshaling power are generally 2 times of 8 marshaling power, so just can go out to select initial cluster center at 1 times power and 2 times powers respectively.
(5c) one by one by data sample (X to be clustered
1, X
2..., X
j..., X
n) distribute to the some V in 4 cluster centres by minimum distance criterion
i(t).Calculate j data sample X
jwith 4 cluster centre V
i(t), i=1,2 ..., 4 distance d (X
j, V
i(t)), i=1,2,3,4, if d is (X
j, V
i(t)) minimum, X
jbe i class, be designated as X '
j;
Wherein, X
jand X
nrepresent a j and N data sample; T represents interative computation number of times, is initially 1; The sequence number of i representation class, i=1,2 ..., 4;
(5d) calculate 4 new cluster centres:
.C
irepresent the number of samples of i class.
If (5e) V
i(t+1)=V
i(t), i=1,2 ..., 4, algorithm finishes, and the most at last sample data to gather be four classes, obtain four class sample datas; Otherwise t=t+1, returns to step (5c).
(6) the four class cluster sample datas that obtain in step (5) are exactly the electric energy quality monitoring data of four kinds of different automobile types.
Finally should be noted that: above embodiment is only in order to illustrate that technical scheme of the present invention is not intended to limit, although the present invention is had been described in detail with reference to above-described embodiment, those of ordinary skill in the field are to be understood that: still can modify or be equal to replacement the specific embodiment of the present invention, and do not depart from any modification of spirit and scope of the invention or be equal to replacement, it all should be encompassed in the middle of claim scope of the present invention.