CN116702078A

CN116702078A - State detection method based on modular expandable cabinet power distribution unit

Info

Publication number: CN116702078A
Application number: CN202310649869.8A
Authority: CN
Inventors: 贾继伟; 邵国栋; 何炬亮; 姚伟军; 陈善民; 郑汉杰; 李兴
Original assignee: Zhejiang Post & Telecommunication Engineering Construction Co ltd; China Telecom Corp Ltd Zhejiang Branch
Current assignee: Zhejiang Post & Telecommunication Engineering Construction Co ltd; China Telecom Corp Ltd Zhejiang Branch
Priority date: 2023-06-02
Filing date: 2023-06-02
Publication date: 2023-09-05
Anticipated expiration: 2043-06-02
Also published as: CN116702078B

Abstract

A state detection method based on a modular expandable cabinet power distribution unit belongs to the technical field of current supply of electric communication technology, and an original non-supervision model U is utilized first _model For the unlabeled second training data set B _t Making predictions to generate virtual tags in advance, and giving the virtual tags a second training data set B _t As a later active learning supervision model S _model Finally, the data set is supervised by the supervision model S _model Responsible for the training and prediction of the active learning thereafter. The proposal adopts initiativeThe state detection of learning is performed by using expert feedback to automatically collect the marks of the samples so as to enhance the accuracy of state detection.

Description

State detection method based on modular expandable cabinet power distribution unit

Technical Field

The invention belongs to the technical field of current supply of electric communication technology, and particularly relates to a state detection method based on a power distribution unit of a modular expandable cabinet.

Background

The power distribution unit of the modular expandable cabinet, as shown in figure 1, consists of a base, a bus, a total incoming line unit, a movable module, an empty panel and the like. Each movable module can be independently plugged and unplugged from the bottom box, can be increased or decreased according to the requirement, and does not influence the communication of other extension module circuits due to the absence of any module. For example, chinese patent publication No. CN112333970a discloses an expandable cabinet power distribution unit.

For another example, in the "power supply network abnormality detection processing system and detection processing method thereof" disclosed in chinese patent publication No. CN111585846a, the processor module obtains the difference of each sampling point and calculates an abnormal component, and when the abnormal component reaches an alarm trigger point, the processor module sends out alarm information.

Therefore, the sensor module and the communication module are arranged in the active module of the power distribution unit, and the power data collected by the sensor is uploaded to the cloud end, which is the prior art in the field.

According to the scheme, the sensor module and the communication module are arranged in the power distribution unit of the modular expandable cabinet, and the power information is uploaded to the cloud, so that the power utilization state of the communication equipment is detected.

Aging or improper use of communication equipment is prone to fire. Therefore, it is necessary to collect power information of the communication equipment, detect the power utilization state, early warn the abnormal state, remind the user to maintain or replace the related equipment, and avoid the damage caused by the fault of the communication equipment.

However, in the system and method for detecting and processing abnormal components of electric power supply network disclosed in chinese patent publication No. CN111585846a, a statistical method is adopted to artificially determine a threshold value for determining the deviation of abnormal components. On the one hand, the setting of the threshold value does not have the real physical meaning behind the abnormality, and on the other hand, the setting of the threshold value does not have a scientific scale; the deviation is not calculated to be abnormal, and an exact standard is difficult to be found.

And for detecting the power utilization state of the communication equipment, non-supervision learning is adopted for classification, and then abnormal quantity is found out. As disclosed in chinese patent publication No. CN109740694a, "a method for detecting non-technical loss of smart grid based on unsupervised learning", which collects the load of power users and classifies them using k-means clustering method, it considers abnormal electricity consumption pattern detection to be essentially a binary classification problem, i.e. all users are classified into two categories: normal users and abnormal users. However, the original data has normal data and abnormal data, and the method ignores the abnormal data which occupies a relatively small amount, and when training the model, all the data are regarded as the normal data for training, and the result after training is not correlated with the initial abnormal data.

Disclosure of Invention

In view of the above-mentioned state of the art and shortcomings, it is an object of the present invention to provide a status detection method based on a modular scalable cabinet power distribution unit.

The state detection method based on the modular expandable cabinet power distribution unit comprises the following steps: step S1, data selection and cleaning: the sensor module is used for collecting power data of the communication equipment, recording and storing a time-series data set at intervals of every 3 minutes; power profile, including the point in time of occurrence and the actual power W;

step S2, converting data: converting the time-series data set into a space vector-series data set; space vector v _t A set of actual powers W recorded for consecutive N time acquisition intervals; n is the width of the space vector;

step S3, a pre-training process:

step S301, training an unsupervised model U _model 。

Half of the data in the data set is used as the training non-supervision model U _model Is not marked in the first training data set A _t The rest data is used as a training supervision model S _model Second training data set B _t The method comprises the steps of carrying out a first treatment on the surface of the Second training data set B _t In the method, the normal sample set is N _maj And the abnormal sample set is O _min The method comprises the steps of carrying out a first treatment on the surface of the From the normal sample set N _maj Randomly selecting H sample sets S _maj Then each sample set S _maj Are respectively matched with the abnormal sample set O _min Mixing to form H mixed samples, wherein the number of normal samples and abnormal samples in each mixed sample is 1:1;

for the first training data set A which is not marked _t Training an unsupervised model U using a unitary classification approach _model ；

Step S302, a mixed sample set is selected and guided into an unsupervised model U _model Predicting virtual tags, recording virtual tags in the virtual tag set PL, and adding the mixed sample set assigned virtual tags to the marked dataset X _t The method comprises the steps of carrying out a first treatment on the surface of the Then use the marked data set X _t Pre-training supervision model S _model Completing the supervision model S _model Initializing;

step S4, active learning feedback flow: supervision model S _model After initialization, selecting the next mixed sample set to the supervision model S _model Performing classification prediction;

step S5, turning to step S4, selecting the next mixed sample set and then entering the supervision model S _model Predictions are made until the mixed sample set is traversed.

Further, the steps ofS2, further comprising the following steps: conversion characteristics: for space vector v in space vector sequence _t Performing a conversion feature, the converted feature comprising:

average power

Maximum difference S _maxdiff ＝max(v _ti )–min(v _ti )；

The first quartile Q1, v _ti A number of 25% ranging from small to large;

second quartile Q2, v _ti A median number ranging from small to large;

third quartile Q3, v _ti A number of 75% ranging from small to large;

the quarter-bit difference iqr=q3-Q1;

standard deviation SD;

maximum data change ratio S _maxratio ＝max(v _ti )/V _tm ；

Minimum data change ratio S _minratio ＝min(v _ti )/V _tm ；

Discrete value S of data _dev ＝S _maxdiff /V _tm ；

Replacing the space vector v with the 10-dimensional features _t Thereby forming a new sequence of space vectors.

Further, step S2 further includes the steps of:

extraction characteristics: and (3) carrying out feature extraction on the 10-dimensional features in the space vector sequence by adopting a principal component analysis method, replacing the extracted new features with the 10-dimensional features to obtain a new space vector sequence, and taking the space vector sequence as a new data set.

Further, step S302, further includes the following steps:

method for calculating unsupervised model U using uncertainty sampling _model Uncertainty of each data in the list, find m which is the most uncertain ₁ Pen data, will be m ₁ And transmitting the data to an expert for updating the labels, and updating the virtual labels in the virtual label set PL according to the updated labels.

Further, step 4, further includes: method for calculating a supervision model S using uncertainty sampling _model Uncertainty of each data in the list, find m which is the most uncertain ₂ Pen data; let m ₂ The data are transmitted to an expert for labeling, and a newly added label is obtained; adding a newly added tag M2 to the marked dataset X _t The method comprises the steps of carrying out a first treatment on the surface of the Then use the marked data set X _t Retraining a supervision model S _model . The proposal firstly utilizes the original non-supervision model U _model For the unlabeled second training data set B _t Making predictions to generate virtual tags in advance, and giving the virtual tags a second training data set B _t As a later active learning supervision model S _model Finally, the data set is supervised by the supervision model S _model Responsible for the training and prediction of the active learning thereafter. It has the following advantages:

according to the scheme, active learning state detection is adopted, and the marks of the samples are automatically collected through feedback of an expert, so that the accuracy of state detection is improved. When the abnormal state of the communication equipment is detected, the abnormal state information is sent to a user for reference so as to carry out relevant maintenance or replacement measures, and the danger caused by the fault of the communication equipment is avoided.

The scheme solves the problem given by the initial label of the data by utilizing the virtual label in the semi-supervised learning, so that the semi-supervised learning scheme can be organically combined with the supervision model of the active learning, and the accuracy of the semi-supervised learning is improved.

If only the supervised model of active learning is used, since there is a limitation in active learning that two kinds of data must be distributed from the beginning, i.e. normal data and outlier abnormal data, and the solution is only a mixed data set of a large amount of normal data and a small amount of abnormal data from the beginning, single active learning is not suitable for the solution.

If the benefits of accuracy improvement of active learning are to be exploited, the supervision model must be trained using real labels. In the scheme, an unsupervised model U is utilized first _model For the second training data set B as unlabeled _t Predicting to obtain initial virtual label for training supervision model S _model Then, the model can be smoothly poured into active learning, retraining is carried out by inquiring uncertain records, and the accuracy of the supervision model is further improved.

Drawings

FIG. 1 is a block diagram of a distribution unit of the present invention;

FIG. 2 is a flow chart of the present invention;

FIG. 3 is a schematic diagram of a data set converted into a space vector sequence;

FIG. 4 is a schematic sampling of a random forest;

FIG. 5 is a diagram of anomaly data;

FIG. 6 is a graph of predicted results versus query times.

Detailed Description

The present invention will be described in further detail with reference to the accompanying drawings.

The power consumption information of the communication equipment has no label, and the traditional method for detecting the state cannot work normally at the moment when the communication equipment has a large amount of normal data or almost no corresponding abnormal data. The scheme solves the problem of data marking by using the concept of virtual marking in semi-supervised learning.

Since the number of samples of the marked data is too small, the model will initially obtain a less accurate classification boundary with low prediction accuracy. The scheme is used for selecting the record with the most influence on the accuracy in the test data, feeding back the record to a user or expert to give the correct mark again, and adding the correct mark into the original training data set to retrain the model. As such, the boundaries of the model classification will become more and more accurate.

FIG. 2 is a flow chart of the present invention; as shown in fig. 2. The state detection method based on the modular expandable cabinet power distribution unit comprises the following steps:

step S1, data selection and cleaning.

And the sensor module is used for collecting the power data of the communication equipment, recording and storing the power data in a database at the time acquisition interval of every 3 minutes.

According to the scheme, in the modular expandable cabinet power distribution unit, the sensor module and the communication module are arranged, and the power information is uploaded to the cloud end, so that the power utilization state of the communication equipment is detected. The structure is the same as that of an abnormality detection processing system of an electric power supply network disclosed in China patent publication No. CN 111585846A.

The database is used for recording the power information of a plurality of row time points, including the occurrence time point, the voltage V, the current A, the power factor PF, the actual power W and the apparent power VA.

The apparent power is the product of current and voltage, the power factor is the ratio of the actual power divided by the apparent power, and is between 0 and 1, and the higher the power factor is, the higher the power consumption efficiency of the communication equipment is. Because the value variation difference between the voltage and the power factor is very small and can be almost regarded as a fixed constant, the scheme adopts the actual power W to match the occurrence time point to form a time series data set.

In this scheme, 36 sensor modules are used, the time range is 2022, 1 month to 11 months, and more than 50 ten thousand records are formed to form a time series data set.

Step S2, converting the data, converting the characteristics and extracting the characteristics.

Converting data: the power consumption abnormality of the communication device is not determinable at a single data point. Thus, in this scheme, the object of investigation of the power consumption state is a set of data points for one period of time.

FIG. 3 is a schematic diagram of a data set converted into a space vector sequence; as shown in fig. 3, the time-series data set is converted into a space vector sequence, which is formulated as follows:

S _n ＝{v ₁ ,v ₂ ,v ₃ ,...,v _t ,...,v _n }；

wherein S is _n When expressedSpace vector sequence with interval continuous length of n, t represents the ordinal number of time acquisition interval and is not less than 1 and not more than n, v _t Representing the space vector recording N actual powers W starting with the t-th time acquisition interval, i.e. v _t ＝{v _t1 ,v _t2 ,...，v _ti ，...,v _tN }，v _t1 Representing a first actual power W recorded at a t-th time acquisition interval; v _ti Representing v _t The i-th actual power W in (a). Preferably, N is 3.ltoreq.N.ltoreq.10.

According to the scheme, state abnormality is predicted through state detection. Abnormality refers to a situation in which the behavior patterns of a small number of individuals in a population do not conform to a defined normal condition, or normal behavior expected with respect to a large number of individuals is referred to as abnormality. Because of the abnormal power consumption of the communication device, it is not a single data point that can be determined. Therefore, the method adopts the set exception, which means that a group of sets consisting of a plurality of record points are needed to form the exception, and the record points in the same set have a dependency relationship. For example, chinese patent publication No. CN109740694a discloses a method for detecting non-technical loss of smart grid based on non-supervised learning, which uses a time sequence and uses a moving average method to continuously analyze the time sequence to obtain a trend index.

Conversion characteristics: if principal component analysis PCA is directly adopted to perform feature extraction on the space vector sequence, the fluctuation of the actual power W is too fine, so that the too fine abnormal power fluctuation cannot be captured. Therefore, it is necessary to first use the space vector v in the space vector sequence _t And converting the characteristics to improve the identification accuracy.

The characteristics after transformation include:

average power

Maximum difference S _maxdiff ＝max(v _ti )–min(v _ti )；

The first quartile Q1, v _ti Number 25% of the small to large arrangementA value;

second quartile Q2, v _ti A median number ranging from small to large;

third quartile Q3, v _ti A number of 75% ranging from small to large;

the quarter-bit difference iqr=q3-Q1;

standard deviation SD;

maximum data change ratio S _maxratio ＝max(v _ti )/V _tm ；

Minimum data change ratio S _minratio ＝min(v _ti )/V _tm ；

Discrete value S of data _dev ＝S _maxdiff /V _tm 。

Replacing the space vector v with the 10-dimensional features _t Thereby forming a new sequence of space vectors. The 10-dimensional feature covers the power characteristics of most communication devices. Because the median eigenvalues are less susceptible to outliers than the power average, the first quartile Q1, the second quartile Q2, the third quartile Q3, and the quartile difference IQR are used to enhance the robustness of the system prediction.

Extraction characteristics: and adopting Principal Component Analysis (PCA), extracting features of 10 dimensions in the space vector sequence, replacing the extracted new features with the features of 10 dimensions to obtain a new space vector sequence, and taking the space vector sequence as a new data set.

Principal component analysis PCA is a statistical method, and in order to explore the correlation degree among a plurality of possible correlation variables, the maximum or minimum correlation direction is searched, and the purposes of data compression or denoising are achieved, the method comprises the following steps: let the variable class be n "; the number of samples for each variable is m';

1) Calculating the mean value of each variable, and then subtracting the mean value (removing the mean value) from each sampling value;

2) Solving covariance matrix, note divided by m "-1 (to get unbiased estimate);

3) And solving covariance matrix eigenvalues and eigenvectors.

Principal component analysis PCA is a conventional technical means and will not be described in detail.

And S3, pre-training the flow.

Step S301, training an unsupervised model U _model 。

Half of the data in the data set is used as the training non-supervision model U _model Is not marked in the first training data set A _t The rest data is used as a training supervision model S _model Second training data set B _t 。

Because the samples with abnormal states are too few and difficult to obtain, the unbalance of the data types can lead to serious bias of the trained prediction results to the majority types, so that the prediction accuracy of the minority types is reduced. Thus, the second training data set B _t It is desirable to solve the problem of data category imbalance.

Second training data set B _t In order, the normal sample set is N _maj The number is |N _maj I, the abnormal sample set is O _min In an amount of |O _min I, and I N _maj |>>|O _min I, then from the normal sample set N _maj Randomly selecting H sample sets S _maj And the number of which is S _maj |＝|O _min I, then each sample set S _maj Are respectively matched with the abnormal sample set O _min Mixing to form H mixed samples, wherein the number of normal samples and abnormal samples in each mixed sample is 1:1; then using the mixed sample set, for the supervision model S _model And (5) predicting.

For the first training data set A which is not marked _t Training an unsupervised model U using a unitary classification approach _model 。

Because the power consumption data of the communication equipment is not labeled at first, but a large amount of normal data and a small amount of abnormal data exist in the data according to the normal distribution assumption, the method is very suitable for adopting a semi-supervised state detection method, namely the problem of unified classification is solved, and therefore, the non-supervised model is trained by adopting the unified classification method. Unitary classification does not take into account the actual labels of the training materials themselves, but instead considers all training materials as the same positive class to train the classification model.

Step S302, a mixed sample set is selected and guided into an unsupervised model U _model Predicting virtual tags, recording the virtual tags in a virtual tag set PL, and calculating an unsupervised model U by using an uncertainty sampling method _model Uncertainty of each data in the list, find m which is the most uncertain ₁ Pen data, will be m ₁ The data are transmitted to an expert for updating the labels, and the virtual labels in the virtual label set PL are updated according to the updated labels; finally adding the mixed sample set endowed with the virtual tag to the marked data set X _t The method comprises the steps of carrying out a first treatment on the surface of the Then use the marked data set X _t Pre-training supervision model S _model Completing the supervision model S _model Initializing.

The mark is the mark in the data that defines whether the data point is normal or abnormal.

Because of the unsupervised model U _model The predicted virtual marking accuracy is limited, so the proposal adopts the mixed sample set of virtual labels as the first training data set for the supervision model S _model Pre-training is carried out; then, go through the supervision model S _model The predicted tag updates the virtual tag, thereby improving the accuracy of the virtual tag.

Step S4, actively learning the feedback flow.

Step S401, supervision model S _model Retraining and predicting.

Supervision model S _model After initialization, selecting the next mixed sample set to the supervision model S _model And (5) performing classification prediction to obtain a mark.

FIG. 4 is a schematic sampling of a random forest; as shown in fig. 4. According to the scheme, a random forest supervised learning algorithm is used for the mixed sample, different decision tree models are trained, and then the classification results are predicted jointly by the different decision tree models in a voting mode.

The random forest is a supervised learning algorithm, the mixed sample is trained into a plurality of different decision tree models, all the decision tree models are used for jointly predicting classification results of new samples in a voting mode, and the construction process is as follows:

1. obtaining m 'mixed samples as m' training sets;

2. respectively training m 'decision tree models for m' training sets;

3. for a single decision tree model, assuming that the number of training sample features is n', selecting the best features for splitting according to information gain, information gain ratio or a coefficient of a radix when splitting each time;

4. and forming a random forest by the generated multiple decision trees, and determining the final classification result according to the voting of the multiple tree classifiers for the classification problem.

As a plurality of different decision tree models exist, the normal sample set can be almost distributed into training samples of different models, so that the problem of information loss is avoided.

Step S402, the flow of the query and expert interaction of the material is not determined.

Method for calculating a supervision model S using uncertainty sampling _model Uncertainty of each data in the list, find m which is the most uncertain ₂ Pen data; let m ₂ The data are transmitted to an expert for labeling, and a newly added label is obtained; adding a newly added tag M2 to the marked dataset X _t . Then use the marked data set X _t Retraining a supervision model S _model 。

Uncertainty samples (Uncertainty Sampling) are unlabeled samples that are used to identify the vicinity of decision boundaries in the current learning model. The sample of the most uncertainty of the model may be data near the classification boundary; by looking at the samples where these classifications are the most difficult to obtain more information about class boundaries, their predictive labels can be sampled with the least confidence. Uncertainty sampling, which is common knowledge, can be referred to as: uncertainty-based active learning algorithm study [ D ], [ Wang Zhen ], university of Hebei, 2011.

Experiments prove that the scheme is as follows:

the TPR index true positive rate indicates the proportion of all abnormal data that is correctly identified as abnormal, and the higher the TPR index is, the better the TPR index is. The FPR index false positive rate indicates the proportion of the data which is erroneously determined to be abnormal to all normal data, and a high FPR index indicates a high probability of being erroneously determined to be abnormal, so that the lower the FPR index is, the better.

Firstly, using a part of data, taking the electric power data of a cooling fan of one communication device as an experiment, taking more than about 11 ten thousand data, adopting k-fold cross validation, and taking half of the data as an unsupervised model U according to an experiment plan _model Is not marked in the first training data set A _t The other half is used as a second training data set B _t And the method is divided into k equal parts in average, and each part is mixed with a certain proportion of abnormal data to obtain a mixed sample set, wherein one equal part is used for predicting the virtual tag, and the rest equal parts are used for the query flow of each active learning. The experiment improves the accuracy of the model by importing k-fold cross-validation by cycling different k aliquots. The mixed sample set must have data covering the actual anomalies so that the classification model can correctly find the anomalies.

FIG. 5 is a diagram of anomaly data; as shown in fig. 5. In this experiment, the rotation speed of the electric fan is reduced to be abnormal by increasing the friction force in a manner of inserting the foreign matters, and the foreign matters are inserted at different time points for two times, so that the power of the electric fan can be observed to rise from the original average 52w to 65w in fig. 5, and then each record slowly drops along with the time, so that abnormal data of rising before falling is obtained.

Number of inquiry data (m) in active learning ₁ +m ₂ ): querying 50 samples; m of queries ₁ Pen comes from unsupervised model U _model To correct the false positive virtual tag to increase accuracy, and the other half from each new test data.

Fig. 6 is a correlation diagram of the predicted result and the number of queries, as shown in fig. 6. At the beginning of the experiment, an unsupervised model of a unitary classification algorithm is adopted, the TPR index of a predicted result is 0.71, the FPR index is 0.049, the influence of the accuracy of the virtual tag is received, and the TPR index and the FPR index at the beginning of the active learning model are distributed almost in the vicinity of the range. It can be seen that the higher the accuracy of the virtual labels predicted by the unsupervised model, the later active learning model can reach convergence with fewer queries.

It will be understood that equivalents and modifications will occur to those skilled in the art in light of the present invention and their spirit, and all such modifications and substitutions are intended to be included within the scope of the present invention as defined in the following claims.

Claims

1. The state detection method based on the modular expandable cabinet power distribution unit is characterized by comprising the following steps of:

step S1, data selection and cleaning: the sensor module is used for collecting power data of the communication equipment, recording and storing a time-series data set at intervals of every 3 minutes; power profile, including the point in time of occurrence and the actual power W;

step S3, a pre-training process:

step S301, training an unsupervised model U _model The method comprises the steps of carrying out a first treatment on the surface of the Half of the data in the data set is used as the training non-supervision model U _model Is not marked in the first training data set A _t The rest data is used as a training supervision model S _model Second training data set B _t The method comprises the steps of carrying out a first treatment on the surface of the Second training data set B _t In the method, the normal sample set is N _maj And the abnormal sample set is O _min The method comprises the steps of carrying out a first treatment on the surface of the From the normal sample set N _maj Randomly selecting H sample sets S _maj Then each sample set S _maj Are respectively matched with the abnormal sample set O _min Mixing to form H mixed samples, wherein the number of normal samples and abnormal samples in each mixed sample is 1:1; for a pair ofIn the first untagged training data set A _t Training an unsupervised model U using a unitary classification approach _model ；

2. The method for detecting a status of a power distribution unit of a modular expandable rack according to claim 1, wherein step S2 further comprises the steps of: conversion characteristics: for space vector v in space vector sequence _t Performing a conversion feature, the converted feature comprising: average power

Maximum difference S _maxdiff ＝max(v _ti )–min(v _ti )；

The first quartile Q1, v _ti A number of 25% ranging from small to large;

second quartile Q2, v _ti A median number ranging from small to large;

third quartile Q3, v _ti A number of 75% ranging from small to large;

the quarter-bit difference iqr=q3-Q1;

standard deviation SD;

maximum data change ratio S _maxratio ＝max(v _ti )/V _tm ；

Minimum data change ratio S _minratio ＝min(v _ti )/V _tm ；

Discrete value S of data _dev ＝S _maxdiff /V _tm ；

3. The method for detecting a status of a power distribution unit of a modular expandable rack according to claim 2, wherein step S2 further comprises the steps of:

4. The method for detecting the status of a power distribution unit of a modular expandable rack according to claim 1 or 3, wherein step S302 further comprises the steps of:

5. The method of claim 1, wherein step 4 further comprises: method for calculating a supervision model S using uncertainty sampling _model Uncertainty of each data in the list, find m which is the most uncertain ₂ Pen data; let m ₂ The data are transmitted to an expert for labeling, and a newly added label is obtained; adding a newly added tag M2 to the marked dataset X _t The method comprises the steps of carrying out a first treatment on the surface of the Then use the marked data set X _t Retraining a supervision model S _model 。