WO2020125166A1 - Power grid anticipated fault set prediction method and apparatus, and electronic device and storage medium - Google Patents

Power grid anticipated fault set prediction method and apparatus, and electronic device and storage medium Download PDF

Info

Publication number
WO2020125166A1
WO2020125166A1 PCT/CN2019/110941 CN2019110941W WO2020125166A1 WO 2020125166 A1 WO2020125166 A1 WO 2020125166A1 CN 2019110941 W CN2019110941 W CN 2019110941W WO 2020125166 A1 WO2020125166 A1 WO 2020125166A1
Authority
WO
WIPO (PCT)
Prior art keywords
sample
feature variable
variable set
section
target
Prior art date
Application number
PCT/CN2019/110941
Other languages
French (fr)
Chinese (zh)
Inventor
张越
张佳楠
李如意
单连飞
吕宏伟
余建明
刘艳
卓峻峰
张连超
Original Assignee
北京科东电力控制系统有限责任公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京科东电力控制系统有限责任公司 filed Critical 北京科东电力控制系统有限责任公司
Publication of WO2020125166A1 publication Critical patent/WO2020125166A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Definitions

  • the present invention relates to the technical field of power grid analysis, and in particular, to a method, device, electronic equipment, and storage medium for predicting a fault set of a power grid.
  • the predicted fault set generation technology in static safety analysis is a necessary means to prevent power grid faults and simulation drills; at present, the predicted fault set generation mainly depends on the experience of the dispatcher to determine, but this method takes a long time and is blind. A large number of actual faults are missed.
  • the operation mode of the power grid is changed and new equipment is connected, it is necessary to manually adjust the expected fault set.
  • the fault set cannot be adaptively added, deleted, changed, or checked, nor can the influence of bad weather on the power grid be considered. Therefore, there is an urgent need to quickly and accurately predict the fault set generation method, get rid of the dependence of the expected fault set generation on the dispatcher's experience, and improve work efficiency.
  • the present invention provides a method for automatically generating a fault set, an electronic device and a storage medium to improve the above problems.
  • an embodiment of the present invention provides a method for predicting a predicted fault set of a power grid, including: acquiring a target cross-section feature variable set of a preset feature variable type of a target power grid cross-section; and determining the target in a preset sample library A sample feature variable set matched with a cross-section feature variable set; wherein, the sample library includes a correspondence between a sample fault set and a sample feature variable set; the feature variable type of the sample feature variable set is the same as the preset feature variable type; It is determined that the sample fault set corresponding to the matched sample feature variable set is the predicted fault set of the target power grid cross section.
  • the method before acquiring the target cross-section feature variable set of the preset feature variable type of the target grid cross-section, the method further includes: acquiring multiple grid variables as the initial feature variable set; A preset feature variable evaluation rule filters the initial feature variable set; it is determined that the type of feature variable included in the filtered initial feature variable set is the preset feature variable type.
  • the filtering of the initial feature variable set according to a preset feature variable evaluation rule includes: obtaining all feature variables of each sample feature variable set in the sample library; based on the Relief algorithm To determine the weight value of each feature variable in the initial feature variable set; delete the feature variables in the initial feature variable set whose weight value is lower than the preset threshold.
  • the method further includes: acquiring at least one initial sample library by at least one acquisition method, and combining at least one initial sample library to obtain a static sample library, and the initial sample library includes samples of each sample The fault feature set and the sample feature variable set corresponding to the sample fault set; the static sample library is filtered according to the preset sample evaluation rules; and the filtered static sample library is determined as the sample library.
  • the method before a sample feature variable set matching the target cross-section feature variable set is determined in the preset sample library, the method further includes: setting the target cross-section feature variable set and the sample library The initial evaluation value of the sample feature variable set in; update the initial evaluation value of the sample in the static sample library according to the result of the target cross-section feature variable set on the predicted fault set, where the prediction result includes: correct, wrong and irrelevant; Delete the sample feature variable set corresponding to the evaluation value lower than the preset evaluation threshold and the sample fault set corresponding to the sample feature variable set in the static sample library to obtain the sample library.
  • the determining a sample feature variable set that matches the target cross-section feature variable set in a preset sample library includes: comparing the target cross-section feature variable set Obtain the similarity between the target grid section and each sample in the sample library with the sample feature variable set in the sample library; determine the sample feature variable set with the highest similarity as the target section feature The sample feature variable set matching the variable set.
  • a sample feature variable set matching the target cross-section feature variable set is determined in a preset sample library, including: normalizing the target cross-section feature variable set and the sample feature variable Each feature variable in the set; calculate the classification distance between the target cross-section feature variable set and each sample feature variable set in the sample library based on the k-proximity algorithm; correspondingly, determine the sample feature variable set with the highest similarity as the target cross-section
  • the sample feature variable set matched by the feature variable set includes: determining the sample feature variable set with the shortest classification distance as the sample feature variable set with the highest similarity.
  • an embodiment of the present invention further provides a device for predicting a predicted set of power grid faults, including: a receiving module for acquiring a target profile feature variable set of a preset feature variable type of a target grid profile; a processing module for predicting A sample feature variable set matching the target cross-section feature variable set is determined from the set sample database, and a sample fault set corresponding to the matched sample feature variable set is determined as the predicted fault set of the target power grid cross-section; the sending module , Used to send the predicted fault set to the staff.
  • an embodiment of the present invention further provides an electronic device, the device including:
  • a memory and a processor The memory is used to store a fault prediction instruction, and the processor is used to run the fault prediction instruction to execute the power grid expected fault set prediction method described in any one of the implementation manners of the first aspect.
  • an embodiment of the present invention further provides a computer-readable storage medium, where the computer program is stored in the readable storage medium, and when the computer program runs on a computer, the computer is executed as in the first aspect
  • the method for predicting the expected fault set of the power grid described in any implementation.
  • the sample failure corresponding to the matched sample feature variable set in the sample library is obtained Set, and then predict the type of failure that occurs in the current grid section, instead of relying on the experience of dispatchers to analyze based on the grid variables of the current grid section and sample failures in the sample library, on the one hand, improve the efficiency of the staff, on the other hand
  • the analysis of the current grid section is more complete.
  • FIG. 1 is a schematic flowchart of a method for predicting an expected fault set of a power grid according to an embodiment of the present invention
  • FIG. 2 is a schematic flowchart of a process of optimizing and filtering to obtain preset feature variable types according to an embodiment of the present invention
  • FIG. 3 is a schematic flowchart of a method for generating a sample library according to an embodiment of the present invention
  • FIG. 4 is a functional block diagram of a device for predicting a fault set of a power grid according to an embodiment of the present invention.
  • the indicated orientation or position relationship is based on the orientation or position relationship shown in the drawings, or the orientation or position relationship conventionally placed when the product of the invention is used, or this
  • the orientation or positional relationship commonly understood by those skilled in the art, or the orientation or positional relationship normally placed when the product of the invention is used is only for the convenience of describing the present invention and simplifying the description, rather than indicating or implying that the device or element It has a specific orientation, is constructed and operated in a specific orientation, and therefore cannot be understood as a limitation of the present invention.
  • the terms “first”, “second”, “third”, etc. are only used to distinguish the description and cannot be understood as indicating or implying relative importance.
  • FIG. 1 is a schematic flowchart of a method for predicting a fault set of a power grid according to an embodiment of the present invention. The method includes steps S101-S103:
  • Step S101 Acquire a target profile feature variable set of a preset feature variable type of the target grid profile
  • the sample library includes the corresponding relationship between the sample fault set and the sample characteristic variable set, and the type of the characteristic variable of the sample characteristic variable set is the same as the type of the preset characteristic variable, and the characteristics of the target section characteristic variable set
  • the variables include all variables in the power grid, such as bus voltage, branch power, etc., which is not limited in this embodiment.
  • Step S102 Determine a sample feature variable set matching the target cross-section feature variable set in a preset sample library
  • the target cross-section feature variable set contains the same feature category as the sample feature variable set.
  • the sample features are set in the preset sample library according to the preset matching rules.
  • the variable set and the target cross-section feature variable set are compared to determine a sample feature variable set that matches the target cross-section feature variable set.
  • Step S103 Determine the sample fault set corresponding to the matched sample feature variable set as the predicted fault set of the target power grid cross section.
  • the sample library By using the sample library, the multiple target features of the target grid section and the sample features of each sample in the sample library are compared, and then according to the matching result, the sample fault set corresponding to the matched sample feature variable set in the sample library is determined to be the current section Predict the fault set; thus replacing the existing way of predicting the current grid section fault set through staff experience, on the one hand, it improves the efficiency of the staff, on the other hand, it can more comprehensively set the fault corresponding to the grid section Set to prevent the issue of major failures from being missed.
  • the grid variables may be optimized and screened to obtain the preset characteristic variable types, and the characteristic variables that can more accurately obtain the fault set can be selected and added to the preset characteristic variable types.
  • FIG. 2 is a schematic flowchart of obtaining a preset feature variable type by optimized screening according to an embodiment of the present invention.
  • the method for screening feature variable types includes:
  • Step S201 Acquire multiple grid variables as initial feature variable sets
  • the multiple grid variables may include all grid variables that can be obtained or detected in the grid section, and then through screening, without worrying about the existence of many meaningless features, appropriate grid variables can be derived as the judgment characteristics.
  • Step S202 Filter the initial feature variable set according to a preset feature variable evaluation rule
  • the preset characteristic variable evaluation rule may adopt one or more of principal component analysis method, kernel principal component analysis method, partial least square method and Relief algorithm.
  • the method of acquiring the preset feature variable types includes: acquiring all feature variables of each sample feature variable set in the sample library; based on the Relief algorithm, determining the weight value of each feature variable in the initial feature variable set; Delete the feature variables in the initial feature variable set whose weight value is lower than the preset threshold.
  • the variable, y i is the sample fault set.
  • N is the number of samples and x i (p) is the p-th feature variable
  • is the threshold, in this example, the weight value of the feature variable with a large weight value and relatively stable is selected as the threshold value.
  • the principal component analysis method or the like may also be used to implement the selection of characteristic variables, which is not limited in this embodiment.
  • Step S203 Determine the filtered initial feature variable set as the determined feature variable set.
  • the influence of unrelated grid variables on the prediction of the target fault set can be reduced and the accuracy of the prediction can be improved.
  • this embodiment also proposes a method for generating the sample library, which is used to collect existing fault set sample acquisition methods, and screen the samples to generate the sample library.
  • FIG. 3 is a schematic flowchart of a method for generating the sample library according to this embodiment.
  • the steps of the method for generating the sample library include steps S301-S303:
  • Step S301 At least one initial sample library is acquired by at least one collection method, and the at least one initial failure set sample set is combined to obtain a static sample library;
  • the at least one acquisition method includes: static safety analysis N-1, N-2 methods, typical expected fault sets, and a variety of power grid analysis methods combined with the experience of power grid dispatchers.
  • N-1 static safety analysis
  • N-2 typical expected fault sets
  • a variety of power grid analysis methods can be used Collection mode to more comprehensively cover the information in the grid operation mode, which is not limited in this embodiment
  • the initial sample library includes the initial sample failure set of each sample and the initial sample failure corresponding to the initial sample failure set Feature variable set.
  • Step S302 Filter the static sample library according to a preset sample evaluation rule
  • the preset sample evaluation rule is used to remove valueless samples in the static sample library, and screen out samples in the static sample library that are more capable of determining the predicted fault set.
  • the preset sample evaluation rules include: a method based on credibility, a method based on the uniformity of the results of multiple classifiers, and a group-based incremental learning algorithm.
  • an implementation method for screening the static sample library based on a group-based incremental learning algorithm is: setting an initial evaluation of the target cross-section feature variable set and the sample feature variable set in the sample library Value; and update the initial evaluation value of the sample in the static sample library according to the result of the target cross-section feature variable set on the predicted fault set, where the predicted result includes: correct, wrong, and irrelevant; And deleting the sample feature variable set corresponding to the evaluation value below the preset evaluation threshold and the sample fault set corresponding to the sample feature variable set in the static sample library, thereby obtaining the sample library.
  • the samples are updated, and when the threshold is set to q 0 , the samples whose evaluation value is lower than q 0 in the sample library are removed to reduce the valuelessness or reduce the prediction accuracy Of samples to improve prediction accuracy.
  • k is 5
  • the learning rate parameter ⁇ is 0.9
  • the initial evaluation value q 0 is 0.5.
  • Step S303 Determine the screened static sample library as the sample library.
  • the new sample failure set can be continuously added, and the upper limit of the sample quantity of the sample library can be set, and the sample can be eliminated according to the preset sample evaluation rule
  • the most valuable samples in the library are used to continuously optimize the sample library.
  • the method includes: acquiring and adding new samples to the sample library, and then obtaining the preset upper limit of the number of samples in the sample library; Whether the number of samples in the sample library is less than or equal to the upper limit of the number of samples, and if not, delete the samples in the sample library according to the preset sample evaluation rule until the number of samples in the sample library is less than or equal to the Maximum sample size.
  • the new fault set sample may be automatically added to the sample library after each target grid section prediction is completed and the result of the prediction is correct or not, according to the preset sample evaluation rules in step S302 Screening to achieve dynamic optimization, the dispatcher may also input some new fault set samples based on experience, this embodiment does not limit this.
  • the upper limit of the sample size can be set by setting multiple upper limit of the sample size. After multiple predictions, the prediction accuracy of each upper limit of sample size is compared. Finally, the upper limit of the sample size with the highest prediction accuracy is selected as the upper limit of the sample size Similarly, the dispatcher can also make a selection according to experience or according to the processing capacity of the application processor, which is not limited in this embodiment.
  • multiple sample caps can be used to make simultaneous predictions. Based on the prediction results of the sample caps with multiple sample caps, such as selecting the most frequent fault set in the prediction results, this implementation Examples also do not limit this.
  • step S102 After obtaining the preset feature variable type and sample library, the step of determining the sample feature variable set matching the target cross-section feature variable set in step S102 may be performed.
  • one way to determine the matching sample feature variable set is to compare the target cross-section feature variable set of the target power grid cross-section with the sample feature variable set in the sample library to obtain the target power grid cross-section
  • the similarity between each sample in the sample library; determining that the sample fault set corresponding to the sample feature variable set with the highest similarity is the predicted fault set of the target power grid cross section.
  • a neural network algorithm or deep learning algorithm can be used to compare each feature variable of the target cross-section feature variable set with each of the sample feature variable sets to obtain a sample feature variable set with the highest similarity It is the matching sample feature variable set.
  • another implementation of determining the matching sample feature variable set may use the k-proximity algorithm, specifically: normalizing each feature variable in the target cross-section feature variable set and the sample feature variable set; and Calculate the classification distance between the target cross-section feature variable set and each sample feature variable set in the sample library based on the k-proximity algorithm; correspondingly, determine the sample feature variable set with the highest similarity as the sample feature variable matching the target cross-section feature variable set Sets include: determining the sample feature variable set with the shortest classification distance as the sample feature variable set with the highest similarity.
  • the characteristic variable, y j is the sample fault set.
  • the normalization formula is: among them, Is the value corresponding to the feature after normalization, x j is the value corresponding to the feature before normalization, and max(x j ) and min(x j ) are the types of the feature variable The maximum and minimum values after conversion.
  • the k-proximity algorithm is used to calculate the classification distance between the target section feature variable set and each sample feature variable set in the sample library.
  • the classification distance is calculated using the Euclidean distance.
  • the formula is: Where do o,j is the classification distance between the target cross-section feature variable set and the jth sample, H is the number of feature variables of the preset feature variable type, x on is the n-dimensional variable of the target feature variable set, x jn is The nth dimension variable of the jth sample.
  • the k sample feature variable sets closest to the target feature variable set in the sample library are obtained, and the k sample feature variable sets and the target are calculated
  • the average value of the distance of the characteristic variable set Determine the classification distance from k sample feature variable sets to the average
  • step S103 is executed, and according to the corresponding relationship between the sample fault set and the sample feature variable set in the sample library, the sample fault set corresponding to the matched sample feature variable set is determined as the predicted fault set, and completed Prediction of faults on the target power grid section.
  • FIG. 4 is a functional block diagram of a device 40 for predicting a grid fault set according to this embodiment.
  • This embodiment also proposes a device 40 for predicting the expected set of power grid faults.
  • the device includes: a receiving module 401, a processing module 402, and a sending module 403, wherein the receiving module 401 is used to obtain the preset characteristic variable types of the target power grid section The target cross-section feature variable set; the processing module 402 is used to determine a sample feature variable set matching the target cross-section feature variable set in a preset sample library, and determine a sample fault set corresponding to the matched sample feature variable set Is the predicted fault set of the target power grid section; the sending module 403 is used to send the predicted fault set to the staff.
  • the receiving module 401 is also used to obtain multiple grid variables as an initial feature variable set; correspondingly, the processing module 402 is also used to filter the initial feature variable set according to a preset feature variable evaluation rule and determine the filtered
  • the initial feature variable set is the preset feature variable type.
  • the receiving module 401 is also used to obtain all feature variables of each sample feature variable set in the sample library; correspondingly, the processing module 402 is also used to determine the value of each feature variable in the initial feature variable set based on the Relief algorithm Weight value; delete the feature variables in the initial feature variable set whose weight value is lower than the preset threshold.
  • the receiving module 401 is further used to obtain at least one initial sample library by using at least one collection method
  • the processing module 402 is further used to combine the at least one initial sample set to obtain a static sample library and filter according to preset sample evaluation rules In the static sample library, it is determined that the filtered static sample library is the sample library.
  • the processing module 402 is also used to set the initial evaluation value of the target cross-sectional feature variable set and the sample feature variable set in the sample library; and according to the results of the target cross-sectional feature variable set for predicting the fault set, the static sample library
  • the initial evaluation value of the sample is updated, where the prediction results include: correct, wrong, and irrelevant; and delete the sample feature variable set corresponding to the evaluation value below the preset evaluation threshold in the static sample library, and the corresponding Sample failure set, thereby obtaining a sample library.
  • the processing module 402 also compares the target cross-section feature variable set of the target power grid cross-section with the sample feature variable set in the sample library to obtain the target power grid cross-section and each sample in the sample library The similarity between them; determining that the sample fault set corresponding to the sample feature variable set with the highest similarity is the predicted fault set of the target power grid section.
  • the processing module 402 is also used to normalize each feature variable in the target cross-section feature variable set and the sample feature variable set; and calculate the target cross-section feature variable set and each sample feature variable in the sample library based on the k-proximity algorithm
  • the classification distance between the sets; correspondingly, the sample feature variable set with the highest similarity is determined as the sample feature variable set matching the target cross-section feature variable set, including: determining the sample feature variable set with the shortest classification distance as the sample with the highest similarity Feature variable set.
  • This embodiment also proposes an electronic device.
  • the electronic device includes a memory and a processor.
  • the memory is used to store a fault prediction instruction
  • the processor is used to run the fault prediction instruction to perform the above-mentioned grid prediction fault set prediction method.
  • This embodiment also proposes a computer-readable storage medium that stores a computer program, and when the computer program runs on a computer, causes the computer to execute the power grid prediction described in this embodiment Fault set prediction method. This embodiment will not repeat them here.
  • each block in the flowchart or block diagram may represent a module, program segment, or part of code that contains one or more of the Executable instructions.
  • the functions noted in the block may occur out of the order noted in the figures. For example, two consecutive blocks can actually be executed substantially in parallel, and sometimes they can also be executed in reverse order, depending on the functions involved.
  • each block in the block diagrams and/or flowcharts, and combinations of blocks in the block diagrams and/or flowcharts can be implemented with a dedicated hardware-based system that performs specified functions or actions Or, it can be realized by a combination of dedicated hardware and computer instructions.
  • the functional modules in the embodiments of the present invention may be integrated together to form an independent part, or each module may exist alone, or two or more modules may be integrated to form an independent part.
  • the functions are implemented in the form of software function modules and sold or used as independent products, they can be stored in a computer-readable storage medium.
  • the technical solution of the present invention essentially or part of the contribution to the existing technology or part of the technical solution can be embodied in the form of a software product
  • the computer software product is stored in a storage medium, including Several instructions are used to enable a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in the embodiments of the present invention.
  • the aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program code .

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Economics (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • General Business, Economics & Management (AREA)
  • General Health & Medical Sciences (AREA)
  • Tourism & Hospitality (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • Primary Health Care (AREA)
  • Marketing (AREA)
  • Supply And Distribution Of Alternating Current (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A power grid anticipated fault set prediction method and apparatus, and an electronic device and a storage medium. The method comprises: obtaining a target section characteristic variable set of a preset characteristic variable type of a target power grid section (S101); determining a sample characteristic variable set matching the target section characteristic variable set in a preset sample library (S102), wherein the sample library comprises a correspondence between a sample fault set and a sample characteristic variable set, the characteristic variable type of the sample characteristic variable set is the same as the preset characteristic variable type; and finally determining that the sample fault set corresponding to the matched sample characteristic variable set is a predicted fault set of the target power grid section (S103). The fault set corresponding to the current section is obtained by comparing a characteristic variable of a sample in the sample library with a characteristic variable of the target power grid section; a problem that the current section depends on manual experience is solved; the possibility of incomplete prediction of the fault set is reduced; the ultra-real-time prediction of the fault set is implemented; and the working efficiency of a worker is improved.

Description

一种电网预想故障集预测方法、装置、电子设备及存储介质Method, device, electronic equipment and storage medium for predicting fault set of power grid 技术领域Technical field
本发明涉及电网分析技术领域,具体而言,涉及一种电网预想故障集预测方法、装置、电子设备及存储介质。The present invention relates to the technical field of power grid analysis, and in particular, to a method, device, electronic equipment, and storage medium for predicting a fault set of a power grid.
背景技术Background technique
静态安全分析中的预想故障集生成技术是预防电网故障、模拟演练的必要手段;目前,预想故障集生成主要依靠调度员经验进行确定,但这种方法耗时较长、盲目性较大,常常漏选大量实际存在的故障。在电网运行方式改变、新设备接入时,需要人工调整预想故障集,不能自适应对故障集进行增、删、改、查,也不能考虑恶劣天气对电网的影响。因此,亟需快速、准确地预想故障集生成方法,摆脱预想故障集生成对调度员经验的依赖,提高工作效率。The predicted fault set generation technology in static safety analysis is a necessary means to prevent power grid faults and simulation drills; at present, the predicted fault set generation mainly depends on the experience of the dispatcher to determine, but this method takes a long time and is blind. A large number of actual faults are missed. When the operation mode of the power grid is changed and new equipment is connected, it is necessary to manually adjust the expected fault set. The fault set cannot be adaptively added, deleted, changed, or checked, nor can the influence of bad weather on the power grid be considered. Therefore, there is an urgent need to quickly and accurately predict the fault set generation method, get rid of the dependence of the expected fault set generation on the dispatcher's experience, and improve work efficiency.
发明内容Summary of the invention
为了克服上述现有技术中的不足,本发明提供一种故障集自动生成方法、电子设备及存储介质以改善上述问题。In order to overcome the above deficiencies in the prior art, the present invention provides a method for automatically generating a fault set, an electronic device and a storage medium to improve the above problems.
为了实现上述目的,本发明实施例所提供的技术方案如下所示:In order to achieve the above objective, the technical solutions provided by the embodiments of the present invention are as follows:
第一方面,本发明实施例提供一种电网预想故障集预测方法,包括:获取目标电网断面的预设特征变量种类的目标断面特征变量集;在预设的样本库中确定出与所述目标断面特征变量集匹配的样本特 征变量集;其中,所述样本库包括样本故障集和样本特征变量集的对应关系;所述样本特征变量集的特征变量种类与所述预设特征变量种类相同;确定所述匹配的样本特征变量集对应的样本故障集为所述目标电网断面的预测故障集。In a first aspect, an embodiment of the present invention provides a method for predicting a predicted fault set of a power grid, including: acquiring a target cross-section feature variable set of a preset feature variable type of a target power grid cross-section; and determining the target in a preset sample library A sample feature variable set matched with a cross-section feature variable set; wherein, the sample library includes a correspondence between a sample fault set and a sample feature variable set; the feature variable type of the sample feature variable set is the same as the preset feature variable type; It is determined that the sample fault set corresponding to the matched sample feature variable set is the predicted fault set of the target power grid cross section.
结合第一方面,在一些可能的实现方式中,在获取目标电网断面的预设特征变量种类的目标断面特征变量集之前,所述方法还包括:获取多个电网变量作为初始特征变量集;根据预设特征变量评价规则筛选所述初始特征变量集;确定筛选后的所述初始特征变量集中包含的特征变量种类为所述预设特征变量种类。With reference to the first aspect, in some possible implementation manners, before acquiring the target cross-section feature variable set of the preset feature variable type of the target grid cross-section, the method further includes: acquiring multiple grid variables as the initial feature variable set; A preset feature variable evaluation rule filters the initial feature variable set; it is determined that the type of feature variable included in the filtered initial feature variable set is the preset feature variable type.
结合第一方面,在一些可能的实现方式中,所述根据预设特征变量评价规则筛选所述初始特征变量集,包括:获取样本库中每个样本特征变量集的所有特征变量;基于Relief算法,确定初始特征变量集中每个特征变量的权重值;删除初始特征变量集中,权重值低于预设阈值的特征变量。With reference to the first aspect, in some possible implementation manners, the filtering of the initial feature variable set according to a preset feature variable evaluation rule includes: obtaining all feature variables of each sample feature variable set in the sample library; based on the Relief algorithm To determine the weight value of each feature variable in the initial feature variable set; delete the feature variables in the initial feature variable set whose weight value is lower than the preset threshold.
结合第一方面,在一些可能的实现方式中,方法还包括:采用至少一种采集方式获取至少一个初始样本库,组合至少一个初始样本库得到静态样本库,初始样本库包括每一个样本的样本故障集及样本故障集对应的样本特征变量集;按照预设样本评价规则筛选静态样本库;确定筛选后的静态样本库为样本库。With reference to the first aspect, in some possible implementation manners, the method further includes: acquiring at least one initial sample library by at least one acquisition method, and combining at least one initial sample library to obtain a static sample library, and the initial sample library includes samples of each sample The fault feature set and the sample feature variable set corresponding to the sample fault set; the static sample library is filtered according to the preset sample evaluation rules; and the filtered static sample library is determined as the sample library.
结合第一方面,在一些可能的实现方式中,在预设的样本库中确定出与目标断面特征变量集匹配的样本特征变量集之前,方法还包括:设定目标断面特征变量集与样本库中的样本特征变量集的初始评价值;根据目标断面特征变量集对预测故障集的结果,对静态样本库中样本的初始评价值进行更新,其中,预测结果包括:正确、错误和不相关;删除静态样本库中,低于预设评价阈值的评价值对应的样本特征变量集以及样本特征变量集对应的样本故障集,得到样本库。With reference to the first aspect, in some possible implementations, before a sample feature variable set matching the target cross-section feature variable set is determined in the preset sample library, the method further includes: setting the target cross-section feature variable set and the sample library The initial evaluation value of the sample feature variable set in; update the initial evaluation value of the sample in the static sample library according to the result of the target cross-section feature variable set on the predicted fault set, where the prediction result includes: correct, wrong and irrelevant; Delete the sample feature variable set corresponding to the evaluation value lower than the preset evaluation threshold and the sample fault set corresponding to the sample feature variable set in the static sample library to obtain the sample library.
结合第一方面,在一些可能的实现方式中,所述在预设的样本库中确定出与所述目标断面特征变量集匹配的样本特征变量集,包括:比对所述目标断面特征变量集与所述样本库中的样本特征变量集,获取所述目标电网断面与所述样本库中各个样本之间的相似度;确定所述相似度最高的样本特征变量集为与所述目标断面特征变量集匹配的样本特征变量集。With reference to the first aspect, in some possible implementation manners, the determining a sample feature variable set that matches the target cross-section feature variable set in a preset sample library includes: comparing the target cross-section feature variable set Obtain the similarity between the target grid section and each sample in the sample library with the sample feature variable set in the sample library; determine the sample feature variable set with the highest similarity as the target section feature The sample feature variable set matching the variable set.
结合第一方面,在一些可能的实现方式中,在预设的样本库中确定出与目标断面特征变量集匹配的样本特征变量集,包括:归一化处理目标断面特征变量集与样本特征变量集中的每一个特征变量;并基于k-邻近算法计算目标断面特征变量集与样本库中各个样本特征变量集之间的分类距离;对应的,确定相似度最高的样本特征变量集为与目标断面特征变量集匹配的样本特征变量集,包括:确定分类距离最短的样本特征变量集为相似度最高的样本特征变量集。With reference to the first aspect, in some possible implementations, a sample feature variable set matching the target cross-section feature variable set is determined in a preset sample library, including: normalizing the target cross-section feature variable set and the sample feature variable Each feature variable in the set; calculate the classification distance between the target cross-section feature variable set and each sample feature variable set in the sample library based on the k-proximity algorithm; correspondingly, determine the sample feature variable set with the highest similarity as the target cross-section The sample feature variable set matched by the feature variable set includes: determining the sample feature variable set with the shortest classification distance as the sample feature variable set with the highest similarity.
第二方面,本发明实施例还提供一种电网预想故障集预测装置,包括:接收模块,用于获取目标电网断面的预设特征变量种类的目标断面特征变量集;处理模块,用于在预设的样本库中确定出与所述目标断面特征变量集匹配的样本特征变量集,并确定所述匹配的样本特征变量集对应的样本故障集为所述目标电网断面的预测故障集;发送模块,用于发送所述预测故障集给工作人员。In a second aspect, an embodiment of the present invention further provides a device for predicting a predicted set of power grid faults, including: a receiving module for acquiring a target profile feature variable set of a preset feature variable type of a target grid profile; a processing module for predicting A sample feature variable set matching the target cross-section feature variable set is determined from the set sample database, and a sample fault set corresponding to the matched sample feature variable set is determined as the predicted fault set of the target power grid cross-section; the sending module , Used to send the predicted fault set to the staff.
第三方面,本发明实施例还提供一种电子设备,所述设备包括:In a third aspect, an embodiment of the present invention further provides an electronic device, the device including:
存储器及处理器,存储器用于存储故障预测指令,处理器用于运行所述故障预测指令以执行如第一方面任意一种实现方式中所述的电网预想故障集预测方法。A memory and a processor. The memory is used to store a fault prediction instruction, and the processor is used to run the fault prediction instruction to execute the power grid expected fault set prediction method described in any one of the implementation manners of the first aspect.
第四方面,本发明实施例还提供一种计算机可读存储介质,所述可读存储介质中存储有计算机程序,当所述计算机程序在计算机上运 行时,使得所述计算机执行如第一方面任意一种实现方式中所述的电网预想故障集预测方法。According to a fourth aspect, an embodiment of the present invention further provides a computer-readable storage medium, where the computer program is stored in the readable storage medium, and when the computer program runs on a computer, the computer is executed as in the first aspect The method for predicting the expected fault set of the power grid described in any implementation.
本发明的有益效果包括:The beneficial effects of the present invention include:
通过获取到目标电网断面的预设特征变量种类的特征,并根据预设的样本库中每一个样本的样本特征变量集进行比对分析,得到样本库中匹配的样本特征变量集对应的样本故障集,进而预测到当前电网断面出现的故障类型,而不再依靠调度人员的经验根据当前电网断面的电网变量及样本库中的样本故障进行分析,一方面提高了工作人员的效率,另一方面对当前电网断面的分析更加充分。By obtaining the characteristics of the preset feature variable types of the target power grid section, and performing a comparative analysis according to the sample feature variable set of each sample in the preset sample library, the sample failure corresponding to the matched sample feature variable set in the sample library is obtained Set, and then predict the type of failure that occurs in the current grid section, instead of relying on the experience of dispatchers to analyze based on the grid variables of the current grid section and sample failures in the sample library, on the one hand, improve the efficiency of the staff, on the other hand The analysis of the current grid section is more complete.
为使本发明的上述目的、特征和优点能更明显易懂,下文举出本发明实施例,并配合所附附图,作详细说明。In order to make the above objects, features and advantages of the present invention more obvious and understandable, the following describes the embodiments of the present invention and the accompanying drawings for detailed description.
附图说明BRIEF DESCRIPTION
为了更清楚地说明本发明实施例的技术方案,下面将对实施例中所需要使用的附图作简单地介绍,应当理解,以下附图仅示出了本发明的某些实施例,因此不应被看作是对范围的限定,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他相关的附图。In order to more clearly explain the technical solutions of the embodiments of the present invention, the drawings required in the embodiments will be briefly introduced below. It should be understood that the following drawings only show some embodiments of the present invention, and therefore do not It should be regarded as a limitation on the scope. For those of ordinary skill in the art, without paying any creative labor, other related drawings can be obtained based on these drawings.
图1本发明实施例提供的一种电网预想故障集预测方法的流程示意图;FIG. 1 is a schematic flowchart of a method for predicting an expected fault set of a power grid according to an embodiment of the present invention;
图2为本发明实施例提供的一种优化筛选得到预设特征变量种类的流程示意图;FIG. 2 is a schematic flowchart of a process of optimizing and filtering to obtain preset feature variable types according to an embodiment of the present invention; FIG.
图3为本发明实施例提供的一种生成样本库方法的流程示意图;3 is a schematic flowchart of a method for generating a sample library according to an embodiment of the present invention;
图4为本发明实施例提供的一种电网预想故障集预测装置的功能框图。4 is a functional block diagram of a device for predicting a fault set of a power grid according to an embodiment of the present invention.
具体实施方式detailed description
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。通常在此处附图中描述和示出的本发明实施例的组件可以以各种不同的配置来布置和设计。To make the objectives, technical solutions, and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be described clearly and completely in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of the embodiments of the present invention, but not all the embodiments. The components of the embodiments of the present invention generally described and illustrated in the drawings herein can be arranged and designed in various configurations.
因此,以下对在附图中提供的本发明的实施例的详细描述并非旨在限制要求保护的本发明的范围,而是仅仅表示本发明的选定实施例。基于本发明中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。Therefore, the following detailed description of the embodiments of the present invention provided in the drawings is not intended to limit the scope of the claimed invention, but only to represent selected embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by a person of ordinary skill in the art without making creative efforts fall within the protection scope of the present invention.
需要说明的是,在不冲突的情况下,本发明中的实施例及实施例中的特征可以相互组合。It should be noted that the embodiments of the present invention and the features in the embodiments can be combined with each other without conflict.
应注意到:相似的标号和字母在下面的附图中表示类似项,因此,一旦某一项在一个附图中被定义,则在随后的附图中不需要对其进行进一步定义和解释。It should be noted that similar reference numerals and letters indicate similar items in the following drawings, therefore, once an item is defined in one drawing, there is no need to further define and explain it in subsequent drawings.
在本发明实施例的描述中,需要说明的是,指示方位或位置关系为基于附图所示的方位或位置关系,或者是该发明产品使用时惯常摆放的方位或位置关系,或者是本领域技术人员惯常理解的方位或位置关系,或者是该发明产品使用时惯常摆放的方位或位置关系,仅是为了便于描述本发明和简化描述,而不是指示或暗示所指的装置或元件必须具有特定的方位、以特定的方位构造和操作,因此不能理解为对本发明的限制。此外,术语“第一”、“第二”、“第三”等仅用于区分描述,而不能理解为指示或暗示相对重要性。In the description of the embodiments of the present invention, it should be noted that the indicated orientation or position relationship is based on the orientation or position relationship shown in the drawings, or the orientation or position relationship conventionally placed when the product of the invention is used, or this The orientation or positional relationship commonly understood by those skilled in the art, or the orientation or positional relationship normally placed when the product of the invention is used, is only for the convenience of describing the present invention and simplifying the description, rather than indicating or implying that the device or element It has a specific orientation, is constructed and operated in a specific orientation, and therefore cannot be understood as a limitation of the present invention. In addition, the terms "first", "second", "third", etc. are only used to distinguish the description and cannot be understood as indicating or implying relative importance.
在本发明实施例的描述中,还需要说明的是,除非另有明确的规定和限定,术语“设置”、“安装”、“连接”应做广义理解,例如,可以是固定连接,也可以是可拆卸连接,或一体地连接;可以是直接连接, 也可以通过中间媒介间接连接。对于本领域的普通技术人员而言,可以具体情况理解上述术语在本发明中的具体含义。In the description of the embodiments of the present invention, it should also be noted that, unless otherwise clearly specified and defined, the terms “setup”, “installation”, and “connection” should be understood in a broad sense, for example, it can be a fixed connection or a It is a detachable connection, or a one-piece connection; it can be a direct connection or an indirect connection through an intermediate medium. For those of ordinary skill in the art, the specific meanings of the above terms in the present invention can be understood in specific situations.
本实施例提供一种电网预想故障集自动生成方法,用于预测某地调电网的预想故障集,并对所预测的预想故障集进行验证。根据存储的样本库及预测方法预测某一电网断面出现的故障集,具体的,请参考图1,图1为本发明实施例提供的一种电网预想故障集预测方法的流程示意图。所述方法包括步骤S101-步骤S103:This embodiment provides a method for automatically generating a predicted fault set of a power grid, which is used to predict a predicted fault set of a local power grid and verify the predicted expected fault set. According to the stored sample database and the prediction method, a fault set occurring in a certain power grid cross section is predicted. For details, please refer to FIG. 1. FIG. 1 is a schematic flowchart of a method for predicting a fault set of a power grid according to an embodiment of the present invention. The method includes steps S101-S103:
步骤S101:获取目标电网断面的预设特征变量种类的目标断面特征变量集;Step S101: Acquire a target profile feature variable set of a preset feature variable type of the target grid profile;
其中,所述样本库包括样本故障集和样本特征变量集的对应关系,且所述样本特征变量集的特征变量的种类与所述预设特征变量种类相同,所述目标断面特征变量集的特征变量包括电网中的全变量,例如母线电压、支路功率等,本实施例对此不做限制。Wherein, the sample library includes the corresponding relationship between the sample fault set and the sample characteristic variable set, and the type of the characteristic variable of the sample characteristic variable set is the same as the type of the preset characteristic variable, and the characteristics of the target section characteristic variable set The variables include all variables in the power grid, such as bus voltage, branch power, etc., which is not limited in this embodiment.
步骤S102:在预设的样本库中确定出与所述目标断面特征变量集匹配的样本特征变量集;Step S102: Determine a sample feature variable set matching the target cross-section feature variable set in a preset sample library;
可以理解的是,所述目标断面特征变量集包含的特征类别也与所述样本特征变量集包含的特征变量的类别相同,通过根据预设的匹配规则,在预设的样本库中对样本特征变量集及目标断面特征变量集进行比对,确定出与所述目标断面特征变量集匹配的样本特征变量集。It is understandable that the target cross-section feature variable set contains the same feature category as the sample feature variable set. The sample features are set in the preset sample library according to the preset matching rules. The variable set and the target cross-section feature variable set are compared to determine a sample feature variable set that matches the target cross-section feature variable set.
步骤S103:确定所述匹配的样本特征变量集对应的样本故障集为所述目标电网断面的预测故障集。Step S103: Determine the sample fault set corresponding to the matched sample feature variable set as the predicted fault set of the target power grid cross section.
通过利用样本库,比对目标电网断面的多个目标特征及样本库中每个样本的样本特征,进而根据匹配结果,确定样本库中匹配的样本特征变量集对应的样本故障集为当前断面的预测故障集;从而代替了现有通过工作人员经验对当前电网断面故障集预想设定的方式,一方面提高了工作人员的效率,另一方面能够更为全面地设定电网断面所 对应的故障集,防止出现重大故障集遗漏的问题。By using the sample library, the multiple target features of the target grid section and the sample features of each sample in the sample library are compared, and then according to the matching result, the sample fault set corresponding to the matched sample feature variable set in the sample library is determined to be the current section Predict the fault set; thus replacing the existing way of predicting the current grid section fault set through staff experience, on the one hand, it improves the efficiency of the staff, on the other hand, it can more comprehensively set the fault corresponding to the grid section Set to prevent the issue of major failures from being missed.
可选的,也可以对电网变量进行优化筛选得到预设特征变量种类,选择更能得出正确预测故障集的特征变量加入到所述预设特征变量种类中。请参考图2,图2为本发明实施例提供的一种优化筛选得到预设特征变量种类的流程示意图。所述筛选特征变量种类的方法包括:Alternatively, the grid variables may be optimized and screened to obtain the preset characteristic variable types, and the characteristic variables that can more accurately obtain the fault set can be selected and added to the preset characteristic variable types. Please refer to FIG. 2, which is a schematic flowchart of obtaining a preset feature variable type by optimized screening according to an embodiment of the present invention. The method for screening feature variable types includes:
步骤S201:获取多个电网变量作为初始特征变量集;Step S201: Acquire multiple grid variables as initial feature variable sets;
其中,多个电网变量可以包括电网断面中可以获取或者检测到的所有电网变量,进而通过筛选,不会担心存在许多无意义特征,而能得出合适的电网变量作为判定特征。Among them, the multiple grid variables may include all grid variables that can be obtained or detected in the grid section, and then through screening, without worrying about the existence of many meaningless features, appropriate grid variables can be derived as the judgment characteristics.
步骤S202:根据预设特征变量评价规则筛选所述初始特征变量集;Step S202: Filter the initial feature variable set according to a preset feature variable evaluation rule;
其中,所述预设特征变量评价规则可以采用:主成分分析法、核主成分分析法、偏最小二乘法以及Relief算法等中的一种或多种。Wherein, the preset characteristic variable evaluation rule may adopt one or more of principal component analysis method, kernel principal component analysis method, partial least square method and Relief algorithm.
可选的,基于Relief算法,获取预设特征变量种类的方法包括:获取样本库中每个样本特征变量集的所有特征变量;基于Relief算法,确定初始特征变量集中每个特征变量的权重值;删除初始特征变量集中,权重值低于预设阈值的特征变量。Optionally, based on the Relief algorithm, the method of acquiring the preset feature variable types includes: acquiring all feature variables of each sample feature variable set in the sample library; based on the Relief algorithm, determining the weight value of each feature variable in the initial feature variable set; Delete the feature variables in the initial feature variable set whose weight value is lower than the preset threshold.
一种获取预设特征变量种类的实现方式为:设定样本库中的样本特征变量集为(x i,y i),i=1,2,...,N,x i为电网断面特征变量,y i为样本故障集。定义第p个特征变量所对应的权重为 An implementation method for obtaining the preset characteristic variable types is: setting the sample characteristic variable set in the sample library as (x i , y i ), i=1, 2, ..., N, and x i are the grid section characteristics The variable, y i is the sample fault set. Define the weight corresponding to the p-th feature variable as
Figure PCTCN2019110941-appb-000001
Figure PCTCN2019110941-appb-000001
其中N为样本数量,x i(p)为第p个特征变量,
Figure PCTCN2019110941-appb-000002
为与x i(p)欧式距离最近的同类样本,
Figure PCTCN2019110941-appb-000003
为与x i(p)欧式距离最近的异类样本。特征变量权重w(p)越大,则对应的特征变量的贡献作用就越大,进而使用 w(p)≥δ进行筛选特征变量,最终得出能够表征样本特征变量集的最优特征变量,其中δ为阈值,本实例中选择权重值较大且相对稳定的特征变量权值作为阈值。
Where N is the number of samples and x i (p) is the p-th feature variable,
Figure PCTCN2019110941-appb-000002
Is the same kind of sample with the closest Euclidean distance to x i (p),
Figure PCTCN2019110941-appb-000003
It is a heterogeneous sample with the closest Euclidean distance to x i (p). The larger the feature variable weight w(p), the greater the contribution of the corresponding feature variable, and then use w(p)≥δ to filter the feature variable, and finally obtain the optimal feature variable that can characterize the sample feature variable set, Where δ is the threshold, in this example, the weight value of the feature variable with a large weight value and relatively stable is selected as the threshold value.
根据上述公式计算得到所述每个特征变量的权重值,进而根据预设阈值,删除所述初始特征变量集中低于所述预设阈值的所述区别相似度对应的特征变量,完成对初始特征变量集的筛选,在其他实施例中,还可以采用主成分分析法等实现特征变量筛选,本实施例对此不做限制。Calculate the weight value of each feature variable according to the above formula, and then delete the feature variable corresponding to the difference similarity in the initial feature variable set that is lower than the preset threshold according to a preset threshold to complete the initial feature For the selection of the variable set, in other embodiments, the principal component analysis method or the like may also be used to implement the selection of characteristic variables, which is not limited in this embodiment.
步骤S203:确定筛选后的所述初始特征变量集为所述判定特征变量集。Step S203: Determine the filtered initial feature variable set as the determined feature variable set.
通过选择可以测得的电网变量中较能体现故障集的特征,能够减小不相关电网变量对目标故障集预测的影响,提高预测的准确性。By selecting the grid variables that can be measured to better reflect the characteristics of the fault set, the influence of unrelated grid variables on the prediction of the target fault set can be reduced and the accuracy of the prediction can be improved.
可选的,本实施例还提出一种生成所述样本库的方法,用于集合现有的故障集样本获取方式,并对样本进行筛选,生成所述样本库。请参考图3,图3为本实施例提供的一种生成所述样本库方法的流程示意图。所述生成所述样本库的方法的步骤包括步骤S301-步骤S303:Optionally, this embodiment also proposes a method for generating the sample library, which is used to collect existing fault set sample acquisition methods, and screen the samples to generate the sample library. Please refer to FIG. 3, which is a schematic flowchart of a method for generating the sample library according to this embodiment. The steps of the method for generating the sample library include steps S301-S303:
步骤S301:采用至少一种采集方式获取至少一个初始样本库,组合所述至少一个初始故障集样本集得到静态样本库;Step S301: At least one initial sample library is acquired by at least one collection method, and the at least one initial failure set sample set is combined to obtain a static sample library;
其中,所述至少一种采集方式包括:静态安全分析N-1、N-2方法、典型预想故障集以及结合电网调度员经验等多种电网分析方法,在本实施例中,可以采用多种采集方式,以更加全面地覆盖电网运行方式中的信息,本实施例对此不做限制;所述初始样本库包括每一个样本的初始样本故障集及所述初始样本故障集对应的初始样本故障特征变量集。Among them, the at least one acquisition method includes: static safety analysis N-1, N-2 methods, typical expected fault sets, and a variety of power grid analysis methods combined with the experience of power grid dispatchers. In this embodiment, a variety of power grid analysis methods can be used Collection mode to more comprehensively cover the information in the grid operation mode, which is not limited in this embodiment; the initial sample library includes the initial sample failure set of each sample and the initial sample failure corresponding to the initial sample failure set Feature variable set.
步骤S302:按照预设样本评价规则筛选所述静态样本库;Step S302: Filter the static sample library according to a preset sample evaluation rule;
其中,所述预设样本评价规则用于去除所述静态样本库中的无价值样本,筛选出所述静态样本库中更能确定预测故障集的样本。所述预设样本评价规则包括:基于可信度的方法、基于多分类器结果统一度的方法以及基于群体的增量学习算法等。Wherein, the preset sample evaluation rule is used to remove valueless samples in the static sample library, and screen out samples in the static sample library that are more capable of determining the predicted fault set. The preset sample evaluation rules include: a method based on credibility, a method based on the uniformity of the results of multiple classifiers, and a group-based incremental learning algorithm.
可选的,基于群体的增量学习算法,筛选所述静态样本库的一种实现方式为:设定所述目标断面特征变量集与所述样本库中的所述样本特征变量集的初始评价值;并根据所述目标断面特征变量集对所述预测故障集的结果,对所述静态样本库中样本的初始评价值进行更新,其中,所述预测结果包括:正确、错误和不相关;以及删除所述静态样本库中,低于预设评价阈值的所述评价值对应的样本特征变量集以及所述样本特征变量集对应的样本故障集,从而得到所述样本库。Optionally, an implementation method for screening the static sample library based on a group-based incremental learning algorithm is: setting an initial evaluation of the target cross-section feature variable set and the sample feature variable set in the sample library Value; and update the initial evaluation value of the sample in the static sample library according to the result of the target cross-section feature variable set on the predicted fault set, where the predicted result includes: correct, wrong, and irrelevant; And deleting the sample feature variable set corresponding to the evaluation value below the preset evaluation threshold and the sample fault set corresponding to the sample feature variable set in the static sample library, thereby obtaining the sample library.
其中,设定每一个样本在未经过筛选前的初始评价值为q 0,设定学习速率参数β,且0<β<1,如果样本i为待分类样本的k个最邻近之一,并且分类结果正确,则该样本的评价值更新为:q i1=βq i1+1-β;如果样本i为待分类样本的k个最邻近之一,但分类结果错误,则该样本的评价值更新为:q i2=βq i2;如果样本i不是待分类样本的k个最邻近样本中任意一个,判定关系为不相关,则该样本的评价值更新为:q i3=βq i3+(1-β)q 0。可以得知,在三种关系下,对样本进行更新,在设定阈值为q 0时,去除所述样本库中,评价值低于q 0的样本,以减小无价值或者降低预测正确率的样本,提高预测正确率。其中,本实例中k为5,学习速率参数β为0.9,初始评价值q 0为0.5。 Where the initial evaluation value of each sample before being screened is q 0 , the learning rate parameter β is set, and 0<β<1, if sample i is one of the k nearest neighbors of the sample to be classified, and If the classification result is correct, the evaluation value of the sample is updated to: q i1 =βq i1 +1-β; if sample i is one of the k nearest neighbors of the sample to be classified, but the classification result is wrong, the evaluation value of the sample is updated Is: q i2 = βq i2 ; if sample i is not any of the k nearest samples of the sample to be classified, and the judgment relationship is not relevant, the evaluation value of the sample is updated to: q i3 = βq i3 + (1-β )q 0 . It can be known that under the three relationships, the samples are updated, and when the threshold is set to q 0 , the samples whose evaluation value is lower than q 0 in the sample library are removed to reduce the valuelessness or reduce the prediction accuracy Of samples to improve prediction accuracy. In this example, k is 5, the learning rate parameter β is 0.9, and the initial evaluation value q 0 is 0.5.
步骤S303:确定筛选后的所述静态样本库为所述样本库。Step S303: Determine the screened static sample library as the sample library.
通过筛选出所述静态样本库中的无价值样本,能够防止该类无价值样本对步骤S103中确定当前断面的预测故障集结果的影响,提高 预测正确率。By filtering out the non-value samples in the static sample database, it is possible to prevent the non-value samples of this type from affecting the prediction fault set result of the current section determined in step S103, and improve the prediction accuracy.
可选的,步骤S303确定所述样本库后,还可以通过在不断加入新的样本故障集的同时,设置样本库的样本数量上限,并根据所述预设样本评价规则,剔除掉所述样本库中认为最无价值的样本,以实现不断地对样本库的优化,所述方法包括:获取并添加新的样本至所述样本库,再获取预设的样本库的样本数量上限;判断所述样本库的样本数量是否小于等于所述样本数量上限,在为否时,根据所述预设样本评价规则删除所述样本库中的样本,直至所述样本库的样本数量是否小于等于所述样本数量上限。Optionally, after the sample library is determined in step S303, the new sample failure set can be continuously added, and the upper limit of the sample quantity of the sample library can be set, and the sample can be eliminated according to the preset sample evaluation rule The most valuable samples in the library are used to continuously optimize the sample library. The method includes: acquiring and adding new samples to the sample library, and then obtaining the preset upper limit of the number of samples in the sample library; Whether the number of samples in the sample library is less than or equal to the upper limit of the number of samples, and if not, delete the samples in the sample library according to the preset sample evaluation rule until the number of samples in the sample library is less than or equal to the Maximum sample size.
可选的,所述新的故障集样本可以在每一次目标电网断面预测完毕并获取预测正确与否的结果后,自动添加到所述样本库,并根据步骤S302中的预设样本评价规则进行筛选,实现动态优化,也可以由调度员根据经验输入一些新的故障集样本,本实施例对此不做限制。Optionally, the new fault set sample may be automatically added to the sample library after each target grid section prediction is completed and the result of the prediction is correct or not, according to the preset sample evaluation rules in step S302 Screening to achieve dynamic optimization, the dispatcher may also input some new fault set samples based on experience, this embodiment does not limit this.
可选的,样本数量上限可以通过设置多个样本数量上限,经过多次预测后,比较每个样本数量上限预测的准确率,最终选择预测准确率最高的样本数量上限作为样本库的样本数量上限,同样的,也可以由调度员根据经验或者根据应用处理器的处理能力进行选择,本实施例对此不做限制。Optionally, the upper limit of the sample size can be set by setting multiple upper limit of the sample size. After multiple predictions, the prediction accuracy of each upper limit of sample size is compared. Finally, the upper limit of the sample size with the highest prediction accuracy is selected as the upper limit of the sample size Similarly, the dispatcher can also make a selection according to experience or according to the processing capacity of the application processor, which is not limited in this embodiment.
可以进一步理解的是,可以利用多个样本数量上限的样本库同时进行预测,根据多个样本数量上限的样本库的预测结果综合考虑,例如选择在预测结果中出现最多的故障集等,本实施例同样对此不做限制。It can be further understood that multiple sample caps can be used to make simultaneous predictions. Based on the prediction results of the sample caps with multiple sample caps, such as selecting the most frequent fault set in the prediction results, this implementation Examples also do not limit this.
在获取到预设特征变量种类及样本库后,即可执行步骤S102中确定出与所述目标断面特征变量集匹配的样本特征变量集的步骤。After obtaining the preset feature variable type and sample library, the step of determining the sample feature variable set matching the target cross-section feature variable set in step S102 may be performed.
可选的,确定出匹配的样本特征变量集的一种实现方式为:比对所述目标电网断面的目标断面特征变量集与所述样本库中的样本特 征变量集,获取所述目标电网断面与所述样本库中各个样本之间的相似度;确定所述相似度最高的样本特征变量集对应的样本故障集为所述目标电网断面的预测故障集。其中,可以采用神经网络算法或者深度学习算法等,将目标断面特征变量集的每个特征变量与每个所述样本特征变量集进行相似度比对等方式,得到相似度最高的样本特征变量集为匹配的样本特征变量集。Optionally, one way to determine the matching sample feature variable set is to compare the target cross-section feature variable set of the target power grid cross-section with the sample feature variable set in the sample library to obtain the target power grid cross-section The similarity between each sample in the sample library; determining that the sample fault set corresponding to the sample feature variable set with the highest similarity is the predicted fault set of the target power grid cross section. Among them, a neural network algorithm or deep learning algorithm can be used to compare each feature variable of the target cross-section feature variable set with each of the sample feature variable sets to obtain a sample feature variable set with the highest similarity It is the matching sample feature variable set.
可选的,确定出匹配的样本特征变量集的另一种实现方式,可以采用k-邻近算法,具体为:归一化处理目标断面特征变量集与样本特征变量集中的每一个特征变量;并基于k-邻近算法计算目标断面特征变量集与样本库中各个样本特征变量集之间的分类距离;对应的,确定相似度最高的样本特征变量集为与目标断面特征变量集匹配的样本特征变量集,包括:确定分类距离最短的样本特征变量集为相似度最高的样本特征变量集。Optionally, another implementation of determining the matching sample feature variable set may use the k-proximity algorithm, specifically: normalizing each feature variable in the target cross-section feature variable set and the sample feature variable set; and Calculate the classification distance between the target cross-section feature variable set and each sample feature variable set in the sample library based on the k-proximity algorithm; correspondingly, determine the sample feature variable set with the highest similarity as the sample feature variable matching the target cross-section feature variable set Sets include: determining the sample feature variable set with the shortest classification distance as the sample feature variable set with the highest similarity.
上述实现方式的具体步骤为:设所述样本特征变量集为(x j,y j),其中,j=1,2,...,N,N为样本数量,x j为电网断面最优特征变量,y j为样本故障集。先对每个样本的每个特征变量进行归一化处理,所述归一化公式为:
Figure PCTCN2019110941-appb-000004
其中,
Figure PCTCN2019110941-appb-000005
为所述经过归一化处理后的特征对应的数值,x j为归一化处理前的特征对应的数值,max(x j)和min(x j)分别为该特征变量的种类在归一化后的最大值和最小值。进而利用k-邻近算法,计算目标断面特征变量集和所述样本库中每个样本特征变量集的分类距离,所述分类距离采用欧式距离进行计算,公式为:
Figure PCTCN2019110941-appb-000006
其中,d o,j为目标断面特征变量集与第j个样本的分类距离,H为所述预设特征变量种类的特征变量数量,x on为目标 特征变量集第n维变量,x jn为第j个样本第n维变量。通过计算目标断面特征变量集和所述样本库中每个样本特征变量集的分类距离,获取样本库中距离目标特征变量集最近的k个样本特征变量集,计算k个样本特征变量集与目标特征变量集距离的平均值
Figure PCTCN2019110941-appb-000007
确定与k个样本特征变量集分类距离到平均值
Figure PCTCN2019110941-appb-000008
差值最小的样本特征变量集所对应的样本故障集作为预测故障集。其中,在本实例中k=5。
The specific steps of the above implementation are: set the sample feature variable set to (x j , y j ), where j=1, 2, ..., N, N is the number of samples, and x j is the optimal cross-section of the power grid The characteristic variable, y j is the sample fault set. First, normalize each feature variable of each sample, and the normalization formula is:
Figure PCTCN2019110941-appb-000004
among them,
Figure PCTCN2019110941-appb-000005
Is the value corresponding to the feature after normalization, x j is the value corresponding to the feature before normalization, and max(x j ) and min(x j ) are the types of the feature variable The maximum and minimum values after conversion. Then, the k-proximity algorithm is used to calculate the classification distance between the target section feature variable set and each sample feature variable set in the sample library. The classification distance is calculated using the Euclidean distance. The formula is:
Figure PCTCN2019110941-appb-000006
Where do o,j is the classification distance between the target cross-section feature variable set and the jth sample, H is the number of feature variables of the preset feature variable type, x on is the n-dimensional variable of the target feature variable set, x jn is The nth dimension variable of the jth sample. By calculating the classification distance between the target cross-section feature variable set and each sample feature variable set in the sample library, the k sample feature variable sets closest to the target feature variable set in the sample library are obtained, and the k sample feature variable sets and the target are calculated The average value of the distance of the characteristic variable set
Figure PCTCN2019110941-appb-000007
Determine the classification distance from k sample feature variable sets to the average
Figure PCTCN2019110941-appb-000008
The sample fault set corresponding to the sample feature variable set with the smallest difference is used as the predicted fault set. Among them, k=5 in this example.
在确定匹配的样本特征变量集后,执行步骤S103,根据所述样本库中样本故障集和样本特征变量集的对应关系,确定匹配的样本特征变量集对应的样本故障集作为预测故障集,完成对目标电网断面的故障预测。After determining the matched sample feature variable set, step S103 is executed, and according to the corresponding relationship between the sample fault set and the sample feature variable set in the sample library, the sample fault set corresponding to the matched sample feature variable set is determined as the predicted fault set, and completed Prediction of faults on the target power grid section.
请参考图4,图4为本实施例提供的一种电网预想故障集预测装置40的功能框图。本实施例还提出一种电网预想故障集预测装置40,所述装置包括:接收模块401、处理模块402和发送模块403,其中,接收模块401用于获取目标电网断面的预设特征变量种类的目标断面特征变量集;处理模块402用于在预设的样本库中确定出与所述目标断面特征变量集匹配的样本特征变量集,并确定所述匹配的样本特征变量集对应的样本故障集为所述目标电网断面的预测故障集;发送模块403用于发送所述预测故障集给工作人员。Please refer to FIG. 4. FIG. 4 is a functional block diagram of a device 40 for predicting a grid fault set according to this embodiment. This embodiment also proposes a device 40 for predicting the expected set of power grid faults. The device includes: a receiving module 401, a processing module 402, and a sending module 403, wherein the receiving module 401 is used to obtain the preset characteristic variable types of the target power grid section The target cross-section feature variable set; the processing module 402 is used to determine a sample feature variable set matching the target cross-section feature variable set in a preset sample library, and determine a sample fault set corresponding to the matched sample feature variable set Is the predicted fault set of the target power grid section; the sending module 403 is used to send the predicted fault set to the staff.
可选的,接收模块401还用于获取多个电网变量作为初始特征变量集;对应的,处理模块402还用于根据预设特征变量评价规则筛选所述初始特征变量集,并确定筛选后的所述初始特征变量集为所述预设特征变量种类。Optionally, the receiving module 401 is also used to obtain multiple grid variables as an initial feature variable set; correspondingly, the processing module 402 is also used to filter the initial feature variable set according to a preset feature variable evaluation rule and determine the filtered The initial feature variable set is the preset feature variable type.
可选的,接收模块401还用于获取所述样本库中每个样本特征变量集的所有特征变量;对应的,处理模块402还用于基于Relief算法,确定初始特征变量集中每个特征变量的权重值;删除初始特征变量集 中,权重值低于预设阈值的特征变量。Optionally, the receiving module 401 is also used to obtain all feature variables of each sample feature variable set in the sample library; correspondingly, the processing module 402 is also used to determine the value of each feature variable in the initial feature variable set based on the Relief algorithm Weight value; delete the feature variables in the initial feature variable set whose weight value is lower than the preset threshold.
可选的,接收模块401还用于采用至少一种采集方式获取至少一个初始样本库,处理模块402还用于组合所述至少一个初始样本集得到静态样本库,并按照预设样本评价规则筛选所述静态样本库,确定筛选后的所述静态样本库为所述样本库。Optionally, the receiving module 401 is further used to obtain at least one initial sample library by using at least one collection method, and the processing module 402 is further used to combine the at least one initial sample set to obtain a static sample library and filter according to preset sample evaluation rules In the static sample library, it is determined that the filtered static sample library is the sample library.
可选的,处理模块402还用于设定目标断面特征变量集与样本库中的样本特征变量集的初始评价值;并根据目标断面特征变量集对预测故障集的结果,对静态样本库中样本的初始评价值进行更新,其中,预测结果包括:正确、错误和不相关;以及删除静态样本库中,低于预设评价阈值的评价值对应的样本特征变量集以及样本特征变量集对应的样本故障集,从而得到样本库。Optionally, the processing module 402 is also used to set the initial evaluation value of the target cross-sectional feature variable set and the sample feature variable set in the sample library; and according to the results of the target cross-sectional feature variable set for predicting the fault set, the static sample library The initial evaluation value of the sample is updated, where the prediction results include: correct, wrong, and irrelevant; and delete the sample feature variable set corresponding to the evaluation value below the preset evaluation threshold in the static sample library, and the corresponding Sample failure set, thereby obtaining a sample library.
可选的,处理模块402还用与比对所述目标电网断面的目标断面特征变量集与所述样本库中的样本特征变量集,获取所述目标电网断面与所述样本库中各个样本之间的相似度;确定所述相似度最高的样本特征变量集对应的样本故障集为所述目标电网断面的预测故障集。Optionally, the processing module 402 also compares the target cross-section feature variable set of the target power grid cross-section with the sample feature variable set in the sample library to obtain the target power grid cross-section and each sample in the sample library The similarity between them; determining that the sample fault set corresponding to the sample feature variable set with the highest similarity is the predicted fault set of the target power grid section.
可选的,处理模块402还用于归一化处理目标断面特征变量集与样本特征变量集中的每一个特征变量;并基于k-邻近算法计算目标断面特征变量集与样本库中各个样本特征变量集之间的分类距离;对应的,确定相似度最高的样本特征变量集为与目标断面特征变量集匹配的样本特征变量集,包括:确定分类距离最短的样本特征变量集为相似度最高的样本特征变量集。Optionally, the processing module 402 is also used to normalize each feature variable in the target cross-section feature variable set and the sample feature variable set; and calculate the target cross-section feature variable set and each sample feature variable in the sample library based on the k-proximity algorithm The classification distance between the sets; correspondingly, the sample feature variable set with the highest similarity is determined as the sample feature variable set matching the target cross-section feature variable set, including: determining the sample feature variable set with the shortest classification distance as the sample with the highest similarity Feature variable set.
本实施例还提出一种电子设备,所述电子设备包括:存储器和处理器,存储器用于存储故障预测指令,处理器用于运行所述故障预测指令以执行如上述的电网预想故障集预测方法。This embodiment also proposes an electronic device. The electronic device includes a memory and a processor. The memory is used to store a fault prediction instruction, and the processor is used to run the fault prediction instruction to perform the above-mentioned grid prediction fault set prediction method.
本实施例还提出一种计算机可读存储介质,所述可读存储介质中存储有计算机程序,当所述计算机程序在计算机上运行时,使得所述 计算机执行本实施例中所述的电网预想故障集预测方法。本实施例对此不再赘述。This embodiment also proposes a computer-readable storage medium that stores a computer program, and when the computer program runs on a computer, causes the computer to execute the power grid prediction described in this embodiment Fault set prediction method. This embodiment will not repeat them here.
在本申请所提供的几个实施例中,应该理解到,所揭露的装置和方法,也可以通过其它的方式实现。以上所描述的装置实施例仅仅是示意性的,例如,附图中的流程图和框图显示了根据本发明的多个实施例的装置、方法和计算机程序产品的可能实现的体系架构、功能和操作。在这点上,流程图或框图中的每个方框可以代表一个模块、程序段或代码的一部分,所述模块、程序段或代码的一部分包含一个或多个用于实现规定的逻辑功能的可执行指令。也应当注意,在有些作为替换的实现方式中,方框中所标注的功能也可以以不同于附图中所标注的顺序发生。例如,两个连续的方框实际上可以基本并行地执行,它们有时也可以按相反的顺序执行,这依所涉及的功能而定。也要注意的是,框图和/或流程图中的每个方框、以及框图和/或流程图中的方框的组合,可以用执行规定的功能或动作的专用的基于硬件的系统来实现,或者可以用专用硬件与计算机指令的组合来实现。In the several embodiments provided in this application, it should be understood that the disclosed device and method may also be implemented in other ways. The device embodiments described above are only schematic. For example, the flowcharts and block diagrams in the drawings show possible implementation architectures, functions, and functions of devices, methods, and computer program products according to multiple embodiments of the present invention. operating. In this regard, each block in the flowchart or block diagram may represent a module, program segment, or part of code that contains one or more of the Executable instructions. It should also be noted that in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two consecutive blocks can actually be executed substantially in parallel, and sometimes they can also be executed in reverse order, depending on the functions involved. It should also be noted that each block in the block diagrams and/or flowcharts, and combinations of blocks in the block diagrams and/or flowcharts, can be implemented with a dedicated hardware-based system that performs specified functions or actions Or, it can be realized by a combination of dedicated hardware and computer instructions.
另外,在本发明各个实施例中的各功能模块可以集成在一起形成一个独立的部分,也可以是各个模块单独存在,也可以两个或两个以上模块集成形成一个独立的部分。In addition, the functional modules in the embodiments of the present invention may be integrated together to form an independent part, or each module may exist alone, or two or more modules may be integrated to form an independent part.
所述功能如果以软件功能模块的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器 (RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。If the functions are implemented in the form of software function modules and sold or used as independent products, they can be stored in a computer-readable storage medium. Based on this understanding, the technical solution of the present invention essentially or part of the contribution to the existing technology or part of the technical solution can be embodied in the form of a software product, the computer software product is stored in a storage medium, including Several instructions are used to enable a computer device (which may be a personal computer, server, or network device, etc.) to perform all or part of the steps of the methods described in the embodiments of the present invention. The aforementioned storage media include: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program code . It should be noted that in this article, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply that these entities or operations There is any such actual relationship or order. Moreover, the terms "include", "include" or any other variant thereof are intended to cover non-exclusive inclusion, so that a process, method, article or device that includes a series of elements includes not only those elements, but also those not explicitly listed Or other elements that are inherent to this process, method, article, or equipment. Without more restrictions, the element defined by the sentence "include one..." does not exclude that there are other identical elements in the process, method, article or equipment that includes the element.
以上仅为本发明的优选实施例而已,并不用于限制本发明,对于本领域的技术人员来说,本发明可以有各种更改和变化。凡在本发明的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本发明的保护范围之内。The above are only preferred embodiments of the present invention and are not intended to limit the present invention. For those skilled in the art, the present invention may have various modifications and changes. Any modification, equivalent replacement, improvement, etc. made within the spirit and principle of the present invention should be included in the protection scope of the present invention.

Claims (10)

  1. 一种电网预想故障集预测方法,其特征在于,所述方法包括:A method for predicting an expected fault set of a power grid, characterized in that the method includes:
    获取目标电网断面的预设特征变量种类的目标断面特征变量集;Obtain the target profile characteristic variable set of the preset characteristic variable type of the target grid section;
    在预设的样本库中确定出与所述目标断面特征变量集匹配的样本特征变量集;其中,所述样本库包括样本故障集和样本特征变量集的对应关系,所述样本特征变量集的特征变量种类与所述预设特征变量种类相同;A sample feature variable set matching the target cross-section feature variable set is determined in a preset sample library; wherein, the sample library includes a correspondence between a sample fault set and a sample feature variable set, the sample feature variable set The type of characteristic variable is the same as the type of the preset characteristic variable;
    确定所述匹配的样本特征变量集对应的样本故障集为所述目标电网断面的预测故障集。It is determined that the sample fault set corresponding to the matched sample feature variable set is the predicted fault set of the target power grid cross section.
  2. 根据权利要求1所述的电网预想故障集预测方法,其特征在于,在获取目标电网断面的预设特征变量种类的目标断面特征变量集之前,所述方法还包括:The method for predicting a predicted fault set of a power grid according to claim 1, wherein before acquiring the target cross-sectional feature variable set of the preset feature variable type of the target power grid cross-section, the method further comprises:
    获取多个电网变量作为初始特征变量集;Obtain multiple grid variables as initial feature variable sets;
    根据预设特征变量评价规则筛选所述初始特征变量集;Filtering the initial feature variable set according to preset feature variable evaluation rules;
    确定筛选后的所述初始特征变量集中包含的特征变量种类为所述预设特征变量种类。It is determined that the type of the characteristic variable included in the initial characteristic variable set after screening is the preset characteristic variable type.
  3. 根据权利要求2所述的电网预想故障集预测方法,其特征在于,所述根据预设特征变量评价规则筛选所述初始特征变量集,包括:The method for predicting an expected fault set of a power grid according to claim 2, wherein the filtering of the initial feature variable set according to a preset feature variable evaluation rule includes:
    获取所述样本库中每个样本特征变量集的所有特征变量;Acquiring all feature variables of each sample feature variable set in the sample library;
    基于Relief算法,确定所述初始特征变量集中每个所述特征变量的权重值;Based on the Relief algorithm, determine the weight value of each of the feature variables in the initial feature variable set;
    删除所述初始特征变量集中,权重值低于预设阈值的所述特征变量。Delete the feature variable in the initial feature variable set whose weight value is lower than a preset threshold.
  4. 根据权利要求1所述的电网预想故障集预测方法,其特征在于, 在所述在预设的样本库中确定出与所述目标断面特征变量集匹配的样本特征变量集之前,所述方法还包括:The method for predicting an expected fault set of a power grid according to claim 1, wherein before the sample feature variable set matching the target cross-section feature variable set is determined in the preset sample library, the method further include:
    采用至少一种采集方式获取至少一个初始样本库,组合所述至少一个初始样本库得到静态样本库,所述初始样本库包括每一个样本的样本故障集及所述样本故障集对应的样本特征变量集;At least one initial sample library is acquired by at least one acquisition method, and the at least one initial sample library is combined to obtain a static sample library, the initial sample library includes a sample failure set of each sample and a sample feature variable corresponding to the sample failure set set;
    按照预设样本评价规则筛选所述静态样本库;Filter the static sample library according to preset sample evaluation rules;
    确定筛选后的所述静态样本库为所述样本库。It is determined that the screened static sample library is the sample library.
  5. 根据权利要求4所述的电网预想故障集预测方法,其特征在于,所述按照预设样本评价规则筛选所述静态样本库,包括:The method for predicting a fault set of a power grid according to claim 4, wherein the filtering the static sample library according to a preset sample evaluation rule includes:
    设定所述目标断面特征变量集与所述样本库中的所述样本特征变量集的初始评价值;Setting an initial evaluation value of the target cross-sectional feature variable set and the sample feature variable set in the sample library;
    根据所述目标断面特征变量集对所述预测故障集的结果,对所述静态样本库中样本的初始评价值进行更新,其中,所述预测结果包括:正确、错误和不相关;Update the initial evaluation value of the sample in the static sample library according to the result of the target cross-section feature variable set on the predicted fault set, where the predicted result includes: correct, wrong, and irrelevant;
    删除所述静态样本库中,低于预设评价阈值的所述评价值对应的样本特征变量集以及所述样本特征变量集对应的样本故障集,得到所述样本库。Delete the sample feature variable set corresponding to the evaluation value below the preset evaluation threshold and the sample fault set corresponding to the sample feature variable set in the static sample library to obtain the sample library.
  6. 根据权利要求1所述的电网预想故障集预测方法,其特征在于,所述在预设的样本库中确定出与所述目标断面特征变量集匹配的样本特征变量集,包括:The method for predicting a predicted fault set of a power grid according to claim 1, wherein the determining of a sample feature variable set matching the target cross-section feature variable set in a preset sample library includes:
    比对所述目标断面特征变量集与所述样本库中的样本特征变量集,获取所述目标电网断面与所述样本库中各个样本之间的相似度;Comparing the feature variable set of the target cross-section with the feature variable set of the sample in the sample library to obtain the similarity between the target grid cross-section and each sample in the sample library;
    确定所述相似度最高的样本特征变量集为与所述目标断面特征变量集匹配的样本特征变量集。It is determined that the sample feature variable set with the highest similarity is a sample feature variable set that matches the target cross-sectional feature variable set.
  7. 根据权利要求6所述的电网预想故障集预测方法,其特征在于,所述比对所述目标断面特征变量集与所述样本库中的样本特征变量 集,获取所述目标电网断面与所述样本库中各个样本之间的相似度,包括:The method for predicting a fault set of a power grid according to claim 6, wherein the comparison between the target cross-section feature variable set and the sample feature variable set in the sample library obtains the target power grid cross-section and the The similarity between samples in the sample library, including:
    归一化处理所述目标断面特征变量集与所述样本特征变量集中的每一个特征变量;Normalize each feature variable in the target cross-section feature variable set and the sample feature variable set;
    并基于k-邻近算法计算目标断面特征变量集与所述样本库中各个样本特征变量集之间的分类距离;And calculate the classification distance between the target cross-section feature variable set and each sample feature variable set in the sample library based on the k-proximity algorithm;
    对应的,所述确定所述相似度最高的样本特征变量集为与所述目标断面特征变量集匹配的样本特征变量集,包括:Correspondingly, the sample feature variable set that determines the highest similarity is the sample feature variable set matching the target cross-section feature variable set, including:
    确定所述分类距离最短的样本特征变量集为所述相似度最高的样本特征变量集。It is determined that the sample feature variable set with the shortest classification distance is the sample feature variable set with the highest similarity.
  8. 一种电网预想故障集预测装置,其特征在于,包括:A power grid prediction fault set prediction device, which is characterized by comprising:
    接收模块,用于获取目标电网断面的预设特征变量种类的目标断面特征变量集;The receiving module is used to obtain the target profile feature variable set of the preset feature variable type of the target grid profile;
    处理模块,用于在预设的样本库中确定出与所述目标断面特征变量集匹配的样本特征变量集,并确定所述匹配的样本特征变量集对应的样本故障集为所述目标电网断面的预测故障集;The processing module is configured to determine a sample feature variable set matching the target cross-section feature variable set in a preset sample library, and determine the sample fault set corresponding to the matched sample cross-section feature variable set as the target power grid cross-section Set of predicted failures;
    发送模块,用于发送所述预测故障集给工作人员。The sending module is used to send the predicted fault set to the staff.
  9. 一种电子设备,其特征在于,所述电子设备包括:An electronic device, characterized in that the electronic device includes:
    存储器,用于存储故障预测指令;及Memory for storing fault prediction instructions; and
    处理器,用于运行所述故障预测指令以执行如权利要求1-7任一项所述的电网预想故障集预测方法。A processor, configured to execute the fault prediction instruction to execute the power grid expected fault set prediction method according to any one of claims 1-7.
  10. 一种存储介质,其特征在于,所述存储介质中存储有计算机程序,当所述计算机程序在计算机上运行时,使得所述计算机执行如权利要求1-7中任意一项所述的电网预想故障集预测方法。A storage medium characterized in that a computer program is stored in the storage medium, and when the computer program is run on a computer, the computer is caused to execute the power grid prediction according to any one of claims 1-7 Fault set prediction method.
PCT/CN2019/110941 2018-12-20 2019-10-14 Power grid anticipated fault set prediction method and apparatus, and electronic device and storage medium WO2020125166A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811563729.4A CN109711450A (en) 2018-12-20 2018-12-20 A kind of power grid forecast failure collection prediction technique, device, electronic equipment and storage medium
CN201811563729.4 2018-12-20

Publications (1)

Publication Number Publication Date
WO2020125166A1 true WO2020125166A1 (en) 2020-06-25

Family

ID=66256975

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/110941 WO2020125166A1 (en) 2018-12-20 2019-10-14 Power grid anticipated fault set prediction method and apparatus, and electronic device and storage medium

Country Status (2)

Country Link
CN (1) CN109711450A (en)
WO (1) WO2020125166A1 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109711450A (en) * 2018-12-20 2019-05-03 北京科东电力控制系统有限责任公司 A kind of power grid forecast failure collection prediction technique, device, electronic equipment and storage medium
CN110175002B (en) * 2019-07-08 2020-06-26 广东电网有限责任公司佛山供电局 Real-time section saving method and device for power distribution network and computer medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106874963A (en) * 2017-03-17 2017-06-20 南京邮电大学 A kind of Fault Diagnosis Method for Distribution Networks and system based on big data technology
CN107085164A (en) * 2017-03-22 2017-08-22 清华大学 A kind of electric network fault type determines method and device
US20180300631A1 (en) * 2015-06-30 2018-10-18 Arizona Technology Enterprises Method and apparatus for large scale machine learning
CN108898249A (en) * 2018-06-28 2018-11-27 鹿寨知航科技信息服务有限公司 A kind of electric network fault prediction technique
CN108919059A (en) * 2018-08-23 2018-11-30 广东电网有限责任公司 A kind of electric network failure diagnosis method, apparatus, equipment and readable storage medium storing program for executing
CN109711450A (en) * 2018-12-20 2019-05-03 北京科东电力控制系统有限责任公司 A kind of power grid forecast failure collection prediction technique, device, electronic equipment and storage medium

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101425686B (en) * 2008-12-11 2010-09-15 国网电力科学研究院 Electrical power system on-line safety and stability evaluation forecast failure collection adaptive selection method
CN102324743B (en) * 2011-09-21 2015-07-08 国电南瑞科技股份有限公司 Screening method for expected fault in online transient safety and stability assessment of power system
CN102629305B (en) * 2012-03-06 2015-02-25 上海大学 Feature selection method facing to SNP (Single Nucleotide Polymorphism) data
CN102638041B (en) * 2012-04-17 2014-05-21 国电南瑞科技股份有限公司 Method forquickly checking safety and stability of scheduling operation of power system on line
CN103955609B (en) * 2014-04-25 2016-09-21 国家电网公司 A kind of forecast failure collection automatic generation method based on typhoon model
CN104268576A (en) * 2014-10-11 2015-01-07 国家电网公司 Electric system transient stability classification method based on TNN-SVM
CN105204489B (en) * 2015-07-17 2018-02-13 国电南瑞科技股份有限公司 Meter and protection control the online preventive control Application of risk decision method of cost with peace certainly
CN105184084B (en) * 2015-09-14 2018-05-01 深圳供电局有限公司 A kind of automatic power-measuring terminal fault type prediction method and system
CN106602548B (en) * 2016-11-25 2019-07-05 国电南瑞科技股份有限公司 A kind of identification method of transient state stable key transmission cross-section based on EEAC
CN106712097A (en) * 2017-02-13 2017-05-24 云南电网有限责任公司电力科学研究院 Transient stability risk assessment method of electric power system containing large-scale wind power station
CN108344812A (en) * 2017-11-24 2018-07-31 北京国网富达科技发展有限责任公司 A kind of Diagnosis Method of Transformer Faults, device and storage medium

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20180300631A1 (en) * 2015-06-30 2018-10-18 Arizona Technology Enterprises Method and apparatus for large scale machine learning
CN106874963A (en) * 2017-03-17 2017-06-20 南京邮电大学 A kind of Fault Diagnosis Method for Distribution Networks and system based on big data technology
CN107085164A (en) * 2017-03-22 2017-08-22 清华大学 A kind of electric network fault type determines method and device
CN108898249A (en) * 2018-06-28 2018-11-27 鹿寨知航科技信息服务有限公司 A kind of electric network fault prediction technique
CN108919059A (en) * 2018-08-23 2018-11-30 广东电网有限责任公司 A kind of electric network failure diagnosis method, apparatus, equipment and readable storage medium storing program for executing
CN109711450A (en) * 2018-12-20 2019-05-03 北京科东电力控制系统有限责任公司 A kind of power grid forecast failure collection prediction technique, device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN109711450A (en) 2019-05-03

Similar Documents

Publication Publication Date Title
WO2019165673A1 (en) Reimbursement form risk prediction method, apparatus, terminal device, and storage medium
CN113391209B (en) Method, device and system for predicting health state of battery and battery
WO2020125166A1 (en) Power grid anticipated fault set prediction method and apparatus, and electronic device and storage medium
WO2021098384A1 (en) Data abnormality detection method and apparatus
US11494690B2 (en) Apparatus and method of high dimensional data analysis in real-time
CN110135603B (en) Power network alarm space characteristic analysis method based on improved entropy weight method
CN112231193A (en) Time series data capacity prediction method, time series data capacity prediction device, electronic equipment and storage medium
US20220188665A1 (en) Gas-oil separation plant virtual water cut predictor based on supervised learning framework on time series data
CN111008299B (en) Quality evaluation method and device of voice database and computer storage medium
CN116457802A (en) Automatic real-time detection, prediction and prevention of rare faults in industrial systems using unlabeled sensor data
CN113242213B (en) Power communication backbone network node vulnerability diagnosis method
Vieira et al. An Enhanced Seasonal-Hybrid ESD technique for robust anomaly detection on time series
CN110770753B (en) Device and method for real-time analysis of high-dimensional data
CN112100024A (en) Resource load abnormity detection method, device and equipment
CN114338351B (en) Network anomaly root cause determination method and device, computer equipment and storage medium
CN116186603A (en) Abnormal user identification method and device, computer storage medium and electronic equipment
CN108446907B (en) Safety verification method and device
CN114039837B (en) Alarm data processing method, device, system, equipment and storage medium
CN109359197B (en) Tax type authentication method, device and computer readable storage medium
CN110458383B (en) Method and device for realizing demand processing servitization, computer equipment and storage medium
CN114168409A (en) Service system running state monitoring and early warning method and system
CN110955811B (en) Power data classification method and system based on naive Bayes algorithm
CN111680572B (en) Dynamic judgment method and system for power grid operation scene
CN108123436B (en) Voltage out-of-limit prediction model based on principal component analysis and multiple regression algorithm
Anuar et al. Reverse Migration Factor in Machine Learning Models

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19898748

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19898748

Country of ref document: EP

Kind code of ref document: A1