CN113902062A - Transformer area line loss abnormal reason analysis method and device based on big data - Google Patents

Transformer area line loss abnormal reason analysis method and device based on big data Download PDF

Info

Publication number
CN113902062A
CN113902062A CN202111514070.5A CN202111514070A CN113902062A CN 113902062 A CN113902062 A CN 113902062A CN 202111514070 A CN202111514070 A CN 202111514070A CN 113902062 A CN113902062 A CN 113902062A
Authority
CN
China
Prior art keywords
data
dependent variable
user
abnormal
variable
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202111514070.5A
Other languages
Chinese (zh)
Inventor
欧阳文华
常乐
王卫平
戚沁雅
安义
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
Electric Power Research Institute of State Grid Jiangxi Electric Power Co Ltd
Original Assignee
State Grid Corp of China SGCC
Electric Power Research Institute of State Grid Jiangxi Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, Electric Power Research Institute of State Grid Jiangxi Electric Power Co Ltd filed Critical State Grid Corp of China SGCC
Priority to CN202111514070.5A priority Critical patent/CN113902062A/en
Publication of CN113902062A publication Critical patent/CN113902062A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/243Classification techniques relating to the number of classes
    • G06F18/24323Tree-organised classifiers
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/20Administration of product repair or maintenance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • General Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Marketing (AREA)
  • Health & Medical Sciences (AREA)
  • General Business, Economics & Management (AREA)
  • Data Mining & Analysis (AREA)
  • Strategic Management (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Evolutionary Computation (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Artificial Intelligence (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • General Engineering & Computer Science (AREA)
  • Evolutionary Biology (AREA)
  • Public Health (AREA)
  • Water Supply & Treatment (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Supply And Distribution Of Alternating Current (AREA)

Abstract

The invention discloses a method and a device for analyzing causes of abnormal line loss of a transformer area based on big data, wherein the method comprises the following steps: acquiring the full data of low-voltage users in a sample platform area; taking the type of the abnormal user in the low-voltage user total data as an independent variable, taking a data field in the low-voltage user total data as a dependent variable, and calculating a combined correlation coefficient of a certain dependent variable and a single independent variable based on a complex correlation-Pearson coefficient analysis method to obtain a line loss influence factor association relation; and inputting the acquired real-time low-voltage user total data of the transformer area into a preset high-loss reason classification model, outputting the abnormal user type, and identifying the line loss abnormal reason of the transformer area according to the abnormal user type. The transformer area high loss reason is analyzed by constructing a transformer area high loss reason analysis model, suspected abnormal users and abnormal user types under the high loss transformer area are identified, operation and maintenance personnel are assisted to solve the problem of low voltage transformer area high loss, and the management efficiency and the benefit of a company are improved.

Description

Transformer area line loss abnormal reason analysis method and device based on big data
Technical Field
The invention belongs to the technical field of analysis of line loss abnormity of a transformer area, and particularly relates to a method and a device for analyzing causes of line loss abnormity of the transformer area based on big data.
Background
The line loss of a low-voltage distribution area in the power system has important influence on the whole power system, the high line loss rate enables the economic efficiency of power supply enterprises to be reduced and serious waste is caused to energy, the line loss of the low-voltage distribution area also becomes an important index for examining the power supply enterprises, and operation and maintenance units put a large amount of people and properties into the management of the line loss every year, so that the line loss of the distribution area is well treated, the method has very important significance for realizing more supply and less loss of the power supply enterprises and improving the operating income of the power supply enterprises.
In recent years, collected data is increasingly refined, and related data of a low-voltage transformer area is gradually improved, but the problem of the low-voltage high-loss transformer area still cannot be solved for a long time due to various reasons. Firstly, the number of high-loss transformer areas is large, the number of low-voltage users is huge, and operation and maintenance personnel and technical level cannot meet the requirement of the huge number of investigation and management; secondly, the reasons causing high loss of the low-voltage transformer area are numerous, such as file problems, acquisition problems, metering device faults, electricity stealing and the like, so that the identification of the reasons of abnormal operation of users is difficult; in addition, after the current analysis and treatment method aiming at a single high-loss transformer area falls down, the realization of accurate positioning of the abnormal users is difficult. Therefore, a method for efficiently solving the problem of the low-voltage high-loss transformer area is urgently needed.
Disclosure of Invention
The invention provides a method and a device for analyzing causes of line loss abnormality of a transformer area based on big data, which are used for solving at least one of the technical problems.
In a first aspect, the present invention provides a method for analyzing a cause of a line loss anomaly in a distribution room based on big data, including: acquiring low-voltage user full data of a sample station area from a power system data center station; taking the type of the abnormal user in the low-voltage user total data as an independent variable, taking a data field in the low-voltage user total data as a dependent variable, and calculating a combined correlation coefficient of a certain dependent variable and a single independent variable based on a complex correlation-Pearson coefficient analysis method to enable the correlation relationship of line loss influence factors to be achieved, wherein the type of the abnormal user comprises a file abnormal user, a collection abnormal user, a measurement abnormal user and a power stealing abnormal user, and the data field comprises station area daily loss electric quantity, a collection success rate, voltage and current; calculating the expression of the combined correlation coefficient of the dependent variable and the single independent variable as follows:
Figure 134156DEST_PATH_IMAGE001
in the formula (I), wherein,
Figure 214107DEST_PATH_IMAGE002
is the combined correlation coefficient of a dependent variable and a single independent variable,
Figure 567728DEST_PATH_IMAGE003
is the complex correlation coefficient of a dependent variable and a corresponding independent variable set,
Figure 638625DEST_PATH_IMAGE004
is the pearson coefficient of a dependent variable and the corresponding single independent variable,
Figure 165421DEST_PATH_IMAGE005
is the weight of a dependent variable and the corresponding independent variable set,
Figure 834300DEST_PATH_IMAGE007
the weight of a dependent variable and a corresponding single independent variable; inputting the acquired real-time low-voltage user total data of the distribution room into a preset high-loss reason classification model, outputting the type of the abnormal user, and performing abnormal operation according to the abnormal operationAnd identifying the abnormal line loss reasons of the transformer area by the user type, wherein the preset high-loss reason classification model comprises the line loss influence factor incidence relation.
In a second aspect, the present invention provides an apparatus for analyzing cause of line loss abnormality in a distribution room based on big data, including: the acquisition module is configured to acquire low-voltage user full data of a sample station area from a power system data center station; the calculation module is configured to take the type of the abnormal user in the low-voltage user total data as an independent variable, take a data field in the low-voltage user total data as a dependent variable, and calculate a combined correlation coefficient of a certain dependent variable and a single independent variable based on a complex correlation-Pearson coefficient analysis method so as to enable a line loss influence factor correlation relationship to be achieved, wherein the type of the abnormal user comprises a file abnormal user, a collected abnormal user, a measured abnormal user and a power stealing abnormal user, and the data field comprises station area daily loss electric quantity, a collected success rate, voltage and current; calculating the expression of the combined correlation coefficient of the dependent variable and the single independent variable as follows:
Figure 375134DEST_PATH_IMAGE001
in the formula (I), wherein,
Figure 457359DEST_PATH_IMAGE002
is the combined correlation coefficient of a dependent variable and a single independent variable,
Figure 725529DEST_PATH_IMAGE003
is the complex correlation coefficient of a dependent variable and a corresponding independent variable set,
Figure 763761DEST_PATH_IMAGE004
is the pearson coefficient of a dependent variable and the corresponding single independent variable,
Figure 724764DEST_PATH_IMAGE005
is the weight of a dependent variable and the corresponding independent variable set,
Figure 28707DEST_PATH_IMAGE007
the weight of a dependent variable and a corresponding single independent variable; and the output module is configured to input the acquired real-time low-voltage user full data of the transformer area into a preset high-loss reason classification model, so as to output the abnormal user type, and identify the line loss abnormal reason of the transformer area according to the abnormal user type, wherein the preset high-loss reason classification model contains the line loss influence factor association relation.
In a third aspect, an electronic device is provided, comprising: the analysis method comprises at least one processor and a memory which is in communication connection with the at least one processor, wherein the memory stores instructions which can be executed by the at least one processor, and the instructions are executed by the at least one processor, so that the at least one processor can execute the steps of the analysis method for the line loss anomaly cause of the transformer area based on the big data according to any embodiment of the invention.
In a fourth aspect, the present invention further provides a computer-readable storage medium, on which a computer program is stored, the computer program including program instructions, which, when executed by a computer, cause the computer to execute the steps of the method for analyzing cause of line loss abnormality of a transformer area based on big data according to any one of the embodiments of the present invention.
According to the method and the device for analyzing the abnormal reason of the line loss of the transformer area based on the big data, the reason of the high loss of the transformer area is analyzed by constructing a transformer area high loss reason analysis model, suspected abnormal users and abnormal user types under the transformer area with high loss are identified, operation and maintenance personnel are assisted to solve the problem of the high loss of the low-voltage transformer area, and the management efficiency and the benefit of a company are improved. And single user, multi-user, many district users can be analyzed, the security is high, the operability is high, it is convenient to use.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the description of the embodiments are briefly introduced below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on the drawings without creative efforts.
Fig. 1 is a flowchart of a method for analyzing causes of line loss abnormality of a distribution room based on big data according to an embodiment of the present invention;
fig. 2 is a flowchart of another method for analyzing causes of line loss abnormality of a distribution room based on big data according to an embodiment of the present invention;
fig. 3 is a block diagram of a device for analyzing a cause of line loss abnormality of a distribution room based on big data according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
Please refer to fig. 1, which shows a flowchart of a method for analyzing a cause of line loss anomaly in a distribution room based on big data according to the present application.
As shown in fig. 1, in step S101, low-voltage user full data of a sample station area is acquired in a power system data center station;
in step S102, a transaction user type in the low-voltage user total data is used as an independent variable, a data field in the low-voltage user total data is used as a dependent variable, and a combined correlation coefficient of a dependent variable and a single independent variable is calculated based on a complex correlation-pearson coefficient analysis method, so as to obtain a line loss influence factor association relationship, wherein the transaction user type includes a file transaction user, a collection transaction user, a measurement transaction user, and a power stealing transaction user, and the data field includes a station area daily power loss, a collection success rate, a voltage, and a current;
calculating the expression of the combined correlation coefficient of the dependent variable and the single independent variable as follows:
Figure 585721DEST_PATH_IMAGE001
in the formula (I), the compound is shown in the specification,
Figure 760350DEST_PATH_IMAGE002
is the combined correlation coefficient of a dependent variable and a single independent variable,
Figure 423413DEST_PATH_IMAGE003
is the complex correlation coefficient of a dependent variable and a corresponding independent variable set,
Figure 932761DEST_PATH_IMAGE004
is the pearson coefficient of a dependent variable and the corresponding single independent variable,
Figure 277154DEST_PATH_IMAGE005
is the weight of a dependent variable and the corresponding independent variable set,
Figure 306290DEST_PATH_IMAGE007
the weight of a dependent variable and a corresponding single independent variable;
in step S103, inputting the acquired real-time low-voltage user full data of the distribution room into a preset high-loss reason classification model, so as to output a transaction user type, and identifying a distribution room line loss abnormal reason according to the transaction user type, wherein the preset high-loss reason classification model includes the line loss influence factor association relationship.
In summary, the method includes the steps of obtaining source end system data from a data center station, wherein the source end system data comprises system data such as an SG186 system, a power utilization information acquisition system and an inspection system, extracting features, carrying out line loss influence factor relevance analysis on the type of the abnormal motion user and the data field to obtain a relevance result, inputting the relevance result into a preset station area high loss reason classification model to judge the type of the abnormal motion and identify the abnormal motion user, and finally giving a conclusion according to a model judgment result to enable operation and maintenance personnel to carry out targeted management work on the high loss station area, so that the operation and maintenance personnel are assisted to solve the problem of high loss of the low voltage station area, and the management efficiency and the benefit of a company are improved. And single user, multi-user, many district users can be analyzed, the security is high, the operability is high, it is convenient to use.
Please refer to fig. 2, which shows a flowchart of another big data based analysis method for cause of line loss anomaly in a distribution room.
As shown in fig. 2, the method for analyzing the cause of line loss abnormality of the transformer area based on big data specifically includes the following steps:
step 1, data acquisition of data center
The method is characterized in that the sample platform area and the low-voltage user full data information under the sample platform area are obtained through the power system data center platform, and the method is different from the existing analysis method, all data are obtained from the data center platform, so that the barrier between systems and between professions is broken, the data real-time performance is higher, and the accuracy is higher. The standing book and archive data are mainly obtained from a relational database of a data center station, and the measurement data are mainly obtained from an HDFS distributed file system and comprise 51 data fields.
Analyzing the type, dimensionality, mean value, variance and the like of the data fields, quantizing the data of the fields, and extracting frequency characteristics, logic characteristics, statistical characteristics and the like of each data field according to the processed abnormal users and normal users. And (4) combining the conditions of the abnormal users and the normal users, and eliminating fields which have no influence on the loss through data dimension reduction.
Step 2, analyzing relevance of influence factors
A complex correlation-Pearson coefficient analysis method is provided for analyzing the correlation of the line loss influence factors, and specifically comprises the following steps:
firstly, classifying users into abnormal users and normal users according to the obtained data samples, wherein the abnormal user types are independent variables, the independent variable characteristics comprise file problems, acquisition problems, metering problems, electricity stealing problems and the like, and 31 data fields of daily power consumption, acquisition success rate, voltage, current and the like of a transformer area are dependent variables;
and secondly, calculating the Pearson coefficients of the respective variables and each dependent variable, and obtaining a Pearson coefficient between each independent variable and each dependent variable, wherein the Pearson coefficient of a certain dependent variable and a corresponding single independent variable is calculated by the following expression:
Figure 890986DEST_PATH_IMAGE008
in the formula (I), the compound is shown in the specification,
Figure 638362DEST_PATH_IMAGE009
is the pearson coefficient of a dependent variable and the corresponding single independent variable,
Figure 786447DEST_PATH_IMAGE010
as to the number of times the argument characteristic occurs,
Figure 716095DEST_PATH_IMAGE012
the number of times the dependent variable characteristic occurs,
Figure 189801DEST_PATH_IMAGE013
is composed of
Figure 893315DEST_PATH_IMAGE010
The average value of the samples of (a),
Figure 392561DEST_PATH_IMAGE014
is composed of
Figure 396289DEST_PATH_IMAGE012
The average value of the samples of (a),
Figure 775318DEST_PATH_IMAGE015
is composed of
Figure 966127DEST_PATH_IMAGE010
The standard deviation of (a) is determined,
Figure 767599DEST_PATH_IMAGE016
is composed of
Figure 625834DEST_PATH_IMAGE012
Standard deviation of (d);
thirdly, the change of one dependent variable is often influenced by the synthesis of a plurality of independent variables, so all independent variables are sequentially grouped with a single dependent variable, complex correlation coefficients of each group of independent variables and dependent variables are calculated, and each group obtains one complex correlation coefficient, wherein the expression for calculating the complex correlation coefficient of a certain dependent variable and a corresponding dependent variable group is as follows:
Figure 175764DEST_PATH_IMAGE017
Figure 870181DEST_PATH_IMAGE018
in the formula (I), the compound is shown in the specification,
Figure 960497DEST_PATH_IMAGE019
is the complex correlation coefficient of a dependent variable and the corresponding dependent variable group,
Figure 407659DEST_PATH_IMAGE020
the number of times the dependent variable characteristic occurs,
Figure 643337DEST_PATH_IMAGE021
to obtain a regression value by regression on the independent variables,
Figure 871056DEST_PATH_IMAGE022
is the mean value of the independent variables,
Figure 968325DEST_PATH_IMAGE023
is as follows
Figure DEST_PATH_IMAGE025
The parameters of the linear regression equation for each independent variable feature,
Figure 551884DEST_PATH_IMAGE026
is as follows
Figure DEST_PATH_IMAGE027
The number of occurrences of the individual independent variable features;
fourthly, weighting and combining the Pearson coefficient and the complex correlation coefficient, and calculating the combined correlation coefficient of a single independent variable and a dependent variable, wherein the weighting and combining formula is as follows:
Figure 286360DEST_PATH_IMAGE001
in the formula (I), the compound is shown in the specification,
Figure 939058DEST_PATH_IMAGE002
is the combined correlation coefficient of a dependent variable and a single independent variable,
Figure 371176DEST_PATH_IMAGE003
is the complex correlation coefficient of a dependent variable and a corresponding independent variable set,
Figure 809242DEST_PATH_IMAGE004
is the pearson coefficient of a dependent variable and the corresponding single independent variable,
Figure 871876DEST_PATH_IMAGE005
is the weight of a dependent variable and the corresponding independent variable set,
Figure 11870DEST_PATH_IMAGE007
the weight of a dependent variable and a corresponding single independent variable;
and fifthly, obtaining the incidence relation of the line loss influence factors by combining the correlation coefficients.
Step 3, constructing a high-loss reason classification model
A high-loss reason classification model is constructed by utilizing iterative training of a machine learning weak classifier (decision tree) and is used for classifying types of abnormal users in a data set. After the model construction is completed, the correlation analysis conclusion of the line loss influence factors in the previous step is input into a classification model, and the following results can be obtained through model calculation: firstly, judging the type of the transaction (metering problem, acquisition problem, file problem and electricity stealing problem), and secondly, identifying the transaction user according to the type of the transaction user.
And selecting 30% of data in the region as a test set to test the model, wherein the test set comprises a transaction user and a normal user. And extracting field data of the test user to perform relevance analysis, inputting an analysis result into a high-loss reason classification model to perform result prediction to obtain abnormal suspected users, and calculating the recall ratio and precision ratio of the model.
Wherein: recall = suspected user number/actual transaction user, precision = actual transaction user/suspected user.
Step 4, auxiliary treatment of the system
According to the model calculation result, the system identifies the cause of the suspected abnormal user, the electricity stealing problem, the metering device problem, the collection problem, the file problem and the like are found out, the operation and maintenance personnel are sent to the site for verification and treatment according to the system troubleshooting reason, the operation and maintenance personnel do not perform blind carpet type troubleshooting on the high loss reason any more, meanwhile, the order dispatching accuracy is improved, and the work efficiency of basic teams and groups is improved.
In summary, the method of the present application can achieve the following technical effects:
1) the method and the system are high in data accuracy and strong in practicability, can effectively obtain the high loss reason, can quickly position problem users, and reduce time and manpower and material resources wasted in on-site carpet type troubleshooting.
2) The analysis recall ratio and precision ratio can be continuously improved through self-learning along with the continuous increase of the number of samples in the using process.
Fig. 3 is a block diagram illustrating a structure of an apparatus for analyzing cause of line loss abnormality of a distribution room based on big data according to the present application.
As shown in fig. 3, the apparatus 200 for analyzing the cause of the line loss abnormality in the transformer substation includes an obtaining module 210, a calculating module 220, and an output module 230.
The acquisition module 210 is configured to acquire low-voltage user full data of a sample station area in a power system data center station;
a calculating module 220, configured to use a type of a different user in the total data of the low-voltage users as an independent variable, use a data field in the total data of the low-voltage users as a dependent variable, and calculate a combined correlation coefficient of a dependent variable and a single independent variable based on a complex correlation-pearson coefficient analysis method, so as to obtain a line loss influence factor association relationship, where the type of the different user includes a profile different user, a collection different user, a measurement different user, and a power stealing different user, and the data field includes a station area daily power loss, a collection success rate, a voltage, and a current;
calculating the expression of the combined correlation coefficient of the dependent variable and the single independent variable as follows:
Figure 762527DEST_PATH_IMAGE001
in the formula (I), the compound is shown in the specification,
Figure 304366DEST_PATH_IMAGE002
is the combined correlation coefficient of a dependent variable and a single independent variable,
Figure 537902DEST_PATH_IMAGE003
is the complex correlation coefficient of a dependent variable and a corresponding independent variable set,
Figure 165192DEST_PATH_IMAGE004
is the pearson coefficient of a dependent variable and the corresponding single independent variable,
Figure 689845DEST_PATH_IMAGE005
is the weight of a dependent variable and the corresponding independent variable set,
Figure 86192DEST_PATH_IMAGE028
the weight of a dependent variable and a corresponding single independent variable;
the output module 230 is configured to input the acquired real-time low-voltage user full data of the transformer area into a preset high-loss reason classification model, so as to output the abnormal user type, and identify the line loss abnormal reason of the transformer area according to the abnormal user type, wherein the preset high-loss reason classification model includes the line loss influence factor association relation.
It should be understood that the modules depicted in fig. 3 correspond to various steps in the method described with reference to fig. 1. Thus, the operations and features described above for the method and the corresponding technical effects are also applicable to the modules in fig. 3, and are not described again here.
In other embodiments, an embodiment of the present invention further provides a computer-readable storage medium, where computer-executable instructions are stored, and the computer-executable instructions may execute the method for analyzing the cause of the line loss abnormality of the distribution room based on the big data in any of the above method embodiments;
as one embodiment, the computer-readable storage medium of the present invention stores computer-executable instructions configured to:
acquiring low-voltage user full data of a sample station area from a power system data center station;
taking the type of the abnormal user in the low-voltage user total data as an independent variable, taking a data field in the low-voltage user total data as a dependent variable, and calculating a combined correlation coefficient of a certain dependent variable and a single independent variable based on a complex correlation-Pearson coefficient analysis method to obtain a line loss influence factor association relation;
and inputting the acquired real-time low-voltage user total data of the transformer area into a preset high-loss reason classification model, outputting the abnormal user type, and identifying the line loss abnormal reason of the transformer area according to the abnormal user type.
The computer-readable storage medium may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created from use of the station area line loss abnormality cause analysis device based on large data, and the like. Further, the computer-readable storage medium may include high speed random access memory, and may also include memory, such as at least one magnetic disk storage device, flash memory device, or other non-volatile solid state storage device. In some embodiments, the computer readable storage medium optionally includes a memory remotely disposed with respect to the processor, and the remote memory may be connected to the big data based station area line loss anomaly cause analyzing apparatus through a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
Fig. 4 is a schematic structural diagram of an electronic device according to an embodiment of the present invention, and as shown in fig. 4, the electronic device includes: a processor 310 and a memory 320. The electronic device may further include: an input device 330 and an output device 340. The processor 310, the memory 320, the input device 330, and the output device 340 may be connected by a bus or other means, such as the bus connection in fig. 4. The memory 320 is the computer-readable storage medium described above. The processor 310 executes various functional applications and data processing of the server by running the nonvolatile software program, instructions and modules stored in the memory 320, that is, the method for analyzing the cause of the line loss abnormality of the transformer area based on the big data in the above method embodiment is realized. The input device 330 may receive input numeric or character information and generate key signal inputs related to user settings and function control of the station area line loss abnormality cause analysis device based on big data. The output device 340 may include a display device such as a display screen.
The electronic device can execute the method provided by the embodiment of the invention, and has the corresponding functional modules and beneficial effects of the execution method. For technical details that are not described in detail in this embodiment, reference may be made to the method provided by the embodiment of the present invention.
As an embodiment, the electronic device is applied to a station area line loss anomaly cause analysis device based on big data, and is used for a client, and the electronic device includes: at least one processor; and a memory communicatively coupled to the at least one processor; wherein the memory stores instructions executable by the at least one processor to cause the at least one processor to:
acquiring low-voltage user full data of a sample station area from a power system data center station;
taking the type of the abnormal user in the low-voltage user total data as an independent variable, taking a data field in the low-voltage user total data as a dependent variable, and calculating a combined correlation coefficient of a certain dependent variable and a single independent variable based on a complex correlation-Pearson coefficient analysis method to obtain a line loss influence factor association relation;
and inputting the acquired real-time low-voltage user total data of the transformer area into a preset high-loss reason classification model, outputting the abnormal user type, and identifying the line loss abnormal reason of the transformer area according to the abnormal user type.
Through the above description of the embodiments, those skilled in the art will clearly understand that each embodiment can be implemented by software plus a necessary general hardware platform, and certainly can also be implemented by hardware. With this understanding in mind, the above-described technical solutions may be embodied in the form of a software product, which can be stored in a computer-readable storage medium, such as ROM/RAM, magnetic disk, optical disk, etc., and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to execute the methods of the various embodiments or some parts of the embodiments.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the present invention has been described in detail with reference to the foregoing embodiments, it will be understood by those of ordinary skill in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some technical features may be equivalently replaced; and such modifications or substitutions do not depart from the spirit and scope of the corresponding technical solutions of the embodiments of the present invention.

Claims (8)

1. A big data-based analysis method for line loss anomaly reasons of a transformer area is characterized by comprising the following steps:
acquiring low-voltage user full data of a sample station area from a power system data center station;
taking the type of the abnormal user in the low-voltage user total data as an independent variable, taking a data field in the low-voltage user total data as a dependent variable, and calculating a combined correlation coefficient of a certain dependent variable and a single independent variable based on a complex correlation-Pearson coefficient analysis method to enable the correlation relationship of line loss influence factors to be achieved, wherein the type of the abnormal user comprises a file abnormal user, a collection abnormal user, a measurement abnormal user and a power stealing abnormal user, and the data field comprises station area daily loss electric quantity, a collection success rate, voltage and current;
calculating the expression of the combined correlation coefficient of the dependent variable and the single independent variable as follows:
Figure 499796DEST_PATH_IMAGE001
in the formula (I), the compound is shown in the specification,
Figure 177902DEST_PATH_IMAGE002
is the combined correlation coefficient of a dependent variable and a single independent variable,
Figure 487791DEST_PATH_IMAGE003
is the complex correlation coefficient of a dependent variable and a corresponding independent variable set,
Figure 997270DEST_PATH_IMAGE004
is the pearson coefficient of a dependent variable and the corresponding single independent variable,
Figure 452522DEST_PATH_IMAGE005
is the weight of a dependent variable and the corresponding independent variable set,
Figure 952946DEST_PATH_IMAGE006
the weight of a dependent variable and a corresponding single independent variable;
inputting the acquired real-time low-voltage user full data of the transformer area into a preset high-loss reason classification model, outputting a transaction user type, and identifying the line loss abnormal reason of the transformer area according to the transaction user type, wherein the preset high-loss reason classification model comprises the line loss influence factor association relation.
2. The big-data-based analysis method for the line loss anomaly cause of the distribution room according to claim 1, wherein after the full amount of data of the low-voltage users of the sample distribution room is acquired, the method comprises the following steps:
and performing data dimension reduction processing on the low-voltage user full data to remove fields which have no influence on the line loss in the low-voltage user full data.
3. The method for analyzing the cause of the line loss abnormality of the transformer area based on the big data as claimed in claim 1, wherein the expression for calculating the complex correlation coefficient between a dependent variable and a corresponding dependent variable group is as follows:
Figure 50215DEST_PATH_IMAGE007
Figure 617463DEST_PATH_IMAGE008
in the formula (I), the compound is shown in the specification,
Figure 56666DEST_PATH_IMAGE009
is the complex correlation coefficient of a dependent variable and the corresponding dependent variable group,
Figure 709364DEST_PATH_IMAGE010
the number of times the dependent variable characteristic occurs,
Figure 656329DEST_PATH_IMAGE011
to obtain a regression value by regression on the independent variables,
Figure 812504DEST_PATH_IMAGE012
are all independent variablesThe value of the one or more of the one,
Figure 937455DEST_PATH_IMAGE013
is as follows
Figure 562602DEST_PATH_IMAGE014
The parameters of the linear regression equation for each independent variable feature,
Figure 63991DEST_PATH_IMAGE015
is as follows
Figure 340251DEST_PATH_IMAGE014
The number of occurrences of each independent variable feature.
4. The big-data-based analysis method for the cause of line loss abnormality of the transformer area according to claim 1, wherein the expression for calculating the pearson coefficient of a dependent variable and a corresponding single independent variable is as follows:
Figure 88633DEST_PATH_IMAGE016
in the formula (I), the compound is shown in the specification,
Figure 512661DEST_PATH_IMAGE017
is the pearson coefficient of a dependent variable and the corresponding single independent variable,
Figure 755424DEST_PATH_IMAGE018
as to the number of times the argument characteristic occurs,
Figure 636923DEST_PATH_IMAGE019
the number of times the dependent variable characteristic occurs,
Figure 103677DEST_PATH_IMAGE020
is composed of
Figure 218263DEST_PATH_IMAGE018
The average value of the samples of (a),
Figure 982826DEST_PATH_IMAGE021
is composed of
Figure 499258DEST_PATH_IMAGE019
The average value of the samples of (a),
Figure 136912DEST_PATH_IMAGE022
is composed of
Figure 223948DEST_PATH_IMAGE018
The standard deviation of (a) is determined,
Figure 605251DEST_PATH_IMAGE023
is composed of
Figure 976190DEST_PATH_IMAGE019
Standard deviation of (2).
5. The big-data-based distribution room line loss anomaly cause analysis method according to claim 1, wherein the preset high-loss cause classification model is a model constructed based on machine learning weak classifier iterative training.
6. The utility model provides a platform district line loss anomaly reason analytical equipment based on big data which characterized in that includes:
the acquisition module is configured to acquire low-voltage user full data of a sample station area from a power system data center station;
the calculation module is configured to take the type of the abnormal user in the low-voltage user total data as an independent variable, take a data field in the low-voltage user total data as a dependent variable, and calculate a combined correlation coefficient of a certain dependent variable and a single independent variable based on a complex correlation-Pearson coefficient analysis method so as to enable a line loss influence factor correlation relationship to be achieved, wherein the type of the abnormal user comprises a file abnormal user, a collected abnormal user, a measured abnormal user and a power stealing abnormal user, and the data field comprises station area daily loss electric quantity, a collected success rate, voltage and current;
calculating the expression of the combined correlation coefficient of the dependent variable and the single independent variable as follows:
Figure 971696DEST_PATH_IMAGE001
in the formula (I), the compound is shown in the specification,
Figure 592033DEST_PATH_IMAGE002
is the combined correlation coefficient of a dependent variable and a single independent variable,
Figure 980289DEST_PATH_IMAGE003
is the complex correlation coefficient of a dependent variable and a corresponding independent variable set,
Figure 690888DEST_PATH_IMAGE004
is the pearson coefficient of a dependent variable and the corresponding single independent variable,
Figure 670345DEST_PATH_IMAGE005
is the weight of a dependent variable and the corresponding independent variable set,
Figure 246820DEST_PATH_IMAGE024
the weight of a dependent variable and a corresponding single independent variable;
and the output module is configured to input the acquired real-time low-voltage user full data of the transformer area into a preset high-loss reason classification model, so as to output the abnormal user type, and identify the line loss abnormal reason of the transformer area according to the abnormal user type, wherein the preset high-loss reason classification model contains the line loss influence factor association relation.
7. An electronic device, comprising: at least one processor, and a memory communicatively coupled to the at least one processor, wherein the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any of claims 1 to 5.
8. A computer-readable storage medium, on which a computer program is stored, which, when being executed by a processor, carries out the method of any one of claims 1 to 5.
CN202111514070.5A 2021-12-13 2021-12-13 Transformer area line loss abnormal reason analysis method and device based on big data Pending CN113902062A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111514070.5A CN113902062A (en) 2021-12-13 2021-12-13 Transformer area line loss abnormal reason analysis method and device based on big data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111514070.5A CN113902062A (en) 2021-12-13 2021-12-13 Transformer area line loss abnormal reason analysis method and device based on big data

Publications (1)

Publication Number Publication Date
CN113902062A true CN113902062A (en) 2022-01-07

Family

ID=79026199

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111514070.5A Pending CN113902062A (en) 2021-12-13 2021-12-13 Transformer area line loss abnormal reason analysis method and device based on big data

Country Status (1)

Country Link
CN (1) CN113902062A (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115392648A (en) * 2022-08-03 2022-11-25 中国电力科学研究院有限公司 Transformer area line loss fusion diagnosis system and diagnosis method thereof
CN116703368A (en) * 2023-08-08 2023-09-05 国网信通亿力科技有限责任公司 Synchronous line loss intelligent closed-loop monitoring method based on data mining
CN117110795A (en) * 2023-10-18 2023-11-24 国网安徽省电力有限公司合肥供电公司 Transformer area line fault positioning system

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109034244A (en) * 2018-07-27 2018-12-18 国家电网有限公司 Line loss abnormality diagnostic method and device based on electric quantity curve characteristic model
CN111781463A (en) * 2020-06-25 2020-10-16 国网福建省电力有限公司 Auxiliary diagnosis method for abnormal line loss of transformer area
CN112284521A (en) * 2020-10-27 2021-01-29 西安西热节能技术有限公司 Quantification and application method of vibration fault characteristics of steam turbine generator unit
CN112598234A (en) * 2020-12-14 2021-04-02 广东电网有限责任公司广州供电局 Low-voltage transformer area line loss abnormity analysis method, device and equipment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109034244A (en) * 2018-07-27 2018-12-18 国家电网有限公司 Line loss abnormality diagnostic method and device based on electric quantity curve characteristic model
CN111781463A (en) * 2020-06-25 2020-10-16 国网福建省电力有限公司 Auxiliary diagnosis method for abnormal line loss of transformer area
CN112284521A (en) * 2020-10-27 2021-01-29 西安西热节能技术有限公司 Quantification and application method of vibration fault characteristics of steam turbine generator unit
CN112598234A (en) * 2020-12-14 2021-04-02 广东电网有限责任公司广州供电局 Low-voltage transformer area line loss abnormity analysis method, device and equipment

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115392648A (en) * 2022-08-03 2022-11-25 中国电力科学研究院有限公司 Transformer area line loss fusion diagnosis system and diagnosis method thereof
CN116703368A (en) * 2023-08-08 2023-09-05 国网信通亿力科技有限责任公司 Synchronous line loss intelligent closed-loop monitoring method based on data mining
CN116703368B (en) * 2023-08-08 2023-10-13 国网信通亿力科技有限责任公司 Synchronous line loss intelligent closed-loop monitoring method based on data mining
CN117110795A (en) * 2023-10-18 2023-11-24 国网安徽省电力有限公司合肥供电公司 Transformer area line fault positioning system
CN117110795B (en) * 2023-10-18 2024-01-30 国网安徽省电力有限公司合肥供电公司 Transformer area line fault positioning system

Similar Documents

Publication Publication Date Title
CN113902062A (en) Transformer area line loss abnormal reason analysis method and device based on big data
CN106909933A (en) A kind of stealing classification Forecasting Methodology of three stages various visual angles Fusion Features
CN109784388A (en) Stealing user identification method and device
CN104504508A (en) Area closed-loop data analysis method on basis of analytical hierarchy process and wavelet regression
CN113435759B (en) Primary equipment risk intelligent assessment method based on deep learning
CN113554361B (en) Comprehensive energy system data processing and calculating method and processing system
CN107832927A (en) 10kV circuits line based on gray relative analysis method becomes relation evaluation method
CN113342784A (en) Database design method for risk assessment of main transformer equipment of power grid
CN115293257A (en) Detection method and system for abnormal electricity utilization user
CN106874676B (en) State evaluation method for electric energy metering device
Du et al. Text similarity detection method of power customer service work order based on tfidf algorithm
CN111475548A (en) Power utilization abnormity analysis decision system based on big data mining technology
Lan et al. Unsupervised power system event detection and classification using unlabeled pmu data
CN107194529B (en) Power distribution network reliability economic benefit analysis method and device based on mining technology
CN113808071A (en) Non-invasive load monitoring method and system based on deep learning
CN115146715A (en) Power utilization potential safety hazard diagnosis method, device, equipment and storage medium
CN114066219A (en) Electricity stealing analysis method for intelligently identifying electricity utilization abnormal points under incidence matrix
CN114626433A (en) Fault prediction and classification method, device and system for intelligent electric energy meter
CN114118759A (en) Distribution transformer area load overload state assessment method and device
Li et al. Semi-supervised learning for fault identification in electricity distribution networks
CN112016631A (en) Improvement scheme related to low-voltage treatment
Dong et al. The research on user short-term electricity load forecasting for judging electric theft
Meng et al. New energy power prediction optimization based on improved TF-IDF single machine information feature extraction
Xuesong et al. The risk quantification evaluation strategy for the distribution line based on emlr
CN118194202A (en) Transverse federal-based electricity stealing identification algorithm and prototype system thereof

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20220107