CN111680074A - Clustering algorithm-based electric power collection load leakage point feature mining method - Google Patents

Clustering algorithm-based electric power collection load leakage point feature mining method Download PDF

Info

Publication number
CN111680074A
CN111680074A CN201911403417.1A CN201911403417A CN111680074A CN 111680074 A CN111680074 A CN 111680074A CN 201911403417 A CN201911403417 A CN 201911403417A CN 111680074 A CN111680074 A CN 111680074A
Authority
CN
China
Prior art keywords
point
leakage
points
data
missing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201911403417.1A
Other languages
Chinese (zh)
Other versions
CN111680074B (en
Inventor
程清
裴旭斌
方舟
郁春雷
杨杰
吴春华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
State Grid Zhejiang Electric Power Co Ltd
Zhejiang Huayun Information Technology Co Ltd
Information and Telecommunication Branch of State Grid Zhejiang Electric Power Co Ltd
Original Assignee
State Grid Corp of China SGCC
State Grid Zhejiang Electric Power Co Ltd
Zhejiang Huayun Information Technology Co Ltd
Information and Telecommunication Branch of State Grid Zhejiang Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, State Grid Zhejiang Electric Power Co Ltd, Zhejiang Huayun Information Technology Co Ltd, Information and Telecommunication Branch of State Grid Zhejiang Electric Power Co Ltd filed Critical State Grid Corp of China SGCC
Priority to CN201911403417.1A priority Critical patent/CN111680074B/en
Publication of CN111680074A publication Critical patent/CN111680074A/en
Application granted granted Critical
Publication of CN111680074B publication Critical patent/CN111680074B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2465Query processing support for facilitating data mining operations in structured databases
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/26Visual data mining; Browsing structured data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Systems or methods specially adapted for specific business sectors, e.g. utilities or tourism
    • G06Q50/06Electricity, gas or water supply

Abstract

The invention discloses a clustering algorithm-based electric power acquisition load leakage point feature mining method, and relates to the field of electric power information acquisition. At present, in the process of acquiring power load data, the condition of unsuccessful data acquisition often occurs, user load data is lost at certain moments to form a leakage point phenomenon, the electric leakage acquisition accuracy in the existing system is not high, the acquisition success rate is relatively low, and the information of mastering the leakage point condition is incomplete. The method comprises the following steps: collecting load data; extracting missing point data; extracting a missing point characteristic index through the missing point data; performing clustering analysis on the missing point data indexes to form clustering class characteristics; and finally extracting the missing point characteristics of the user. Through analyzing the load collection missing point result data and the collection process behavior data, the user missing point condition can be deeply mastered, the equipment missing point characteristics can be timely found, and the collection success rate is improved.

Description

Clustering algorithm-based electric power collection load leakage point feature mining method
Technical Field
The invention relates to the field of electric power information acquisition, in particular to an electric power acquisition load leakage point characteristic mining method based on a clustering algorithm.
Background
In the electric power collection service, the collection of user load data is a basic work in electric power data collection, and has great significance for the collection of user electric quantity loads and other data. At present, in the process of acquiring power load data, the condition of unsuccessful data acquisition often occurs, and the phenomenon of missing of user load data at certain moments occurs to form a missing point phenomenon. The current leakage collecting accuracy in the existing system is not high, the collecting success rate is relatively low, and the information for mastering the leakage point condition is incomplete.
Disclosure of Invention
The technical problem to be solved and the technical task to be solved by the invention are to perfect and improve the prior technical scheme, and provide a clustering algorithm-based electric power acquisition load leakage point characteristic mining method, aiming at deeply mastering the leakage point condition and improving the success rate of leakage point acquisition. Therefore, the invention adopts the following technical scheme.
A clustering algorithm-based electric power collection load leakage point feature mining method comprises the following steps:
1) collecting load data from a collection system;
2) extracting corresponding metering point missing point data;
3) extracting a missing point characteristic index through the missing point data;
4) analyzing the data indexes of the missing points through a clustering algorithm to form clustering class characteristics of the metering points;
5) and extracting the missing point characteristics of each user through the clustering category characteristics of the metering points.
The device has the advantages that the device can deeply master the leakage point condition of the user by analyzing the load collection leakage point result data and the collection process behavior data, the collection success rate is improved, reference is provided for collection personnel, collection work is convenient to carry out, the auxiliary collection rate is improved, and timely normal operation of the device is guaranteed.
As a preferable technical means: in step 3), counting indexes of each metering point of the leakage point data, wherein the index content comprises a total number of the leakage points, a continuity characteristic of the leakage points, a week characteristic of the leakage points and a moment characteristic of the leakage points, the total number of the leakage points is used for counting the whole leakage point condition of the metering point, the continuity characteristic of the leakage points is used for describing the number of times of 2 or more continuous leakage points of the metering points and reflecting the serious condition of the leakage points, the week characteristic of the leakage points is used for describing the time law of the leakage points of the metering points in a week period, and the moment characteristic of the leakage points is used for describing the moment law of the leakage points of the metering points of the leakage points 24 hours a day. And the extraction of the characteristic indexes of the missing points is effectively realized.
As a preferable technical means: in step 4), passing through the distance function
Figure RE-638739DEST_PATH_IMAGE001
Wherein
Figure RE-730323DEST_PATH_IMAGE002
And representing each calculation index of any two metering points to calculate the distance of each metering point, and gathering the points with similar distances together to form each characteristic category. Effectively realizing the clustering class characteristics of the metering points.
Has the advantages that: the device has the advantages that the device can deeply master the leakage point condition of the user by analyzing the load collection leakage point result data and the collection process behavior data, the collection success rate is improved, reference is provided for collection personnel, collection work is convenient to carry out, the auxiliary collection rate is improved, and timely normal operation of the device is guaranteed.
Drawings
FIG. 1 is a schematic flow diagram of the present invention.
Detailed Description
The technical scheme of the invention is further explained in detail by combining the drawings in the specification.
As shown in fig. 1, a method for mining characteristics of a power collection load leakage point based on a clustering algorithm includes the following steps:
1) acquiring the collection load data of the past month of the specific transformer user from the current date from the collection system.
2) And extracting corresponding measuring point missing point data by matching the acquired measuring points, time and related load data with the complete measuring points and time.
3) The data indexes of the leakage point data are summarized to be used as input of analysis and mining, each index of each metering point in the past month is counted, the index content comprises total leakage points, continuity characteristics of the leakage points, week characteristics of the leakage points and time characteristics of the leakage points, the total leakage points are used for counting the whole leakage point condition of the metering points, the continuity characteristics of the leakage points are used for describing 2 times or more of continuous leakage points of the metering points and reflecting the serious condition of the leakage points, the week characteristics of the leakage points are used for describing the time law of the leakage points in one week period and are totally divided into 7 leakage point indexes, the time characteristics of the leakage points are used for describing the time law of the leakage points 24 hours a day, and are totally divided into 24 leakage point indexes, and the table is specifically shown in the following table.
Figure RE-550512DEST_PATH_IMAGE003
4) Analyzing the data index of the leakage point by a clustering algorithm, counting each calculation index of the metering point, and then utilizing the clustering calculationThe method carries out clustering mining analysis on the indexes to form clustering classification characteristics of the metering points through a distance function
Figure RE-RE-840679DEST_PATH_IMAGE004
Wherein
Figure RE-RE-314386DEST_PATH_IMAGE005
And representing each calculation index of any two metering points to calculate the distance of each metering point, and gathering the points with similar distances together to form each characteristic category. Each category is specific to the calculation index, and for example, a certain category may be characterized by continuous missing points in the past month, a missing point occupation ratio of wednesday, and a missing point occupation ratio of 17 points.
5) And extracting the missing point characteristics of each user through the clustering category characteristics of the metering points, wherein for example, if a certain category characteristic is continuous missing points in the past month, the missing point occupancy ratio of the wednesday and the missing point occupancy ratio of 17 points, the user missing point characteristics in the category can be labeled as serious continuous missing points in the month, frequent missing points in the wednesday and frequent missing point moments in the 17 hours.
In this example, the clustering algorithm is an analysis process that groups a set of physical or abstract objects into a plurality of classes composed of similar objects, collects results of the load leakage points, and can classify the leakage point objects into different classes according to the similarity of the leakage point characteristics through clustering analysis, extract the leakage point information, and visually express the load leakage point characteristics. The distribution rule of load leakage points is researched currently, the quantity and the continuity of object leakage points are analyzed from the existing leakage point data, the distribution rule is distributed along with time, and the modeling is carried out on the leakage point condition according to different information by using a clustering algorithm.
The method for mining the characteristics of the power collection load leakage point based on the clustering algorithm shown in fig. 1 is a specific embodiment of the invention, has shown the outstanding substantive features and remarkable progress of the invention, and can modify the same in shape, structure and the like according to the practical use requirements and under the teaching of the invention, and the method is in the scope of protection of the scheme.

Claims (5)

1. A clustering algorithm-based electric power collection load leakage point feature mining method is characterized by comprising the following steps:
1) collecting load data from a collection system;
2) extracting corresponding metering point missing point data;
3) extracting a missing point characteristic index through the missing point data;
4) analyzing the data indexes of the missing points through a clustering algorithm to form clustering class characteristics of the metering points;
5) and extracting the missing point characteristics of each user through the clustering category characteristics of the metering points.
2. The clustering algorithm-based power collection load leakage point feature mining method according to claim 1, characterized in that: in the step 1), the time period for acquiring the acquired load data from the acquisition system is at least one month backward from the current date.
3. The clustering algorithm-based power collection load leakage point feature mining method according to claim 1, characterized in that: in the step 2), corresponding measurement point missing point data is extracted from the measurement point missing point data by matching the acquired measurement point, time and related load data with the complete measurement point and time.
4. The clustering algorithm-based power collection load leakage point feature mining method according to claim 1, characterized in that: in step 3), counting indexes of each metering point of the leakage point data, wherein the index content comprises a total number of the leakage points, a continuity characteristic of the leakage points, a week characteristic of the leakage points and a moment characteristic of the leakage points, the total number of the leakage points is used for counting the whole leakage point condition of the metering point, the continuity characteristic of the leakage points is used for describing the number of times of 2 or more continuous leakage points of the metering points and reflecting the serious condition of the leakage points, the week characteristic of the leakage points is used for describing the time law of the leakage points of the metering points in a week period, and the moment characteristic of the leakage points is used for describing the moment law of the leakage points of the metering points of the leakage points 24 hours a day.
5. The clustering algorithm-based power collection load leakage point feature mining method according to claim 1, characterized in that: in step 4), passing through the distance function
Figure DEST_PATH_IMAGE002
To calculate the distance of each metering point, and to group the points with similar distance together to form each characteristic class, wherein (A)
Figure DEST_PATH_IMAGE004
) Each calculation index representing any two measurement points.
CN201911403417.1A 2019-12-31 2019-12-31 Clustering algorithm-based power acquisition load leakage point feature mining method Active CN111680074B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201911403417.1A CN111680074B (en) 2019-12-31 2019-12-31 Clustering algorithm-based power acquisition load leakage point feature mining method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201911403417.1A CN111680074B (en) 2019-12-31 2019-12-31 Clustering algorithm-based power acquisition load leakage point feature mining method

Publications (2)

Publication Number Publication Date
CN111680074A true CN111680074A (en) 2020-09-18
CN111680074B CN111680074B (en) 2023-07-04

Family

ID=72451247

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201911403417.1A Active CN111680074B (en) 2019-12-31 2019-12-31 Clustering algorithm-based power acquisition load leakage point feature mining method

Country Status (1)

Country Link
CN (1) CN111680074B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116756528A (en) * 2023-08-18 2023-09-15 杭州鸿晟电力设计咨询有限公司 Method, device, equipment and medium for complementing electricity load data

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102651116A (en) * 2012-03-31 2012-08-29 上海市电力公司 Power load data refining method
CN106447206A (en) * 2016-10-09 2017-02-22 国网浙江省电力公司信息通信分公司 Power utilization analysis method based on acquisition data of power utilization information
CN108681973A (en) * 2018-05-14 2018-10-19 广州供电局有限公司 Sorting technique, device, computer equipment and the storage medium of power consumer
CN108761196A (en) * 2018-03-30 2018-11-06 国家电网公司 A kind of intelligent electric meter user missing voltage data restorative procedure
JP2019054715A (en) * 2017-09-15 2019-04-04 東京電力ホールディングス株式会社 Power theft monitoring system, power theft monitoring device, power theft monitoring method and program
CN110022226A (en) * 2019-01-04 2019-07-16 国网浙江省电力有限公司 A kind of data collection system and acquisition method based on object-oriented
CN110188919A (en) * 2019-04-22 2019-08-30 武汉大学 A kind of load forecasting method based on shot and long term memory network

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102651116A (en) * 2012-03-31 2012-08-29 上海市电力公司 Power load data refining method
CN106447206A (en) * 2016-10-09 2017-02-22 国网浙江省电力公司信息通信分公司 Power utilization analysis method based on acquisition data of power utilization information
JP2019054715A (en) * 2017-09-15 2019-04-04 東京電力ホールディングス株式会社 Power theft monitoring system, power theft monitoring device, power theft monitoring method and program
CN108761196A (en) * 2018-03-30 2018-11-06 国家电网公司 A kind of intelligent electric meter user missing voltage data restorative procedure
CN108681973A (en) * 2018-05-14 2018-10-19 广州供电局有限公司 Sorting technique, device, computer equipment and the storage medium of power consumer
CN110022226A (en) * 2019-01-04 2019-07-16 国网浙江省电力有限公司 A kind of data collection system and acquisition method based on object-oriented
CN110188919A (en) * 2019-04-22 2019-08-30 武汉大学 A kind of load forecasting method based on shot and long term memory network

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
CHENG Q 等: "Abnormal electricity consumption detection based on ensemble learning" *
罗清雷 等: "基于增长模型的电力设备缺失数据筛查算法研究" *
裴旭斌 等: "于蚁群算法改进One-Class SVM的电力离群用户检测算法研究" *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116756528A (en) * 2023-08-18 2023-09-15 杭州鸿晟电力设计咨询有限公司 Method, device, equipment and medium for complementing electricity load data
CN116756528B (en) * 2023-08-18 2023-11-28 杭州鸿晟电力设计咨询有限公司 Method, device, equipment and medium for complementing electricity load data

Also Published As

Publication number Publication date
CN111680074B (en) 2023-07-04

Similar Documents

Publication Publication Date Title
CN104408547A (en) Data-mining-based detection method for medical insurance fraud behavior
CN109145097A (en) A kind of judgement document's classification method based on information extraction
CN102426590B (en) Quality evaluation method and device
CN103823896A (en) Subject characteristic value algorithm and subject characteristic value algorithm-based project evaluation expert recommendation algorithm
CN103605651A (en) Data processing showing method based on on-line analytical processing (OLAP) multi-dimensional analysis
CN107958043A (en) A kind of electricity power engineering budget inventory automatic generation method
CN103164537B (en) A kind of method of search engine logs data mining of user oriented information requirement
CN108021683A (en) A kind of scale model retrieval implementation method based on three-dimensional labeling
CN104992297A (en) Electricity fee collection risk assessment device based on big data platform clustering algorithm and method thereof
CN109033322A (en) A kind of test method and device of multidimensional data
CN105574265B (en) Entire assembly model quantitative description towards model index
CN106599138A (en) Variety identification method for electrical appliances
CN111427725A (en) Big data storage system and method
CN103955596A (en) Accident hotspot comprehensive judging method based on traffic accident collection technology
CN111680074A (en) Clustering algorithm-based electric power collection load leakage point feature mining method
CN107766983B (en) Method for setting emergency rescue parking point of urban rail transit station
CN104166756A (en) Computation method for mass distribution of aircraft
CN103700027A (en) Method for achieving multi-dimensional bidirectional free-matching visual display of electric power data and graphics
CN107844962A (en) A kind of distribution Construction Cost Data based on standard data structure collects system
CN104598713A (en) Power grid theoretical line loss computation demand data synthesizing method
CN105445577A (en) Power quality disturbance source working condition identifying method
CN102737254B (en) Identification method of mark image
CN107705194A (en) Electricity power engineering project category Cost Comparison and Analysis method based on data analysis technique
CN115587314A (en) Platform region portrait drawing method based on cluster analysis method
CN104200338A (en) Line loss statistics and decision analysis system

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant