CN104850629A - Analysis method of massive intelligent electricity-consumption data based on improved k-means algorithm - Google Patents

Analysis method of massive intelligent electricity-consumption data based on improved k-means algorithm Download PDF

Info

Publication number
CN104850629A
CN104850629A CN201510263237.3A CN201510263237A CN104850629A CN 104850629 A CN104850629 A CN 104850629A CN 201510263237 A CN201510263237 A CN 201510263237A CN 104850629 A CN104850629 A CN 104850629A
Authority
CN
China
Prior art keywords
data
means algorithm
cluster
intelligent power
magnanimity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510263237.3A
Other languages
Chinese (zh)
Inventor
周天和
卢晓飞
张元元
蔡荣
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
HANGZHOU TIANKUAN TECHNOLOGY Co Ltd
Original Assignee
HANGZHOU TIANKUAN TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by HANGZHOU TIANKUAN TECHNOLOGY Co Ltd filed Critical HANGZHOU TIANKUAN TECHNOLOGY Co Ltd
Priority to CN201510263237.3A priority Critical patent/CN104850629A/en
Publication of CN104850629A publication Critical patent/CN104850629A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/35Clustering; Classification
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention relates to an analysis method of massive intelligent electricity-consumption data based on an improved k-means algorithm. The analysis method comprises: firstly, establishing a Map-Reduce parallel processing model of the data of electricity-consumption information of residential consumers, such as the number of residential consumers, house areas, the number of family members, daily electricity consumption, the peak of electricity consumption, and the number of household electrical appliances; using the k-means algorithm to comprehensively consider two factors of the selection of an initial clustering center and the selection of the number of clusters, to improve the improved k-means algorithm, wherein the density of data objects is used as a selection standard of the initial clustering center, and a distance between clusters and the dispersing degree of objects in the clusters are used as the important references to select the number of clusters; optimizing the initial clustering center under the Map-Reduce parallel processing model to accurately position the clustering center; performing parallel mining on the data which belongs to each cluster to complete the analysis of electricity-consumption data. Through experiments on Hadoop clusters, results prove that the method provided by the present invention is stable, efficient and feasible to operate, and has a high speed-up ratio.

Description

A kind of magnanimity intelligent power data analysing method based on improving k-means algorithm
Technical field
The present invention relates to intelligent power grid technology field, particularly relating to a kind of magnanimity intelligent power data analysing method based on improving k-means algorithm.
Background technology
In recent years, in the face of electricity needs and the day by day nervous electric power supply of the strong growth of intelligent power field, and the energy-saving and emission-reduction task improved constantly, the intelligent grid turning to basic technical features with informationization, robotization, interaction becomes worldwide study hotspot.In China, the scope that power supply enterprise's power information collection covers specially becomes user from only covering important special line, expand to cover all kinds of special line gradually and specially become the multiple electric field such as user, general industrial and commercial producer, low pressure resident, the scale accessing all kinds of acquisition terminal and table meter also increases thereupon, wants the electricity consumption data volume exponentially level growth of sampling and processing every day.Relevance is there is between data, and in data, hide the electricity consumption behavioural habits of user, explore effective data mining algorithm, user is segmented, for the user of different electricity consumption classification, excavate a large amount of valuable information such as electricity consumption behavior, electricity consumption quickly and accurately, that instructs in the work such as user's reducing energy consumption is passive and delayed, support intelligent business diagnosis and decision-making, become the problem that intelligent power field must solve.Cluster analysis, as a kind of data mining algorithm be widely used, can obtain the distribution characteristics of data in global scope with higher treatment effeciency, and progressively be applied to intelligent power field.But there is the bottleneck that operand is large, treatment effeciency is low in the face of magnanimity intelligent power data in large scale in traditional clustering method, can not the efficient excavation demand of satisfying magnanimity intelligent power data.
Summary of the invention
The present invention overcomes above-mentioned weak point, object is to provide a kind of magnanimity intelligent power data analysing method based on improving k-means algorithm, Map-Reduce parallel process model is set up for domestic consumer, using the size of object densities, bunch spacing, bunch in the degree of scatter of object as important references, choose reasonable meets cluster number k and the initial cluster center of the electricity consumption data of P mining demand, under Map-Reduce model, carry out P mining analysis to magnanimity domestic consumer electricity consumption data.This method is efficient, feasible, and has good speed-up ratio, can excavate the potential valuable information of magnanimity intelligent power data accurately and efficiently, for the formulation of optimum electricity consumption strategy provides favourable guidance.
The present invention achieves the above object by the following technical programs: a kind of magnanimity intelligent power data analysing method based on improving k-means algorithm, comprises the steps:
(1) Map-Reduce parallel process model is set up to domestic consumer, and with Map-Reduce parallel process model for data object creates magnanimity intelligent power data analysis framework;
(2) utilize and improve k-means algorithm, based on the size of object densities, bunch spacing, bunch in the degree of scatter of object select initial cluster center and cluster number k;
(3) under Map-Reduce parallel process model, initial cluster center is optimized, accurately locates cluster centre;
(4) to each cluster bunch belonging to data carry out P mining, complete electricity consumption data analysis.
As preferably, the Map-Reduce parallel process model of described step (1) comprise the domestic consumer number of domestic consumer's power information, floor space, kinsfolk's number, every day power consumption, peak-valley electric energy, household electrical appliance number six kinds of data.
As preferably, the magnanimity intelligent power data analysis framework of described step (1) adopts master/slave architecture mode, comprises data source, cloud computing main control server and cloud computing from server; Cloud computing main control server receives the magnanimity point data of data source and defines by the dimension of point data, and cloud computing carries out Parallel Algorithm for Mining calculating from server to the electricity consumption data after the definition of cloud computing main control server.
As preferably, the improvement k-means algorithm of described step (2) is the clustering algorithm belonging to division methods, adopt Euclidean distance as the evaluation index of 2 sample similarity degrees, k point of random selecting data centralization is as initial cluster center, each sample according to data centralization carries out cluster to the distance size of k initial cluster center, calculate all sample means be grouped in each cluster, upgrade each initial cluster center, until square error criterion function is stabilized in minimum value.
As preferably, the method for the selection initial cluster center of described step (2) is as follows:
1) according to the distance d (x between 2 objects in following formula calculating object set M i, x j);
d(x i,x j)=[(x i1-x j1) 2+(x i2-x j2) 2+…+(x in-x jn) 2]
2) according to the mean distance MeanDis (M) between all objects in following formula calculating object set M;
MeanDis ( M ) = 1 n ( n - 1 ) Σd ( x i , x j )
3) according to following formula calculating object x idensity D en (x i);
Den ( x i ) = Σ j = 1 n u ( MeanDis ( M ) - d ( x i , x j ) )
Wherein when x >=0, u (x)=1, otherwise u (x)=0;
4) density collection D={Den (x is obtained from above 1), Den (x 2) ..., Den (x n), elect object maximum for density set D Midst density as the 1st initial cluster center O 1, the object selecting density second largest is as the 2nd primary election cluster centre O 2, the rest may be inferred, will meet as shown in the formula subconditional object y ias a kth cluster centre, until reach predetermined cluster numbers;
max(min(d(y i,O 1)),…,min(d(y i,O n-1)))。
As preferably, described step (2) utilizes to improve the minimum δ of k-means Algorithm for Solving to determine cluster number k, and computing formula is as follows:
Beneficial effect of the present invention is: (1) is simple based on improvement k-means Algorithm Analysis, fast convergence rate, improves the treatment effeciency of magnanimity electricity consumption data, shortens computing time; (2) this method has good speed-up ratio, can excavate the potential valuable information of magnanimity intelligent power data accurately and efficiently, for the formulation of optimum electricity consumption strategy provides favourable guidance.
Accompanying drawing explanation
Fig. 1 is the magnanimity intelligent power data analysis configuration diagram based on cloud computing;
Fig. 2 is the k-means Parallel Algorithms for Data Mining FB(flow block) improved.
Embodiment
Below in conjunction with specific embodiment, the present invention is described further, but protection scope of the present invention is not limited in this:
Embodiment: a kind of magnanimity intelligent power data analysing method based on improving k-means algorithm, comprises the steps:
(1) first to domestic consumer set up comprise domestic consumer's power information domestic consumer number, floor space, kinsfolk's number, every day power consumption, peak-valley electric energy, household electrical appliance number etc. the Map-Reduce parallel process model of data, and with Map-Reduce parallel process model for data object creates magnanimity intelligent power data analysis framework;
Magnanimity intelligent power data analysis framework as shown in Figure 1, adopts master/slave architecture mode to realize the storage of power consumer magnanimity intelligent power data, and realizes the mining analysis of the potential valuable information of magnanimity intelligent power data based on Parallel Algorithm for Mining.Main control server receives the magnanimity electricity consumption data of the peripheral systems such as power information acquisition system, sales service application system, intelligent residents ' area management system and enterprise energy management system, the dimension that electricity consumption data comprise is defined, by valuable dimension combination composition data dimensional model, electricity consumption Data distribution8 stores to corresponding cloud computing from the data memory module server according to data dimension model by data management module, and records and be stored in from information such as the data block server and memory locations.When there being mass data analysis request, main control server receives request, and analyzes request, according to actual conditions, from data mining model storehouse, select one or more to be suitable for the Parallel Algorithm for Mining of this request, the task matching after dimension decomposition is given from server; Each from server according to being assigned to of task, data memory module and task execution module cooperatively interact, the Parallel Algorithm for Mining using main control server to select performs electricity consumption data mining task, realizes the analysis of magnanimity electricity consumption data, obtains valuable information fast and efficiently.
(2) utilize and improve k-means algorithm, based on the size of object densities, bunch spacing, bunch in the degree of scatter of object select initial cluster center and cluster number k;
Improving k-means algorithm is a kind of clustering algorithm belonging to division methods, usual employing Euclidean distance is as the evaluation index of 2 sample similarity degrees, its basic thought is: k point of random selecting data centralization is as initial cluster center, each sample according to data centralization carries out cluster to the distance size of k initial cluster center, calculate all sample means be grouped in each cluster, upgrade each initial cluster center, until square error criterion function is stabilized in minimum value.
Object set M={x 1, x 2..., x n, x i=(x i1, x i2..., x it), sample x iwith sample x jeuclidean distance computing formula as follows:
d(x i,x j)=[(x i1-x j1) 2+(x i2-x j2) 2+…+(x in-x jn) 2] (1)
Average criteria error function is as follows:
I C = Σ i = 1 k Σ j = 1 t i | | x i - n i | | 2 - - - ( 2 )
In formula: k is the number wanting cluster; t iit is the number of sample in the i-th class; n iit is the average of sample in the i-th class.
The k-means algorithm improved is simple, fast convergence rate, easily extensible and efficiency is high, but this algorithm exists cluster number determines difficulty, and initial cluster center is chosen the inaccurate cluster result that causes and is easily absorbed in the defects such as locally optimal solution.Therefore, the present invention is directed to this defect, based on Map-Reduce parallel process model, propose the mass data Parallel Algorithm for Mining improving k-means algorithm, realize the analysis of domestic consumer's mass data for target computing time with the execution efficiency, the shortening that improve algorithm.
The selection of (a) k-means initial cluster center
Euclidean distance between data object is less, and its similarity is larger.If the data object in region residing for a certain data object of data centralization is more, other object in region is less to the distance of this object, illustrate that the packing density of this object is larger, data distribution characteristics can be reflected preferably, more be conducive to the convergence of square error criterion function as cluster centre, and the randomness selected due to initial cluster center can be avoided to bring result to be absorbed in the deadlock of local optimum.
Distance d (the x between 2 objects is calculated according to formula (1) i, x j).
Mean distance MeanDis (M) according to the following formula in calculating object set M between all objects.
MeanDis ( M ) = 1 n ( n - 1 ) Σd ( x i , x j ) - - - ( 3 )
Calculating object x according to the following formula idensity D en (x i).
Den ( x i ) = Σ j = 1 n u ( MeanDis ( M ) - d ( x i , x j ) ) - - - ( 4 )
Wherein when x >=0, u (x)=1, otherwise u (x)=0.
Density set D={Den (x 1), Den (x 2) ..., Den (x n).
Elect object maximum for density set D Midst density as the 1st initial cluster center O 1, the object selecting density second largest is as the 2nd primary election cluster centre O 2, the rest may be inferred, will meet max (min (d (y i, O 1)) ..., min (d (y i, O n-1))) the object y of condition i(y iaccording to the descending selection of the density of object, each n that selects is individual) as a kth cluster centre, until reach predetermined cluster numbers.
Choosing of (b) cluster number
In bunch, object degree of scatter is less, and the distance between bunch is larger, and Clustering Effect is better.The number of preliminary supposition bunch is k, according to the center of (a) method choice each bunch, using the mean distance between data object in the i-th bunch as the degree of scatter of data object, uses p irepresent, represent the distance between the i-th bunch and individual bunch of jth, reset the value of k, repeat (a), select the k making following formula value minimum as best cluster number.
(3) under Map-Reduce parallel process model, initial cluster center is optimized, accurately locates cluster centre;
(4) to each cluster bunch belonging to data carry out P mining, complete electricity consumption data analysis.
In the present embodiment, as shown in Figure 2, excavate based on the domestic consumer's electricity consumption data parallel improving k-means algorithm and be divided into nine steps, particularly:
1. power information data set is stored in distributed file system with row form, makes pending electricity consumption data set can form electricity consumption data subset by burst by row.Task management determines cluster number k and an initialization k cluster centre according to this section to improving one's methods of k-means algorithm, is sent to the node of m pending Map task.
2. the magnanimity intelligent power data subset in distributed file system is formatd, produce <key 1, value 1> key-value pair, is specifically formatted as <UserID, info>, and UserID represents domestic consumer ID here, and info is domestic consumer's information, comprises floor space, kinsfolk's number, power consumption etc.
3. each record <UserID, the info> of domestic consumer electricity consumption data subset of Map function to input scans, and calculates the Euclidean distance of itself and k central point respectively, the central point kmin that recording distance is minimum.Map function generates and exports intermediate result <key 2, value 2> couple, is defined as <kmin here, info 1> couple, kmin represent the mark of affiliated bunch of this user, info 1comprise the information such as domestic consumer ID, floor space, kinsfolk's number, power consumption.
4. intermediate result is carried out Hash according to kmin by partition functions Partitioner, is divided into the individual different subregion of r, each subregion is assigned to the Reduce function of specifying.
5. the node being assigned with Reduce task reads corresponding intermediate result <kmin, list< (info from m Map task 1<userID 1... >), (info 1<userID 2>), >> also sorts to data according to kmin, make to have the data gathering of identical kmin together, the intermediate data after node traverses sequence, by <kmin, list> passes to Reduce function, then Reduce function is according to list value, calculates the mean vector with identical kmin data, upgrades the central point of corresponding bunch of kmin.
6. repeat 2.-5., until square error criterion function is stabilized in minimum value, export the data of k bunch respectively.
7. for the data set of the i-th bunch, re-start format, produce key-value to <EC, list 1>, wherein EC is user power utilization amount, list 1for other information of user.Dividing data integrates as data subset, transfers to m the node processing performing Map task.
8. the data subset of Map function to input scans, and generates intermediate result <EC, list 1>, the intermediate result that each Map task produces will be divided into r subregion p according to partition functions Partitioner by EC value 1, p 2..., p r, Map task can respectively to each subregion p simultaneously i(1≤i≤r) sorts by EC value, and r Reduce task obtains corresponding intermediate result respectively, and then carries out the data processing in Reduce stage.For ensureing that final Output rusults is by EC value global orderly, the cryptographic hash that Partitioner no longer presses EC divides intermediate result, but the scope of employing division principle, make the intermediate result [EC in each Reduce task process EC segment i, EC j].
9. after each Reduce task reads the intermediate result of respective partition from m Map task, Map-Reduce internal mechanism carries out merger (merge) operation to these data, thus make the output of Reduce task be that the data set of the i-th bunch is by one section of continuous print data subset after power consumption sequence, the output of all Reduce tasks is merged according to the order of sequence, obtains the ordered data collection by power consumption sequence in the i-th bunch.
The know-why being specific embodiments of the invention and using described in above, if the change done according to conception of the present invention, its function produced do not exceed that instructions and accompanying drawing contain yet spiritual time, must protection scope of the present invention be belonged to.

Claims (6)

1. the magnanimity intelligent power data analysing method based on improvement k-means algorithm, it is characterized in that comprising the steps: that (1) sets up Map-Reduce parallel process model to domestic consumer, and with Map-Reduce parallel process model for data object creates magnanimity intelligent power data analysis framework;
(2) utilize and improve k-means algorithm, based on the size of object densities, bunch spacing, bunch in the degree of scatter of object select initial cluster center and cluster number k;
(3) under Map-Reduce parallel process model, initial cluster center is optimized, accurately locates cluster centre;
(4) to each cluster bunch belonging to data carry out P mining, complete electricity consumption data analysis.
2. a kind of magnanimity intelligent power data analysing method based on improving k-means algorithm according to claim 1, is characterized in that: the Map-Reduce parallel process model of described step (1) comprise the domestic consumer number of domestic consumer's power information, floor space, kinsfolk's number, every day power consumption, peak-valley electric energy, household electrical appliance number six kinds of data.
3. a kind of magnanimity intelligent power data analysing method based on improving k-means algorithm according to claim 1, it is characterized in that: the magnanimity intelligent power data analysis framework of described step (1) adopts master/slave architecture mode, comprises data source, cloud computing main control server and cloud computing from server; Cloud computing main control server receives the magnanimity point data of data source and defines by the dimension of point data, and cloud computing carries out Parallel Algorithm for Mining calculating from server to the electricity consumption data after the definition of cloud computing main control server.
4. a kind of magnanimity intelligent power data analysing method based on improving k-means algorithm according to claim 1, it is characterized in that: the improvement k-means algorithm of described step (2) is a kind of clustering algorithm belonging to division methods, adopt Euclidean distance as the evaluation index of 2 sample similarity degrees, k point of random selecting data centralization is as initial cluster center, each sample according to data centralization carries out cluster to the distance size of k initial cluster center, calculate all sample means be grouped in each cluster, upgrade each initial cluster center, until square error criterion function is stabilized in minimum value.
5. a kind of magnanimity intelligent power data analysing method based on improving k-means algorithm according to claim 1, is characterized in that: the method for the selection initial cluster center of described step (2) is as follows:
1) according to the distance d (x between 2 objects in following formula calculating object set M i, x j);
d(x i,x j)=[(x i1-x j1) 2+(x i2-x j2) 2+…+(x in-x jn) 2]
2) according to the mean distance MeanDis (M) between all objects in following formula calculating object set M;
MeanDis ( M ) = 1 n ( n - 1 ) &Sigma;d ( x i , x j )
3) according to following formula calculating object x idensity D en (x i);
Den ( x i ) = &Sigma; j = 1 n u ( MeanDis ( M ) - d ( x i , x j ) )
Wherein when x >=0, u (x)=1, otherwise u (x)=0;
4) density collection D={Den (x is obtained from above 1), Den (x 2) ..., Den (x n), elect object maximum for density set D Midst density as the 1st initial cluster center O 1, the object selecting density second largest is as the 2nd primary election cluster centre O 2, the rest may be inferred, will meet as shown in the formula subconditional object y ias a kth cluster centre, until reach predetermined cluster numbers;
max(min(d(y i,O 1)),…,min(d(y i,O n-1)))。
6. a kind of magnanimity intelligent power data analysing method based on improving k-means algorithm according to claim 1, it is characterized in that: described step (2) utilizes to improve the minimum δ of k-means Algorithm for Solving to determine cluster number k, and computing formula is as follows:
CN201510263237.3A 2015-05-21 2015-05-21 Analysis method of massive intelligent electricity-consumption data based on improved k-means algorithm Pending CN104850629A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201510263237.3A CN104850629A (en) 2015-05-21 2015-05-21 Analysis method of massive intelligent electricity-consumption data based on improved k-means algorithm

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201510263237.3A CN104850629A (en) 2015-05-21 2015-05-21 Analysis method of massive intelligent electricity-consumption data based on improved k-means algorithm

Publications (1)

Publication Number Publication Date
CN104850629A true CN104850629A (en) 2015-08-19

Family

ID=53850273

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201510263237.3A Pending CN104850629A (en) 2015-05-21 2015-05-21 Analysis method of massive intelligent electricity-consumption data based on improved k-means algorithm

Country Status (1)

Country Link
CN (1) CN104850629A (en)

Cited By (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105786996A (en) * 2016-02-18 2016-07-20 国网智能电网研究院 Electricity information data quality analyzing system
CN105844294A (en) * 2016-03-21 2016-08-10 全球能源互联网研究院 Electricity usage behavior analysis method based on FCM cluster algorithm
CN105867152A (en) * 2016-04-25 2016-08-17 国网浙江省电力公司 Method for electric appliance control based on user behaviors and electric appliances
CN105930353A (en) * 2016-04-08 2016-09-07 西安美林数据技术股份有限公司 Feature subdivision method for call center customer service staff in mass data
CN106022664A (en) * 2016-07-08 2016-10-12 大连大学 Big data analysis based network intelligent power saving monitoring method
CN106251034A (en) * 2016-07-08 2016-12-21 大连大学 Wisdom energy saving electric meter monitoring system based on cloud computing technology
CN106682079A (en) * 2016-11-21 2017-05-17 云南电网有限责任公司电力科学研究院 Detection method of user's electricity consumption behavior of user based on clustering analysis
CN108493946A (en) * 2018-04-12 2018-09-04 辽宁石油化工大学 Electric energy control method, device and equipment based on user power utilization analysis
CN108681973A (en) * 2018-05-14 2018-10-19 广州供电局有限公司 Sorting technique, device, computer equipment and the storage medium of power consumer
CN108763576A (en) * 2018-05-28 2018-11-06 大连理工大学 A kind of parallel k-means algorithms for higher-dimension text data
CN108764324A (en) * 2018-05-23 2018-11-06 南京航空航天大学 A kind of text data immediate processing method based on K-Means algorithms and co-occurrence word
CN109597302A (en) * 2018-10-29 2019-04-09 曹凯 A kind of configuration method and configuration system of the quick-fried carbon-based bacterial manure of bud root guard
CN109656696A (en) * 2018-12-03 2019-04-19 华南师范大学 A kind of processing method that data API is efficiently called
CN110047016A (en) * 2019-04-25 2019-07-23 国网浙江省电力有限公司 A kind of extensive electric power data analysis method of streaming based on Spark Streaming
CN111125469A (en) * 2019-12-09 2020-05-08 重庆邮电大学 User clustering method and device for social network and computer equipment
CN112286989A (en) * 2020-10-28 2021-01-29 上海电机学院 Big data clustering mining method and platform
CN112465014A (en) * 2020-11-26 2021-03-09 江苏方天电力技术有限公司 Mass security data online aggregation method facing power distribution Internet of things
CN112906736A (en) * 2021-01-13 2021-06-04 国网山东省电力公司日照供电公司 Community safety accurate management and control method and system based on household electricity consumption
CN114022790A (en) * 2022-01-10 2022-02-08 成都国星宇航科技有限公司 Cloud layer detection and image compression method and device in remote sensing image and storage medium
CN117076691A (en) * 2023-10-16 2023-11-17 青岛文达通科技股份有限公司 Commodity resource knowledge graph algorithm model oriented to intelligent communities

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8402027B1 (en) * 2010-02-11 2013-03-19 Disney Enterprises, Inc. System and method for hybrid hierarchical segmentation
CN104376057A (en) * 2014-11-06 2015-02-25 南京邮电大学 Self-adaptation clustering method based on maximum distance, minimum distance and K-means

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8402027B1 (en) * 2010-02-11 2013-03-19 Disney Enterprises, Inc. System and method for hybrid hierarchical segmentation
CN104376057A (en) * 2014-11-06 2015-02-25 南京邮电大学 Self-adaptation clustering method based on maximum distance, minimum distance and K-means

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
赵莉 等: "基于改进 k-means 算法的海量智能用电数据分析", 《电网技术》 *

Cited By (25)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105786996A (en) * 2016-02-18 2016-07-20 国网智能电网研究院 Electricity information data quality analyzing system
CN105844294A (en) * 2016-03-21 2016-08-10 全球能源互联网研究院 Electricity usage behavior analysis method based on FCM cluster algorithm
CN105930353A (en) * 2016-04-08 2016-09-07 西安美林数据技术股份有限公司 Feature subdivision method for call center customer service staff in mass data
CN105867152A (en) * 2016-04-25 2016-08-17 国网浙江省电力公司 Method for electric appliance control based on user behaviors and electric appliances
CN106022664A (en) * 2016-07-08 2016-10-12 大连大学 Big data analysis based network intelligent power saving monitoring method
CN106251034A (en) * 2016-07-08 2016-12-21 大连大学 Wisdom energy saving electric meter monitoring system based on cloud computing technology
CN106682079A (en) * 2016-11-21 2017-05-17 云南电网有限责任公司电力科学研究院 Detection method of user's electricity consumption behavior of user based on clustering analysis
CN106682079B (en) * 2016-11-21 2020-06-05 云南电网有限责任公司电力科学研究院 User electricity consumption behavior detection method based on cluster analysis
CN108493946A (en) * 2018-04-12 2018-09-04 辽宁石油化工大学 Electric energy control method, device and equipment based on user power utilization analysis
CN108681973A (en) * 2018-05-14 2018-10-19 广州供电局有限公司 Sorting technique, device, computer equipment and the storage medium of power consumer
CN108764324A (en) * 2018-05-23 2018-11-06 南京航空航天大学 A kind of text data immediate processing method based on K-Means algorithms and co-occurrence word
CN108763576A (en) * 2018-05-28 2018-11-06 大连理工大学 A kind of parallel k-means algorithms for higher-dimension text data
CN109597302A (en) * 2018-10-29 2019-04-09 曹凯 A kind of configuration method and configuration system of the quick-fried carbon-based bacterial manure of bud root guard
CN109656696A (en) * 2018-12-03 2019-04-19 华南师范大学 A kind of processing method that data API is efficiently called
CN109656696B (en) * 2018-12-03 2020-10-16 华南师范大学 Processing method for efficient calling of data API
CN110047016A (en) * 2019-04-25 2019-07-23 国网浙江省电力有限公司 A kind of extensive electric power data analysis method of streaming based on Spark Streaming
CN111125469A (en) * 2019-12-09 2020-05-08 重庆邮电大学 User clustering method and device for social network and computer equipment
CN111125469B (en) * 2019-12-09 2022-06-10 重庆邮电大学 User clustering method and device of social network and computer equipment
CN112286989A (en) * 2020-10-28 2021-01-29 上海电机学院 Big data clustering mining method and platform
CN112465014A (en) * 2020-11-26 2021-03-09 江苏方天电力技术有限公司 Mass security data online aggregation method facing power distribution Internet of things
CN112465014B (en) * 2020-11-26 2022-06-14 江苏方天电力技术有限公司 Mass security data online aggregation method facing power distribution Internet of things
CN112906736A (en) * 2021-01-13 2021-06-04 国网山东省电力公司日照供电公司 Community safety accurate management and control method and system based on household electricity consumption
CN114022790A (en) * 2022-01-10 2022-02-08 成都国星宇航科技有限公司 Cloud layer detection and image compression method and device in remote sensing image and storage medium
CN114022790B (en) * 2022-01-10 2022-04-26 成都国星宇航科技有限公司 Cloud layer detection and image compression method and device in remote sensing image and storage medium
CN117076691A (en) * 2023-10-16 2023-11-17 青岛文达通科技股份有限公司 Commodity resource knowledge graph algorithm model oriented to intelligent communities

Similar Documents

Publication Publication Date Title
CN104850629A (en) Analysis method of massive intelligent electricity-consumption data based on improved k-means algorithm
CN106709035B (en) A kind of pretreatment system of electric power multidimensional panoramic view data
Xu et al. Hierarchical K-means method for clustering large-scale advanced metering infrastructure data
Huang et al. A clustering based grouping method of nearly zero energy buildings for performance improvements
CN113837311B (en) Resident customer clustering method and device based on demand response data
CN110311420A (en) A kind of generation method of scene joint power output timing scene
CN107248031B (en) Rapid power consumer classification method aiming at load curve peak-valley difference
CN107807961B (en) Spark calculation engine-based power distribution network big data partition processing method
CN106296315A (en) Context aware systems based on user power utilization data
CN103632306A (en) Distribution network power supply area division method based on clustering analysis
CN116993555A (en) Partition method, system and storage medium for identifying territory space planning key region
CN106022578A (en) Residential electricity peak-valley-flat dividing method based on data dimension increasing and K-means clustering
Lv et al. Design of cloud data warehouse and its application in smart grid
CN112865089A (en) Improved large-scale scene analysis method for active power distribution network
CN110287237B (en) Social network structure analysis based community data mining method
Liu et al. Research on big data mining technology of electric vehicle charging behaviour
Liu et al. Research on k-means algorithm based on cloud computing
CN110909786A (en) New user load identification method based on characteristic index and decision tree model
Li et al. A data warehouse architecture supporting energy management of intelligent electricity system
Hoogsteen et al. On the scalability of decentralized energy management using profile steering
CN115776138A (en) Micro-grid capacity planning method and device considering multi-dimensional uncertainty and energy management strategy
CN102523300A (en) Data-intensive cloud storage model facing intelligent power grid
CN111768066B (en) Park electric heating load coupling relation analysis method and device based on fusion characteristics
CN114004393A (en) Double-layer multi-time-scale operation optimization method for power distribution network-natural gas combined system
Zhang et al. Research on massive data analysis of power utilization based on improved K-means algorithm and cloud computing

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
EXSB Decision made by sipo to initiate substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 310018, Zhejiang, Hangzhou economic and Technological Development Zone, Poplar Street science and technology road, No. 2, 2, 3, 09 units

Applicant after: HANGZHOU TIANKUAN TECHNOLOGY CO., LTD.

Address before: Hangzhou City, Zhejiang province 310000 Xihu District Tianmushan Road No. 398 building 7F Qianjiang refrigeration

Applicant before: HANGZHOU TIANKUAN TECHNOLOGY CO., LTD.

COR Change of bibliographic data
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20150819