CN112035715B

CN112035715B - User label design method and device

Info

Publication number: CN112035715B
Application number: CN202010663731.XA
Authority: CN
Inventors: 洪莹; 王凯; 吴思思; 黄玉珊; 韦国惠; 黄绪荣
Original assignee: Guangxi Power Grid Co Ltd
Current assignee: Guangxi Power Grid Co Ltd
Priority date: 2020-07-10
Filing date: 2020-07-10
Publication date: 2023-04-14
Anticipated expiration: 2040-07-10
Also published as: CN112035715A

Abstract

The invention discloses a user label design method and a device, wherein the method comprises the following steps: acquiring all user power consumption data, and performing data screening on all user power consumption data based on the basic attributes of users to obtain screened user power consumption data; performing correlation calculation between the power consumption data and the average temperature on the screened user power consumption data to obtain correlation characteristic data between the user power consumption data and the average temperature; and forming a corresponding user label according to the correlation characteristic data between the electricity utilization data and the average temperature of the user and the basic attribute of the user. In the embodiment of the invention, the user label is constructed according to the actual situation of the user, so that the subsequent user label adjustment of the power supply strategy of the user is facilitated, the power utilization requirement of the user is met, and the power utilization experience of the user is improved.

Description

User label design method and device

Technical Field

The invention relates to the technical field of power grid user power supply, in particular to a user tag design method and device.

Background

With the continuous improvement of the practicability degree of the information-based construction, a large amount of basic information of all aspects of customers is accumulated at present, and data support is provided for the development of all work. However, the existing data analysis and support mode can not realize multi-dimensional and three-dimensional customer feature depiction, and can not support the relationship between the power consumption and the temperature of the basic attributes of the user files of different users; when the weather changes suddenly, there is no way to adjust the power supply strategy through the corresponding tag, so that the power consumption requirement of the user cannot be effectively guaranteed.

Disclosure of Invention

The invention aims to overcome the defects of the prior art, and provides a user tag design method and device, which are used for constructing a user tag according to the actual situation of a user, facilitating the adjustment of a user power supply strategy for the user tag in the follow-up process, meeting the power utilization requirement of the user and improving the power utilization experience of the user.

In order to solve the above technical problem, an embodiment of the present invention provides a user tag design method, where the method includes:

acquiring all user electricity consumption data, and performing data screening on all user electricity consumption data based on the basic attributes of the users to obtain screened user electricity consumption data;

performing correlation calculation between the power consumption data and the average temperature on the screened user power consumption data to obtain correlation characteristic data between the user power consumption data and the average temperature;

forming a corresponding user label according to the correlation characteristic data between the electricity utilization data of the user and the average temperature and the basic attribute of the user;

clustering and identifying the user labels based on a clustering algorithm, and determining the proportion of the basic attribute of each user profile in the user labels after clustering and identification;

obtaining the correlation strength of the user electricity consumption data corresponding to the user tags after grouping and identification and the average temperature data;

and adjusting the user power supply service of the basic attribute of the relevant user profile corresponding to the corresponding parcel based on the proportion of the basic attribute of each user profile in the grouped and identified user tags, the correlation strength between the power consumption data of the users corresponding to the grouped and identified user tags and the average temperature data, and the average temperature information of the weather predicted in the future.

Optionally, the user electricity consumption data comprises quarterly and/or monthly and/or daily electricity consumption data of the user in the whole year;

the basic attributes of the user comprise a user profile basic attribute and a user state basic attribute;

the user profile basic attributes comprise social security attributes, electricity utilization categories, user categories, importance degrees, regional characteristics, electricity price types and load properties;

the user state basic attributes comprise new customers, long-term electricity-free customers and batch electricity-using customers.

Optionally, the performing of correlation calculation between the power consumption data and the average temperature on the screened user power consumption data includes:

and performing correlation calculation based on the screened user electricity utilization data of the user in the preset time period and the average temperature of the preset time period.

Optionally, the calculation formula of the correlation calculation is as follows:

wherein r (X, Y) represents the correlation between X and Y; cov (X, Y) represents the covariance between X and Y; var (X) Var (Y) represents the variance of X and the variance of Y, respectively; x represents the electricity consumption data of the user in the preset time period, and Y represents the average temperature in the preset time period.

Optionally, the clustering and identifying the user tags based on the clustering algorithm includes:

carrying out preliminary clustering on the user tags by using a Canopy clustering algorithm to obtain a preliminary clustering result;

clustering the primary clustering result by using a Kmeans clustering algorithm to obtain a clustering result;

and clustering and identifying the user tags according to each clustering center in the clustering result.

Optionally, the performing preliminary clustering on the user tag by using a Canopy clustering algorithm to obtain a preliminary clustering result includes:

initializing the user tag as a list data;

randomly selecting an object D from the list data as a clustering center of a Canopy clustering algorithm, marking the object D as C, and deleting the object D from the list data;

calculating the distances between all objects in the list data and C, adding the objects into a clustering center of a Canopy clustering algorithm and marking the objects as C when the distances are greater than a first preset distance, and deleting the objects in the list data when the distances are less than a second preset distance;

adding the clustering center of the Canopy clustering algorithm into a clustering list, and repeating until the list data is empty;

and taking the clustering list as a preliminary clustering result.

Optionally, the clustering the preliminary clustering result by using a Kmeans clustering algorithm to obtain a clustering result includes:

taking the preliminary clustering result as an initialized mass center of a Kmeans clustering algorithm, and distributing each user label to the corresponding mass center;

calculating the distance of each user label to each centroid, and distributing the user labels to the nearest clustering centroid;

carrying out mean value calculation on each clustering centroid, and updating the clustering centroids according to the mean value calculation;

and calculating variance error values of all the user tags to the corresponding updated clustering centroids, judging whether the variance error values are larger than a preset threshold value, if so, repeating the step of calculating the distance from each user tag to each centroid, otherwise, finishing clustering and obtaining a clustering result.

Optionally, the correlation strength includes: very strong correlation, moderate correlation, weak correlation, or irrelevant.

In addition, an embodiment of the present invention further provides a user tag design apparatus, where the apparatus includes:

the data screening module: the system comprises a data processing module, a data processing module and a data processing module, wherein the data processing module is used for acquiring all user electricity consumption data, and performing data screening on all the user electricity consumption data based on the basic attribute of a user to acquire screened user electricity consumption data;

a correlation calculation module: the correlation calculation module is used for carrying out correlation calculation between the power consumption data and the average temperature on the screened user power consumption data to obtain correlation characteristic data between the user power consumption data and the average temperature;

a tag generation module: the system comprises a data processing module, a data processing module and a data processing module, wherein the data processing module is used for forming a corresponding user tag according to correlation characteristic data between power utilization data and average temperature of a user and basic attributes of the user;

a grouping and identification module: the system comprises a clustering algorithm, a user label identification module, a user profile base attribute module and a user label identification module, wherein the clustering algorithm is used for clustering and identifying the user labels based on the clustering algorithm and determining the proportion of each user profile base attribute in the user labels after clustering and identification;

a correlation mild acquisition module: the correlation strength between the user electricity consumption data corresponding to the user tags after the grouping and the identification and the average temperature data is obtained;

a power supply adjustment module: and the user power supply service is used for adjusting the basic attributes of the related user profiles of the corresponding parcel based on the proportion of the basic attributes of each user profile in the grouped and identified user tags, the correlation strength between the power consumption data and the average temperature data of the users corresponding to the grouped and identified user tags and the average temperature information of the weather predicted in the future.

In the embodiment of the invention, the user label is constructed according to the actual situation of the user, so that the subsequent user label adjustment of the power supply strategy of the user is facilitated, the power utilization requirement of the user is met, and the power utilization experience of the user is improved.

Drawings

In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the embodiments or the prior art descriptions will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.

FIG. 1 is a flow chart diagram of a user tag design method in an embodiment of the invention;

fig. 2 is a schematic structural diagram of a user tag designing apparatus in an embodiment of the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Examples

Referring to fig. 1, fig. 1 is a flow chart illustrating a user tag design method according to an embodiment of the present invention.

As shown in fig. 1, a user tag design method includes:

s11: acquiring all user electricity consumption data, and performing data screening on all user electricity consumption data based on the basic attributes of the users to obtain screened user electricity consumption data;

in the implementation process of the invention, the electricity consumption data of the user comprises the electricity consumption data of the user in each quarter and/or each month and/or each day in the whole year; the basic attributes of the user comprise a user profile basic attribute and a user state basic attribute; the user file basic attributes comprise social security attributes, electricity utilization categories, user categories, importance degrees, regional characteristics, electricity price types and load properties; the user state basic attributes comprise new customers, long-term electricity-free customers and batch electricity-using customers.

Specifically, the user electricity consumption data comprises quarterly and/or monthly and/or daily electricity consumption data of the user in the whole year; basic attributes of the user; the basic attributes comprise user profile basic attributes and user state basic attributes; user profile base attributes: social security attributes (low security, five security, etc.), electricity utilization categories, user categories, importance levels (important customers, important attention customers, etc.), regional characteristics (urban areas, towns, etc.), electricity price types (single-system electricity price, two-system electricity price), load properties, etc.; user state base attributes: the new customer, the long-term electricity-free customer and the batch electricity-using customer.

And screening the user electricity utilization data according to the basic attribute of the user, removing the electricity utilization data of the long-term electricity-non-utilization client in the basic attribute of the user state, performing simple user classification and other operations according to the basic data of the user, and then obtaining the screened user electricity utilization data.

S12: performing correlation calculation between the power consumption data and the average temperature on the screened user power consumption data to obtain correlation characteristic data between the user power consumption data and the average temperature;

in a specific implementation process of the present invention, the calculating a correlation between the power consumption data and the average temperature of the screened user power consumption data includes: and performing correlation calculation based on the screened user electricity utilization data of the user in the preset time period and the average temperature of the preset time period.

Further, the calculation formula of the correlation calculation is as follows:

Specifically, correlation calculation is carried out on the user electricity utilization data of the user in the screened user electricity utilization data in a preset time period and the average temperature of the preset time period, and correlation characteristic data between the user electricity utilization data and the average temperature are obtained; and the calculation formula of the correlation calculation is as follows:

S13: forming a corresponding user label according to the correlation characteristic data between the electricity utilization data of the user and the average temperature and the basic attribute of the user;

in the specific implementation process of the invention, the marking is carried out according to the correlation characteristic data between the electricity consumption data of the user and the average temperature and the basic attribute of the user, and then a corresponding user label is formed.

S14: clustering and identifying the user labels based on a clustering algorithm, and determining the proportion of the basic attribute of each user profile in the user labels after clustering and identification;

in the specific implementation process of the present invention, the clustering and identifying the user tags based on the clustering algorithm includes: carrying out preliminary clustering on the user tags by using a Canopy clustering algorithm to obtain a preliminary clustering result; clustering the primary clustering result by using a Kmeans clustering algorithm to obtain a clustering result; and clustering and identifying the user tags according to each clustering center in the clustering result.

Further, the performing preliminary clustering on the user tag by using a Canopy clustering algorithm to obtain a preliminary clustering result includes: initializing the user tag as a list data; randomly selecting an object D from the list data as a clustering center of a Canopy clustering algorithm, marking the object D as C, and deleting the object D from the list data; calculating the distances between all objects in the list data and C, when the distances are larger than a first preset distance, adding the objects into a clustering center of a Canopy clustering algorithm and marking the objects as C, and when the distances are smaller than a second preset distance, deleting the objects from the list data; adding the clustering center of the Canopy clustering algorithm into a clustering list, and repeating until the list data is empty; and taking the clustering list as a preliminary clustering result.

Further, the clustering the preliminary clustering result by using a Kmeans clustering algorithm to obtain a clustering result includes: taking the preliminary clustering result as an initialized mass center of a Kmeans clustering algorithm, and distributing each user label to the corresponding mass center; calculating the distance of each user label to each centroid, and distributing the user labels to the nearest clustering centroid; carrying out mean value calculation on each clustering centroid, and updating the clustering centroids according to the mean value calculation; and calculating variance error values of all the user tags to the corresponding updated clustering centroids, judging whether the variance error values are larger than a preset threshold value, if so, repeating the step of calculating the distance from each user tag to each centroid, otherwise, finishing clustering and obtaining a clustering result.

Specifically, clustering is carried out on the user labels through a Canopy-Kmeans clustering algorithm, clustering and identification are carried out according to clustering results, and after the clustering and identification are finished, the proportion of the basic attribute of each user profile in the user labels after the clustering and identification is determined.

When the user tags need to be grouped and identified, the user tags need to be clustered firstly, specifically, a Canopy clustering algorithm is adopted for primary clustering, then a Kmeans clustering algorithm is utilized for secondary clustering to obtain a clustering result, and then the user tags are grouped and identified according to each clustering center in the clustering result.

When the Canopy clustering algorithm is used for preliminary clustering, the clustering process comprises the following steps: initializing a user tag into a data list, and presetting two threshold values comprising a first preset distance and a second preset distance; randomly selecting an object D in the data list as a clustering center of a Canopy clustering algorithm, marking the object D as C, and deleting the object D from the list data; calculating the distances between all objects in the list data and C, adding the objects into a clustering center of a Canopy clustering algorithm and marking the objects as C when the distances are greater than a first preset distance, and deleting the objects in the list data when the distances are less than a second preset distance; adding the clustering center of the Canopy clustering algorithm into a clustering list, and repeating until the list data is empty; the cluster list is taken as a preliminary clustering result.

And assuming that all objects in the list data are A and the cluster center object is C, calculating the distance between A and C by adopting a cosine distance calculation formula, specifically:

wherein A = (a) ₁ ,a ₂ ,…,a _n )，C＝(c ₁ ,c ₂ ,…,c _n )，i＝1,2,…,n。

After the Canopy clustering algorithm is completed, obtaining a primary clustering result, and clustering the primary clustering result by adopting a Kmeans clustering algorithm; and classifying by taking k objects in the space as centers, classifying the objects closest to each center in the object space into one class respectively, and successively calculating and updating the value of each clustering centroid in a multi-iteration mode until the clustering centroid is stable and unchanged.

And (3) clustering by using a Kmeans clustering algorithm: taking the preliminary clustering result as an initialized mass center of a Kmeans clustering algorithm, and distributing each user label to the corresponding mass center; namely, the Canopy center generated by the Canopy clustering algorithm is used as the initialized centroid of the Kmeans algorithm, and each label is already distributed to the corresponding centroid; calculating the distance of each user label to each centroid, and distributing the user labels to the nearest clustering centroids; the distance calculation formula still adopts the cosine distance used in the Canopy clustering algorithm; carrying out mean value calculation on each clustering centroid, and updating the clustering centroids according to the mean value calculation; and calculating variance error values of all the user tags from the corresponding updated clustering centroids, judging whether the variance error values are larger than a preset threshold value, if so, repeating the step of calculating the distance from each user tag to each centroid, and if not, finishing clustering to obtain a clustering result.

After the clustering result is obtained, clustering and identifying the user labels according to each clustering center in the clustering result; and then calculating the proportion of the basic attribute of each user profile in the user tags after the grouping and the identification.

S15: obtaining the correlation strength of the user electricity consumption data corresponding to the user tags after grouping and identification and the average temperature data;

in the implementation process of the present invention, the correlation strength includes: very strong correlation, moderate correlation, weak correlation, or irrelevant.

The correlation coefficient between the power consumption data of the user and the temperature needs to be divided according to the numerical ranges, and the correlation strength in each numerical range is labeled, which is specifically shown in the following table:

magnitude of correlation coefficient	General explanation
		0.8～1.0	Very strong correlation
0.6～0.8	Strong correlation
		0.4～0.6	Moderate correlation
0.2～0.4	Weak correlation
		0～0.2	Weakly or not related

And obtaining the correlation strength of the user electricity consumption data corresponding to the user tags after grouping and identification and the average temperature data according to the table.

S16: and adjusting the user power supply service of the basic attribute of the relevant user profile corresponding to the parcel based on the proportion of the basic attribute of each user profile in the grouped and identified user tags, the correlation strength of the user power consumption data and the average temperature data corresponding to the grouped and identified user tags and the average temperature information of the future predicted weather.

In the specific implementation process of the invention, the user power supply service of the relevant user profile basic attribute of the corresponding parcel is adjusted according to the proportion of the basic attribute of each user profile in the grouped and identified user tags, the correlation strength of the user power consumption data and the average temperature data corresponding to the grouped and identified user tags and the future predicted weather average temperature information.

Examples

Referring to fig. 2, fig. 2 is a schematic structural diagram of a user tag designing apparatus according to an embodiment of the present invention.

As shown in fig. 2, a user tag designing apparatus, the apparatus comprising:

the data screening module 21: the system comprises a data acquisition module, a data processing module and a data processing module, wherein the data acquisition module is used for acquiring all user electricity consumption data, and performing data screening on all user electricity consumption data based on the basic attributes of users to acquire screened user electricity consumption data;

in the specific implementation process of the invention, the user electricity utilization data comprises quarterly and/or monthly and/or daily electricity utilization data of the user in the whole year; the basic attributes of the user comprise user profile basic attributes and user state basic attributes; the user profile basic attributes comprise social security attributes, electricity utilization categories, user categories, importance degrees, regional characteristics, electricity price types and load properties; the user state basic attribute is new customers, long-term electricity-free customers and batch electricity-using customers.

Specifically, the user electricity consumption data comprises quarterly and/or monthly and/or daily electricity consumption data of the user in the whole year; basic attributes of the user; the basic attributes comprise user profile basic attributes and user state basic attributes; user profile base attributes: social security attributes (low security, five security, etc.), electricity utilization categories, user categories, importance levels (important customers, important attention customers, etc.), regional characteristics (urban areas, towns, etc.), electricity price types (single-system electricity price, two-system electricity price), load properties, etc.; user state base attributes: new customers, long-term electricity-free customers and batch electricity-using customers.

The correlation calculation module 22: the correlation calculation module is used for carrying out correlation calculation between the power consumption data and the average temperature on the screened user power consumption data to obtain correlation characteristic data between the user power consumption data and the average temperature;

Further, the calculation formula of the correlation calculation is as follows:

wherein r (X, Y) represents the correlation between X and Y; cov (X, Y) represents the covariance between X and Y; var (X) Var (Y) each represents a variance of X and a variance of Y; x represents the electricity consumption data of the user in the preset time period, and Y represents the average temperature in the preset time period.

The label generation module 23: the system comprises a data processing module, a data processing module and a data processing module, wherein the data processing module is used for forming a corresponding user tag according to correlation characteristic data between power utilization data and average temperature of a user and basic attributes of the user;

in the specific implementation process of the invention, the electricity consumption data of the user and the average temperature are labeled according to the correlation characteristic data and the basic attribute of the user, and then a corresponding user label is formed.

Group and identification module 24: the system comprises a clustering algorithm, a user label identification module, a user profile base attribute module and a user label identification module, wherein the clustering algorithm is used for clustering and identifying the user labels based on the clustering algorithm and determining the proportion of each user profile base attribute in the user labels after clustering and identification;

Further, the performing preliminary clustering on the user tag by using a Canopy clustering algorithm to obtain a preliminary clustering result includes: initializing the user tag as a list data; randomly selecting an object D from the list data as a clustering center of a Canopy clustering algorithm, marking the object D as C, and deleting the object D from the list data; calculating the distances between all objects in the list data and C, adding the objects into a clustering center of a Canopy clustering algorithm and marking the objects as C when the distances are greater than a first preset distance, and deleting the objects in the list data when the distances are less than a second preset distance; adding the clustering center of the Canopy clustering algorithm into a clustering list, and repeating until the list data is empty; and taking the clustering list as a preliminary clustering result.

Further, the clustering the preliminary clustering result by using a Kmeans clustering algorithm to obtain a clustering result includes: taking the preliminary clustering result as an initialized mass center of a Kmeans clustering algorithm, and distributing each user label to the corresponding mass center; calculating the distance of each user label to each centroid, and distributing the user labels to the nearest clustering centroid; carrying out mean value calculation on each clustering centroid, and updating the clustering centroids according to the mean value calculation; and calculating variance error values of all the user tags from the corresponding updated clustering centroids, judging whether the variance error values are larger than a preset threshold value, if so, repeating the step of calculating the distance from each user tag to each centroid, and if not, finishing clustering to obtain a clustering result.

Specifically, clustering is carried out on the user labels through a Canopy-Kmeans clustering algorithm, clustering and identification are carried out according to clustering results, and after the clustering and identification are completed, the proportion of each user profile basic attribute in the user labels after clustering and identification is determined.

When the Canopy clustering algorithm is used for preliminary clustering, the clustering process comprises the following steps: initializing a user tag into a data list, and presetting two threshold values comprising a first preset distance and a second preset distance; randomly selecting an object D in the data list as a clustering center of a Canopy clustering algorithm, marking the object D as C, and deleting the object D from the list data; calculating the distances between all objects in the list data and C, adding the objects into a clustering center of a Canopy clustering algorithm and marking the objects as C when the distances are greater than a first preset distance, and deleting the objects in the list data when the distances are less than a second preset distance; adding the clustering center of the Canopy clustering algorithm into a clustering list, and repeating until the list data is empty; the cluster list is used as a preliminary clustering result.

After the Canopy clustering algorithm is completed, obtaining a primary clustering result, and clustering the primary clustering result by adopting a Kmeans clustering algorithm; and classifying k objects in the space as centers, classifying the objects closest to each center in the object space into one class, and gradually calculating and updating the value of each clustering centroid in a multi-iteration mode until the clustering centroids are stable and unchanged.

The correlation mild acquisition module 25: the correlation strength between the user electricity consumption data corresponding to the user tags after the grouping and the identification and the average temperature data is obtained;

magnitude of correlation coefficient	General explanation
		0.8～1.0	Very strong correlation
0.6～0.8	Strong correlation
		0.4～0.6	Moderate correlation
0.2～0.4	Weak correlation
		0～0.2	Weakly or not

The power supply adjustment module 26: and the user power supply service is used for adjusting the basic attributes of the related user profiles of the corresponding parcel based on the proportion of the basic attributes of each user profile in the grouped and identified user tags, the correlation strength between the power consumption data and the average temperature data of the users corresponding to the grouped and identified user tags and the average temperature information of the weather predicted in the future.

In the specific implementation process of the invention, the power supply service of the user corresponding to the basic attribute of the relevant user profile of the corresponding parcel is adjusted according to the proportion of the basic attribute of each user profile in the grouped and identified user tags, the correlation strength of the power consumption data and the average temperature data of the user corresponding to the grouped and identified user tags and the average temperature information of the weather predicted in the future.

Those skilled in the art will appreciate that all or part of the steps in the methods of the above embodiments may be implemented by hardware related to instructions of a program, and the program may be stored in a computer-readable storage medium, and the storage medium may include: read Only Memory (ROM), random Access Memory (RAM), magnetic or optical disks, and the like.

In addition, the user tag design method and apparatus provided by the embodiment of the present invention are described in detail above, a specific example should be adopted herein to explain the principle and the implementation manner of the present invention, and the description of the above embodiment is only used to help understanding the method and the core idea of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, the specific embodiments and the application range may be changed, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims

1. A method for designing a user tag, the method comprising:

and adjusting the user power supply service of the basic attribute of the relevant user profile corresponding to the parcel based on the proportion of the basic attribute of each user profile in the grouped and identified user tags, the correlation strength of the user power consumption data and the average temperature data corresponding to the grouped and identified user tags and the average temperature information of the future predicted weather.

2. The method of claim 1, wherein the user electricity usage data comprises user quarterly and/or monthly and/or daily electricity usage data throughout the year;

the user state basic attributes comprise new customers, long-term electricity-unused customers and batch electricity-used customers.

3. The method for designing the user tag according to claim 1, wherein the calculating the correlation between the power consumption data and the average temperature of the screened power consumption data of the user includes:

4. The method of claim 3, wherein the correlation calculation is calculated as follows:

5. The method of claim 1, wherein the clustering and identifying the user tags based on a clustering algorithm comprises:

6. The method according to claim 5, wherein the performing preliminary clustering on the user tags by using a Canopy clustering algorithm to obtain a preliminary clustering result comprises:

initializing the user tag as a list data;

calculating the distances between all objects in the list data and C, when the distances are larger than a first preset distance, adding the objects into a clustering center of a Canopy clustering algorithm and marking the objects as C, and when the distances are smaller than a second preset distance, deleting the objects from the list data;

and taking the clustering list as a preliminary clustering result.

7. The method according to claim 5, wherein the clustering the preliminary clustering result by using a Kmeans clustering algorithm to obtain a clustering result comprises:

8. The user tag design method of claim 1, wherein the correlation strength comprises: very strong correlation, moderate correlation, weak correlation, or irrelevant.

9. A user tag design apparatus, the apparatus comprising:

a power supply adjusting module: and the user power supply service is used for adjusting the basic attributes of the related user profiles of the corresponding parcel based on the proportion of the basic attributes of each user profile in the grouped and identified user tags, the correlation strength between the power consumption data and the average temperature data of the users corresponding to the grouped and identified user tags and the average temperature information of the weather predicted in the future.