CN113011472B - Multi-section electric power quotation curve similarity judging method and device - Google Patents

Multi-section electric power quotation curve similarity judging method and device Download PDF

Info

Publication number
CN113011472B
CN113011472B CN202110222513.7A CN202110222513A CN113011472B CN 113011472 B CN113011472 B CN 113011472B CN 202110222513 A CN202110222513 A CN 202110222513A CN 113011472 B CN113011472 B CN 113011472B
Authority
CN
China
Prior art keywords
points
quotation
point
cluster
core
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202110222513.7A
Other languages
Chinese (zh)
Other versions
CN113011472A (en
Inventor
赵越
刘思捷
白杨
林少华
蔡秋娜
高海翔
龚超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Electric Power Dispatch Control Center of Guangdong Power Grid Co Ltd
Original Assignee
Electric Power Dispatch Control Center of Guangdong Power Grid Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Electric Power Dispatch Control Center of Guangdong Power Grid Co Ltd filed Critical Electric Power Dispatch Control Center of Guangdong Power Grid Co Ltd
Priority to CN202110222513.7A priority Critical patent/CN113011472B/en
Publication of CN113011472A publication Critical patent/CN113011472A/en
Application granted granted Critical
Publication of CN113011472B publication Critical patent/CN113011472B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Economics (AREA)
  • Water Supply & Treatment (AREA)
  • Public Health (AREA)
  • General Health & Medical Sciences (AREA)
  • Human Resources & Organizations (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Strategic Management (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention provides a method and a device for judging similarity of multi-section power quotation curves, wherein the method comprises the following steps: acquiring spot quotation data of a market main body and carrying out per unit on the spot quotation data to form a point set P; carrying out density clustering on the quotation points in the point set P according to a preset optimization variable constraint condition, and outputting a density clustering result; optimizing and calculating a preset objective function based on the density clustering result, and obtaining the neighborhood radius of the optimal cluster and the cluster class, the core point, the classified point and the unclassified point corresponding to the optimal cluster; and screening out quotation similar unit clusters according to the neighborhood radius of the optimal clusters and the cluster class, the core point, the classified points and the unclassified points corresponding to the optimal clusters. According to the invention, through optimizing the clustering parameters, the condition that the number of unclassified points is excessive can be avoided when the quotation point set is optimally clustered, so that the accuracy of similarity judgment of the multi-section electric power quotation curves is effectively improved.

Description

Multi-section electric power quotation curve similarity judging method and device
Technical Field
The invention relates to the technical field of power data analysis, in particular to a method and a device for judging similarity of multi-section power quotation curves.
Background
With the promotion of new power market reform, the test operation work has been carried out on the spot market in China. In the power spot market, the market main body declares a plurality of sections of quotations according to the self-assembly cost and characteristics and in combination with the market boundary conditions and the prejudgment of the game pattern, so the spot quotations are often highly complex. In the process of adapting to the market for a long time, the spot quotation of each unit gradually changes from high differentiation to local differentiation, and collusion behavior that a plurality of market subjects form a substantial quotation alliance is intended to control the market price can occur. Quotation behavior analysis and collusion recognition will play an important role in power spot market monitoring.
The quotation curve similarity analysis of the electric power spot market is a basic work for judging whether collusion quotation exists, and mainly judges which quotation similar unit groups exist and the quotation similarity degree of the unit groups through cluster analysis. In collusion bidding behavior analysis based on density clustering, the clustering parameters are the neighborhood radius and the minimum number of bid points in the neighborhood. The setting of the parameters influences the number and the similarity of cluster types after density clustering. The similarity of density clusters is usually determined by the sum of the distances from all classified points to their core points after density clustering, but this approach may result in excessive unclassified points such that similar quotation behavior between certain units is ignored. How to reasonably set the clustering parameters and better classify the power quotation points is the key of using the clustering analysis to analyze the quotation pattern in the power spot market.
Disclosure of Invention
The invention aims to provide a method and a device for judging the similarity of a multi-section power quotation curve, which are used for solving the technical problems, so that the accuracy of judging the similarity of the multi-section power quotation curve can be effectively improved.
In order to solve the technical problems, the invention provides a method for judging similarity of multi-section power quotation curves, which comprises the following steps:
acquiring spot quotation data of a market main body and carrying out per unit on the spot quotation data to form a point set P;
carrying out density clustering on the quotation points in the point set P according to a preset optimization variable constraint condition, and outputting a density clustering result; the density clustering result comprises all neighborhood radiuses conforming to the constraint conditions of the optimized variables and corresponding cluster types, core points, classified points and unclassified points;
optimizing and calculating a preset objective function based on the density clustering result, and obtaining the neighborhood radius of the optimal cluster and the cluster class, the core point, the classified point and the unclassified point corresponding to the optimal cluster;
and screening out quotation similar unit clusters according to the neighborhood radius of the optimal clusters and the cluster class, the core point, the classified points and the unclassified points corresponding to the optimal clusters.
The density clustering is carried out on the quotation points in the point set P according to the preset optimization variable constraint condition, and a density clustering result is output, and the method specifically comprises the following steps:
core point judgment is carried out on the quotation points in the point set P according to the preset density clustering neighborhood radius and the preset minimum quotation point number in the neighborhood;
performing density clustering on a point set P based on each core point, calculating Hausdorff distance between the core point and unprocessed points in the point set P, classifying the points meeting preset conditions into the same cluster according to each Hausdorff distance, and marking the points of the same cluster as processed points;
repeating the core point judgment and density clustering steps until all points in the point set P are classified into clusters or are judged to be non-core points, and classifying points which do not belong to any clusters as unclassified points;
and outputting all the neighborhood radiuses conforming to the constraint conditions of the optimized variables and the cluster class, the core point, the classified points and the unclassified points corresponding to the neighborhood radiuses.
The objective function is:
wherein ,for the m-th classified point->,/>、/>Declaration price and Shen Baoliang, < > -of the r-th declaration section of the market subject represented by the m-th classification point, respectively>Core Point of cluster class where the mth classified Point is located +.>,/>Representation->To->M represents the number of classified quotation points, N is the total number of quotation points;
the constraint condition of the optimization variable d of the optimal cluster is as follows:
wherein ,neighborhood radius for density clustering, +.>For optimal clustering of the set of all N points, k denotes the number of significant digits, ++>Multidimensional vector P for the ith bid i And a j-th offer multidimensional vector P j A hausdorff distance therebetween.
Further, the classifying the points meeting the preset condition into the same cluster according to each hausdorff distance specifically includes:
if judging that the Hausdorff distance between each unprocessed point and the core point belongs to the condition that the direct density is reachable, the density is reachable or the densities are connected, classifying the core point and the unprocessed points corresponding to the core point into the same cluster.
Further, when the optimal clustering is performed, only other quotation points except the quotation points of the first reporting segments are optimally clustered.
Further, screening out a quotation similar unit cluster according to the neighborhood radius of the optimal cluster and the cluster class, the core point, the classified points and the unclassified points corresponding to the optimal cluster, specifically:
and selecting a unit corresponding to the core point of each cluster and units corresponding to other quotation points in the neighborhood of the core point according to the neighborhood radius of the optimal cluster and the clusters, the core points, the classified points and the unclassified points corresponding to the optimal cluster, so as to obtain the quotation similar unit cluster.
In order to solve the same technical problems, the invention also provides a device for judging the similarity of the multi-section power quotation curves, which comprises the following steps:
the spot set acquisition module is used for acquiring spot quotation data of a market main body and carrying out per unit on the spot quotation data to form a spot set P;
the density clustering module is used for carrying out density clustering on the quotation points in the point set P according to a preset optimization variable constraint condition and outputting a density clustering result; the density clustering result comprises all neighborhood radiuses conforming to the constraint conditions of the optimized variables and corresponding cluster types, core points, classified points and unclassified points;
the optimization calculation module is used for carrying out optimization calculation on a preset objective function based on the density clustering result and obtaining the neighborhood radius of the optimal cluster and the cluster class, the core point, the classified point and the unclassified point corresponding to the neighborhood radius;
and the similar quotation screening module is used for screening quotation similar unit clusters according to the neighborhood radius of the optimal cluster and the cluster class, the core point, the classified points and the unclassified points corresponding to the optimal cluster.
The density clustering module is specifically used for:
core point judgment is carried out on the quotation points in the point set P according to the preset density clustering neighborhood radius and the preset minimum quotation point number in the neighborhood;
performing density clustering on a point set P based on each core point, calculating Hausdorff distance between the core point and unprocessed points in the point set P, classifying the points meeting preset conditions into the same cluster according to each Hausdorff distance, and marking the points of the same cluster as processed points;
repeating the core point judgment and density clustering steps until all points in the point set P are classified into clusters or are judged to be non-core points, and classifying points which do not belong to any clusters as unclassified points;
and outputting all the neighborhood radiuses conforming to the constraint conditions of the optimized variables and the cluster class, the core point, the classified points and the unclassified points corresponding to the neighborhood radiuses.
The objective function is:
wherein ,for the m-th classified point->,/>、/>Declaration price and Shen Baoliang, < > -of the r-th declaration section of the market subject represented by the m-th classification point, respectively>Core Point of cluster class where the mth classified Point is located +.>,/>Representation->To->M represents the number of classified quotation points, N is the total number of quotation points;
the constraint condition of the optimization variable d of the optimal cluster is as follows:
wherein ,neighborhood radius for density clustering, +.>For optimal clustering of the set of all N points, k denotes the number of significant digits, ++>Multidimensional vector P for the ith bid i And a j-th offer multidimensional vector P j A hausdorff distance therebetween.
Further, the classifying the points meeting the preset condition into the same cluster according to each hausdorff distance specifically includes:
if judging that the Hausdorff distance between each unprocessed point and the core point belongs to the condition that the direct density is reachable, the density is reachable or the densities are connected, classifying the core point and the unprocessed points corresponding to the core point into the same cluster.
Further, when the optimal clustering is performed, only other quotation points except the quotation points of the first reporting segments are optimally clustered.
Further, the similar offer screening module is specifically configured to: and selecting a unit corresponding to the core point of each cluster and units corresponding to other quotation points in the neighborhood of the core point according to the neighborhood radius of the optimal cluster and the clusters, the core points, the classified points and the unclassified points corresponding to the optimal cluster, so as to obtain the quotation similar unit cluster.
Compared with the prior art, the invention has the following beneficial effects:
the invention provides a method and a device for judging similarity of multi-section power quotation curves, wherein the method comprises the following steps: acquiring spot quotation data of a market main body and carrying out per unit on the spot quotation data to form a point set P; carrying out density clustering on the quotation points in the point set P according to a preset optimization variable constraint condition, and outputting a density clustering result; the density clustering result comprises all neighborhood radiuses conforming to the constraint conditions of the optimized variables and corresponding cluster types, core points, classified points and unclassified points; optimizing and calculating a preset objective function based on the density clustering result, and obtaining the neighborhood radius of the optimal cluster and the cluster class, the core point, the classified point and the unclassified point corresponding to the optimal cluster; and screening out quotation similar unit clusters according to the neighborhood radius of the optimal clusters and the cluster class, the core point, the classified points and the unclassified points corresponding to the optimal clusters. According to the invention, through optimizing the clustering parameters, the condition that the number of unclassified points is excessive can be avoided when the quotation point set is optimally clustered, so that the accuracy of similarity judgment of the multi-section electric power quotation curves is effectively improved.
Drawings
FIG. 1 is a flow chart of a method for determining similarity of a multi-segment power quotation curve according to one embodiment of the invention;
FIG. 2 is a schematic diagram of another flow chart of a method for determining similarity of a multi-segment power quotation curve according to one embodiment of the invention;
fig. 3 is a schematic structural diagram of a multi-segment power quotation curve similarity determination device according to an embodiment of the invention.
Detailed Description
The following description of the embodiments of the present invention will be made clearly and completely with reference to the accompanying drawings, in which it is apparent that the embodiments described are only some embodiments of the present invention, but not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the invention without making any inventive effort, are intended to be within the scope of the invention.
Referring to fig. 1, an embodiment of the present invention provides a method for determining similarity of a multi-segment power quotation curve, including the steps of:
s1, acquiring spot quotation data of a market main body and carrying out per unit on the spot quotation data to form a point set P.
S2, performing density clustering on the quotation points in the point set P according to a preset optimization variable constraint condition, and outputting a density clustering result; the density clustering result comprises all neighborhood radiuses conforming to the constraint conditions of the optimization variables and corresponding cluster types, core points, classified points and unclassified points.
Further, step S2 specifically includes:
s201, core point judgment is carried out on the quotation points in the point set P according to a preset density clustering neighborhood radius and a preset minimum quotation point number in the neighborhood.
S202, carrying out density clustering on a point set P based on each core point, calculating Hausdorff distance between the core point and an unprocessed point in the point set P, classifying the points meeting the preset condition into the same cluster according to each Hausdorff distance, and marking the points of the same cluster as processed points.
Further, the classifying the points meeting the preset condition into the same cluster according to each hausdorff distance specifically includes:
if judging that the Hausdorff distance between each unprocessed point and the core point belongs to the condition that the direct density is reachable, the density is reachable or the densities are connected, classifying the core point and the unprocessed points corresponding to the core point into the same cluster.
S203, repeating the steps S201-S202 until all points in the point set P are classified into clusters or judged as non-core points, and classifying points which do not belong to any clusters as unclassified points.
S204, outputting all neighborhood radiuses conforming to the constraint conditions of the optimized variables and corresponding cluster types, core points, classified points and unclassified points.
And S3, carrying out optimization calculation on a preset objective function based on the density clustering result, and obtaining the neighborhood radius of the optimal cluster, the cluster class, the core point, the classified point and the unclassified point corresponding to the optimal cluster.
S4, screening out quotation similar unit clusters according to the neighborhood radius of the optimal clusters and the cluster class, the core point, the classified points and the unclassified points corresponding to the optimal clusters.
Further, step S4 specifically includes:
and selecting a unit corresponding to the core point of each cluster and units corresponding to other quotation points in the neighborhood of the core point according to the neighborhood radius of the optimal cluster and the clusters, the core points, the classified points and the unclassified points corresponding to the optimal cluster, so as to obtain the quotation similar unit cluster.
Further, when the optimal clustering is performed, only other quotation points except the quotation points of the first reporting segments are optimally clustered.
Referring to fig. 2, based on the above scheme, in order to better understand the multi-segment power quotation curve similarity determination method provided by the embodiment of the invention, the following details are described:
1. reading spot quotation data of a market main body in a statistical period, forming quotation point data (multi-section quotation forms multidimensional vector points) after per unit conversion, and counting into a point setIs->. wherein />、/>The declaration price and declaration quantity of the r declaration segment of the market subject represented by the nth point. Since price games in the spot market are mainly concentrated on the next few segments of quotations, the first few segments of quotations of the generator may be lower than the generation cost declaration, so that the bid is marked for starting upAnd therefore select other bid points than the first two to cluster optimally.
2. And carrying out optimal clustering on N quotation points except the first two segments of quotation points, wherein the optimal variable is the neighborhood radius d of the density clustering, and the objective function is as follows:
(1)
where M represents the number of bid points classified,for the m-th classified point->、/>Declaration price and Shen Baoliang, < > -of the r-th declaration section of the market subject represented by the m-th classification point, respectively>Core Point of cluster class where the mth classified Point is located +.>,/>Representation->To the point ofThe Hausdorff (Hausdorff) distance is calculated as shown in formula (6). The optimization variable d of the optimal clustering model has the following constraint conditions:
(2)
in the formula ,to optimally cluster a set of all N points; />K effective numbers are reserved for the neighborhood radius of the density cluster, so that the dispersion of the objective function is ensured, and the optimization model has a certain solution. />Is the hausdorff distance between the bid multidimensional vectors.
3. When the optimization model objective function is calculated in the step 2, density clustering is needed to be carried out on all N points in the point set P, so that constraint conditions conforming to the formula (2) are obtainedCorresponding cluster class, core point, classified point and unclassified point. The specific process of density clustering is as follows:
3.1 inputting the density clustering neighborhood radius of the optimization iterationAnd setting the minimum number MinNum of quotation points in the neighborhood. The MinNum value can be set according to the number of units in the spot market and the demand for density clustering. For example, there are 50 units in the market, minNum can be set to 5, that is, in each cluster after density clustering, 5 other units corresponding to the core point are judged to be similar to the quotation.
3.2 taking a point from the Point set PAnd judging whether the point is a core point or not.
If the neighborhood of the quotation pointAt least MinNum quotation points are included, and the quotation points are core points. All unprocessed points in a point setGathering and calculating unprocessed point and core point +.>Is a hausdorff distance.
If the quotation point is not the core point, the cycle is jumped out, and the next point in the point set P is judged as the core point.
The Haoskov distance calculation method is as follows:
(3)
in the formula ,is a dot set->Arrival point setIs one-way Haosdorf distance, +.>Is a dot set->To the point set->Is calculated as follows
(4)
(5)
in the formula ,representing two-dimensional points +.>And (4) point->Euclidean distance of (c):
(6)
in the formula ,meaning of (1) is that the point set->R in total) and the point set +.>All points (s in total) are subjected to Euclidean distance calculation, the minimum value is taken from the s Euclidean distances, and the maximum value is taken from all the r minimum Euclidean distances.
And 3.3, judging whether the Hausdorff distance between the unprocessed point and the core point belongs to the conditions of direct density accessibility, density accessibility and density connection, if so, classifying the points meeting the conditions into the same cluster class and marking the points (including the core point) in the same cluster class as processed. The judging method for the direct density is reachable, the density is reachable and the density is connected is as follows:
1) The direct density can be achieved: if the quotation point x is at a core point yWithin the neighborhood (also including boundary points), x and y are considered to be directly reachable densities.
2) The density can be achieved: if there are quotation points x, y, z, wherein the direct densities of x and y are reachable, the direct densities of y and z are reachable, but z is not at xIn the neighborhood. In this case, x and z cannot be reached directly in density, but are +.>The y-point in the neighborhood can be directly density reachable to the z-point, then x and z are defined as density reachable.
3) Density connection: if the quotation point w can not be directly reached or reached with the core point x, but the quotation point w can be reachedIn the neighborhood, x and w are defined to be connected in density.
3.4 repeating the steps 3.2 and 3.3 until all N points in the point set P are classified into clusters or judged as non-core points, and classifying points not belonging to any clusters as unclassified points. Outputting the density clustering neighborhood radius of the optimized iterationAll corresponding cluster classes, core points, classified points and unclassified points.
4. Optimizing the objective function and obtaining the neighborhood radius of the optimal clusterAnd its corresponding cluster class, core point, classified point, and unclassified point. When the method is applied to collusion analysis, the core point of each cluster class is +.>Corresponding units and other quotation points in the neighborhood of the units>The corresponding units are the unit clusters with the screened stock quotes being highly similar, and the possibility of collusion quotes exists. In addition, the method can be used for analyzing the quotation mode of a specific market body in a certain time period or summarizing the price and capacity characteristics of the quotation points of each cluster to form a general quotation strategy library and the like.
It should be noted that, for simplicity of description, the above method or flow embodiments are all described as a series of combinations of acts, but it should be understood by those skilled in the art that the embodiments of the present invention are not limited by the order of acts described, as some steps may occur in other orders or concurrently in accordance with the embodiments of the present invention. Further, those skilled in the art will appreciate that the embodiments described in the specification are all alternative embodiments and that the actions involved are not necessarily required for the embodiments of the present invention.
It should be noted that, in the embodiment of the invention, on the basis of spot price report per unit, similarity judgment is performed on the multi-section power quotation curves through an optimal clustering model. When classifying the power quotation curves based on density clustering, the method of the invention creatively optimizes the clustering parameters. The neighborhood radius in the density cluster is used as an optimization variable in the optimization model, and the sum of the distances from all points to the core point is minimum as an objective function. The neighborhood radius only takes k-bit effective numbers, so that the optimal clustering model is converted into a discrete optimization model, the optimization model is guaranteed to have an extremum, and the process of infinite optimization is avoided. The method of the invention satisfies the optimization solution under the condition of certain precision, and meets the engineering requirement.
The method sets the distance from the unclassified point to the core point as the average value of the distances from all points in other classified clusters to the core point of the cluster, instead of 0 in the traditional density clustering. The inventive arrangement may trade off the number of classified and unclassified points, avoiding the situation where unclassified points are too many in order to minimize the sum of all point-to-center point distances. When the distances from the core point to other points are calculated, the accuracy of judging the similarity of the quotation curve of the method can be better exerted under the condition of multi-section quotation (five sections and above) by using the Haoskov distance instead of the Euclidean distance.
Referring to fig. 3, in order to solve the same technical problem, the present invention further provides a device for determining similarity of multi-segment power quotation curves, including:
the point set acquisition module 1 is used for acquiring spot quotation data of a market main body and carrying out per unit on the spot quotation data to form a point set P;
the density clustering module 2 is used for carrying out density clustering on the quotation points in the point set P according to a preset optimization variable constraint condition and outputting a density clustering result; the density clustering result comprises all neighborhood radiuses conforming to the constraint conditions of the optimized variables and corresponding cluster types, core points, classified points and unclassified points;
the optimization calculation module 3 is used for carrying out optimization calculation on a preset objective function based on the density clustering result and obtaining the neighborhood radius of the optimal cluster and the cluster class, the core point, the classified point and the unclassified point corresponding to the neighborhood radius;
and the similar quotation screening module 4 is used for screening quotation similar unit clusters according to the neighborhood radius of the optimal cluster and the cluster class, the core point, the classified points and the unclassified points corresponding to the optimal cluster.
Further, the density clustering module 2 is specifically configured to:
core point judgment is carried out on the quotation points in the point set P according to the preset density clustering neighborhood radius and the preset minimum quotation point number in the neighborhood;
performing density clustering on a point set P based on each core point, calculating Hausdorff distance between the core point and unprocessed points in the point set P, classifying the points meeting preset conditions into the same cluster according to each Hausdorff distance, and marking the points of the same cluster as processed points;
repeating the core point judgment and density clustering steps until all points in the point set P are classified into clusters or are judged to be non-core points, and classifying points which do not belong to any clusters as unclassified points;
and outputting all the neighborhood radiuses conforming to the constraint conditions of the optimized variables and the cluster class, the core point, the classified points and the unclassified points corresponding to the neighborhood radiuses.
Further, the classifying the points meeting the preset condition into the same cluster according to each hausdorff distance specifically includes:
if judging that the Hausdorff distance between each unprocessed point and the core point belongs to the condition that the direct density is reachable, the density is reachable or the densities are connected, classifying the core point and the unprocessed points corresponding to the core point into the same cluster.
Further, when the optimal clustering is performed, only other quotation points except the quotation points of the first reporting segments are optimally clustered.
Further, the similar offer screening module 4 is specifically configured to: and selecting a unit corresponding to the core point of each cluster and units corresponding to other quotation points in the neighborhood of the core point according to the neighborhood radius of the optimal cluster and the clusters, the core points, the classified points and the unclassified points corresponding to the optimal cluster, so as to obtain the quotation similar unit cluster.
It can be understood that the embodiment of the device item corresponds to the embodiment of the method item of the present invention, and the device for judging the similarity of the multi-segment power quotation curves provided by the embodiment of the present invention can implement the method for judging the similarity of the multi-segment power quotation curves provided by any one of the embodiments of the method item of the present invention.
While the foregoing is directed to the preferred embodiments of the present invention, it will be appreciated by those skilled in the art that changes and modifications may be made without departing from the principles of the invention, such changes and modifications are also intended to be within the scope of the invention.

Claims (8)

1. A multi-section electric power quotation curve similarity judging method is characterized by comprising the following steps:
acquiring spot quotation data of a market main body and carrying out per unit on the spot quotation data to form a point set P;
carrying out density clustering on the quotation points in the point set P according to a preset optimization variable constraint condition, and outputting a density clustering result; the density clustering result comprises all neighborhood radiuses conforming to the constraint conditions of the optimized variables and corresponding cluster types, core points, classified points and unclassified points;
optimizing and calculating a preset objective function based on the density clustering result, and obtaining the neighborhood radius of the optimal cluster and the cluster class, the core point, the classified point and the unclassified point corresponding to the optimal cluster;
screening out quotation similar unit clusters according to the neighborhood radius of the optimal clusters and the cluster class, the core point, the classified points and the unclassified points corresponding to the optimal clusters;
the density clustering is carried out on the quotation points in the point set P according to the preset optimization variable constraint condition, and a density clustering result is output, and the method specifically comprises the following steps:
core point judgment is carried out on the quotation points in the point set P according to the preset density clustering neighborhood radius and the preset minimum quotation point number in the neighborhood;
performing density clustering on a point set P based on each core point, calculating Hausdorff distance between the core point and unprocessed points in the point set P, classifying the points meeting preset conditions into the same cluster according to each Hausdorff distance, and marking the points of the same cluster as processed points;
repeating the core point judgment and density clustering steps until all points in the point set P are classified into clusters or are judged to be non-core points, and classifying points which do not belong to any clusters as unclassified points;
outputting all neighborhood radiuses conforming to the constraint conditions of the optimized variables and corresponding cluster types, core points, classified points and unclassified points;
the objective function is:
wherein ,for the m-th classified point->,/>、/>Declaration price and Shen Baoliang, < > -of the r-th declaration section of the market subject represented by the m-th classification point, respectively>Core Point of cluster class where the mth classified Point is located +.>,/>Representation->To->M represents the number of classified quotation points, N is the total number of quotation points;
the constraint condition of the optimization variable d of the optimal cluster is as follows:
wherein ,neighborhood radius for density clustering, +.>For optimal clustering of the set of all N points, k denotes the number of significant digits, ++>Multidimensional vector P for the ith bid i And a j-th offer multidimensional vector P j A hausdorff distance therebetween.
2. The method for determining similarity of multiple segments of power quotation curves according to claim 1, wherein the classifying points satisfying the preset condition into the same cluster according to each hausdorff distance comprises:
if judging that the Hausdorff distance between each unprocessed point and the core point belongs to the condition that the direct density is reachable, the density is reachable or the densities are connected, classifying the core point and the unprocessed points corresponding to the core point into the same cluster.
3. The method for determining similarity of multiple segments of power quotation curves according to claim 1, wherein only the points other than the first ones of the declared segments are optimally clustered when the optimal clustering is performed.
4. The method for judging similarity of multi-segment power quotation curves according to claim 1, wherein the screening of the quotation similar unit clusters according to the neighborhood radius of the optimal cluster and the cluster class, the core point, the classified points and the unclassified points corresponding to the neighborhood radius, specifically comprises:
and selecting a unit corresponding to the core point of each cluster and units corresponding to other quotation points in the neighborhood of the core point according to the neighborhood radius of the optimal cluster and the clusters, the core points, the classified points and the unclassified points corresponding to the optimal cluster, so as to obtain the quotation similar unit cluster.
5. A multi-segment power quotation curve similarity determination device, comprising:
the spot set acquisition module is used for acquiring spot quotation data of a market main body and carrying out per unit on the spot quotation data to form a spot set P;
the density clustering module is used for carrying out density clustering on the quotation points in the point set P according to a preset optimization variable constraint condition and outputting a density clustering result; the density clustering result comprises all neighborhood radiuses conforming to the constraint conditions of the optimized variables and corresponding cluster types, core points, classified points and unclassified points;
the optimization calculation module is used for carrying out optimization calculation on a preset objective function based on the density clustering result and obtaining the neighborhood radius of the optimal cluster and the cluster class, the core point, the classified point and the unclassified point corresponding to the neighborhood radius;
the similar quotation screening module is used for screening quotation similar unit clusters according to the neighborhood radius of the optimal cluster and the cluster class, the core point, the classified points and the unclassified points corresponding to the optimal cluster;
the density clustering module is specifically used for:
core point judgment is carried out on the quotation points in the point set P according to the preset density clustering neighborhood radius and the preset minimum quotation point number in the neighborhood;
performing density clustering on a point set P based on each core point, calculating Hausdorff distance between the core point and unprocessed points in the point set P, classifying the points meeting preset conditions into the same cluster according to each Hausdorff distance, and marking the points of the same cluster as processed points;
repeating the core point judgment and density clustering steps until all points in the point set P are classified into clusters or are judged to be non-core points, and classifying points which do not belong to any clusters as unclassified points;
outputting all neighborhood radiuses conforming to the constraint conditions of the optimized variables and corresponding cluster types, core points, classified points and unclassified points;
the objective function is:
wherein ,for the m-th classified point->,/>、/>Declaration price and Shen Baoliang, < > -of the r-th declaration section of the market subject represented by the m-th classification point, respectively>For the m-th classified pointIn the core point of cluster->,/>Representation->To->M represents the number of classified quotation points, N is the total number of quotation points;
the constraint condition of the optimization variable d of the optimal cluster is as follows:
wherein ,neighborhood radius for density clustering, +.>For optimal clustering of the set of all N points, k denotes the number of significant digits, ++>Multidimensional vector P for the ith bid i And a j-th offer multidimensional vector P j A hausdorff distance therebetween.
6. The apparatus for determining similarity of multiple segments of power quotation curves according to claim 5, wherein the classifying points satisfying a predetermined condition into the same cluster according to each hausdorff distance comprises:
if judging that the Hausdorff distance between each unprocessed point and the core point belongs to the condition that the direct density is reachable, the density is reachable or the densities are connected, classifying the core point and the unprocessed points corresponding to the core point into the same cluster.
7. The multi-segment power quotation curve similarity judging device according to claim 5, wherein when the optimal clustering is performed, only the quotation points other than the preceding claims segment quotations are optimally clustered.
8. The multi-segment power quotation curve similarity determination device of claim 5, wherein the similarity quotation screening module is specifically configured to: and selecting a unit corresponding to the core point of each cluster and units corresponding to other quotation points in the neighborhood of the core point according to the neighborhood radius of the optimal cluster and the clusters, the core points, the classified points and the unclassified points corresponding to the optimal cluster, so as to obtain the quotation similar unit cluster.
CN202110222513.7A 2021-02-26 2021-02-26 Multi-section electric power quotation curve similarity judging method and device Active CN113011472B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110222513.7A CN113011472B (en) 2021-02-26 2021-02-26 Multi-section electric power quotation curve similarity judging method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110222513.7A CN113011472B (en) 2021-02-26 2021-02-26 Multi-section electric power quotation curve similarity judging method and device

Publications (2)

Publication Number Publication Date
CN113011472A CN113011472A (en) 2021-06-22
CN113011472B true CN113011472B (en) 2023-09-01

Family

ID=76386848

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110222513.7A Active CN113011472B (en) 2021-02-26 2021-02-26 Multi-section electric power quotation curve similarity judging method and device

Country Status (1)

Country Link
CN (1) CN113011472B (en)

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109816398A (en) * 2018-12-29 2019-05-28 昆明电力交易中心有限责任公司 A kind of method, apparatus and medium for screening Power Generation collusion behavior
CN110135450A (en) * 2019-03-26 2019-08-16 中电莱斯信息系统有限公司 A kind of hotspot path analysis method based on Density Clustering
JP2019139651A (en) * 2018-02-14 2019-08-22 Kddi株式会社 Program, device and method for classifying unknown multi-dimensional vector data groups into classes
CN111582406A (en) * 2020-05-31 2020-08-25 重庆大学 Power equipment state monitoring data clustering method and system
CN111861397A (en) * 2020-07-22 2020-10-30 亿景智联(北京)科技有限公司 Intelligent scheduling platform for client visit
CN111931839A (en) * 2020-08-04 2020-11-13 西门子电力自动化有限公司 Method and device for on-line monitoring of switch equipment
CN112328728A (en) * 2020-11-30 2021-02-05 浙江师范大学 Clustering method and device for mining traveler track, electronic device and storage medium

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2019139651A (en) * 2018-02-14 2019-08-22 Kddi株式会社 Program, device and method for classifying unknown multi-dimensional vector data groups into classes
CN109816398A (en) * 2018-12-29 2019-05-28 昆明电力交易中心有限责任公司 A kind of method, apparatus and medium for screening Power Generation collusion behavior
CN110135450A (en) * 2019-03-26 2019-08-16 中电莱斯信息系统有限公司 A kind of hotspot path analysis method based on Density Clustering
CN111582406A (en) * 2020-05-31 2020-08-25 重庆大学 Power equipment state monitoring data clustering method and system
CN111861397A (en) * 2020-07-22 2020-10-30 亿景智联(北京)科技有限公司 Intelligent scheduling platform for client visit
CN111931839A (en) * 2020-08-04 2020-11-13 西门子电力自动化有限公司 Method and device for on-line monitoring of switch equipment
CN112328728A (en) * 2020-11-30 2021-02-05 浙江师范大学 Clustering method and device for mining traveler track, electronic device and storage medium

Also Published As

Publication number Publication date
CN113011472A (en) 2021-06-22

Similar Documents

Publication Publication Date Title
CN111199016B (en) Daily load curve clustering method for improving K-means based on DTW
CN111724278A (en) Fine classification method and system for power multi-load users
CN110827924B (en) Clustering method and device for gene expression data, computer equipment and storage medium
CN110826618A (en) Personal credit risk assessment method based on random forest
CN112819299A (en) Differential K-means load clustering method based on center optimization
WO2018006631A1 (en) User level automatic segmentation method and system
CN113077444A (en) CNN-based ultrasonic nondestructive detection image defect classification method
CN110929399A (en) Wind power output typical scene generation method based on BIRCH clustering and Wasserstein distance
CN110738232A (en) grid voltage out-of-limit cause diagnosis method based on data mining technology
CN110796159A (en) Power data classification method and system based on k-means algorithm
CN110704710A (en) Chinese E-commerce emotion classification method based on deep learning
CN114925767A (en) Scene generation method and device based on variational self-encoder
CN116128544A (en) Active auditing method and system for electric power marketing abnormal business data
CN114549897A (en) Training method and device for classification model and storage medium
CN113011472B (en) Multi-section electric power quotation curve similarity judging method and device
CN113255752A (en) Solid material consistency sorting method based on feature clustering
CN117170979A (en) Energy consumption data processing method, system, equipment and medium for large-scale equipment
CN112149052A (en) Daily load curve clustering method based on PLR-DTW
Liu et al. A weight-incorporated similarity-based clustering ensemble method
CN116719714A (en) Training method and corresponding device for screening model of test case
CN111931861B (en) Anomaly detection method for heterogeneous data set and computer-readable storage medium
CN115017988A (en) Competitive clustering method for state anomaly diagnosis
CN109543930B (en) Dispatching method and system based on machine multi-level steady-state production rate
CN113535527A (en) Load shedding method and system for real-time flow data predictive analysis
CN111933226A (en) Artificial intelligence material screening method based on classification model

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant