CN104376057A - Self-adaptation clustering method based on maximum distance, minimum distance and K-means - Google Patents
Self-adaptation clustering method based on maximum distance, minimum distance and K-means Download PDFInfo
- Publication number
- CN104376057A CN104376057A CN201410621601.4A CN201410621601A CN104376057A CN 104376057 A CN104376057 A CN 104376057A CN 201410621601 A CN201410621601 A CN 201410621601A CN 104376057 A CN104376057 A CN 104376057A
- Authority
- CN
- China
- Prior art keywords
- bunch
- distance
- center
- self
- clustering
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/28—Databases characterised by their database models, e.g. relational or object models
- G06F16/284—Relational databases
- G06F16/285—Clustering or classification
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
- G06F18/23211—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with adaptive number of clusters
Landscapes
- Engineering & Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Theoretical Computer Science (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Probability & Statistics with Applications (AREA)
- Artificial Intelligence (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
The invention discloses a self-adaptation clustering method based on the maximum distance, the minimum distance and the K-means. The method solves the problems that a traditional K-means clustering algorithm is sensitive to an initial center, and the number of clusters needs to be determined in advance. The self-adaptation clustering method is a strategic method. According to the self-adaptation clustering method, selection of the initial center and initial centers of the newly-added clusters is not random and is obtained through calculation, two points serve as the initial centers, the distances between the two points and the two points with the maximum distance in data sets (subsets) are minimum, and isolated points can be effectively prevented from being selected to serve as the initial centers; meanwhile, the number of iterations in the clustering process can be effectively decreased, and the better clustering results can be obtained. According to the self-adaptation clustering method, whether the number of the clusters is increased or not and whether clustering is finished or not are determined according to SSE (sum of squared errors) conditions of the clusters and the total SSE variation trend, the number of the clusters can be additionally determined in a self-adaptation mode, and the self-adaptation clustering method is particularly suitable for the application scenes with the difficultly-determined number of clusters.
Description
Technical field
The present invention relates to a kind of adaptive clustering scheme based on minimax Distance geometry K-means clustering algorithm, belong to data mining technology field.
Background technology
Data Placement is become meaningful or useful group (bunch) by cluster analysis, and its target is: the object in group is similar each other, and in different groups to as if different.Similarity in group is larger, and between group, difference is larger, and cluster is better.Under certain meaning, cluster analysis just solves the starting point of other problems.In information retrieval, WWW comprises hundreds of millions of Web page, and network search engines may return thousands of pages.Can use cluster that Search Results is divided into some bunches, each bunch of certain particular aspects of catching inquiry, each classification (bunch) can be divided into again some subclass (submanifold), thus produces a hierarchical structure, helps user to explore Query Result further.In weather, cluster analysis has been used for finding to have land weather polar region and the marine atmosphere pressure pattern of appreciable impact.Learn within and medical science aspect, cluster has been used for identifying dissimilar depression, and cluster analysis simultaneously also can be used for the spatio-temporal distribution pattern of detection disease.Therefore no matter be intended to understand or practicality, cluster analysis all plays important role at numerous areas, and these fields comprise: statistics, pattern-recognition, information retrieval, machine learning and data mining.
Internal authority academic conference IEEE International Conference on Data Mining (ICDM) has chosen the ten communication classics algorithm of Data Mining in Dec, 2006, and K-means clustering algorithm is one of them.K-means method comparison is simple, and first, selects K initial center, wherein K is the parameter that user specifies, namely bunch number.Each point is divided into nearest center, and the point set being divided into a center is one bunch.Then, calculate the barycenter of each bunch, it can be used as the center of each bunch.Repeat division points and upgrade bunch central step, until bunch no longer to change, or equivalently, until center does not change.But, select K-means clustering algorithm initial center randomly, cluster not only can be made to be absorbed in locally optimal solution, and optimum cluster result may be can not get.Select suitable initial center, not only can reduce the iterations of cluster process, and the effect of cluster can be improved, and random selecting initial center may choose isolated point as initial center, cause iterations too much, or obtain irrational cluster result.K-means clustering algorithm is not only responsive to initial center, and the selection of bunch number is also the key factor affecting cluster result simultaneously.And the present invention can solve problem above preferably.
Summary of the invention
The object of the invention is to provide a kind of adaptive clustering scheme based on minimax Distance geometry K-means clustering algorithm, this method solve the responsive to initial center of traditional K-means clustering algorithm existence, and bunch number needs pre-determined problem.The method can be avoided choosing isolated point as initial center effectively, effectively can reduce the iterations of cluster process simultaneously, and can obtain good cluster result.
The technical solution adopted for the present invention to solve the technical problems is: the present invention is a kind of tactic method.K-means is based on the clustering technique of prototype, division, be widely applied with its simple algorithm, faster cluster speed and stable cluster result, but also there are some problems in basic K-means algorithm, such as K-means difficult aspheric bunch and different size bunch, and by the impact of noise and outlier.Cluster result is also very by the impact that cluster centre number and initial center are selected simultaneously.
The shortcoming that the present invention is directed to K-means clustering algorithm proposes one based on minimax Distance geometry K-means algorithm, the adaptive clustering scheme of flex point as cluster termination condition is there is with the SSE of data lump (Sum of Square Error, error sum of squares).The method is not random to the selection of initial center, and is through and calculates, and more effectively can avoid choosing isolated point as initial center, effectively can reduce the iterations of cluster process simultaneously, and obtain good cluster result; In addition determine whether to increase bunch for each bunch of SSE situation and total SSE variation tendency and whether terminate cluster, thus can self-adaptation determination number of clusters, be particularly suitable for the application scenarios that those number of clusters are difficult to determine.
Tradition K-means clustering algorithm n data point is divided in K bunch, makes each data point minimum to the distance sum at its bunch of center, algorithm process process:
(1) Stochastic choice K data point is as initial center
(2) each data point is divided into nearest center, forms K bunch
(3) barycenter of each bunch is calculated, the center to it can be used as bunch
(4) step (2) and (3) is repeated until center no longer changes
Following definition and computing formula is used in the present invention:
(1) similarity degree between data point can be determined by the distance calculated between two between data, Euclidean distance is the most known distance measure, tie up in theorem in Euclid space at n, each point is that n ties up real number vector, and the Euclidean distance in space between x and y 2 is defined as:
(2) cluster objective function is used for measuring clustering result quality, and use error quadratic sum of the present invention is as the objective function of tolerance clustering result quality, and the error sum of squares SSE of data lump is defined as:
Wherein c
ithe i-th bunch of S
icenter.
Method flow:
The present invention proposes a kind of adaptive clustering scheme based on minimax Distance geometry K-means clustering algorithm, and the method is to data set S={x
1, x
2..., x
ncluster comprise the steps:
Step 1: calculate the Euclidean distance between any two data points in data set S;
Step 2: two the some x finding lie farthest away in data set S
iand x
j, then find distance x
inearest some x
pwith distance x
jnearest some x
q;
Step 3: by x
pand x
qas initial clustering (bunch) center, now cluster (bunch) centralization C
(0)={ x
p, x
q, separately establish t=1, SSE
(0)=∞;
Step 4: adopt K-means clustering algorithm, dividing data collection S, upgrades each bunch of center, obtains new bunch centralization C
(t)with | C
(t)| individual bunch, now
Step 5: calculate respectively each point in each bunch to the square distance at bunch center and
k=1,2 ..., | C
(t)|, and cumulative obtain total error sum of squares SSE
{ t}if,
wherein δ is threshold value, goes to step 9; Otherwise continue;
Step 6: select
maximum bunch, be designated as S
max, its center is designated as c
max, remove C
(t)in the cluster centre of this bunch, even C
(t)=C
(t)-{ c
max;
Step 7: find data subset S
maxtwo some x of middle lie farthest away
iand x
j, then find distance x
inearest some x
pwith distance x
jnearest some x
q;
Step 8: by x
pand x
qbe incorporated to C
(t)even, C
(t)=C
(t)∪ { x
p, x
q, then make t=t+1, go to step 4;
Step 9: getting last cluster result is final cluster result, namely final cluster centre collection C=C
(t-1).
First step 2 of the present invention, step 3 find two some x of lie farthest away in data set S
iand x
j, avoid problem likely too contiguous when K-means algorithm initial center is chosen; Find distance x again
inearest some x
pwith distance x
jnearest some x
q, the nearest neighbor point choosing lie farthest away two points is in order to avoid lie farthest away two points are isolated points; And with these two closest approach x farthest
pand x
qfor initial center, carry out first time K-means cluster.Whether step 5, step 6 determine to increase bunch for each bunch of SSE situation and total SSE variation tendency and whether terminate cluster, thus can self-adaptation determination number of clusters, are particularly suitable for the application scenarios that those number of clusters are difficult to determine.Step 6 ~ 8 divide bunch and adopt the present invention's distinctive minimax distance method be new bunch select initial center.
Beneficial effect:
1, the selection of the present invention to the initial center of initial center and newly-increased bunch is not random, and be through and calculate, can effectively avoid choosing isolated point as initial center, effectively can reduce the iterations of cluster process simultaneously, and good cluster result can be obtained.
2, the present invention is directed to each bunch of SSE situation and total SSE variation tendency determine whether to increase bunch and whether terminate cluster, thus can self-adaptation determination number of clusters, be particularly suitable for the application scenarios that those number of clusters are difficult to determine.
3, the present invention is applied to data mining technology field.
Accompanying drawing explanation
Fig. 1 is method flow diagram of the present invention.
Embodiment
Below in conjunction with Figure of description, the invention is described in further detail.
For convenience of description, give one example below and carry out brief description:
Given data collection: x
1=(0,0), x
2=(1,1), x
3=(2,2), x
4=(4,4), x
5=(5,5), x
6=(2,1), x
7=(5,4), x
8=(3,6), x
9=(7,4), x
10=(8,5), threshold value σ=0.3
As shown in Figure 1, the invention provides a kind of adaptive clustering scheme based on minimax Distance geometry K-means, the method comprises the steps:
(1) according to above-mentioned formula (1), calculate the distance of data centralization between two between data, select two points farthest, for given data collection known some x
1=(0,0) and some x
10=(8,5) are distance two points farthest;
(2) with time point x
2=(1,1) is x
1x, apart from minimum point, is put in=(0,0)
9=(7,4) are x
10=(8,5) are apart from minimum point;
(3) initial center point c is made
1and c
2store minimax range points x respectively
2=(1,1) and some x
9=(7,4)
(4) by initial center point c
1and c
2substitute into K-means clustering algorithm, obtain two bunches of S
1and S
2, S
1={ x
1, x
2, x
3, x
6and S
2={ x
4, x
5, x
7, x
8, x
9, x
10, calculate SSE
1, SSE
2, SSE
(1)=SSE
1+ SSE
2;
(5) because of SSE
1/ | S
1| <SSE
2/ | S
2|, by bunch S
2in data regard new data set as, calculating now minimax range points is x
5=(5,5) and x
9c is also used in=(7,4)
21and c
22store;
(6) new initial center C={c
1, c
21, c
22and substitute into K-means clustering algorithm, obtain 3 bunches of S
21, S
22and S
23, S
21={ x
1, x
2, x
3, x
6, S
22={ x
4, x
5, x
7, x
8, S
23={ x
9, x
10, calculate SSE
21, SSE
22, SSE
23, SSE
(2)=SSE
21+ SSE
22+ SSE
23, because of
cluster terminates.
Claims (3)
1. based on an adaptive clustering scheme for minimax Distance geometry K-means clustering algorithm, it is characterized in that, described method is to data set S={x
1, x
2..., x
ncluster comprise the steps:
Step 1: calculate the Euclidean distance between any two data points in data set S;
Step 2: two the some x finding lie farthest away in data set S
iand x
j, then find distance x
inearest some x
pwith distance x
jnearest some x
q;
Step 3: by x
pand x
qas initial clustering (bunch) center, now cluster (bunch) centralization C
(0)={ x
p, x
q, separately establish t=1, SSE
(0)=∞;
Step 4: adopt K-means clustering algorithm, dividing data collection S, upgrades each bunch of center, obtains new bunch centralization C
(t)with | C
(t)| individual bunch, now
Step 5: calculate respectively each point in each bunch to the square distance at bunch center and
and cumulative obtain total error sum of squares SSE
{ t}if,
wherein δ is threshold value, goes to step 9; Otherwise continue;
Step 6: select
maximum bunch, be designated as S
max, its center is designated as c
max, remove C
(t)in the cluster centre of this bunch, even C
(t)=C
(t)-{ c
max;
Step 7: find data subset S
maxtwo some x of middle lie farthest away
iand x
j, then find distance x
inearest some x
pwith distance x
jnearest some x
q;
Step 8: by x
pand x
qbe incorporated to C
(t)even, C
(t)=C
(t)∪ { x
p, x
q, then make t=t+1, go to step 4;
Step 9: getting last cluster result is final cluster result, namely final cluster centre collection C=C
(t-1).
2. a kind of adaptive clustering scheme based on minimax Distance geometry K-means clustering algorithm according to claim 1, to it is characterized in that: the selection of described method to the initial center of initial center and newly-increased bunch is not random, and be through and calculate.
3. a kind of adaptive clustering scheme based on minimax Distance geometry K-means clustering algorithm according to claim 1, is characterized in that: described method is applied to data mining technology field.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410621601.4A CN104376057A (en) | 2014-11-06 | 2014-11-06 | Self-adaptation clustering method based on maximum distance, minimum distance and K-means |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410621601.4A CN104376057A (en) | 2014-11-06 | 2014-11-06 | Self-adaptation clustering method based on maximum distance, minimum distance and K-means |
Publications (1)
Publication Number | Publication Date |
---|---|
CN104376057A true CN104376057A (en) | 2015-02-25 |
Family
ID=52554964
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410621601.4A Pending CN104376057A (en) | 2014-11-06 | 2014-11-06 | Self-adaptation clustering method based on maximum distance, minimum distance and K-means |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN104376057A (en) |
Cited By (28)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104850629A (en) * | 2015-05-21 | 2015-08-19 | 杭州天宽科技有限公司 | Analysis method of massive intelligent electricity-consumption data based on improved k-means algorithm |
CN105426387A (en) * | 2015-10-23 | 2016-03-23 | 北京锐安科技有限公司 | K-means algorithm based map aggregation method |
CN106650228A (en) * | 2016-11-08 | 2017-05-10 | 浙江理工大学 | Noise data removal method through improved k-means algorithm and implementation system |
WO2017181660A1 (en) * | 2016-04-21 | 2017-10-26 | 华为技术有限公司 | K-means algorithm-based data clustering method and device |
CN107784597A (en) * | 2017-09-19 | 2018-03-09 | 平安科技(深圳)有限公司 | Trip mode recognition methods, device, terminal device and storage medium |
CN107886071A (en) * | 2017-11-10 | 2018-04-06 | 南京航空航天大学 | A kind of processing method of fibre reinforced composites damage acoustic emission signal |
CN108268611A (en) * | 2017-12-29 | 2018-07-10 | 天津南大通用数据技术股份有限公司 | A kind of method and device of the k-means text clusters based on MapReduce |
CN108733003A (en) * | 2017-04-20 | 2018-11-02 | 南京理工大学 | Slewing parts process working hour prediction technique based on kmeans clustering algorithms and system |
CN108804520A (en) * | 2018-04-27 | 2018-11-13 | 厦门快商通信息技术有限公司 | A kind of visitor's behavior sorting technique and system |
CN108983251A (en) * | 2018-07-27 | 2018-12-11 | 杭州电子科技大学 | A kind of portable standing long jump measurement method |
CN109271462A (en) * | 2018-11-23 | 2019-01-25 | 河北航天信息技术有限公司 | A kind of taxpayer's tax registration registered address information cluster method based on K-means algorithm model |
CN109615426A (en) * | 2018-12-05 | 2019-04-12 | 重庆锐云科技有限公司 | A kind of marketing method based on Customer clustering, system |
CN109685092A (en) * | 2018-08-21 | 2019-04-26 | 中国平安人寿保险股份有限公司 | Clustering method, equipment, storage medium and device based on big data |
CN109948657A (en) * | 2019-02-25 | 2019-06-28 | 湖北同诚通用航空有限公司 | Withered trees recognition methods and equipment based on visible images |
CN109977803A (en) * | 2019-03-07 | 2019-07-05 | 北京超维度计算科技有限公司 | A kind of face identification method based on Kmeans supervised learning |
CN110717551A (en) * | 2019-10-18 | 2020-01-21 | 中国电子信息产业集团有限公司第六研究所 | Training method and device of flow identification model and electronic equipment |
CN110738272A (en) * | 2019-10-23 | 2020-01-31 | 智洋创新科技股份有限公司 | method for labeling visualized mechanical continuous alarm samples of power transmission line channel |
CN110781963A (en) * | 2019-10-28 | 2020-02-11 | 西安电子科技大学 | K-means clustering-based aerial target clustering method |
CN110837873A (en) * | 2019-11-18 | 2020-02-25 | 上海吉七物联网科技有限公司 | Three-dimensional point cloud clustering algorithm |
CN111061865A (en) * | 2018-10-17 | 2020-04-24 | 武汉斗鱼网络科技有限公司 | Method and computing device for text mining of session scene |
CN111476449A (en) * | 2019-10-09 | 2020-07-31 | 北京交通大学 | Subway station operation period division method based on improved K-means clustering algorithm |
CN111966951A (en) * | 2020-07-06 | 2020-11-20 | 东南数字经济发展研究院 | User group hierarchy dividing method based on social e-commerce transaction data |
CN112100497A (en) * | 2020-09-14 | 2020-12-18 | 北京嘀嘀无限科技发展有限公司 | Data processing method and device, electronic equipment and readable storage medium |
CN112383932A (en) * | 2021-01-12 | 2021-02-19 | 华东交通大学 | User association and resource allocation joint optimization method based on clustering |
CN112699913A (en) * | 2020-11-25 | 2021-04-23 | 国网湖南省电力有限公司 | Transformer area household variable relation abnormity diagnosis method and device |
CN113301600A (en) * | 2021-07-27 | 2021-08-24 | 南京中网卫星通信股份有限公司 | Abnormal data detection method and device for performance of satellite and wireless communication converged network |
US11664129B2 (en) | 2019-08-13 | 2023-05-30 | International Business Machines Corporation | Mini-batch top-k-medoids for extracting specific patterns from CGM data |
CN118212001A (en) * | 2024-03-19 | 2024-06-18 | 涅生科技(广州)股份有限公司 | Electronic commerce market trend prediction system based on big data analysis |
-
2014
- 2014-11-06 CN CN201410621601.4A patent/CN104376057A/en active Pending
Cited By (39)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104850629A (en) * | 2015-05-21 | 2015-08-19 | 杭州天宽科技有限公司 | Analysis method of massive intelligent electricity-consumption data based on improved k-means algorithm |
CN105426387B (en) * | 2015-10-23 | 2020-02-07 | 北京锐安科技有限公司 | Map aggregation method based on K-means algorithm |
CN105426387A (en) * | 2015-10-23 | 2016-03-23 | 北京锐安科技有限公司 | K-means algorithm based map aggregation method |
WO2017181660A1 (en) * | 2016-04-21 | 2017-10-26 | 华为技术有限公司 | K-means algorithm-based data clustering method and device |
CN106650228A (en) * | 2016-11-08 | 2017-05-10 | 浙江理工大学 | Noise data removal method through improved k-means algorithm and implementation system |
CN106650228B (en) * | 2016-11-08 | 2019-02-26 | 浙江理工大学 | It improves the noise data minimizing technology of k-means algorithm and implements system |
CN108733003A (en) * | 2017-04-20 | 2018-11-02 | 南京理工大学 | Slewing parts process working hour prediction technique based on kmeans clustering algorithms and system |
CN107784597A (en) * | 2017-09-19 | 2018-03-09 | 平安科技(深圳)有限公司 | Trip mode recognition methods, device, terminal device and storage medium |
CN107886071A (en) * | 2017-11-10 | 2018-04-06 | 南京航空航天大学 | A kind of processing method of fibre reinforced composites damage acoustic emission signal |
CN108268611A (en) * | 2017-12-29 | 2018-07-10 | 天津南大通用数据技术股份有限公司 | A kind of method and device of the k-means text clusters based on MapReduce |
CN108268611B (en) * | 2017-12-29 | 2021-03-26 | 天津南大通用数据技术股份有限公司 | K-means text clustering method and device based on MapReduce |
CN108804520A (en) * | 2018-04-27 | 2018-11-13 | 厦门快商通信息技术有限公司 | A kind of visitor's behavior sorting technique and system |
CN108983251A (en) * | 2018-07-27 | 2018-12-11 | 杭州电子科技大学 | A kind of portable standing long jump measurement method |
CN109685092A (en) * | 2018-08-21 | 2019-04-26 | 中国平安人寿保险股份有限公司 | Clustering method, equipment, storage medium and device based on big data |
CN109685092B (en) * | 2018-08-21 | 2024-02-06 | 中国平安人寿保险股份有限公司 | Clustering method, equipment, storage medium and device based on big data |
CN111061865A (en) * | 2018-10-17 | 2020-04-24 | 武汉斗鱼网络科技有限公司 | Method and computing device for text mining of session scene |
CN109271462A (en) * | 2018-11-23 | 2019-01-25 | 河北航天信息技术有限公司 | A kind of taxpayer's tax registration registered address information cluster method based on K-means algorithm model |
CN109615426A (en) * | 2018-12-05 | 2019-04-12 | 重庆锐云科技有限公司 | A kind of marketing method based on Customer clustering, system |
CN109948657A (en) * | 2019-02-25 | 2019-06-28 | 湖北同诚通用航空有限公司 | Withered trees recognition methods and equipment based on visible images |
CN109977803A (en) * | 2019-03-07 | 2019-07-05 | 北京超维度计算科技有限公司 | A kind of face identification method based on Kmeans supervised learning |
US11664129B2 (en) | 2019-08-13 | 2023-05-30 | International Business Machines Corporation | Mini-batch top-k-medoids for extracting specific patterns from CGM data |
CN111476449A (en) * | 2019-10-09 | 2020-07-31 | 北京交通大学 | Subway station operation period division method based on improved K-means clustering algorithm |
CN111476449B (en) * | 2019-10-09 | 2022-05-24 | 北京交通大学 | Subway station operation time interval division method based on improved k-means clustering algorithm |
CN110717551B (en) * | 2019-10-18 | 2023-01-20 | 中国电子信息产业集团有限公司第六研究所 | Training method and device of flow identification model and electronic equipment |
CN110717551A (en) * | 2019-10-18 | 2020-01-21 | 中国电子信息产业集团有限公司第六研究所 | Training method and device of flow identification model and electronic equipment |
CN110738272A (en) * | 2019-10-23 | 2020-01-31 | 智洋创新科技股份有限公司 | method for labeling visualized mechanical continuous alarm samples of power transmission line channel |
CN110781963B (en) * | 2019-10-28 | 2022-03-04 | 西安电子科技大学 | K-means clustering-based aerial target clustering method |
CN110781963A (en) * | 2019-10-28 | 2020-02-11 | 西安电子科技大学 | K-means clustering-based aerial target clustering method |
CN110837873A (en) * | 2019-11-18 | 2020-02-25 | 上海吉七物联网科技有限公司 | Three-dimensional point cloud clustering algorithm |
CN111966951A (en) * | 2020-07-06 | 2020-11-20 | 东南数字经济发展研究院 | User group hierarchy dividing method based on social e-commerce transaction data |
CN112100497B (en) * | 2020-09-14 | 2021-10-19 | 北京嘀嘀无限科技发展有限公司 | Data processing method and device, electronic equipment and readable storage medium |
CN112100497A (en) * | 2020-09-14 | 2020-12-18 | 北京嘀嘀无限科技发展有限公司 | Data processing method and device, electronic equipment and readable storage medium |
CN112699913A (en) * | 2020-11-25 | 2021-04-23 | 国网湖南省电力有限公司 | Transformer area household variable relation abnormity diagnosis method and device |
CN112699913B (en) * | 2020-11-25 | 2023-08-29 | 国网湖南省电力有限公司 | Method and device for diagnosing abnormal relationship of household transformer in transformer area |
CN112383932B (en) * | 2021-01-12 | 2021-04-09 | 华东交通大学 | User association and resource allocation joint optimization method based on clustering |
CN112383932A (en) * | 2021-01-12 | 2021-02-19 | 华东交通大学 | User association and resource allocation joint optimization method based on clustering |
CN113301600A (en) * | 2021-07-27 | 2021-08-24 | 南京中网卫星通信股份有限公司 | Abnormal data detection method and device for performance of satellite and wireless communication converged network |
CN118212001A (en) * | 2024-03-19 | 2024-06-18 | 涅生科技(广州)股份有限公司 | Electronic commerce market trend prediction system based on big data analysis |
CN118212001B (en) * | 2024-03-19 | 2024-11-08 | 涅生科技(广州)股份有限公司 | Electronic commerce market trend prediction system based on big data analysis |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN104376057A (en) | Self-adaptation clustering method based on maximum distance, minimum distance and K-means | |
Elbatta et al. | A dynamic method for discovering density varied clusters | |
Nagpal et al. | Review based on data clustering algorithms | |
Amsaleg et al. | Intrinsic dimensionality estimation within tight localities | |
CN108428015B (en) | Wind power prediction method based on historical meteorological data and random simulation | |
CN109359135B (en) | Time sequence similarity searching method based on segment weight | |
CN110879856A (en) | Social group classification method and system based on multi-feature fusion | |
Lin et al. | A SOM-based approach to estimating design hyetographs of ungauged sites | |
CN109151727B (en) | WLAN fingerprint positioning database construction method based on improved DBN | |
Si et al. | Dynamic topology representing networks | |
CN102938161A (en) | Three-dimensional shape automatic partition method based on Mean Shift | |
CN104850712A (en) | Method for finding topological neighbors in sampled data of physical surface | |
CN111027574A (en) | Building mode identification method based on graph convolution | |
CN111709478A (en) | Fuzzy clustering method and device based on anchor graph | |
CN102110173A (en) | Improved multi-path spectral clustering method for affinity matrix | |
Purnawansyah et al. | K-Means clustering implementation in network traffic activities | |
Dai et al. | Clustering of DOA data in radar pulse based on SOFM and CDbw | |
Fang et al. | Quasi-cluster centers clustering algorithm based on potential entropy and t-distributed stochastic neighbor embedding | |
Zhang et al. | Semi-supervised community detection via constraint matrix construction and active node selection | |
Meng et al. | Passage method for nonlinear dimensionality reduction of data on multi-cluster manifolds | |
Canales et al. | Modification of the growing neural gas algorithm for cluster analysis | |
Amsaleg et al. | Intrinsic dimensionality estimation within tight localities: A theoretical and experimental analysis | |
Zhang et al. | Color clustering using self-organizing maps | |
CN113672975B (en) | Privacy protection method and device for user track | |
Zhao et al. | An improved multidimensional scaling localisation algorithm |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20150225 |
|
RJ01 | Rejection of invention patent application after publication |