WO2022088390A1 - Image incremental clustering method and apparatus, electronic device, storage medium and program product - Google Patents
Image incremental clustering method and apparatus, electronic device, storage medium and program product Download PDFInfo
- Publication number
- WO2022088390A1 WO2022088390A1 PCT/CN2020/134074 CN2020134074W WO2022088390A1 WO 2022088390 A1 WO2022088390 A1 WO 2022088390A1 CN 2020134074 W CN2020134074 W CN 2020134074W WO 2022088390 A1 WO2022088390 A1 WO 2022088390A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- cluster
- clusters
- image data
- center
- similarity
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 79
- 239000011159 matrix material Substances 0.000 claims description 24
- 238000004590 computer program Methods 0.000 claims description 17
- 230000011218 segmentation Effects 0.000 claims description 5
- 238000010586 diagram Methods 0.000 description 15
- 239000000523 sample Substances 0.000 description 15
- 230000000694 effects Effects 0.000 description 14
- 230000009286 beneficial effect Effects 0.000 description 11
- 230000006870 function Effects 0.000 description 7
- 238000012545 processing Methods 0.000 description 7
- 101100481876 Danio rerio pbk gene Proteins 0.000 description 5
- 101100481878 Mus musculus Pbk gene Proteins 0.000 description 5
- 238000004364 calculation method Methods 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 4
- 230000018109 developmental process Effects 0.000 description 4
- 238000004422 calculation algorithm Methods 0.000 description 3
- 238000010521 absorption reaction Methods 0.000 description 2
- 230000007423 decrease Effects 0.000 description 2
- 238000013135 deep learning Methods 0.000 description 2
- 238000013461 design Methods 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000003203 everyday effect Effects 0.000 description 2
- 230000007774 longterm Effects 0.000 description 2
- 238000011160 research Methods 0.000 description 2
- 238000013473 artificial intelligence Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000001815 facial effect Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000003064 k means clustering Methods 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 238000012423 maintenance Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/762—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using clustering, e.g. of similar faces in social networks
- G06V10/763—Non-hierarchical techniques, e.g. based on statistics of modelling distributions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
- G06F18/232—Non-hierarchical techniques
- G06F18/2321—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
- G06F18/23213—Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/764—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using classification, e.g. of video objects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/77—Processing image or video features in feature spaces; using data integration or data reduction, e.g. principal component analysis [PCA] or independent component analysis [ICA] or self-organising maps [SOM]; Blind source separation
- G06V10/774—Generating sets of training patterns; Bootstrap methods, e.g. bagging or boosting
Definitions
- the present disclosure is based on a Chinese patent application with an application number of 202011185911.8 and an application date of October 30, 2020, and claims the priority of the Chinese patent application, the entire contents of which are hereby incorporated by reference into the present disclosure.
- the embodiments of the present disclosure relate to the technical field of computer vision, and in particular, to a method and apparatus for incremental clustering of images, an electronic device, a storage medium, and a program product.
- the present disclosure provides an incremental clustering method, device, electronic device, storage medium and program product for images, which are beneficial to solve the problem that the clustering effect is affected by the drift of the clustering center in the incremental clustering .
- a first aspect of the embodiments of the present disclosure provides an incremental clustering method for images, the method comprising:
- a cluster center the M is an integer greater than or equal to 1; a second image data set is obtained, and the first cluster center is used to combine the second image data set and the first cluster.
- the first cluster includes a first cluster A, a first cluster B, and a first cluster C;
- the cluster center merges the second image data set with the first cluster, including:
- the second image data set includes a plurality of image data
- cluster the plurality of image data to obtain isolated image data and a second cluster
- Merging the isolated image data with the first cluster A and merging the second cluster with the first cluster B using the first cluster center; in the second image data
- the single image data is merged with the first cluster C by using the first cluster center.
- the plurality of image data in the second image data set is clustered, and the isolated image data and the second cluster are obtained by using the obtained isolated image data and the first cluster A and the first cluster included in the first cluster respectively.
- the cluster can absorb a single sample and merge between the clusters.
- the first cluster has a corresponding second cluster center; when using the first cluster center to combine the second image data set with the Before the first cluster is merged, the method further includes:
- K first clusters are determined from the first clusters by using the second cluster centers.
- the second cluster has a corresponding third cluster center; the second cluster center is determined from the first cluster by using the second cluster center Get K first clusters, including:
- the first cluster is screened, which is beneficial to determine the image in the second image data set.
- the first cluster with more similar data cluster categories.
- using the first cluster center to combine the isolated image data with the first cluster A includes:
- the first cluster center D is each of the first clusters in the K first clusters the first cluster center corresponding to the first sub-cluster; for each of the K first clusters, determine the fourth similarity in each of the first clusters
- the first number of the first cluster centers D whose degree is greater than the first threshold; the first cluster with the largest first number among the K first clusters is determined as the first cluster Cluster A; merge the isolated image data with the first cluster cluster A.
- using the first cluster center to merge the second cluster with the first cluster B includes:
- the second cluster Divide the second cluster into N second subclusters, and obtain a fourth cluster center corresponding to each second subcluster in the N second subclusters; the N is greater than or equal to 1 obtain the fifth similarity between the fourth cluster center and the first cluster center E; the first cluster center E is each of the K first clusters The first cluster center corresponding to each first sub-cluster of ; for each first cluster in the K first clusters, determine the The second number of the first cluster centers E whose fifth similarity is greater than the second threshold; the first cluster with the largest second number among the K first clusters is determined as the first cluster a cluster B; the second cluster is merged with the first cluster B.
- the first cluster B has at most first clusters that are closer to the second subcluster of the second cluster. sub-cluster, merging the second cluster into the first cluster B can make the clustering result more accurate.
- the use of the first cluster center to combine the single image data with the first cluster C includes:
- the first cluster center F is each of the first clusters in the K first clusters the first cluster center corresponding to the first sub-cluster; for each of the K first clusters, determine the sixth similarity in each of the first clusters
- the third number of the first cluster centers F whose degree is greater than the third threshold; the first cluster with the largest third number among the K first clusters is determined as the first cluster Cluster C; merge the single image data with the first cluster cluster C.
- the M is less than or equal to a fourth threshold; when using the first cluster center to combine the second image data set with the first cluster Afterwards, the method further includes:
- the first cluster center is updated; when the R is greater than the fourth threshold, obtain the fourth quantity of image data in each of the R third sub-clusters; according to the Sorting the R third subclusters from large to small with the fourth number to obtain a fourth clustering cluster sequence, selecting the first P third subclusters in the fourth clustering cluster sequence, and using the P third subclusters.
- the fifth cluster center corresponding to the three sub-clusters updates the first cluster center; the P is less than or equal to the fourth threshold.
- the first cluster is obtained by clustering the image data in the first image data set; the first cluster is divided into M first subclusters, including:
- the first cluster can be divided into the M first sub-clusters by using the similarity matrix.
- the dividing the first cluster into the M first sub-clusters based on the similarity matrix includes:
- the plurality of vertices with the seventh similarity greater than the fifth threshold can be divided into a first sub-cluster by using the connectivity graph.
- a second aspect of the embodiments of the present disclosure provides an apparatus for incremental clustering of images, and the apparatus includes:
- a first obtaining module configured to obtain a first cluster of a first image data set
- a first segmentation module configured to divide the first cluster into M first sub-clusters, and obtain the M first sub-clusters the first cluster center corresponding to each first sub-cluster in the first sub-cluster; the M is an integer greater than or equal to 1
- the merging module is configured to obtain a second image data set, using the first cluster center The second image dataset is merged with the first cluster.
- a third aspect of the embodiments of the present disclosure provides an electronic device, the electronic device includes an input device and an output device, and further includes a processor adapted to implement one or more instructions; and a computer storage medium, the computer storage medium storing There is one or more instructions adapted to be loaded by the processor and to perform the steps in any of the embodiments of the first aspect above.
- a fourth aspect of the embodiments of the present disclosure provides a computer storage medium, where the computer storage medium stores one or more instructions, and the one or more instructions are suitable for being loaded by a processor and executing any one of the foregoing first aspects steps in the implementation.
- a fifth aspect of the embodiments of the present disclosure provides a computer program product, the computer program product includes one or more instructions, and the one or more instructions are suitable for being loaded by a processor and executing any one of the implementations of the first aspect above steps in the method.
- the embodiment of the present disclosure obtains the first cluster of the first image data set; divides the first cluster into M first sub-clusters, and obtains the M first sub-clusters.
- the first cluster is divided into a plurality of first sub-clusters, and the second image data set is merged by the first cluster based on the first cluster center of the first sub-cluster.
- the cluster center (the cluster center of the first cluster, that is, the main center) will be affected by the new image data and cause drift, which is beneficial to Make the clustering results more accurate to improve the clustering effect.
- the second image data set does not need to perform similarity calculation with the first image data set as a whole, which is beneficial to reduce the computational complexity.
- FIG. 1 is a schematic diagram of an application environment provided by an embodiment of the present disclosure
- FIG. 2 is a schematic flowchart of a method for incremental clustering of images according to an embodiment of the present disclosure
- 3A is a schematic diagram of a connectivity graph of a first cluster according to an embodiment of the present disclosure
- 3B is a schematic diagram of dividing a first cluster into first sub-clusters according to an embodiment of the present disclosure
- FIG. 4A is a schematic diagram of a clustering result of a second image data set according to an embodiment of the present disclosure
- 4B is a schematic diagram of merging isolated image data with a first cluster according to an embodiment of the present disclosure
- 4C is a schematic diagram of merging a second cluster with a first cluster according to an embodiment of the present disclosure
- FIG. 5 is a schematic flowchart of updating a first cluster center according to an embodiment of the present disclosure
- FIG. 6 is a schematic flowchart of another method for incremental clustering of images according to an embodiment of the present disclosure
- FIG. 7 is a schematic structural diagram of an apparatus for incremental clustering of images according to an embodiment of the present disclosure.
- FIG. 8 is a schematic structural diagram of an electronic device according to an embodiment of the present disclosure.
- An embodiment of the present disclosure proposes an incremental clustering method for image data, which can be implemented based on the application environment shown in FIG. 1 .
- the application environment mainly includes an image processing center 101 and an image acquisition device 102 .
- the processing center 101 includes but is not limited to a server 1011, a terminal and a database.
- the image acquisition device 102 may be a camera or a camera deployed in scenes such as gate passages, shopping malls, and residential areas, and is used to collect images, such as face images, video surveillance images, and the image processing center 101 may be The monitoring center, the image processing center 101 can introduce a video cloud node (Video Cloud Node, VCN) 1012 to manage the video monitoring, for example: display the images on the display 1013, and store the images in the database 1014 after clustering.
- VCN Video Cloud Node
- the image collection device 102 may also be a user terminal, and the images it collects may be photos taken by the user, for example, photos posted by the user on social media, and the image processing center may be the processing background of social media.
- the image acquisition device 102 can upload the collected images to the image processing center 101, and the image processing center 101 performs operations such as feature extraction, cluster classification, face recognition, etc. Since the images on the image acquisition device side are generated incrementally every day , and incremental clustering needs to maintain some clusters. With the continuous increase of image data and the continuous progress of incremental clustering, the cluster center of the original maintained cluster will have the risk of drift, which makes the clustering The effect gradually deteriorates, so the server 1011 can be used to execute the incremental clustering method proposed by the embodiment of the present disclosure, so as to solve the problem that the clustering effect is affected by the drift of the cluster center in the incremental clustering.
- the above-mentioned server 1011 may be an independent physical server, a server cluster or a distributed system, or may provide cloud services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, and middleware services , domain name services, security services, and cloud servers for basic cloud computing services such as big data and artificial intelligence platforms.
- FIG. 2 is a schematic flowchart of an image incremental clustering method provided by an embodiment of the present disclosure.
- the image incremental clustering method is applied to a server, as shown in FIG. 2 , including steps S21 to S23:
- the first image dataset refers to an image dataset that has been clustered into multiple clusters before the current batch of image data. ) is the current batch of data, then the data of the face image that has been uploaded to the server before this is the first image data set.
- the first cluster is a cluster obtained by clustering the image data in the first image data set, and the clustering algorithm used may be a K-means clustering algorithm. It should be understood that each cluster exists The corresponding cluster center, that is, the second cluster center.
- S22 Divide the first cluster into M first subclusters, and obtain a first cluster center corresponding to each first subcluster in the M first subclusters; the M is greater than or An integer equal to 1.
- FIG. 3A is a schematic diagram of a connectivity graph of a first cluster according to an embodiment of the present disclosure.
- the connectivity graph of the first cluster includes a first cluster 301 and a second cluster center 302 , wherein the first clustering cluster 301 is a clustering cluster obtained by clustering the image data in the first image data set; the second clustering center 302 is that each clustering cluster has a corresponding clustering center.
- FIG. 3B is a schematic diagram of dividing a first cluster into first sub-clusters according to an embodiment of the present disclosure.
- the division of the first cluster into first sub-clusters includes a first cluster 301 , the second cluster center 302, the first sub-cluster 303 and the first cluster center 304, wherein the first sub-cluster 303 is a sub-cluster obtained by dividing the first cluster cluster 301; the first cluster center 304 is the cluster center of each first subcluster.
- the first sub-cluster is the sub-cluster obtained by dividing the first cluster. For each first cluster in the first data set, the similarity between the image data in the first cluster is obtained, that is, the first sub-cluster is obtained. Seven degrees of similarity, get a similarity matrix, and then obtain a connected graph with the image data in the first cluster as vertices, as shown in Figure 3A, for every two vertices in the connected graph, query from the similarity matrix Its similarity, in the case of clustering the first image data set, the threshold used is X, that is, the fifth threshold, then the multiple image data whose similarity is greater than this X is divided into a more compact first sub-cluster , so that M first subclusters are obtained. As shown in FIG.
- the first cluster shown in FIG. 3A is divided into M first subclusters through the analysis of the connected graph.
- the cluster center of each first sub-cluster in the M first sub-clusters is obtained, that is, the first cluster center, then each first cluster cluster can be composed of a main cluster Center and M sub-cluster center descriptions. Describing the first cluster with a more compact sub-cluster is beneficial to solve the problem that the expression ability of a single main cluster center is weakened with the incorporation of new image data.
- S23 Acquire a second image data set, and combine the second image data set with the first cluster by using the first cluster center.
- FIG. 4A is a schematic diagram of a clustering result of a second image dataset provided by an embodiment of the present disclosure.
- the clustering result of the second image dataset includes a second image dataset 401 , a second cluster Cluster 402, isolated image data 403 and third cluster center 404, wherein the second image data set 401 is the data set of the current batch of images uploaded by the image acquisition device; The image data is clustered by clustering; the isolated image data 403 is the isolated image data that has not been clustered; the third cluster center 404 is the cluster center where each second cluster exists.
- FIG. 4B is a schematic diagram of merging isolated image data with a first cluster according to an embodiment of the present disclosure. As shown in FIG. 4B , merging isolated image data with a first cluster includes a first cluster A 405 and an isolated cluster A 405 . Image data 403, wherein the first cluster A 405 is the first cluster A determined in the first cluster.
- FIG. 4C is a schematic diagram of merging a second cluster with a first cluster according to an embodiment of the present disclosure. As shown in FIG. 4C , the combination of the second cluster and the first cluster includes the first cluster B 406 and the second cluster 407, wherein the first cluster B 406 and the second cluster 407 belong to the same cluster category.
- the second image data set is the data set of the current batch of images uploaded by the image acquisition device, and is obtained from the images uploaded by the image acquisition device.
- the first cluster includes a first cluster A, a first cluster B, and a first cluster C, and when the second image data set includes multiple image data, cluster to get the clustering result.
- the clustering result includes unclustered isolated image data and several second clusters, and each of the several second clusters has a corresponding cluster center, that is, the third cluster center, see Figure 4A.
- the first cluster A is determined from the first cluster, and the first cluster center is used to merge it with the first cluster A, that is, as shown in FIG.
- the isolated image data Absorbed into the first cluster A, the first cluster A and the isolated image data belong to the same cluster category.
- For each second cluster determine the first cluster B from the first cluster, and use the first cluster center to merge it with the first cluster B, that is, as shown in FIG. 4C .
- the first cluster B and the second cluster belong to the same cluster category. Similar to the isolated image data, in the case where there is only a single image data in the second image data set, that is, the newly added image data is only a single image data, and there is no need to perform a clustering operation on the second image data set.
- the first cluster C is determined, and the first cluster C is merged with the first cluster C by using the first cluster center, and the first cluster C and the single image data belong to the same cluster category.
- the method before using the first cluster center to combine the second image data set with the first cluster, the method further includes:
- K first clusters are determined from the first clusters by using the second cluster centers.
- all the first clusters need to be preliminarily screened by using the second cluster center of the first cluster, and from all the first clusters K first clusters are determined, and then the above-mentioned first cluster A and first cluster B, or first cluster C are selected from the K clusters.
- the K first clusters may be the top K after sorting all the first clusters by using the second cluster center, for example: the top 20 of the 100 first clusters after sorting
- the K first clusters may also be all sorted first clusters, for example, 100 first clusters are still selected after sorting.
- Using the second cluster center to preliminarily screen the first cluster is beneficial to determine the first cluster that is more similar to the image data clustering category in the second image data set, such as the above-mentioned first cluster A, the first cluster B and the first cluster C.
- the determining K first clusters from the first clusters by using the second cluster center includes:
- the second image data set when the second image data set is clustered to obtain isolated image data and multiple second clusters, for the isolated sample image data, calculate the difference between it and the second cluster center of each first cluster.
- the second cluster calculate the second similarity between the corresponding third cluster center and the second cluster center of each first cluster, respectively according to the first similarity Sort all the first clusters from high to low degree and the second similarity to obtain the corresponding first and second cluster sequences, and then from the first and second clusters
- the first K first cluster clusters are respectively selected from the cluster sequence.
- the third similarity between the single image data and the second cluster center of each first cluster is calculated, and the third similarity is from high to low.
- the first clusters are sorted to obtain a corresponding third cluster sequence, and then the top K first clusters are selected from the third cluster sequence.
- the using the first cluster center to combine the isolated image data with the first cluster A includes:
- the first cluster center D is each of the first clusters in the K first clusters the first cluster center corresponding to the first sub-cluster; for each of the K first clusters, determine the fourth similarity in each of the first clusters
- the first number of the first cluster centers D whose degree is greater than the first threshold; the first cluster with the largest first number among the K first clusters is determined as the first cluster Cluster A; merge the isolated image data with the first cluster cluster A.
- the first cluster A needs to be determined from the first K first clusters selected. It should be noted that the first K first clusters may be sorted All the first clusters of . First, the similarity between the isolated image data and the cluster center (ie, the first cluster center D) of each first sub-cluster of each first cluster in the K first clusters is calculated, and is determined as the first cluster center D.
- the K first clusters Four similarity degrees, and then analyze the K first clusters to determine the number of first cluster centers D in each first cluster that satisfy the fourth similarity greater than the first threshold, and determine it as the first number, Determine the first cluster with the largest first number as the first cluster A, for example, among the K first clusters, the first cluster 1 has 20 such first cluster centers D, The first cluster 2 has 18 such first cluster centers D, ..., the first cluster K has 15 such first cluster centers D, and the first cluster 1 has the largest number, then it is It is determined to be the first cluster A, that is to say, the first sub-cluster A that is most similar to the isolated image data exists in the first cluster A. Merging the isolated image data into the first cluster A can make the clustering The results are more accurate.
- the merging the second cluster cluster and the first cluster cluster B by using the first cluster center includes:
- the second cluster Divide the second cluster into N second subclusters, and obtain a fourth cluster center corresponding to each second subcluster in the N second subclusters; the N is greater than or equal to 1 obtain the fifth similarity between the fourth cluster center and the first cluster center E; the first cluster center E is each of the K first clusters The first cluster center corresponding to each first sub-cluster of ; for each first cluster in the K first clusters, determine the The second number of the first cluster centers E whose fifth similarity is greater than the second threshold; the first cluster with the largest second number among the K first clusters is determined as the first cluster a cluster B; the second cluster is merged with the first cluster B.
- the first cluster B needs to be determined from the first K first clusters selected.
- the clusters can be all the first cluster clusters after sorting. First, divide each second cluster into N second sub-clusters according to the method of dividing the first cluster, and calculate the cluster center of each second sub-cluster, that is, the fourth cluster center, and then calculate The similarity between the fourth cluster center and the cluster center (ie, the first cluster center E) of each first sub-cluster of each first cluster in the K first clusters is determined as eh
- the fifth similarity and then analyze the K first clusters to determine the number of first cluster centers E that satisfy the fifth similarity greater than the second threshold in each first cluster, and determine it as the second number , determine the first cluster with the second largest number as the first cluster B, for example: among the K first clusters, the first cluster 1 has 30 such first cluster centers E , the first cluster 2 has 15 such first cluster centers E, ..., the first cluster K has
- the combining the single image data with the first cluster C by using the first cluster center includes:
- the first cluster center F is each of the first clusters in the K first clusters the first cluster center corresponding to the first sub-cluster; for each of the K first clusters, determine the sixth similarity in each of the first clusters
- the third number of the first cluster centers F whose degree is greater than the third threshold; the first cluster with the largest third number among the K first clusters is determined as the first cluster Cluster C; merge the single image data with the first cluster cluster C.
- the first cluster C For the merging of single image data, it is necessary to determine the first cluster C from the selected top K first clusters. It should be noted that the top K first clusters may be sorted All first cluster clusters. First, the similarity between the single image data and the cluster center (ie, the first cluster center F) of each first sub-cluster of each first cluster in the K first clusters is calculated, and it is determined as the first cluster center.
- the cluster center ie, the first cluster center F
- the M is less than or equal to a fourth threshold; after the second image data set and the first cluster are merged by using the first cluster center, as shown in FIG. 5 As shown, the method further includes:
- the merged first cluster is divided into R third sub-clusters according to the method of dividing the first cluster, and the fifth cluster center of each third sub-cluster is calculated, and the third sub-cluster is determined according to R.
- the number of three sub-clusters if the number of third sub-clusters is less than or equal to the fourth threshold, for example: 20, the R third sub-clusters are reserved, and the fifth cluster center of these R third sub-clusters is used as The new sub-center of the merged first cluster to update the original first cluster center, then the merged first cluster is described by the second cluster center and the R fifth cluster centers .
- the fourth threshold for example: 20
- the R third sub-clusters are reserved, and the fifth cluster center of these R third sub-clusters is used as The new sub-center of the merged first cluster to update the original first cluster center, then the merged first cluster is described by the second cluster center and the R fifth cluster centers .
- the R third sub-clusters are sorted according to the number of image data in each third sub-cluster (that is, the fourth number) from large to small to obtain the fourth cluster.
- Cluster-like sequence select the first P third sub-clusters to keep, for example: only keep the first 20 third sub-clusters, discard the rest of the third sub-clusters, and use the fifth cluster center of the P third sub-clusters as the merge Then, the merged first cluster is described by using the second cluster center and the P fifth cluster centers. It should be understood that each time a cluster is divided into sub-clusters, only a preset number of sub-clusters are reserved.
- both M and N are less than or equal to the fourth threshold, so that when there are many sub-clusters.
- the embodiment of the present disclosure obtains the first cluster of the first image data set; divides the first cluster into M first sub-clusters, and obtains the M first sub-clusters.
- the first cluster is divided into a plurality of first sub-clusters, and the second image data set is merged by the first cluster based on the first cluster center of the first sub-cluster.
- the cluster center (the cluster center of the first cluster, that is, the main center) will be affected by the new image data and cause drift, which is beneficial to Make the clustering results more accurate to improve the clustering effect.
- the second image data set does not need to perform similarity calculation with the first image data set as a whole, which is beneficial to reduce the computational complexity.
- FIG. 6 is a schematic flowchart of another image incremental clustering method provided by an embodiment of the present disclosure, as shown in FIG. 6, including steps S61 to S66:
- S62 Divide the first cluster into M first subclusters, and obtain a first cluster center corresponding to each first subcluster in the M first subclusters; the M is greater than or an integer equal to 1;
- the incremental clustering method needs to maintain some clusters in the clustering process.
- the traditional clustering algorithm uses a single cluster center to describe a cluster, such as taking the mean of all sample features in the cluster to obtain the cluster center , but different clusters have different degrees of sparseness, so simply adopting a single cluster center with mean value is easy to lose the rich sample information inside the cluster. As the process of incremental clustering continues, the clustering effect will be gradually affected. .
- the similarity matrix S can be obtained. Assuming that the threshold used for clustering is ⁇ , a higher threshold ⁇ ' needs to be set, that is, ⁇ '> ⁇ is satisfied to cluster a cluster. The cluster is split into several tighter subclusters.
- Clusters can be analyzed using methods based on connectivity graph analysis to obtain the polycentricity of clusters.
- the similarity matrix is calculated for the clusters.
- a cluster can be divided into several more compact sub-clusters, so that multiple sub-cluster centers can be obtained, plus as The center of the cluster in the main center constitutes the multi-center description of the cluster.
- using the design analysis of cluster multi-center based on connected graph analysis to obtain multiple sub-centers includes: first, for each cluster, by setting a higher threshold (needs to be higher than the clustering threshold), the cluster is Scatter into several more compact connected sub-graphs, and calculate the sub-centers for each connected sub-graph, so that multiple sub-centers can be obtained.
- step S69 generating a number of clusters and unclustered isolated samples, and obtaining the existing clustering results in step S67 for cluster merging.
- Multi-center incremental clustering method based on a single main center and multiple sub-centers On the basis of obtaining the main center and multiple sub-centers, in the process of incremental clustering, first use the main center and new data to perform a TopK search. sieve, and then further determine whether to absorb new samples or other clusters based on multiple sub-centers.
- the process of cluster merging involves merging between clusters and absorbing individual isolated samples into clusters. For the absorption of isolated sample points, based on the multi-center design, a lower threshold is first set, and the main center is used to search for TopK, and then according to whether the sub-center and the sample point meet the clustering threshold ⁇ . In this case, there may be multiple clusters and isolated sample points to meet such requirements, and the cluster with the largest number of sub-centers that meet the requirements is used as the target cluster.
- a lower threshold is also used to filter and retrieve TopK, and then according to whether there are sub-center pairs between clusters that meet the threshold requirements, when there are multiple clusters that meet the requirements, take the threshold that meets the requirements.
- the cluster with the largest number of sub-centers is used as the target cluster.
- a single main center and multiple sub-centers in the multi-center mechanism are comprehensively utilized.
- the main center is used to participate in the calculation of similarity, and then through multiple sub-centers and pending
- the similarity of a single sample or cluster of clusters is calculated to further determine whether the absorption of a single sample or the merging of clusters is completed.
- This architecture comprehensively utilizes the advantages of multi-center representation, which can improve the clustering effect without increasing too much computational complexity.
- each sub-center can be sorted from large to small according to the number of sample points represented, for example, only the first 20 sub-centers are taken at most.
- a multi-center construction method of face clusters is proposed, which can be used to obtain the description of a single main center and multiple sub-centers of a face cluster. It solves the problem that the description of a cluster is to maintain a cluster center, ignoring some compact sub-cluster information inside the cluster, and as the data continues to increase, due to the maintenance of a single cluster center, the cluster center will continue to be subject to new changes.
- the influence of the samples has a certain risk of center drift, and the influence of the existing samples in the cluster will continue to weaken, reducing the expression ability of the center.
- a single cluster center will lose the sample information inside the cluster during the incremental clustering process.
- a single cluster center is usually maintained for each cluster, and data is continuously added.
- the clustering center is used to calculate the similarity between new samples or clusters to merge and update the clusters, and the clustering center will also be updated continuously.
- a single multi-center will gradually lose the rich sample information within the cluster, and it is also prone to drift, which will affect the clustering effect over time.
- an incremental clustering architecture based on multi-center is proposed. Using this architecture, the computational complexity and clustering accuracy of incremental clustering using multi-center representation can be well balanced. The merging of samples and clusters solves the problem that the multi-center setting of the prior art will have a great impact on the computing speed and storage of clustering in large-scale data scenarios.
- an embodiment of the present disclosure further provides an apparatus for incremental clustering of images. Please refer to FIG. 7 .
- FIG. 7 provides an image increment according to an embodiment of the present disclosure.
- a schematic diagram of the structure of the clustering device, as shown in Figure 7, the device includes:
- the first acquisition module 71 is configured to acquire the first cluster of the first image data set
- a first segmentation module 72 configured to segment the first cluster into M first sub-clusters, and obtain a first cluster center corresponding to each first sub-cluster in the M first sub-clusters;
- the M is an integer greater than or equal to 1;
- the merging module 73 is configured to obtain a second image data set, and use the first cluster center to merge the second image data set and the first cluster.
- the first cluster includes a first cluster A, a first cluster B and a first cluster C;
- the merging module 73 is configured to: if the second image data set includes a plurality of image data, cluster the plurality of image data, obtaining isolated image data and a second cluster; using the first cluster center to merge the isolated image data with the first cluster A; and using the first cluster center to combine the first cluster The two clusters are merged with the first cluster B; in the case that there is only a single image data in the second image data set, the first cluster center is used to combine the single image data with the second image data A cluster C is merged.
- the first cluster has a corresponding second cluster center; when using the first cluster center to associate the second image data set with the first cluster Before merging, the merging module 73 is further configured to: determine K first clusters from the first clusters by using the second cluster center.
- the second cluster has a corresponding third cluster center; after using the second cluster center to determine K first clusters from the first cluster
- the merging module 73 is configured to: obtain a first similarity between the isolated image data and the second cluster center; Sorting the clusters to obtain a first cluster sequence, and selecting the top K first clusters in the first cluster sequence; and obtaining the distance between the third cluster center and the second cluster center the second similarity; sort the first clusters according to the second similarity from high to low to obtain a second cluster sequence, and select the top K first clusters in the second cluster sequence Clustering; or, obtaining a third similarity between the single image data and the second cluster center; sorting the first clusters according to the third similarity from high to low to obtain For the third cluster sequence, the top K first clusters in the third cluster sequence are selected.
- the merging module 73 is configured to: obtain the isolated image data and the first cluster A.
- a fourth similarity between cluster centers D; the first cluster center D is the The first cluster center; for each first cluster in the K first clusters, determine that the fourth similarity in each first cluster is greater than the first threshold.
- the first number of the first cluster centers D; the first cluster with the largest number of the K first clusters is determined as the first cluster A; the isolated image The data is merged with the first cluster A.
- the merging module 73 is configured to: combine the second cluster The cluster is divided into N second subclusters, and the fourth cluster center corresponding to each second subcluster in the N second subclusters is obtained; the N is an integer greater than or equal to 1; obtain the The fifth similarity between the fourth cluster center and the first cluster center E; the first cluster center E is each first child of each first cluster in the K first clusters the first cluster center corresponding to the cluster; for each first cluster in the K first clusters, determine that the fifth similarity in each first cluster is greater than the first cluster
- the second number of the first cluster centers E with two thresholds; the first cluster with the largest second number among the K first clusters is determined as the first cluster B; The second cluster is merged with the first cluster B.
- the merging module 73 is configured to: obtain the single image data and the first cluster C. a sixth degree of similarity between cluster centers F; the first cluster center F is the The first cluster center; for each first cluster in the K first clusters, determine the sixth similarity greater than the third threshold in each first cluster. the third number of the first cluster centers F; the first cluster with the third largest number of the K first clusters is determined as the first cluster C; the single image The data is merged with the first cluster C.
- the M is less than or equal to a fourth threshold; the first dividing module 72 is further configured to: divide the merged first cluster into R third sub-clusters, and obtain the The fifth cluster center of each third sub-cluster in the R third sub-clusters; the R is an integer greater than or equal to 1; when the R is less than or equal to the fourth threshold, keep the R third sub-clusters, and the first cluster center is updated with the fifth cluster center corresponding to the R third sub-clusters; when the R is greater than the fourth threshold, Obtain the fourth quantity of image data in each of the R third sub-clusters; sort the R third sub-clusters according to the fourth quantity in descending order to obtain a fourth cluster Cluster-like sequence, select the first P third subclusters in the fourth clustering cluster sequence, and use the fifth clustering centers corresponding to the P third subclusters to update the first clustering center ; the P is less than or equal to the fourth threshold.
- the first dividing module 72 is configured to: acquire between the image data in the first cluster The seventh similarity is obtained, and a similarity matrix is obtained; the first cluster is divided into the M first sub-clusters based on the similarity matrix.
- the first dividing module 72 is configured to: obtain the first sub-cluster with the first sub-cluster
- the image data in the cluster is a connected graph composed of vertices; the seventh similarity between the vertices in the connected graph is obtained by querying the similarity matrix; the seventh similarity is greater than the fifth similarity
- the multiple vertices of the threshold are divided into a first sub-cluster to obtain the M first sub-clusters.
- each unit in the apparatus for incremental clustering of images shown in FIG. 7 may be respectively or all merged into one or several other units to form, or some of the unit(s) may be further It can be further divided into multiple units with smaller functions, which can realize the same operation without affecting the realization of the technical effects of the embodiments of the present disclosure.
- the above-mentioned units are divided based on logical functions. In practical applications, the function of one unit may also be implemented by multiple units, or the functions of multiple units may be implemented by one unit. In other embodiments of the present disclosure, the image-based incremental clustering apparatus may also include other units, and in practical applications, these functions may also be implemented with the assistance of other units, and may be implemented by a plurality of units in cooperation.
- a general-purpose computing device such as a computer
- a general-purpose computing device such as a computer
- a general-purpose computing device such as a computer
- processing elements such as a central processing unit (CPU), random access storage medium (RAM), read-only storage medium (ROM), etc.
- storage elements such as a central processing unit (CPU), random access storage medium (RAM), read-only storage medium (ROM), etc.
- Running a computer program capable of executing the steps involved in the corresponding method as shown in FIG. 2 or FIG. 6, to construct the incremental clustering apparatus of the image as shown in FIG. 7, and to realize the present invention.
- the computer program can be recorded on, for example, a computer-readable recording medium, and loaded in the above-mentioned computing device through the computer-readable recording medium, and executed therein.
- the embodiments of the present disclosure further provide an electronic device.
- the electronic device includes at least a processor 81 , an input device 82 , an output device 83 and a computer storage medium 84 .
- the processor 81 , the input device 82 , the output device 83 and the computer storage medium 84 in the electronic device may be connected through a bus or other means.
- the computer storage medium 84 may be stored in the memory of the electronic device, the computer storage medium 84 configured to store a computer program including program instructions, the processor 81 configured to execute the program stored by the computer storage medium 84 instruction.
- the processor 81 (or called CPU (Central Processing Unit, central processing unit)) is the computing core and the control core of the electronic device, which is suitable for implementing one or more instructions, and is suitable for loading and executing one or more instructions to achieve the corresponding Method flow or corresponding function.
- CPU Central Processing Unit, central processing unit
- the processor 81 of the electronic device provided by the embodiment of the present disclosure may be configured to perform incremental clustering processing of a series of images:
- the first cluster includes a first cluster A, a first cluster B and a first cluster C; the processor 81 executes the process of using the first cluster center to Combining the second image data set with the first cluster includes: in the case that the second image data set includes a plurality of image data, clustering the plurality of image data to obtain an isolated image data and a second cluster; combining the isolated image data with the first cluster A using the first cluster center; and combining the second cluster using the first cluster center The cluster is merged with the first cluster cluster B; in the case that there is only a single image data in the second image data set, the single image data is combined with the first cluster by using the first cluster center Cluster C is merged.
- the first cluster has a corresponding second cluster center; before using the first cluster center to merge the second image data set and the first cluster, The processor 81 is further configured to perform: using the second cluster center to determine K first clusters from the first clusters.
- the second cluster has a corresponding third cluster center; the processor 81 executes the process of determining the Kth cluster from the first cluster by using the second cluster center.
- a cluster including: acquiring a first similarity between the isolated image data and the second cluster center; sorting the first clusters according to the first similarity from high to low Obtain the first cluster sequence, and select the top K first clusters in the first cluster sequence; and, obtain the second cluster center between the third cluster center and the second cluster center.
- the processor 81 performing the process of using the first cluster center to merge the isolated image data with the first cluster A includes: acquiring the isolated image data and the first cluster A.
- the fourth similarity between centers D; the first cluster center D is the first cluster corresponding to each first sub-cluster of each first cluster in the K first clusters Class center; for each of the K first clusters, determine the first cluster whose fourth similarity is greater than a first threshold in each of the first clusters the first number of cluster centers D; determine the first cluster with the largest first number among the K first clusters as the first cluster A; combine the isolated image data with all The first cluster cluster A is merged.
- the processor 81 performing the using the first cluster center to merge the second cluster with the first cluster B includes: dividing the second cluster is N second subclusters, and obtains the fourth cluster center corresponding to each second subcluster in the N second subclusters; the N is an integer greater than or equal to 1; obtain the fourth cluster center
- the processor 81 performing the merging of the single image data and the first cluster C by using the first cluster center includes: acquiring the single image data and the first cluster C.
- the sixth similarity between centers F; the first cluster center F is the first cluster corresponding to each first sub-cluster of each first cluster in the K first clusters class center; for each first cluster in the K first clusters, determine the first cluster whose sixth similarity is greater than a third threshold in each first cluster the third number of class centers F; determine the first cluster with the largest third number among the K first clusters as the first cluster C; combine the single image data with all The first cluster cluster C is merged.
- the M is less than or equal to a fourth threshold; after merging the second image data set and the first cluster by using the first cluster center, the processor 81 is further configured to: Execute: divide the merged first cluster into R third subclusters, and obtain the fifth cluster center of each third subcluster in the R third subclusters; the R is greater than or an integer equal to 1; when the R is less than or equal to the fourth threshold, the R third subclusters are reserved, and the fifth cluster center pair corresponding to the R third subclusters is used The first cluster center is updated; when the R is greater than the fourth threshold, obtain the fourth quantity of image data in each of the R third sub-clusters; according to Sorting the R third subclusters in descending order of the fourth number to obtain a fourth clustering cluster sequence, selecting the first P third subclusters in the fourth clustering clustering sequence, and using the P The fifth cluster center corresponding to the third sub-cluster updates the first cluster center; the P is less than or equal to the fourth threshold.
- the first cluster is obtained by clustering the image data in the first image data set; the processor 81 executes the process of dividing the first cluster into M first clusters. sub-cluster, including: acquiring the seventh similarity between the image data in the first cluster to obtain a similarity matrix; dividing the first cluster into the M based on the similarity matrix first subcluster.
- the processor 81 performing the dividing the first cluster into the M first sub-clusters based on the similarity matrix includes: obtaining a The image data is a connected graph composed of vertices; the seventh similarity between the vertices in the connected graph is obtained by querying the similarity matrix; the plurality of vertices whose seventh similarity is greater than the fifth threshold are obtained Divide into a first sub-cluster to obtain the M first sub-clusters.
- the above-mentioned electronic devices may be computers, computer hosts, servers, cloud servers, server clusters, etc.
- the electronic devices may include, but are not limited to, a processor 81, an input device 82, an output device 83, and a computer storage medium 84.
- the input device 82 It can be a keyboard, a touch screen, etc.
- the output device 83 can be a speaker, a display, a radio frequency transmitter, and the like.
- the schematic diagram may be an example of an electronic device, and does not constitute a limitation on the electronic device, and may include more or less components than the one shown, or combine some components, or different components.
- the processor 81 of the electronic device implements the steps in the above-mentioned incremental image clustering method when executing the computer program, the above-mentioned embodiments of the incremental image clustering method are all applicable to the electronic device, and can achieve the same or similar beneficial effects.
- Embodiments of the present disclosure further provide a computer program product, which implements any one of the methods in the foregoing embodiments when the computer program product is executed by a processor.
- the computer program product can be implemented in hardware, software or a combination thereof.
- the computer program product is embodied as a computer storage medium, and in other embodiments of the present disclosure, the computer program product is embodied as a software product, such as a software development kit (Software Development Kit, SDK), etc. Wait.
- Embodiments of the present disclosure also provide a computer storage medium (Memory), where the computer storage medium is a memory device in an electronic device and is configured to store programs and data.
- the computer storage medium here may include both a built-in storage medium in the terminal, and certainly also an extended storage medium supported by the terminal.
- the computer storage medium provides storage space, and the storage space stores the operating system of the terminal.
- one or more instructions suitable for being loaded and executed by the processor 81 are also stored in the storage space, and these instructions may be one or more computer programs (including program codes).
- the computer storage medium here may be a high-speed RAM memory, or a non-volatile memory (Non-Volatile Memory), such as at least one disk memory; in some embodiments of the present disclosure, it may also be at least one disk memory.
- the computer program of the computer storage medium includes computer program code, which may be in source code form, object code form, executable file or some intermediate form, and the like.
- the computer-readable medium may include: any entity or device capable of carrying the computer program code, recording medium, U disk, removable hard disk, magnetic disk, optical disk, computer memory, read-only memory (ROM, Read-Only Memory) , Random Access Memory (RAM, Random Access Memory), electric carrier signal, telecommunication signal and software distribution medium, etc.
- the first cluster is divided into a plurality of first sub-clusters, and the first cluster cluster is combined with the second image data set based on the first cluster center of the first sub-cluster.
- the first clustering center is used to solve the problem that with the increase of image data, the clustering center will drift due to the influence of the newly added image data, which is conducive to making the clustering result more accurate and improving the clustering effect.
Abstract
Description
Claims (23)
- 一种图像的增量聚类方法,所述方法包括:A method for incremental clustering of images, the method comprising:获取第一图像数据集的第一聚类簇;obtaining the first cluster of the first image data set;将所述第一聚类簇分割为M个第一子簇,并获取所述M个第一子簇中每个第一子簇对应的第一聚类中心;所述M为大于或等于1的整数;Divide the first cluster into M first subclusters, and obtain the first cluster center corresponding to each first subcluster in the M first subclusters; the M is greater than or equal to 1 the integer;获取第二图像数据集,利用所述第一聚类中心将所述第二图像数据集与所述第一聚类簇合并。Acquire a second image data set, and combine the second image data set with the first cluster by using the first cluster center.
- 根据权利要求1所述的方法,其中,所述第一聚类簇包括第一聚类簇A、第一聚类簇B和第一聚类簇C;所述利用所述第一聚类中心将所述第二图像数据集与所述第一聚类簇合并,包括:The method according to claim 1, wherein the first cluster includes a first cluster A, a first cluster B and a first cluster C; the use of the first cluster center Combining the second image dataset with the first cluster includes:在所述第二图像数据集中包括多个图像数据的情况下,对所述多个图像数据进行聚类,得到孤立图像数据和第二聚类簇;In the case that the second image data set includes a plurality of image data, clustering the plurality of image data to obtain isolated image data and a second cluster;利用所述第一聚类中心将所述孤立图像数据与所述第一聚类簇A合并;以及,利用所述第一聚类中心将所述第二聚类簇与所述第一聚类簇B合并;combining the isolated image data with the first cluster A using the first cluster center; and combining the second cluster with the first cluster using the first cluster center Cluster B merge;在所述第二图像数据集中只存在单个图像数据的情况下,利用所述第一聚类中心将所述单个图像数据与所述第一聚类簇C合并。When only a single image data exists in the second image data set, the single image data is merged with the first cluster C by using the first cluster center.
- 根据权利要求2所述的方法,其中,所述第一聚类簇存在对应的第二聚类中心;在利用所述第一聚类中心将所述第二图像数据集与所述第一聚类簇合并之前,所述方法还包括:The method according to claim 2, wherein the first cluster has a corresponding second cluster center; when using the first cluster center to combine the second image data set with the first cluster Before the cluster merging, the method further includes:利用所述第二聚类中心从所述第一聚类簇中确定出K个第一聚类簇。K first clusters are determined from the first clusters by using the second cluster centers.
- 根据权利要求3所述的方法,其中,所述第二聚类簇存在对应的第三聚类中心;所述利用所述第二聚类中心从所述第一聚类簇中确定出K个第一聚类簇,包括:The method according to claim 3, wherein the second cluster has a corresponding third cluster center; the second cluster center is used to determine K from the first cluster The first cluster, including:获取所述孤立图像数据与所述第二聚类中心之间的第一相似度;obtaining the first similarity between the isolated image data and the second cluster center;根据所述第一相似度从高到低对所述第一聚类簇进行排序得到第一聚类簇序列,选取所述第一聚类簇序列中前K个第一聚类簇;以及,Sort the first clusters from high to low according to the first similarity to obtain a first cluster sequence, and select the top K first clusters in the first cluster sequence; and,获取所述第三聚类中心与所述第二聚类中心之间的第二相似度;obtaining the second similarity between the third cluster center and the second cluster center;根据所述第二相似度从高到低对所述第一聚类簇进行排序得到第二聚类簇序列,选取所述第二聚类簇序列中前K个第一聚类簇;或者,Sort the first clusters from high to low according to the second similarity to obtain a second cluster sequence, and select the top K first clusters in the second cluster sequence; or,获取所述单个图像数据与所述第二聚类中心之间的第三相似度;obtaining the third similarity between the single image data and the second cluster center;根据所述第三相似度从高到低对所述第一聚类簇进行排序得到第三聚类簇序列,选取所述第三聚类簇序列中前K个第一聚类簇。Sort the first clusters from high to low according to the third similarity to obtain a third cluster sequence, and select the top K first clusters in the third cluster sequence.
- 根据权利要求3所述的方法,其中,所述利用所述第一聚类中心将所述孤立图像数据与所述第一聚类簇A合并,包括:The method according to claim 3, wherein the combining the isolated image data with the first cluster A by using the first cluster center comprises:获取所述孤立图像数据与第一聚类中心D之间的第四相似度;所述第一聚类中心D为所述K个第一聚类簇中每个第一聚类簇的每个第一子簇对应的所述第一聚类中心;Obtain the fourth similarity between the isolated image data and the first cluster center D; the first cluster center D is each of the first clusters in the K first clusters the first cluster center corresponding to the first sub-cluster;对于所述K个第一聚类簇中的每个第一聚类簇,确定所述每个第一聚类簇中所述第四相似度大于第一阈值的所述第一聚类中心D的第一数量;For each first cluster in the K first clusters, determine the first cluster center D whose fourth similarity is greater than a first threshold in each first cluster the first quantity;将所述K个第一聚类簇中所述第一数量最大的第一聚类簇确定为所述第一聚类簇A;Determining the first cluster with the largest first number among the K first clusters as the first cluster A;将所述孤立图像数据与所述第一聚类簇A合并。The isolated image data is merged with the first cluster A.
- 根据权利要求3所述的方法,其中,所述利用所述第一聚类中心将所述第二聚类簇与所述第一聚类簇B合并,包括:The method according to claim 3, wherein the combining the second cluster cluster with the first cluster cluster B by using the first cluster center comprises:将所述第二聚类簇分割为N个第二子簇,并获取所述N个第二子簇中每个第二子簇对应的第四聚类中心;所述N为大于或等于1的整数;Divide the second cluster into N second subclusters, and obtain a fourth cluster center corresponding to each second subcluster in the N second subclusters; the N is greater than or equal to 1 the integer;获取所述第四聚类中心与第一聚类中心E之间的第五相似度;所述第一聚类中心E为K个第一聚类簇中每个第一聚类簇的每个第一子簇对应的所述第一聚类中心;Obtain the fifth similarity between the fourth cluster center and the first cluster center E; the first cluster center E is each of the first clusters in the K first clusters the first cluster center corresponding to the first sub-cluster;对于所述K个第一聚类簇中的每个第一聚类簇,确定所述每个第一聚类簇中所述第五相似度大于第二阈值的所述第一聚类中心E的第二数量;For each first cluster in the K first clusters, determine the first cluster center E whose fifth similarity is greater than a second threshold in each first cluster the second quantity;将所述K个第一聚类簇中所述第二数量最大的第一聚类簇确定为所述第一聚类簇B;Determining the first cluster with the largest second number among the K first clusters as the first cluster B;将所述第二聚类簇与所述第一聚类簇B合并。The second cluster is merged with the first cluster B.
- 根据权利要求3所述的方法,其中,所述利用所述第一聚类中心将所述单个图像数据与所述第一聚类簇C合并,包括:The method according to claim 3, wherein the combining the single image data with the first cluster C using the first cluster center comprises:获取所述单个图像数据与第一聚类中心F之间的第六相似度;所述第一聚类中心F为所述K个第一聚类簇中每个第一聚类簇的每个第一子簇对应的所述第一聚类中心;Obtain the sixth similarity between the single image data and the first cluster center F; the first cluster center F is each of the first clusters in the K first clusters the first cluster center corresponding to the first sub-cluster;对于所述K个第一聚类簇中的每个第一聚类簇,确定所述每个第一聚类簇中所述第六相似度大于第三阈值的所述第一聚类中心F的第三数量;For each first cluster in the K first clusters, determine the first cluster center F whose sixth similarity is greater than a third threshold in each first cluster the third quantity;将所述K个第一聚类簇中所述第三数量最大的第一聚类簇确定为所述第一聚类簇C;Determining the first cluster with the third largest number of the K first clusters as the first cluster C;将所述单个图像数据与所述第一聚类簇C合并。The single image data is merged with the first cluster C.
- 根据权利要求1至7任一项所述的方法,其中,所述M小于或等于第四阈值;在利用所述第一聚类中心将所述第二图像数据集与所述第一聚类簇合并之后,所述方法还包括:The method according to any one of claims 1 to 7, wherein the M is less than or equal to a fourth threshold; when using the first cluster center to associate the second image data set with the first cluster After the clusters are merged, the method further includes:将合并后的第一聚类簇分割为R个第三子簇,并获取所述R个第三子簇中每个第三子簇的第五聚类中心;所述R为大于或等于1的整数;Divide the merged first cluster into R third subclusters, and obtain the fifth cluster center of each third subcluster in the R third subclusters; the R is greater than or equal to 1 the integer;在所述R小于或等于所述第四阈值的情况下,保留所述R个第三子簇,并用所述R个第三子簇对应的所述第五聚类中心对所述第一聚类中心进行更新;In the case that the R is less than or equal to the fourth threshold, the R third subclusters are reserved, and the first clustering The class center is updated;在所述R大于所述第四阈值的情况下,获取所述R个第三子簇中每个第三子簇中的图像数据的第四数量;In the case that the R is greater than the fourth threshold, acquiring a fourth quantity of image data in each of the R third sub-clusters;根据所述第四数量从大到小对所述R个第三子簇进行排序得到第四聚类簇序列,选取所述第四聚类簇序列中前P个第三子簇,并用所述P个第三子簇对应的所述第五聚类中心对所述第一聚类中心进行更新;所述P小于或等于所述第四阈值。Sort the R third sub-clusters from large to small according to the fourth number to obtain a fourth cluster sequence, select the first P third sub-clusters in the fourth cluster sequence, and use the The fifth cluster centers corresponding to the P third subclusters update the first cluster centers; the P is less than or equal to the fourth threshold.
- 根据权利要求1至7任一项所述的方法,其中,所述第一聚类簇通过对所述第一图像数据集中的图像数据进行聚类得到;所述将所述第一聚类簇分割为M个第一子簇,包括:The method according to any one of claims 1 to 7, wherein the first cluster is obtained by clustering image data in the first image data set; Divide into M first subclusters, including:获取所述第一聚类簇中的图像数据之间的第七相似度,得到相似度矩阵;obtaining the seventh similarity between the image data in the first cluster to obtain a similarity matrix;基于所述相似度矩阵将所述第一聚类簇分割为所述M个第一子簇。The first cluster is divided into the M first sub-clusters based on the similarity matrix.
- 根据权利要求9所述的方法,其中,所述基于所述相似度矩阵将所述第一聚类簇分割为所述M个第一子簇,包括:The method according to claim 9, wherein the dividing the first cluster into the M first sub-clusters based on the similarity matrix comprises:获取以所述第一聚类簇中的图像数据为顶点构成的连通图;obtaining a connected graph composed of image data in the first cluster as vertices;从所述相似度矩阵中查询得到所述连通图中的顶点之间的所述第七相似度;Obtain the seventh similarity between vertices in the connected graph by querying the similarity matrix;将所述第七相似度大于第五阈值的多个顶点分割为一个第一子簇,得到所述M个第一子簇。The plurality of vertices with the seventh similarity greater than the fifth threshold are divided into a first subcluster to obtain the M first subclusters.
- 一种图像的增量聚类装置,所述装置包括:A device for incremental clustering of images, the device comprising:第一获取模块,配置为获取第一图像数据集的第一聚类簇;a first acquisition module, configured to acquire the first cluster of the first image data set;第一分割模块,配置为将所述第一聚类簇分割为M个第一子簇,并获取所述M个第一子簇中每个第一子簇对应的第一聚类中心;所述M为大于或等于1的整数;a first dividing module, configured to divide the first cluster into M first sub-clusters, and obtain a first cluster center corresponding to each first sub-cluster in the M first sub-clusters; The above M is an integer greater than or equal to 1;合并模块,配置为获取第二图像数据集,利用所述第一聚类中心将所述第二图像数据集与所述第一聚类簇合并。The merging module is configured to obtain a second image data set, and use the first cluster center to merge the second image data set and the first cluster.
- 根据权利要求11所述的装置,其中,所述第一聚类簇包括第一聚类簇A、第一聚类簇B和第一聚类簇C;所述合并模块包括:The apparatus according to claim 11, wherein the first cluster includes a first cluster A, a first cluster B and a first cluster C; the merging module includes:聚类子模块,配置为在所述第二图像数据集中包括多个图像数据的情况下,对所述多个图像数据进行聚类,得到孤立图像数据和第二聚类簇;a clustering submodule, configured to perform clustering on the plurality of image data when the second image data set includes a plurality of image data to obtain isolated image data and a second cluster;第一合并子模块,配置为利用所述第一聚类中心将所述孤立图像数据与所述第一聚类簇A合并;以及,a first merging submodule configured to use the first cluster center to merge the isolated image data with the first cluster A; and,第二合并子模块,配置为利用所述第一聚类中心将所述第二聚类簇与所述第一聚类簇B合并;a second merging submodule, configured to use the first cluster center to merge the second cluster with the first cluster B;第三合并子模块,配置为在所述第二图像数据集中只存在单个图像数据的情况下,利用所述第一聚类中心将所述单个图像数据与所述第一聚类簇C合并。A third merging submodule is configured to merge the single image data with the first cluster C by using the first cluster center when only a single image data exists in the second image data set.
- 根据权利要求12所述的装置,所述第一聚类簇存在对应的第二聚类中心;所述合并模块还包括:The device according to claim 12, wherein the first cluster has a corresponding second cluster center; the merging module further comprises:第一确定子模块,配置为利用所述第二聚类中心从所述第一聚类簇中确定出K个第一聚类簇。The first determination submodule is configured to determine K first clusters from the first clusters by using the second cluster centers.
- 根据权利要求13所述的装置,所述第二聚类簇存在对应的第三聚类中心;所述第一确定子模块包括:The device according to claim 13, wherein the second cluster has a corresponding third cluster center; the first determination submodule comprises:第一获取单元,配置为获取所述孤立图像数据与所述第二聚类中心之间的第一相似度;a first obtaining unit, configured to obtain a first similarity between the isolated image data and the second cluster center;第一排序单元,配置为根据所述第一相似度从高到低对所述第一聚类簇进行排序得到第一聚类簇序列,选取所述第一聚类簇序列中前K个第一聚类簇;以及,A first sorting unit, configured to sort the first clusters according to the first similarity from high to low to obtain a first cluster sequence, and select the first Kth in the first cluster sequence a cluster; and,第二获取单元,配置为获取所述第三聚类中心与所述第二聚类中心之间的第二相似度;a second obtaining unit, configured to obtain a second similarity between the third cluster center and the second cluster center;第二排序单元,配置为根据所述第二相似度从高到低对所述第一聚类簇进行排序得到第二聚类簇序列,选取所述第二聚类簇序列中前K个第一聚类簇;或者,The second sorting unit is configured to sort the first clusters according to the second similarity from high to low to obtain a second cluster sequence, and select the top K th cluster in the second cluster sequence a cluster; or,第三获取单元,配置为获取所述单个图像数据与所述第二聚类中心之间的第三相似度;a third obtaining unit, configured to obtain a third similarity between the single image data and the second cluster center;第三排序单元,配置为根据所述第三相似度从高到低对所述第一聚类簇进行排序得到第三聚类簇序列,选取所述第三聚类簇序列中前K个第一聚类簇。A third sorting unit, configured to sort the first clusters according to the third similarity from high to low to obtain a third cluster sequence, and select the top K th cluster in the third cluster sequence A cluster of clusters.
- 根据权利要求13所述的装置,所述第一合并子模块包括:The apparatus according to claim 13, the first merging submodule comprises:第四获取单元,配置为获取所述孤立图像数据与第一聚类中心D之间的第四相似度;所述第一聚类中心D为所述K个第一聚类簇中每个第一聚类簇的每个第一子簇对应的所述第一聚类中心;A fourth obtaining unit, configured to obtain a fourth similarity between the isolated image data and the first cluster center D; the first cluster center D is each of the K first clusters. the first cluster center corresponding to each first sub-cluster of a cluster;第一确定单元,配置为对于所述K个第一聚类簇中的每个第一聚类簇,确定所述每个第一聚类簇中所述第四相似度大于第一阈值的所述第一聚类中心D的第一数量;A first determining unit, configured to, for each of the K first clusters, determine all of the first clusters whose fourth similarity is greater than a first threshold in each of the first clusters; Describe the first quantity of the first cluster center D;第二确定单元,配置为将所述K个第一聚类簇中所述第一数量最大的第一聚类簇确定为所述第一聚类簇A;a second determining unit, configured to determine the first cluster with the largest number of first clusters among the K first clusters as the first cluster A;第一合并单元,配置为将所述孤立图像数据与所述第一聚类簇A合并。a first merging unit, configured to merge the isolated image data with the first cluster A.
- 根据权利要求13所述的装置,所述第二合并子模块包括:The apparatus according to claim 13, the second merging submodule comprises:第一分割单元,配置为将所述第二聚类簇分割为N个第二子簇,并获取所述N 个第二子簇中每个第二子簇对应的第四聚类中心;所述N为大于或等于1的整数;a first dividing unit, configured to divide the second cluster into N second subclusters, and obtain a fourth cluster center corresponding to each second subcluster in the N second subclusters; Said N is an integer greater than or equal to 1;第五获取单元,配置为获取所述第四聚类中心与第一聚类中心E之间的第五相似度;所述第一聚类中心E为K个第一聚类簇中每个第一聚类簇的每个第一子簇对应的所述第一聚类中心;A fifth obtaining unit, configured to obtain the fifth similarity between the fourth cluster center and the first cluster center E; the first cluster center E is each of the K first clusters. the first cluster center corresponding to each first sub-cluster of a cluster;第三确定单元,配置为对于所述K个第一聚类簇中的每个第一聚类簇,确定所述每个第一聚类簇中所述第五相似度大于第二阈值的所述第一聚类中心E的第二数量;A third determination unit, configured to, for each of the K first clusters, determine all of the first clusters whose fifth similarity is greater than the second threshold in each of the first clusters Describe the second quantity of the first cluster center E;第四确定单元,配置为将所述K个第一聚类簇中所述第二数量最大的第一聚类簇确定为所述第一聚类簇B;a fourth determination unit, configured to determine the first cluster with the second largest number of the K first clusters as the first cluster B;第二合并单元,配置为将所述第二聚类簇与所述第一聚类簇B合并。The second merging unit is configured to merge the second cluster with the first cluster B.
- 根据权利要求13所述的装置,所述第三合并子模块包括:The apparatus according to claim 13, the third merging submodule comprises:第六获取单元,配置为获取所述单个图像数据与第一聚类中心F之间的第六相似度;所述第一聚类中心F为所述K个第一聚类簇中每个第一聚类簇的每个第一子簇对应的所述第一聚类中心;A sixth obtaining unit, configured to obtain a sixth degree of similarity between the single image data and the first cluster center F; the first cluster center F is each of the K first clusters. the first cluster center corresponding to each first sub-cluster of a cluster;第五确定单元,配置为对于所述K个第一聚类簇中的每个第一聚类簇,确定所述每个第一聚类簇中所述第六相似度大于第三阈值的所述第一聚类中心F的第三数量;A fifth determining unit, configured to, for each of the K first clusters, determine all of the first clusters whose sixth similarity is greater than a third threshold in each of the first clusters. Describe the third quantity of the first cluster center F;第六确定单元,配置为将所述K个第一聚类簇中所述第三数量最大的第一聚类簇确定为所述第一聚类簇C;a sixth determining unit, configured to determine the first cluster with the third largest number of the K first clusters as the first cluster C;第三合并单元,配置为将所述单个图像数据与所述第一聚类簇C合并。A third merging unit configured to merge the single image data with the first cluster C.
- 根据权利要求11至17任一项所述的装置,其中,所述M小于或等于第四阈值;所述装置还包括:The apparatus according to any one of claims 11 to 17, wherein the M is less than or equal to a fourth threshold; the apparatus further comprises:第二分割模块,配置为将合并后的第一聚类簇分割为R个第三子簇,并获取所述R个第三子簇中每个第三子簇的第五聚类中心;所述R为大于或等于1的整数;The second dividing module is configured to divide the merged first cluster into R third sub-clusters, and obtain the fifth cluster center of each third sub-cluster in the R third sub-clusters; The above R is an integer greater than or equal to 1;第一更新模块,配置为在所述R小于或等于所述第四阈值的情况下,保留所述R个第三子簇,并用所述R个第三子簇对应的所述第五聚类中心对所述第一聚类中心进行更新;a first update module, configured to retain the R third subclusters when the R is less than or equal to the fourth threshold, and use the fifth cluster corresponding to the R third subclusters The center updates the first cluster center;第二获取模块,配置为在所述R大于所述第四阈值的情况下,获取所述R个第三子簇中每个第三子簇中的图像数据的第四数量;a second acquisition module, configured to acquire a fourth quantity of image data in each of the R third subclusters in the case that the R is greater than the fourth threshold;第二更新模块,配置为根据所述第四数量从大到小对所述R个第三子簇进行排序得到第四聚类簇序列,选取所述第四聚类簇序列中前P个第三子簇,并用所述P个第三子簇对应的所述第五聚类中心对所述第一聚类中心进行更新;所述P小于或等于所述第四阈值。The second update module is configured to sort the R third sub-clusters from large to small according to the fourth number to obtain a fourth cluster sequence, and select the first P-th cluster sequence in the fourth cluster sequence. Three sub-clusters, and the first cluster centers are updated with the fifth cluster centers corresponding to the P third sub-clusters; the P is less than or equal to the fourth threshold.
- 根据权利要求11至17任一项所述的装置,其中,所述第一聚类簇通过对所述第一图像数据集中的图像数据进行聚类得到;所述第一分割模块包括:The apparatus according to any one of claims 11 to 17, wherein the first cluster is obtained by clustering image data in the first image data set; the first segmentation module comprises:获取子模块,配置为获取所述第一聚类簇中的图像数据之间的第七相似度,得到相似度矩阵;an obtaining submodule, configured to obtain the seventh similarity between the image data in the first cluster to obtain a similarity matrix;分割子模块,配置为基于所述相似度矩阵将所述第一聚类簇分割为所述M个第一子簇。A segmentation sub-module, configured to segment the first cluster into the M first sub-clusters based on the similarity matrix.
- 根据权利要求19所述的装置,所述分割子模块包括:The apparatus according to claim 19, the segmentation submodule comprises:第七获取单元,配置为获取以所述第一聚类簇中的图像数据为顶点构成的连通图;a seventh obtaining unit, configured to obtain a connected graph formed by taking the image data in the first cluster as vertices;查询单元,配置为从所述相似度矩阵中查询得到所述连通图中的顶点之间的所述第七相似度;a query unit, configured to query the similarity matrix to obtain the seventh similarity between the vertices in the connected graph;第二分割单元,配置为将所述第七相似度大于第五阈值的多个顶点分割为一个第一子簇,得到所述M个第一子簇。The second dividing unit is configured to divide a plurality of vertices with the seventh similarity greater than the fifth threshold into a first sub-cluster to obtain the M first sub-clusters.
- 一种电子设备,包括输入设备和输出设备,还包括:An electronic device includes an input device and an output device, and also includes:处理器,适于实现一条或多条指令;以及,a processor adapted to implement one or more instructions; and,计算机存储介质,所述计算机存储介质存储有一条或多条指令,所述一条或多条指令适于由所述处理器加载并执行如权利要求1至10任一项所述的方法。A computer storage medium having stored thereon one or more instructions adapted to be loaded by the processor and to perform the method of any one of claims 1 to 10.
- 一种计算机存储介质,所述计算机存储介质存储有一条或多条指令,所述一条或多条指令适于由处理器加载并执行如权利要求1至10任一项所述的方法。A computer storage medium having stored thereon one or more instructions adapted to be loaded by a processor and to perform the method of any one of claims 1 to 10.
- 一种计算机程序产品,所述计算机程序产品包括一条或多条指令,所述一条或多条指令适于由处理器加载并执行如权利要求1至10任一项所述的方法。A computer program product comprising one or more instructions adapted to be loaded by a processor and to perform the method of any one of claims 1 to 10.
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2022524182A JP2023502863A (en) | 2020-10-30 | 2020-12-04 | Image incremental clustering method and apparatus, electronic device, storage medium and program product |
KR1020227013791A KR20220070482A (en) | 2020-10-30 | 2020-12-04 | Image incremental clustering method, apparatus, electronic device, storage medium and program product |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011185911.8 | 2020-10-30 | ||
CN202011185911.8A CN112257801B (en) | 2020-10-30 | 2020-10-30 | Incremental clustering method and device for images, electronic equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2022088390A1 true WO2022088390A1 (en) | 2022-05-05 |
Family
ID=74268958
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/134074 WO2022088390A1 (en) | 2020-10-30 | 2020-12-04 | Image incremental clustering method and apparatus, electronic device, storage medium and program product |
Country Status (5)
Country | Link |
---|---|
JP (1) | JP2023502863A (en) |
KR (1) | KR20220070482A (en) |
CN (1) | CN112257801B (en) |
TW (1) | TW202217597A (en) |
WO (1) | WO2022088390A1 (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117152543A (en) * | 2023-10-30 | 2023-12-01 | 山东浪潮科学研究院有限公司 | Image classification method, device, equipment and storage medium |
CN117333926A (en) * | 2023-11-30 | 2024-01-02 | 深圳须弥云图空间科技有限公司 | Picture aggregation method and device, electronic equipment and readable storage medium |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113327195A (en) * | 2021-04-09 | 2021-08-31 | 中科创达软件股份有限公司 | Image processing method and device, image processing model training method and device, and image pattern recognition method and device |
CN113743533B (en) * | 2021-09-17 | 2023-08-01 | 重庆紫光华山智安科技有限公司 | Picture clustering method and device and storage medium |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110103700A1 (en) * | 2007-12-03 | 2011-05-05 | National University Corporation Hokkaido University | Image classification device and image classification program |
CN102129451A (en) * | 2011-02-17 | 2011-07-20 | 上海交通大学 | Method for clustering data in image retrieval system |
CN110866555A (en) * | 2019-11-11 | 2020-03-06 | 广州国音智能科技有限公司 | Incremental data clustering method, device and equipment and readable storage medium |
CN111242040A (en) * | 2020-01-15 | 2020-06-05 | 佳都新太科技股份有限公司 | Dynamic face clustering method, device, equipment and storage medium |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2012140315A1 (en) * | 2011-04-15 | 2012-10-18 | Nokia Corporation | Method, apparatus and computer program product for providing incremental clustering of faces in digital images |
CN103718190B (en) * | 2011-07-29 | 2017-05-24 | 惠普发展公司,有限责任合伙企业 | incremental image clustering |
CN103886048B (en) * | 2014-03-13 | 2017-04-26 | 浙江大学 | Cluster-based increment digital book recommendation method |
US11176206B2 (en) * | 2015-12-01 | 2021-11-16 | International Business Machines Corporation | Incremental generation of models with dynamic clustering |
CN107798354B (en) * | 2017-11-16 | 2022-11-01 | 腾讯科技(深圳)有限公司 | Image clustering method and device based on face image and storage equipment |
CN109886311B (en) * | 2019-01-25 | 2021-08-20 | 北京奇艺世纪科技有限公司 | Incremental clustering method and device, electronic equipment and computer readable medium |
CN111062407B (en) * | 2019-10-15 | 2023-12-19 | 深圳市商汤科技有限公司 | Image processing method and device, electronic equipment and storage medium |
CN110781957B (en) * | 2019-10-24 | 2023-05-30 | 深圳市商汤科技有限公司 | Image processing method and device, electronic equipment and storage medium |
CN111460153B (en) * | 2020-03-27 | 2023-09-22 | 深圳价值在线信息科技股份有限公司 | Hot topic extraction method, device, terminal equipment and storage medium |
-
2020
- 2020-10-30 CN CN202011185911.8A patent/CN112257801B/en active Active
- 2020-12-04 JP JP2022524182A patent/JP2023502863A/en not_active Withdrawn
- 2020-12-04 KR KR1020227013791A patent/KR20220070482A/en active Search and Examination
- 2020-12-04 WO PCT/CN2020/134074 patent/WO2022088390A1/en active Application Filing
- 2020-12-28 TW TW109146540A patent/TW202217597A/en unknown
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20110103700A1 (en) * | 2007-12-03 | 2011-05-05 | National University Corporation Hokkaido University | Image classification device and image classification program |
CN102129451A (en) * | 2011-02-17 | 2011-07-20 | 上海交通大学 | Method for clustering data in image retrieval system |
CN110866555A (en) * | 2019-11-11 | 2020-03-06 | 广州国音智能科技有限公司 | Incremental data clustering method, device and equipment and readable storage medium |
CN111242040A (en) * | 2020-01-15 | 2020-06-05 | 佳都新太科技股份有限公司 | Dynamic face clustering method, device, equipment and storage medium |
Cited By (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN117152543A (en) * | 2023-10-30 | 2023-12-01 | 山东浪潮科学研究院有限公司 | Image classification method, device, equipment and storage medium |
CN117333926A (en) * | 2023-11-30 | 2024-01-02 | 深圳须弥云图空间科技有限公司 | Picture aggregation method and device, electronic equipment and readable storage medium |
CN117333926B (en) * | 2023-11-30 | 2024-03-15 | 深圳须弥云图空间科技有限公司 | Picture aggregation method and device, electronic equipment and readable storage medium |
Also Published As
Publication number | Publication date |
---|---|
JP2023502863A (en) | 2023-01-26 |
CN112257801B (en) | 2022-04-29 |
CN112257801A (en) | 2021-01-22 |
KR20220070482A (en) | 2022-05-31 |
TW202217597A (en) | 2022-05-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2022088390A1 (en) | Image incremental clustering method and apparatus, electronic device, storage medium and program product | |
Han et al. | Semisupervised feature selection via spline regression for video semantic recognition | |
CN106528874B (en) | The CLR multi-tag data classification method of big data platform is calculated based on Spark memory | |
WO2021109464A1 (en) | Personalized teaching resource recommendation method for large-scale users | |
CN104392250A (en) | Image classification method based on MapReduce | |
WO2019080908A1 (en) | Image processing method and apparatus for implementing image recognition, and electronic device | |
CN110751027B (en) | Pedestrian re-identification method based on deep multi-instance learning | |
CN112529068B (en) | Multi-view image classification method, system, computer equipment and storage medium | |
Yao et al. | Spatio-temporal information for human action recognition | |
Yadav et al. | Vid-win: Fast video event matching with query-aware windowing at the edge for the internet of multimedia things | |
WO2023155508A1 (en) | Graph convolutional neural network and knowledge base-based paper correlation analysis method | |
Bartolini et al. | A general framework for real-time analysis of massive multimedia streams | |
Ye et al. | Efficient point cloud segmentation with geometry-aware sparse networks | |
Liu et al. | Application of gcForest to visual tracking using UAV image sequences | |
CN109934852B (en) | Video description method based on object attribute relation graph | |
KR102039244B1 (en) | Data clustering method using firefly algorithm and the system thereof | |
Ła̧giewka et al. | Distributed image retrieval with colour and keypoint features | |
Lu et al. | Video person re-identification using key frame screening with index and feature reorganization based on inter-frame relation | |
Cao et al. | A parallel Adaboost-backpropagation neural network for massive image dataset classification | |
Han et al. | Real-time adversarial GAN-based abnormal crowd behavior detection | |
Wang et al. | MIC-KMeans: a maximum information coefficient based high-dimensional clustering algorithm | |
Tan et al. | A novel image matting method using sparse manual clicks | |
Yan et al. | Alpha matting with image pixel correlation | |
Dhoot et al. | Efficient Dimensionality Reduction for Big Data Using Clustering Technique | |
Hanif et al. | Re-ranking person re-identification using distance aggregation of k-nearest neighbors hierarchical tree |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
ENP | Entry into the national phase |
Ref document number: 2022524182 Country of ref document: JP Kind code of ref document: A Ref document number: 20227013791 Country of ref document: KR Kind code of ref document: A |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20959549 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20959549 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20959549 Country of ref document: EP Kind code of ref document: A1 |