WO2020199773A1 - 图像检索方法及装置和计算机可读存储介质 - Google Patents
图像检索方法及装置和计算机可读存储介质 Download PDFInfo
- Publication number
- WO2020199773A1 WO2020199773A1 PCT/CN2020/075685 CN2020075685W WO2020199773A1 WO 2020199773 A1 WO2020199773 A1 WO 2020199773A1 CN 2020075685 W CN2020075685 W CN 2020075685W WO 2020199773 A1 WO2020199773 A1 WO 2020199773A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- cluster
- image
- vector
- local visual
- sub
- Prior art date
Links
- 238000000034 method Methods 0.000 title claims abstract description 28
- 230000000007 visual effect Effects 0.000 claims abstract description 120
- 239000013598 vector Substances 0.000 claims abstract description 112
- 239000013604 expression vector Substances 0.000 claims abstract description 49
- 238000004422 calculation algorithm Methods 0.000 claims description 43
- 230000001186 cumulative effect Effects 0.000 claims description 18
- 230000009467 reduction Effects 0.000 claims description 17
- 238000012545 processing Methods 0.000 claims description 16
- 238000009825 accumulation Methods 0.000 claims description 13
- 238000007621 cluster analysis Methods 0.000 claims description 13
- 238000012549 training Methods 0.000 claims description 12
- 238000004458 analytical method Methods 0.000 claims description 10
- 238000010606 normalization Methods 0.000 claims description 6
- 230000035508 accumulation Effects 0.000 description 11
- 238000010586 diagram Methods 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 4
- 238000012360 testing method Methods 0.000 description 4
- 230000009286 beneficial effect Effects 0.000 description 3
- 238000001514 detection method Methods 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 230000003044 adaptive effect Effects 0.000 description 2
- 230000011218 segmentation Effects 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000000513 principal component analysis Methods 0.000 description 1
- 230000003068 static effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2411—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
Definitions
- the present disclosure relates to the field of image processing technology, and in particular to an image retrieval method and device, electronic equipment, and computer-readable storage media.
- image retrieval retrieves a list of images similar to the image to be recognized and corresponding content such as text, video or web page link, which has extremely promising application prospects.
- the existing related image retrieval algorithm workflow includes: extracting the visual features of the image to be recognized; constructing a feature index or image expression; making similarity judgments based on the image features or the distance of the image expression (such as Euclidean distance, cosine distance, etc.); and Give a list of similar images.
- an image retrieval method including:
- a list of candidate images whose similarity with the expression vector exceeds the similarity threshold is retrieved from the candidate image library.
- determining the expression vector of the image to be recognized based on the at least one local visual feature includes:
- the image classification quantizer determines the cluster and subclass to which each of the at least one local visual feature belongs, and the image classification quantizer includes K1 clusters and corresponding K1 cluster centers, and the The K2 sub-categories cut out by each of the K1 clusters and the distance threshold of each sub-category; and
- the accumulated sum vector is the expression vector of the image to be recognized, where K1 And K2 are positive integers.
- determining the cluster and subcategory to which each of the at least one local visual feature belongs includes:
- Determining that the cluster cluster where the cluster center closest to the local visual feature is located is the cluster cluster to which the local visual feature belongs;
- the image retrieval method further includes:
- the dimensionality reduction algorithm is used to perform dimensionality reduction processing on the long vector to obtain a dimensionality-reduced long vector, and the dimensionality-reduced long vector is the expression vector of the image to be recognized.
- the image classification quantizer is trained through the following steps, including:
- the distance between the local visual feature in the cluster and the cluster center of the cluster and the maximum distance value are obtained; based on the maximum distance value, the cluster is divided into K2 sub-categories, get the distance threshold of each sub-category,
- the number of obtained distance thresholds is K1*K2, and the sub-categories in the same cluster do not overlap each other.
- the cluster cluster is divided into K2 sub-categories based on the maximum distance value, and the distance boundary threshold of each sub-category is obtained, including:
- the maximum distance value is divided into K2 segments, the local visual features corresponding to each segment constitute a sub-category, and K2 sub-categories are obtained; and the maximum distance between each segment and the cluster center of the cluster The distance is the distance threshold of the corresponding sub-category of each segment.
- the cluster analysis algorithm includes the K-means algorithm.
- an image retrieval device including:
- the memory is coupled to the processor, and stores executable instructions, which when executed by the processor enable the processor to be configured as:
- a list of candidate images whose similarity with the expression vector exceeds the similarity threshold is retrieved from the candidate image library.
- the processor is further configured to:
- the image classification quantizer determines the cluster and subclass to which each of the at least one local visual feature belongs, and the image classification quantizer includes K1 clusters and corresponding K1 cluster centers, and the The K2 sub-categories cut out by each of the K1 clusters and the distance threshold of each sub-category; and
- the cumulative sum vector is the expression vector of the image to be recognized, where K1 and K2 are positive integers.
- the processor is further configured to:
- Determining that the cluster cluster where the cluster center closest to the local visual feature is located is the cluster cluster to which the local visual feature belongs;
- the processor is further configured to:
- the dimensionality reduction algorithm is used to perform dimensionality reduction processing on the long vector to obtain a dimensionality-reduced long vector, and the dimensionality-reduced long vector is the expression vector of the image to be recognized.
- the image classification quantizer is trained through the following steps, including:
- the number of obtained distance thresholds is K1*K2, and the sub-categories in the same cluster do not overlap each other.
- a computer-readable storage medium having computer-executable instructions stored thereon, which is characterized in that, when the instructions are executed by a processor, the instructions in any one of claims 1 to 7 are implemented. The steps of the method.
- Fig. 1 is a block diagram of an image retrieval method according to an embodiment of the present disclosure
- FIG. 2 is a flowchart of obtaining an expression vector according to an embodiment of the present disclosure
- FIG. 3 is a flowchart of obtaining a distance threshold according to an embodiment of the present disclosure
- FIG. 4 is a flowchart of obtaining cluster cluster sub-categories according to an embodiment of the present disclosure
- FIG. 5 is a schematic diagram showing the effect of dividing K1 clusters and K2 sub-categories according to an embodiment of the present disclosure
- Fig. 6 is another flow chart for obtaining expression vectors according to an embodiment of the present disclosure.
- FIGS. 7 to 10 are block diagrams of an image detection device shown in embodiments of the present disclosure.
- FIG. 11 is a block diagram of an electronic device according to an embodiment of the present disclosure.
- FIG. 12 is a computer-readable storage medium 1200 according to a disclosed embodiment.
- image retrieval can retrieve a list of images similar to the image to be recognized and corresponding content such as text, video or web page links by inputting an image to be recognized, which has extremely promising application prospects.
- the existing related image retrieval algorithm workflow includes: extracting the visual features of the image to be recognized; constructing a feature index or image expression; making similarity judgments based on the image features or the distance of the image expression (such as Euclidean distance, cosine distance, etc.); and Give a list of similar images.
- the related technology only uses part of the visual features and distance information in the image to be recognized, and does not make full use of the visual features of the image to be recognized, which is not conducive to improving the accuracy and efficiency of retrieval of the image to be recognized.
- the embodiments of the present disclosure provide an image retrieval method.
- the public idea is to obtain the local visual features of the image to be recognized, and then determine the residual vector of the local visual feature, and use the residual vector to form the The expression vector of the image, and then the candidate image list can be determined based on the expression vector.
- the residual vector can be used to make full use of the features of the image to be recognized, which is beneficial to improve the accuracy of image retrieval.
- FIG. 1 is a block diagram of an image retrieval method shown in an embodiment of the present disclosure.
- an image retrieval method can be applied to electronic devices, such as smart phones, tablet computers, personal computers, etc., including steps 101 to 103 ,among them:
- step 101 at least one local visual feature of the image to be recognized is acquired.
- the electronic device can acquire the image to be recognized input by the user, and then the electronic device can call the visual feature acquisition algorithm to extract the local visual features of the image to be recognized.
- the visual feature acquisition algorithm may include Scale-invariant Feature Transform (SIFT) algorithm, SURF (Speed Up Robust Features, SURF) algorithm, ORB (Oriented FAST and Rotated BRIE) algorithm, etc.
- SIFT Scale-invariant Feature Transform
- SURF Speed Up Robust Features
- ORB Oriented FAST and Rotated BRIE
- obtaining local visual features includes:
- Direction determination Based on the local gradient direction of the image to be recognized, one or more directions are assigned to each key point position. In the subsequent process, the image to be recognized is transformed based on the direction, scale and position of the key points.
- step 102 an expression vector of the image to be recognized is determined based on the at least one local visual feature.
- the electronic device may determine the residual vector of the local visual feature based on the local visual feature, and then the expression vector of the image to be recognized is formed by the residual vector.
- the electronic device can call a pre-trained and stored image classification quantizer.
- the image classification quantizer is a quantitative model constructed by clustering analysis of all visual features using, for example, the k-means algorithm. After the image classification quantizer obtains the local visual features, it can determine the cluster and sub-categories to which each local visual feature belongs.
- the image classification quantizer includes K1 clusters and the corresponding K1 cluster centers, as well as K2 sub-categories divided from each cluster in the K1 clusters and the distance boundary threshold of each sub-category (corresponding step 201). Among them, K1 and K2 are positive integers.
- the local visual features are described by vector representation, and the center of each cluster can also be described by vector representation. Therefore, the difference between the local visual feature and the cluster center corresponding to the cluster cluster can get the residual of the local visual feature vector.
- the electronic device determining the cluster cluster and subcategory to which each local visual feature belongs may include: the electronic device can use a clustering analysis algorithm to obtain the distance between the local visual feature and each cluster center And the residual vector (corresponding to step 301). Then, the electronic device can determine that the cluster cluster where the cluster center closest to the local visual feature is located is the cluster cluster to which the local visual feature belongs (corresponding to step 302). After that, the electronic device may determine the sub-category to which the local visual feature belongs based on the distance and the distance boundary threshold of each sub-category in the cluster (corresponding to step 303).
- the training step of the image classification quantizer may include: the electronic device may obtain an image training set. Then, the electronic device can obtain the local visual features of each image in the image training set (corresponding to step 401).
- the electronic device uses a clustering analysis algorithm to perform cluster analysis on the local visual features to obtain K1 clusters and cluster centers of each cluster (corresponding to step 402).
- the clustering analysis algorithm may include the K-means algorithm, of course, other clustering algorithms may also be selected, which is not limited here.
- the electronic device obtains the distance between the local visual feature in the cluster and the cluster center of the cluster and the maximum distance value; based on the maximum distance value, the cluster is divided into K2 sub Class, get the distance threshold of each subclass.
- the number of distance thresholds obtained here is K1*K2, and the sub-categories in the same cluster do not overlap each other (corresponding to step 403).
- each distance is equally divided into K2 categories according to the maximum distance.
- K2 is formed by taking the cluster center of the cluster as the center and the segmentation position of the maximum distance as the radius. Concentric circles, the local visual features located in each circle area are regarded as a sub-category, and the segmentation position is the distance threshold. That is, in this embodiment, the maximum distance value can be divided into K2 segments, and the local visual features in the area formed by each segment are regarded as a sub-category. The boundary between the area corresponding to each segment and the area corresponding to other segments can be used as the distance boundary threshold. Refer to Fig.
- the cluster K11 can be further divided into K2 sub-categories.
- K2 is equal to 3
- the 3 sub-categories can include sub-categories K21, K22, and K23.
- the distance thresholds of each sub-category are respectively L1, L2 and L3.
- the electronic device can obtain and accumulate residual vectors of all local visual features belonging to the same subcategory, thereby obtaining the accumulated sum vector of K1*K2 residual vectors.
- the accumulated sum vector is the waiting Identify the expression vector of the image (corresponding to step 202).
- each cluster can get K2 sub-categories, and the more detailed local visual features of the image to be recognized can be obtained, that is, the obtained expression vector can reflect more accurately The more detailed local visual features of the image to be recognized help to improve the retrieval accuracy.
- the electronic device may perform normalization processing on the accumulation sum vector, respectively, to obtain a normalized accumulation sum vector (corresponding to step 601). Then, the electronic device connects the K1*K2 normalized accumulation and vectors end to end to obtain a long vector (corresponding to step 602). After that, the electronic device uses the dimensionality reduction algorithm to perform dimensionality reduction processing on the long vector to obtain the dimensionality-reduced long vector, and the dimensionality-reduced long vector is the expression vector of the image to be recognized (corresponding to step 602). In this way, by processing the cumulative sum vector in this embodiment, the dimension of the vector can be reduced, which is beneficial to reduce the amount of data processing and improve the retrieval efficiency.
- quantifying each local visual feature of the electronic device may include: using the k-means quantity, that is, calculating the closest one of each local visual feature to K1 cluster centers, indicating that the local visual feature belongs to the cluster cluster.
- the electronic device also calculates the residual vector and distance value between the local visual feature and the cluster center. Then, according to the distance value, it can be determined which of the K2 areas the local visual feature is located, so as to determine the subclass to which the local visual feature belongs.
- the above residual cumulative sum vector is the cumulative sum vector.
- the dimensionality reduction algorithm may include a principal component analysis algorithm (Principal Components Analysis, PCA). Of course, technical personnel may also choose other dimensionality reduction algorithms, which are not limited here.
- the training step of the image classification quantizer by the electronic device has been completed.
- the electronic device obtains the K1 cluster centers, K2 distance thresholds obtained in the training process, and the long vector after PCA dimensionality reduction.
- the electronic device obtains the local visual features of each test image and obtains its expression vector, and uses the PCA algorithm to reduce the dimensionality of the expression vector.
- the electronic device calculates the similarity between the test image and the training image, using calculation methods such as cosine distance, Euclidean distance, etc., and selects the training image with the highest similarity to the test image to complete the test process.
- step 103 a list of candidate images whose similarity with the expression vector exceeds the similarity threshold is retrieved from the candidate image library.
- the electronic device respectively calculates the similarity between the expression vector of the image to be recognized and the expression vector of each candidate image in the candidate image library, and compares the similarity with the similarity threshold. If the similarity exceeds the similarity threshold, then Add candidate images to the candidate image list until the candidate image library is filtered out or the number of candidate images set in advance is reached.
- the local visual features of the image to be recognized can be acquired, and then the expression vector of the image to be recognized composed of the residual vector of the local visual feature can be determined, and then the similarity with the expression vector can be retrieved A list of candidate images that exceed the similarity threshold.
- the residual vector of the local visual feature is used in this embodiment, which is beneficial to further express the visual feature of the image to be recognized, thereby improving the accuracy of searching for candidate images.
- FIG. 7 is a block diagram of an image detection device shown in an embodiment of the present disclosure.
- an image retrieval device 700 includes:
- the visual feature acquisition module 701 is configured to acquire at least one local visual feature of the image to be recognized
- the expression vector determining module 702 is configured to determine an expression vector of the image to be recognized based on the at least one local visual feature, the expression vector being composed of a residual vector of the at least one local visual feature;
- the image retrieval module 703 is configured to retrieve a list of candidate images whose similarity with the expression vector exceeds the similarity threshold from the candidate image library.
- the expression vector determining module 702 includes:
- the cluster determining unit 801 is configured to call an image classification quantizer, and the image classification quantizer determines the cluster cluster and subclass to which each of the at least one local visual feature belongs; the image classification quantizer includes K1 clusters and corresponding K1 cluster centers, as well as K2 sub-categories divided from each cluster in the K1 clusters and the distance dividing threshold of each sub-category;
- the sum vector obtaining unit 802 is configured to obtain and accumulate residual vectors of all local visual features belonging to the same subcategory, thereby obtaining the cumulative sum vector of K1*K2 residual vectors, and the cumulative sum vector is the waiting Identify the expression vector of the image, where K1 and K2 are positive integers.
- the cluster determination unit 801 includes:
- the distance obtaining subunit 901 is configured to obtain the distance and residual vector between the local visual feature and each cluster center by using a cluster analysis algorithm
- a cluster cluster obtaining subunit 902 configured to determine that the cluster cluster with the closest cluster center to the local visual feature is the cluster cluster to which the local visual feature belongs;
- the sub-category obtaining sub-unit 903 is configured to determine the sub-category to which the local visual feature belongs based on the distance and the distance boundary threshold of each sub-category in the cluster.
- the sum vector acquiring unit 802 further includes:
- the sum vector obtaining subunit 1001 is configured to perform normalization processing on the accumulation sum vector respectively to obtain a normalized accumulation sum vector;
- the long vector connection subunit 1002 is used to connect the normalized accumulation and vector end to end to obtain a long vector
- the expression vector obtaining subunit 1003 is configured to perform dimensionality reduction processing on the long vector by using a dimensionality reduction algorithm to obtain a dimensionality-reduced long vector, and the dimensionality-reduced long vector is the expression vector of the image to be recognized.
- the image classification quantizer is trained through the following steps, including:
- the distance between the local visual feature in the cluster and the cluster center of the cluster and the maximum distance value are obtained; based on the maximum distance value, the cluster is divided into K2 sub-categories, get the distance threshold of each sub-category.
- the number of the obtained distance thresholds is K1*K2, and the sub-categories in the same cluster do not overlap each other.
- FIG. 11 is a block diagram of an electronic device shown in an embodiment of the present disclosure.
- an electronic device 1100 includes a processor 1101 and a memory 1102 for storing executable instructions; the processor 1101 uses a communication bus 1103 It is connected to the memory 1102 and is used to read executable instructions from the memory 1102 to implement the steps of the image retrieval method shown in FIGS. 1 to 6.
- an embodiment of the present disclosure also provides a computer-readable storage medium 1200 on which computer-executable instructions are stored.
- the image retrieval method shown in FIGS. 1 to 6 is implemented. A step of.
- Computer-readable media include permanent/non-permanent, volatile/non-volatile, removable/non-removable media, and information storage can be achieved by any method or technology.
- the information can be computer-readable instructions, data structures, program modules, or other data.
- Examples of computer storage media include, but are not limited to, phase change memory (PRAM), static random access memory (SRAM), dynamic random access memory (DRAM), other types of random access memory (RAM), read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disc (DVD) or other optical storage, Magnetic cassettes, magnetic tape magnetic disk storage or other magnetic storage devices or any other non-transmission media can be used to store information that can be accessed by computing devices. According to the definition in this article, computer-readable media does not include transitory media, such as modulated data signals and carrier waves.
- PRAM phase change memory
- SRAM static random access memory
- DRAM dynamic random access memory
- RAM random access memory
- ROM read-only memory
- EEPROM electrically erasable programmable read-only memory
- flash memory or other memory technology
- CD-ROM compact disc
- DVD digital versatile disc
- Magnetic cassettes magnetic tape magnetic disk storage or other magnetic storage devices or any other non
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Bioinformatics & Computational Biology (AREA)
- Artificial Intelligence (AREA)
- Evolutionary Biology (AREA)
- Evolutionary Computation (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Image Analysis (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (13)
- 一种图像检索方法,包括:获取待识别图像的至少一个局部视觉特征;基于所述至少一个局部视觉特征确定所述待识别图像的表达向量,所述表达向量由所述至少一个局部视觉特征的残差向量构成;以及从候选图像库中检索出与所述表达向量相似度超过相似度阈值的候选图像列表。
- 根据权利要求1所述的图像检索方法,其中,基于所述至少一个局部视觉特征确定所述待识别图像的表达向量,包括:由图像分类量化器确定所述至少一个局部视觉特征中的每一个所属的聚类簇和子类,所述图像分类量化器包括K1个聚类簇和对应的K1个聚类中心,以及所述K1个聚类簇中的每一个聚类簇切分出的K2个子类和每个子类的距离分界阈值;以及获取属于同一个子类的所有局部视觉特征的残差向量并累加,从而得到K1*K2个残差向量的累加和向量,所述累加和向量即为所述待识别图像的表达向量,其中K1和K2为正整数。
- 根据权利要求2所述的图像检索方法,其中,确定所述至少一个局部视觉特征中的每一个所属的聚类簇和子类包括:利用聚类分析算法获取该局部视觉特征与各聚类中心的距离和残差向量;确定与所述局部视觉特征距离最近的聚类中心所在的聚类簇为所述局部视觉特征所属的聚类簇;以及基于所述距离和所述聚类簇内各子类的距离分界阈值确定所述局部视觉特征所属的子类。
- 根据权利要求2所述的图像检索方法,还包括:对所述累加和向量分别进行归一化处理,得到归一化后的累加和向量;将归一化后的累加和向量首尾连接得到一个长向量;以及利用降维算法对所述长向量进行降维处理,得到降维后的长向量,所述降维后 的长向量即为所述待识别图像的表达向量。
- 根据权利要求2所述的图像检索方法,其中,所述图像分类量化器通过以下步骤训练,包括:获取图像训练集中每一个图像的局部视觉特征;利用聚类分析算法对所述局部视觉特征进行聚类分析,得到所述K1个聚类簇和各聚类簇的聚类中心;以及针对K1个聚类簇中的每一个,获取该聚类簇内局部视觉特征与该聚类簇的聚类中心的距离以及最大距离值;基于所述最大距离值将该聚类簇切分为K2个子类,得到每个子类的距离分界阈值,其中,得到的距离分界阈值的数量为K1*K2个,同一个聚类簇中的各子类互不重合。
- 根据权利要求5所述的图像检索方法,其中,基于所述最大距离值将该聚类簇切分为K2个子类,得到每个子类的距离分界阈值,包括:将所述最大距离值划分为K2个分段,每个分段对应的局部视觉特征构成一个子类,得到K2个子类;并且,每个分段与所述聚类簇的聚类中心的最大距离为所述每个分段对应子类的距离分界阈值。
- 根据权利要求5所述的图像检索方法,其中,所述聚类分析算法包括K-means算法。
- 一种图像检索装置,包括:处理器;以及存储器,与所述处理器耦连,并存储可执行指令,所述可执行指令在由所述处理器执行时使得所述处理器被配置为:获取待识别图像的至少一个局部视觉特征;基于所述至少一个局部视觉特征确定所述待识别图像的表达向量,所述表达向量由所述至少一个局部视觉特征的残差向量构成;以及从候选图像库中检索出与所述表达向量相似度超过相似度阈值的候选图像列表。
- 根据权利要求8所述的图像检索装置,其中,所述处理器还被配置为:由图像分类量化器确定所述至少一个局部视觉特征中的每一个所属的聚类簇和子类,所述图像分类量化器包括K1个聚类簇和对应的K1个聚类中心,以及所述K1个聚类簇中的每一个聚类簇切分出的K2个子类和每个子类的距离分界阈值;以及获取属于同一个子类的所有局部视觉特征的残差向量,并累加,从而得到K1*K2个残差向量的累加和向量,所述累加和向量即为所述待识别图像的表达向量,其中K1和K2为正整数。
- 根据权利要求9所述的图像检索装置,其中,所述处理器还被配置为:利用聚类分析算法获取该局部视觉特征与各聚类中心的距离和残差向量;确定与所述局部视觉特征距离最近的聚类中心所在的聚类簇为所述局部视觉特征所属的聚类簇;以及基于所述距离和所述聚类簇内各子类的距离分界阈值确定所述局部视觉特征所属的子类。
- 根据权利要求9所述的图像检索装置,其中,所述处理器还被配置为:对所述累加和向量分别进行归一化处理,得到归一化后的累加和向量;将归一化后的累加和向量首尾连接得到一个长向量;以及利用降维算法对所述长向量进行降维处理,得到降维后的长向量,所述降维后的长向量即为所述待识别图像的表达向量。
- 根据权利要求9所述的图像检索装置,其中,所述图像分类量化器通过以下步骤训练,包括:获取图像训练集中每个图像的局部视觉特征;利用聚类分析算法对所述局部视觉特征进行聚类分析,得到所述K1个聚类簇和各聚类簇的聚类中心;以及针对K1个聚类簇中的每一个,获取该聚类簇内局部视觉特征与所述聚类簇的聚类中心的距离以及最大距离值;基于所述最大距离值将该聚类簇切分为K2个子类,得到各子类的距离分界阈值,其中,得到的距离分界阈值的数量为K1*K2个,同一个聚类簇中的各子类互不 重合。
- 一种计算机可读存储介质,其上存储有计算机可执行指令,其特征在于,该指令被处理器执行时实现权利要求1~7任一项所述方法的步骤。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910273228.0A CN111783805B (zh) | 2019-04-04 | 2019-04-04 | 图像检索方法及装置、电子设备、可读存储介质 |
CN201910273228.0 | 2019-04-04 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2020199773A1 true WO2020199773A1 (zh) | 2020-10-08 |
Family
ID=72664894
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/075685 WO2020199773A1 (zh) | 2019-04-04 | 2020-02-18 | 图像检索方法及装置和计算机可读存储介质 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN111783805B (zh) |
WO (1) | WO2020199773A1 (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112926406A (zh) * | 2021-02-02 | 2021-06-08 | 广东嘉铭智能科技有限公司 | 一种自动化免疫荧光检测方法、装置、设备及系统 |
CN113506284A (zh) * | 2021-07-26 | 2021-10-15 | 电子科技大学 | 一种眼底图像微血管瘤检测装置、方法及存储介质 |
CN115035556A (zh) * | 2021-03-03 | 2022-09-09 | 北京迈格威科技有限公司 | 人脸检索方法、装置、电子设备及存储介质 |
CN115841488A (zh) * | 2023-02-21 | 2023-03-24 | 聊城市飓风工业设计有限公司 | 一种基于计算机视觉的pcb板的检孔方法 |
WO2023222091A1 (zh) * | 2022-05-18 | 2023-11-23 | 华为技术有限公司 | 一种向量检索方法及装置 |
Families Citing this family (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113204665B (zh) * | 2021-04-28 | 2023-09-22 | 北京百度网讯科技有限公司 | 图像检索方法、装置、电子设备及计算机可读存储介质 |
CN113536020B (zh) * | 2021-07-23 | 2022-05-24 | 贝壳找房(北京)科技有限公司 | 数据查询的方法、存储介质和计算机程序产品 |
CN113792171B (zh) * | 2021-11-15 | 2022-02-18 | 西安热工研究院有限公司 | 基于内存管理的图像检索方法、系统、设备及存储介质 |
Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105335757A (zh) * | 2015-11-03 | 2016-02-17 | 电子科技大学 | 一种基于局部特征聚合描述符的车型识别方法 |
CN106326395A (zh) * | 2016-08-18 | 2017-01-11 | 北京大学 | 一种局部视觉特征选择方法及装置 |
CN108563777A (zh) * | 2018-04-24 | 2018-09-21 | 京东方科技集团股份有限公司 | 一种获得图像表示的方法和装置 |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2003256429A (ja) * | 2002-03-01 | 2003-09-12 | Kyodo Printing Co Ltd | 模様検索装置及びその方法 |
CN103092861B (zh) * | 2011-11-02 | 2016-01-06 | 阿里巴巴集团控股有限公司 | 一种商品代表图的选取方法和系统 |
GB2516037A (en) * | 2013-07-08 | 2015-01-14 | Univ Surrey | Compact and robust signature for large scale visual search, retrieval and classification |
CN104216949A (zh) * | 2014-08-13 | 2014-12-17 | 中国科学院计算技术研究所 | 一种融合空间信息的图像特征聚合表示方法及系统 |
CN108805183B (zh) * | 2018-05-28 | 2022-07-26 | 南京邮电大学 | 一种融合局部聚合描述符和局部线性编码的图像分类方法 |
-
2019
- 2019-04-04 CN CN201910273228.0A patent/CN111783805B/zh active Active
-
2020
- 2020-02-18 WO PCT/CN2020/075685 patent/WO2020199773A1/zh active Application Filing
Patent Citations (3)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105335757A (zh) * | 2015-11-03 | 2016-02-17 | 电子科技大学 | 一种基于局部特征聚合描述符的车型识别方法 |
CN106326395A (zh) * | 2016-08-18 | 2017-01-11 | 北京大学 | 一种局部视觉特征选择方法及装置 |
CN108563777A (zh) * | 2018-04-24 | 2018-09-21 | 京东方科技集团股份有限公司 | 一种获得图像表示的方法和装置 |
Cited By (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112926406A (zh) * | 2021-02-02 | 2021-06-08 | 广东嘉铭智能科技有限公司 | 一种自动化免疫荧光检测方法、装置、设备及系统 |
CN115035556A (zh) * | 2021-03-03 | 2022-09-09 | 北京迈格威科技有限公司 | 人脸检索方法、装置、电子设备及存储介质 |
CN113506284A (zh) * | 2021-07-26 | 2021-10-15 | 电子科技大学 | 一种眼底图像微血管瘤检测装置、方法及存储介质 |
CN113506284B (zh) * | 2021-07-26 | 2023-05-09 | 电子科技大学 | 一种眼底图像微血管瘤检测装置、方法及存储介质 |
WO2023222091A1 (zh) * | 2022-05-18 | 2023-11-23 | 华为技术有限公司 | 一种向量检索方法及装置 |
CN115841488A (zh) * | 2023-02-21 | 2023-03-24 | 聊城市飓风工业设计有限公司 | 一种基于计算机视觉的pcb板的检孔方法 |
Also Published As
Publication number | Publication date |
---|---|
CN111783805B (zh) | 2024-08-23 |
CN111783805A (zh) | 2020-10-16 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2020199773A1 (zh) | 图像检索方法及装置和计算机可读存储介质 | |
CN108920580B (zh) | 图像匹配方法、装置、存储介质及终端 | |
US8805117B2 (en) | Methods for improving image search in large-scale databases | |
Tabia et al. | Covariance-based descriptors for efficient 3D shape matching, retrieval, and classification | |
US10474872B2 (en) | Fingerprint matching using virtual minutiae | |
CN111914921A (zh) | 一种基于多特征融合的相似性图像检索方法及系统 | |
US20190205693A1 (en) | Scale-Invariant Feature Point Extraction in Edge Map | |
CN104199842A (zh) | 一种基于局部特征邻域信息的相似图片检索方法 | |
CN110941989A (zh) | 图像校验、视频校验方法、装置、设备及存储介质 | |
US20210312215A1 (en) | Method for book recognition and book reading device | |
US20180307940A1 (en) | A method and a device for image matching | |
US9870517B2 (en) | Image object retrieval | |
CN113343920A (zh) | 人脸识别照片的分类方法、装置、电子设备和存储介质 | |
US11449704B2 (en) | Searching in multilevel clustered vector-based data | |
CN112445926B (zh) | 一种图像检索方法以及装置 | |
WO2019100348A1 (zh) | 图像检索方法和装置以及图像库的生成方法和装置 | |
CN110083731B (zh) | 图像检索方法、装置、计算机设备及存储介质 | |
CN109829383B (zh) | 掌纹识别方法、装置和计算机设备 | |
Zhang et al. | Handwritten word image matching based on Heat Kernel Signature | |
JP6017277B2 (ja) | 特徴ベクトルの集合で表されるコンテンツ間の類似度を算出するプログラム、装置及び方法 | |
Vinay et al. | A double filtered GIST descriptor for face recognition | |
CN111008294B (zh) | 交通图像处理、图像检索方法及装置 | |
CN113688263B (zh) | 用于搜索图像的方法、计算设备和存储介质 | |
CN113657378B (zh) | 车辆跟踪方法、车辆跟踪系统和计算设备 | |
CN115482398A (zh) | 一种外观侵权的检测方法、装置及存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20782417 Country of ref document: EP Kind code of ref document: A1 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20782417 Country of ref document: EP Kind code of ref document: A1 |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20782417 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 04.05.2022) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20782417 Country of ref document: EP Kind code of ref document: A1 |