WO2021031645A1 - 图像处理方法及装置、电子设备和存储介质 - Google Patents
图像处理方法及装置、电子设备和存储介质 Download PDFInfo
- Publication number
- WO2021031645A1 WO2021031645A1 PCT/CN2020/092391 CN2020092391W WO2021031645A1 WO 2021031645 A1 WO2021031645 A1 WO 2021031645A1 CN 2020092391 W CN2020092391 W CN 2020092391W WO 2021031645 A1 WO2021031645 A1 WO 2021031645A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- feature
- image
- category
- processed
- class center
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/168—Feature extraction; Face representation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/50—Information retrieval; Database structures therefor; File system structures therefor of still image data
- G06F16/58—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/583—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/23—Clustering techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/24—Classification techniques
- G06F18/241—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
- G06F18/2413—Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on distances to training or reference patterns
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/70—Arrangements for image or video recognition or understanding using pattern recognition or machine learning
- G06V10/762—Arrangements for image or video recognition or understanding using pattern recognition or machine learning using clustering, e.g. of similar faces in social networks
- G06V10/763—Non-hierarchical techniques, e.g. based on statistics of modelling distributions
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/172—Classification, e.g. identification
Definitions
- the present disclosure relates to the field of computer technology, and in particular to an image processing method and device, electronic equipment, and storage medium.
- face retrieval has been widely used, especially when solving a case in the public security industry, it is necessary to retrieve images of unidentified suspects in massive portrait databases.
- the commonly used face retrieval method is to compare the retrieved pictures with the database pictures one by one.
- the present disclosure proposes an image processing technical solution.
- an image processing method including: performing feature extraction on an image to be processed to obtain a first feature of the image to be processed; according to the first feature and multiple reference image categories in a feature library Determine the image category of the image to be processed; in the case that the image category of the image to be processed is the first category of the multiple reference image categories, according to the first feature and the The multiple feature information of the first category in the feature database is updated to update the category center feature of the first category.
- the image category of the image to be processed can be determined according to the first feature of the image to be processed and the class center feature of the reference image category; when the image to be processed is an existing first category, according to the first feature And multiple feature information of the first category, update the class center feature of the first category to realize the clustering of the image to be processed, so that the image retrieval can be clustered and then retrieved, which improves the accuracy and recall rate of image retrieval, and Through the comparison of the image to be processed and the center feature of the class, the number of image comparisons during retrieval is reduced, and the speed of image retrieval is increased.
- the features of the image to be processed can be added to the feature library, which increases the number of images and features corresponding to each category, and further improves Retrieval accuracy rate.
- the method further includes: in the case that the image category of the image to be processed is not a category of the multiple reference image categories, performing the first feature of the image to be processed Class center extraction, to obtain the class center feature of the second category of the image to be processed; add the first feature and the class center feature of the second category to the feature library, and add the second category Added to the multiple reference image categories.
- the feature information and image category in the feature library can be updated as new images increase, thereby The feature information and image categories in the feature library can be continuously enriched, and the accuracy of image retrieval can be improved.
- the method further includes: in the case that the image category of the image to be processed is not a category of the multiple reference image categories, performing the first feature of the image to be processed Clustering to obtain one or more third categories; respectively perform class center extraction on each third category to obtain the class center features of the third category; combine the class center features of the first feature and the third category Adding to the feature library, and adding the third category to the multiple reference image categories.
- the image to be processed when the image to be processed is a plurality of pictures, the image to be processed can be clustered when the image category matching fails to obtain one or more new image categories, and the new image category and class center feature Adding to the feature library allows the feature information and image categories in the feature library to be updated as new images increase, so that the feature information and image categories in the feature library can be continuously enriched, and the accuracy of image retrieval can be improved.
- determining the image category of the image to be processed according to the first feature and the class center features of multiple reference image categories in the feature library includes: acquiring the first feature and multiple reference image categories. A plurality of first distances between class center features; in the case that the second distance with the smallest distance value among the plurality of first distances is less than or equal to the distance threshold, the image category of the image to be processed is determined to be the same as that of the first distance The first category corresponding to the two distances.
- the image category of the image to be processed is determined by the relationship between the second distance and the distance threshold, and when the second distance is less than or equal to the distance threshold, it is determined that the image category of the image to be processed is relative to the object at the second distance.
- the first category simple and fast, can improve the efficiency and accuracy of image classification.
- determining the image category of the image to be processed according to the first feature and the class center features of multiple reference image categories in the feature library includes: when the second distance is greater than the In the case of the distance threshold, it is determined that the image category of the image to be processed is not the category of the multiple reference image categories.
- the second distance when the second distance is greater than the distance threshold, it can be considered that the image to be processed does not belong to any one of the multiple reference image categories in the feature library, and a new image category needs to be determined for the image to be processed, so Can improve the accuracy of image classification.
- the class center feature includes N class center features, where N is a positive integer
- acquiring multiple first distances between the first feature and the multiple class center features includes : Perform quantization processing on the N class center features to obtain N feature vectors; obtain N third distances between the first feature and the N feature vectors; determine the N third distances K class center features corresponding to the smallest K approximate distances; determine the K first distances between the first feature and the K class center features, K is a positive integer and K ⁇ N.
- the method further includes: extracting the class centers of the feature information of each reference image category in the feature library to obtain the class center features of each image category.
- the class center features of each reference image category are obtained, which can improve the accuracy of the class center features.
- performing feature extraction on the image to be processed to obtain the first feature of the image to be processed includes: performing feature extraction on the image to be processed to obtain the second feature of the image to be processed; The second feature is normalized to obtain the first feature of the image to be processed.
- the second feature of the image to be processed is normalized, and the normalized feature value is used as the first feature of the image to be processed, so that the feature values of the first feature are all within a certain range, whereby, the complexity of calculation can be reduced and the calculation efficiency can be improved.
- the method further includes: re-clustering the feature information of the multiple fourth categories in the case that the multiple fourth categories in the feature library correspond to the same object, Obtain the fifth category; perform class center extraction on the fifth category to obtain the class center features of the fifth category; add the class center features of the fifth category to the feature library, and add the first Five categories are added to multiple reference image categories.
- multiple image categories of the same object in the feature library can be merged by re-clustering, so as to improve the accuracy of image classification, and thus the accuracy of image retrieval.
- the method further includes: deleting the multiple fourth category center features from the feature library, and deleting the multiple first category features from the multiple reference image categories. Four categories.
- the efficiency of image retrieval can be improved by deleting the class center features that do not exist in the feature library and the image categories that do not exist in the multiple reference image categories.
- an image processing device including: a first feature extraction module, configured to perform feature extraction on an image to be processed, to obtain the first feature of the image to be processed;
- the first feature and the class center feature of the multiple reference image categories in the feature library determine the image category of the image to be processed; the first update module is used to determine the image category of the image to be processed
- the class center feature of the first category is updated according to the first feature and the multiple feature information of the first category in the feature library.
- the device further includes: a second feature extraction module, which is configured to: if the image category of the image to be processed is not one of the multiple reference image categories, Perform class center extraction on the first feature of the image to be processed to obtain the class center feature of the second category of the image to be processed; the second update module is used to combine the first feature and the class center feature of the second category Adding to the feature library, and adding the second category to the multiple reference image categories.
- a second feature extraction module which is configured to: if the image category of the image to be processed is not one of the multiple reference image categories, Perform class center extraction on the first feature of the image to be processed to obtain the class center feature of the second category of the image to be processed
- the second update module is used to combine the first feature and the class center feature of the second category Adding to the feature library, and adding the second category to the multiple reference image categories.
- the device further includes: a first clustering module, configured to perform a comparison of the image category of the image to be processed if the category of the multiple reference image categories Clustering the first feature of the image to be processed to obtain one or more third categories; the third feature extraction module is configured to perform class center extraction on each third category to obtain the class center feature of the third category; The third update module is configured to add the class center features of the first feature and the third category to the feature library, and add the third category to the multiple reference image categories.
- a first clustering module configured to perform a comparison of the image category of the image to be processed if the category of the multiple reference image categories Clustering the first feature of the image to be processed to obtain one or more third categories
- the third feature extraction module is configured to perform class center extraction on each third category to obtain the class center feature of the third category
- the third update module is configured to add the class center features of the first feature and the third category to the feature library, and add the third category to the multiple reference image categories.
- the category determining module includes: a distance determining submodule, configured to obtain multiple first distances between the first feature and multiple category center features; The module is configured to determine the image category of the image to be processed as the first category corresponding to the second distance when the second distance with the smallest distance value among the plurality of first distances is less than or equal to the distance threshold.
- the category determining module includes: a second category determining submodule, configured to determine the image category of the image to be processed when the second distance is greater than the distance threshold It is not a category among the multiple reference image categories.
- the class center features include N class center features, where N is a positive integer
- the distance determination submodule is used to: quantize the N class center features to obtain N feature vectors; respectively obtain N third distances between the first feature and the N feature vectors; determine K class centers corresponding to the smallest K approximate distances among the N third distances Features; determine the K first distances between the first feature and the K class center features, K is a positive integer and K ⁇ N.
- the device further includes: a fourth feature extraction module, configured to perform class center extraction on the feature information of each reference image category in the feature library to obtain the class center features of each image category.
- the first feature extraction module includes: a feature extraction sub-module, configured to perform feature extraction on the image to be processed to obtain the second feature of the image to be processed; and a normalization sub-module, It is used to normalize the second feature to obtain the first feature of the image to be processed.
- the device further includes: a second clustering module, which is configured to: when the multiple fourth categories in the feature library correspond to the same object, compare the multiple fourth categories Re-clustering the feature information of the fifth category to obtain the fifth category; the fifth feature extraction module is used to extract the category center of the fifth category to obtain the category center feature of the fifth category; the fourth update module is used to The class center feature of the fifth category is added to the feature library, and the fifth category is added to multiple reference image categories.
- a second clustering module which is configured to: when the multiple fourth categories in the feature library correspond to the same object, compare the multiple fourth categories Re-clustering the feature information of the fifth category to obtain the fifth category
- the fifth feature extraction module is used to extract the category center of the fifth category to obtain the category center feature of the fifth category
- the fourth update module is used to The class center feature of the fifth category is added to the feature library, and the fifth category is added to multiple reference image categories.
- the device further includes: a deletion module, configured to delete the plurality of class center features of the fourth category from the feature library, and delete from the plurality of reference image categories The plurality of fourth categories.
- a deletion module configured to delete the plurality of class center features of the fourth category from the feature library, and delete from the plurality of reference image categories The plurality of fourth categories.
- an electronic device including: a processor; a memory for storing executable instructions of the processor; wherein the processor is configured to execute the above-mentioned image processing method.
- a computer-readable storage medium having computer program instructions stored thereon, and the computer program instructions implement the above-mentioned image processing method when executed by a processor.
- a computer program including computer readable code, and when the computer readable code runs in an electronic device, a processor in the electronic device performs the above-mentioned image processing method.
- the image category of the image to be processed can be determined based on the first feature of the image to be processed and the class center feature of the reference image category; when the image to be processed is an existing first category, Features and multiple feature information of the first category, update the class center feature of the first category, realize the clustering of the image to be processed, so that the image retrieval can be clustered and then retrieved, which improves the accuracy and recall rate of image retrieval. And through the comparison of the image to be processed and the center feature of the class, the number of image comparisons during retrieval is reduced, and the speed of image retrieval is increased. At the same time, the features of the image to be processed can be added to the feature library to increase the number of images and features corresponding to each category. Improve retrieval accuracy.
- Fig. 1 shows a flowchart of an image processing method according to an embodiment of the present disclosure.
- Fig. 2 shows a schematic diagram of an application scenario of an image processing method according to an embodiment of the present disclosure.
- Fig. 3 shows a block diagram of an image processing apparatus according to an embodiment of the present disclosure.
- Fig. 4 shows a block diagram of an electronic device according to an embodiment of the present disclosure.
- Fig. 5 shows a block diagram of an electronic device according to an embodiment of the present disclosure.
- Fig. 1 shows a flowchart of an image processing method according to an embodiment of the present disclosure. As shown in Fig. 1, the image processing method includes:
- Step S11 performing feature extraction on the image to be processed to obtain the first feature of the image to be processed
- Step S12 Determine the image category of the image to be processed according to the first feature and the class center features of multiple reference image categories in the feature library;
- Step S13 in the case that the image category of the image to be processed is the first category of the multiple reference image categories, according to the first feature and the multiple of the first category in the feature library
- the feature information updates the class center feature of the first category.
- the image category of the image to be processed can be determined according to the first feature of the image to be processed and the class center feature of the reference image category; when the image to be processed is an existing first category, according to the first feature Features and multiple feature information of the first category, update the class center feature of the first category, realize the clustering of the image to be processed, so that the image retrieval can be clustered and then retrieved, which improves the accuracy and recall rate of image retrieval. And through the comparison of the image to be processed and the center feature of the class, the number of image comparisons during retrieval is reduced, and the speed of image retrieval is increased. At the same time, the features of the image to be processed can be added to the feature library to increase the number of images and features corresponding to each category. Improve retrieval accuracy.
- the image processing method can be executed by electronic equipment such as a terminal device or a server.
- the terminal device can be a user equipment (UE), a mobile device, a user terminal, a terminal, a cellular phone, or a cordless
- UE user equipment
- PDAs personal digital assistants
- the method can be implemented by a processor calling computer-readable instructions stored in a memory.
- the method can be executed by a server.
- the image to be processed may include one or more pictures or video frames, where the pictures or video frames include a human face.
- the pictures or video frames include a human face.
- they can be classified into one or more clusters according to factors such as face, time, and location.
- a cluster is a preliminary classification of the image to be processed, and a cluster can contain one or more to be processed image.
- multiple images of the same person are collected at different times. According to different collection times, the pictures of the same person can be divided into multiple clusters.
- the image processing method can be used to process the image to be processed in real time or periodically. For example, you can perform image processing once a day or a week, or start image processing after a certain number of images to be processed are collected, or perform image processing before image retrieval.
- the present disclosure does not limit the start timing of image processing.
- step S11 feature extraction of the image to be processed may be performed to obtain the first feature of the image to be processed.
- the first feature may include one or more feature information of the image to be processed, for example, the first feature includes multiple feature information of a human face.
- the first feature may include multiple feature information of the multiple pictures.
- the extracted feature information can be used as its first feature.
- the present disclosure does not limit the method of feature extraction.
- step S11 may include: performing feature extraction on the image to be processed to obtain the second feature of the image to be processed; performing normalization processing on the second feature to obtain the image to be processed The first feature.
- the normalization process can generalize and unify the feature information, and unify the feature value within a certain range.
- the normalization processing may include, for example, regularization processing, and the present disclosure does not limit the specific manner of the normalization processing.
- the to-be-processed image may be determined according to the first feature and the class center features of multiple reference image categories in the feature library.
- the image category of the processed image may be determined according to the first feature and the class center features of multiple reference image categories in the feature library.
- the reference image category can be an image category classified in the feature library, and an image category can be a collection of images of a certain type, for example, a collection of images of the same person.
- the center feature of the category can be determined.
- a variety of clustering algorithms (such as k-average algorithm, mean shift algorithm, hierarchical clustering algorithm, etc.) can be used to determine the class center feature of the reference image category.
- the clustering algorithm is different, the calculation method of the corresponding cluster center feature is also different.
- Those skilled in the art can determine the method for determining the quasi-center feature according to the actual situation, which is not limited in this disclosure.
- the image category of the image to be processed can be determined according to the first feature of the image to be processed and the category center features of the multiple reference image categories in the feature library. That is, the first feature of the image to be processed can be compared with the category center features of multiple reference image categories to determine the image category of the image to be processed. For example, when the k-average algorithm is used, the distance between the first feature of the image to be processed and the class center features of multiple reference image categories can be calculated respectively, and the image category of the image to be processed can be determined according to the distance.
- the method may further include: extracting the class centers of the feature information of each reference image category in the feature library to obtain the class center features of each image category. That is to say, for each reference image category in the feature library, the feature information can be extracted by class center, and the extracted feature information can be used as the class center feature of each reference image category.
- the clustering algorithm can be used to determine the extraction method of the cluster center. For example, when using the k-average algorithm to extract the cluster center, you can first determine the feature information of each image in the reference image category, and calculate the distance between each feature information (such as European Distance), and then determine the average value of each distance, and determine the feature information corresponding to the average value as the class center feature of the reference image category.
- the present disclosure does not limit the method of class center extraction.
- the class center features of each reference image category are obtained, which can improve the accuracy of the class center features.
- step S12 may include: acquiring a plurality of first distances between the first feature and a plurality of class center features; among the plurality of first distances, the second distance with the smallest distance value is less than If it is equal to the distance threshold, the image category of the image to be processed is determined as the first category corresponding to the second distance.
- the distance threshold can be set in advance, and the present disclosure does not limit the value of the distance threshold.
- the distances between the first feature of the image to be processed and multiple center-like features can be calculated separately to obtain multiple first distances.
- the first distance with the smallest distance value is taken as the second distance.
- the relationship between the second distance and the preset distance threshold is determined. If the second distance is less than or equal to the distance threshold, the image category of the image to be processed may be determined as the first category corresponding to the second distance.
- the image category of the image to be processed is determined by the relationship between the second distance and the distance threshold, and when the second distance is less than or equal to the distance threshold, it is determined that the image category of the image to be processed is relative to the object at the second distance.
- the first category simple and fast, can improve the efficiency and accuracy of image classification.
- the class center feature includes N class center features, where N is a positive integer
- acquiring multiple first distances between the first feature and the multiple class center features includes : Perform quantization processing on the N class center features to obtain N feature vectors; obtain N third distances between the first feature and the N feature vectors; determine the N third distances K class center features corresponding to the smallest K third distances; determine the K first distances between the first feature and the K class center features, K is a positive integer and K ⁇ N.
- the N class central features can be quantized separately to obtain N feature vectors.
- the IVFADC algorithm in Faiss can be used to quantify N class central features, where the IVFADC algorithm includes a coarse quantizer (such as k-average). Algorithm) and product quantizer.
- a rough quantizer such as k-average algorithm
- N third distances between the first feature and the N feature vectors can be obtained respectively.
- the asymmetric distance may be used to calculate N third distances between the first feature and the N feature vectors, where the third distance is an approximate distance (for example, approximate Euclidean distance).
- x represents the first feature
- y represents the class center feature
- q represents the quantization process
- q 1 represents the coarse quantizer
- q 1 (y) represents the quantization result (quantization center) of the coarse quantizer
- q 2 Represents the product quantizer
- q 2 (yq 1 (y)) represents the result of product quantization
- its input yq 1 (y) represents the residual of y and the quantization center.
- the smallest K third distances can be selected from the N third distances, and K class center features corresponding to the K third distances can be determined.
- K class center features the precise distances between the first feature and each class center feature (such as inner product distance) can be calculated separately, and the calculation result is taken as the K first distances between the first feature and the K class center features .
- step S12 may include: in a case where the second distance is greater than the distance threshold, determining that the image category of the image to be processed is not a category among the multiple reference image categories. In other words, when the second distance is greater than the distance threshold, it can be considered that the image to be processed does not belong to any one of the multiple reference image categories in the feature library, and a new image category needs to be determined for the image to be processed, which can improve The accuracy of image classification.
- step S13 in the case that the image category of the image to be processed is the first category of the multiple reference image categories, according to the first feature and the The multiple feature information of the first category in the feature database is updated to update the category center feature of the first category.
- the first feature can be added to the first category as new feature information, and the distance between each feature information in the first category is calculated separately, and each distance is determined Use the feature information corresponding to the average value to update the class center feature of the first category.
- Other clustering algorithms can also be used to update the class center features of the first category, which is not limited in the present disclosure. In this way, the class center feature of the reference image category in the feature library can be updated when a new image is added.
- the method may further include: in a case where the image category of the image to be processed is not the category of the multiple reference image categories, performing the first feature of the image to be processed Class center extraction, to obtain the class center feature of the second category of the image to be processed; add the first feature and the class center feature of the second category to the feature library, and add the second category Added to the multiple reference image categories.
- the uncategorized images to be processed can be aggregated
- the category is the new category, the second category.
- the first feature of the image to be processed can be extracted by class center to obtain the class center feature of the second category.
- the extraction method of the class center is similar to the above, so I won't repeat it here.
- the first feature and the class center features of the second category can be added to the feature library, and the second category can be added to multiple reference image categories , So that the new feature information and image categories can be updated to the feature library in time.
- the feature information and image category in the feature library can be updated as new images increase, thereby The feature information and image categories in the feature library can be continuously enriched, and the accuracy of image retrieval can be improved.
- the method may further include: in the case that the image category of the image to be processed is not a category of the multiple reference image categories, comparing the first feature of the image to be processed Perform clustering to obtain one or more third categories; perform class center extraction on each third category to obtain the class center features of the third category; combine the first features and the class centers of the third category
- the feature is added to the feature library, and the third category is added to the multiple reference image categories.
- the first feature of the image to be processed can be clustered to obtain one Or multiple third categories.
- a third category may be obtained; when the image to be processed is multiple pictures of multiple objects, after clustering, multiple third categories may be obtained. category.
- search in other images to be processed for example, search through Faiss
- K similarity results you can draw an affinity map Search for link fluxes to determine clusters, or DFS (Deep First Search) recursive staining to determine clusters, where staining can be determined according to the similarity threshold. For example, if the similarity threshold is 0.7, then the pair is greater than The similarity results of the similarity threshold are colored, and the similarity results less than the similarity threshold are skipped.
- DFS Deep First Search
- each third category can be extracted separately to obtain the characteristics of the third category;
- the first feature of the image and the class center feature of the third category are added to the feature library, and the third category is added to multiple reference image categories, so that the newly added feature information and image categories can be updated to the feature library in time.
- the extraction method of the class center is similar to the above, and will not be repeated here.
- the image to be processed when the image to be processed is a plurality of pictures, the image to be processed can be clustered when the image category matching fails to obtain one or more new image categories, and the new image category and class center feature Adding to the feature library allows the feature information and image categories in the feature library to be updated as new images increase, so that the feature information and image categories in the feature library can be continuously enriched, and the accuracy of image retrieval can be improved.
- the method further includes: re-clustering the feature information of the multiple fourth categories in the case that the multiple fourth categories in the feature library correspond to the same object, Obtain the fifth category; perform class center extraction on the fifth category to obtain the class center features of the fifth category; add the class center features of the fifth category to the feature library, and add the first Five categories are added to multiple reference image categories.
- multiple fourth categories corresponding to the same object means that the same object (for example, the same face) has multiple image categories in the feature database, that is, the fourth category.
- the feature information of multiple fourth categories can be re-clustered to obtain the fifth category, that is, multiple image categories of the same object are merged into one image category through re-clustering.
- the class center extraction of the fifth category can be performed to obtain the class center feature of the fifth category, and the class center feature of the fifth category can be added to the feature library, and the fifth category can be added to multiple reference images in.
- multiple image categories of the same object in the feature library can be merged by re-clustering, so as to improve the accuracy of image classification, and thus the accuracy of image retrieval.
- the method further includes: deleting the multiple fourth category center features from the feature library, and deleting the multiple first category features from the multiple reference image categories. Four categories.
- the class center features of multiple fourth categories that have been re-clustered into the fifth category can be deleted from the feature library, and multiple fourth categories can be deleted from multiple reference image categories.
- the class center features that do not exist in the feature library and the image categories that do not exist in multiple reference image categories can be deleted in time, thereby improving the efficiency of image retrieval.
- Fig. 2 shows a schematic diagram of an application scenario of an image processing method according to an embodiment of the present disclosure.
- feature extraction can be performed first to obtain its first feature 22; then, based on the first feature 21 and the class center features 23 of multiple reference image categories in the feature library, the similar
- the algorithm in the sexual search library 29 determines the image category 24 of the image to be processed.
- the image category 24 of the image 21 to be processed is the first category 26 among the multiple reference image categories
- the first category 26 can be updated according to the first feature 21 and multiple feature information of the first category 26 in the feature library. Class center feature.
- the processed image 21 can be clustered 27 by the algorithm in the similarity search library 29 (ie Faiss), and a new image category 28 can be determined according to the clustering result. , And add the first feature and the class center feature of the new image category 28 to the feature library, and add the new image category 28 to multiple reference image categories.
- the similarity search library 29 ie Faiss
- the image processing method of the embodiment of the present disclosure can determine the image category of the image to be processed according to the feature information of the image to be processed and the class center feature in the feature library, and update the feature library and image category to achieve clustering of the image to be processed , Not only can improve the retrieval speed and recall rate of image retrieval, especially face retrieval; it can also automatically build personnel files to improve image utilization.
- clustering the image to be processed can improve the retrieval speed and recall rate of image retrieval, especially face retrieval.
- face retrieval is an important scene for solving cases in the public security industry, and a series of information such as the identity of the suspect needs to be retrieved in a massive portrait database based on the picture of an unidentified suspect.
- the image processing method of the embodiment of the present disclosure can realize the clustering of the image to be processed, and can automatically construct the personnel file, thereby improving the utilization rate of the image.
- the captured pictures can be grouped into one category in a human dimension to achieve the integration of massive and scattered pictures, which can be seen in the system All the captured pictures related to the same person form a personal track, realize big data analysis, and assist case study and judgment.
- the image processing method of the embodiment of the present disclosure can realize the automatic iteration of the image system through clustering.
- the image category and its center can be updated, so that the system can continuously obtain Incremental training forms a positive feedback loop to improve system capabilities.
- the present disclosure also provides image processing devices, electronic equipment, computer-readable storage media, and programs, all of which can be used to implement any image processing method provided in the present disclosure.
- image processing devices electronic equipment, computer-readable storage media, and programs, all of which can be used to implement any image processing method provided in the present disclosure.
- the writing order of the steps does not mean a strict execution order but constitutes any limitation on the implementation process.
- the specific execution order of each step should be based on its function and possibility.
- the inner logic is determined.
- Fig. 3 shows a block diagram of an image processing device according to an embodiment of the present disclosure. As shown in Fig. 3, the image processing device includes:
- the first feature extraction module 31 is configured to perform feature extraction on the image to be processed to obtain the first feature of the image to be processed;
- the category determining module 32 is configured to determine the image category of the image to be processed according to the first feature and the category center features of multiple reference image categories in the feature library;
- the first update module 33 is configured to, when the image category of the image to be processed is the first category among the multiple reference image categories, according to the first feature and the first category in the feature
- the multiple feature information in the library updates the class center feature of the first category.
- the device further includes: a second feature extraction module, which is configured to: if the image category of the image to be processed is not one of the multiple reference image categories, Perform class center extraction on the first feature of the image to be processed to obtain the class center feature of the second category of the image to be processed; the second update module is used to combine the first feature and the class center feature of the second category Adding to the feature library, and adding the second category to the multiple reference image categories.
- a second feature extraction module which is configured to: if the image category of the image to be processed is not one of the multiple reference image categories, Perform class center extraction on the first feature of the image to be processed to obtain the class center feature of the second category of the image to be processed
- the second update module is used to combine the first feature and the class center feature of the second category Adding to the feature library, and adding the second category to the multiple reference image categories.
- the device further includes: a first clustering module, configured to perform a comparison of the image category of the image to be processed if the category of the multiple reference image categories Clustering the first feature of the image to be processed to obtain one or more third categories; the third feature extraction module is configured to perform class center extraction on each third category to obtain the class center feature of the third category; The third update module is configured to add the class center features of the first feature and the third category to the feature library, and add the third category to the multiple reference image categories.
- a first clustering module configured to perform a comparison of the image category of the image to be processed if the category of the multiple reference image categories Clustering the first feature of the image to be processed to obtain one or more third categories
- the third feature extraction module is configured to perform class center extraction on each third category to obtain the class center feature of the third category
- the third update module is configured to add the class center features of the first feature and the third category to the feature library, and add the third category to the multiple reference image categories.
- the category determining module 32 includes: a distance determining sub-module for obtaining multiple first distances between the first feature and multiple category center features; the first category determining A sub-module, configured to determine the image category of the image to be processed as the first category corresponding to the second distance when the second distance with the smallest distance value among the multiple first distances is less than or equal to the distance threshold .
- the category determining module 32 includes: a second category determining submodule, configured to determine the image of the image to be processed when the second distance is greater than the distance threshold The category is not among the multiple reference image categories.
- the class center features include N class center features, where N is a positive integer
- the distance determination submodule is used to: quantize the N class center features to obtain N feature vectors; respectively obtain N third distances between the first feature and the N feature vectors; determine K class centers corresponding to the smallest K approximate distances among the N third distances Features; determine the K first distances between the first feature and the K class center features, K is a positive integer and K ⁇ N.
- the device further includes: a fourth feature extraction module, configured to perform class center extraction on the feature information of each reference image category in the feature library to obtain the class center features of each image category.
- the first feature extraction module 31 includes: a feature extraction sub-module, configured to perform feature extraction on the image to be processed to obtain the second feature of the image to be processed; a normalization sub-module , Used to normalize the second feature to obtain the first feature of the image to be processed.
- the device further includes: a second clustering module, which is configured to: when the multiple fourth categories in the feature library correspond to the same object, compare the multiple fourth categories Re-clustering the feature information of the fifth category to obtain the fifth category; the fifth feature extraction module is used to extract the category center of the fifth category to obtain the category center feature of the fifth category; the fourth update module is used to The class center feature of the fifth category is added to the feature library, and the fifth category is added to multiple reference image categories.
- a second clustering module which is configured to: when the multiple fourth categories in the feature library correspond to the same object, compare the multiple fourth categories Re-clustering the feature information of the fifth category to obtain the fifth category
- the fifth feature extraction module is used to extract the category center of the fifth category to obtain the category center feature of the fifth category
- the fourth update module is used to The class center feature of the fifth category is added to the feature library, and the fifth category is added to multiple reference image categories.
- the device further includes: a deletion module, configured to delete the plurality of class center features of the fourth category from the feature library, and delete from the plurality of reference image categories The plurality of fourth categories.
- a deletion module configured to delete the plurality of class center features of the fourth category from the feature library, and delete from the plurality of reference image categories The plurality of fourth categories.
- the functions or modules contained in the device provided in the embodiments of the present disclosure can be used to execute the methods described in the above method embodiments.
- the functions or modules contained in the device provided in the embodiments of the present disclosure can be used to execute the methods described in the above method embodiments.
- the embodiment of the present disclosure also proposes a computer-readable storage medium on which computer program instructions are stored, and the computer program instructions implement the above method when executed by a processor.
- the computer-readable storage medium may be a non-volatile computer-readable storage medium or a volatile computer-readable storage medium.
- An embodiment of the present disclosure also provides an electronic device, including: a processor; a memory for storing executable instructions of the processor; wherein the processor is configured as the above method.
- the embodiments of the present disclosure also provide a computer program product, the computer program includes computer readable code, and when the computer readable code runs in an electronic device, the processor in the electronic device executes to realize the above An image processing method provided by any embodiment.
- the electronic device can be provided as a terminal, server or other form of device.
- FIG. 4 shows a block diagram of an electronic device 800 according to an embodiment of the present disclosure.
- the electronic device 800 may be a mobile phone, a computer, a digital broadcasting terminal, a messaging device, a game console, a tablet device, a medical device, a fitness device, a personal digital assistant, and other terminals.
- the electronic device 800 may include one or more of the following components: a processing component 802, a memory 804, a power supply component 806, a multimedia component 808, an audio component 810, an input/output (I/O) interface 812, and a sensor component 814 , And communication component 816.
- the processing component 802 generally controls the overall operations of the electronic device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations.
- the processing component 802 may include one or more processors 820 to execute instructions to complete all or part of the steps of the foregoing method.
- the processing component 802 may include one or more modules to facilitate the interaction between the processing component 802 and other components.
- the processing component 802 may include a multimedia module to facilitate the interaction between the multimedia component 808 and the processing component 802.
- the memory 804 is configured to store various types of data to support operations in the electronic device 800. Examples of these data include instructions for any application or method operating on the electronic device 800, contact data, phone book data, messages, pictures, videos, etc.
- the memory 804 can be implemented by any type of volatile or nonvolatile storage device or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Magnetic Disk or Optical Disk.
- SRAM static random access memory
- EEPROM electrically erasable programmable read-only memory
- EPROM erasable Programmable Read Only Memory
- PROM Programmable Read Only Memory
- ROM Read Only Memory
- Magnetic Memory Flash Memory
- Magnetic Disk Magnetic Disk or Optical Disk.
- the power supply component 806 provides power for various components of the electronic device 800.
- the power supply component 806 may include a power management system, one or more power supplies, and other components associated with the generation, management, and distribution of power for the electronic device 800.
- the multimedia component 808 includes a screen that provides an output interface between the electronic device 800 and the user.
- the screen may include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive input signals from the user.
- the touch panel includes one or more touch sensors to sense touch, sliding, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure related to the touch or slide operation.
- the multimedia component 808 includes a front camera and/or a rear camera. When the electronic device 800 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera can receive external multimedia data. Each front camera and rear camera can be a fixed optical lens system or have focal length and optical zoom capabilities.
- the audio component 810 is configured to output and/or input audio signals.
- the audio component 810 includes a microphone (MIC).
- the microphone is configured to receive external audio signals.
- the received audio signal may be further stored in the memory 804 or transmitted via the communication component 816.
- the audio component 810 further includes a speaker for outputting audio signals.
- the I/O interface 812 provides an interface between the processing component 802 and a peripheral interface module.
- the peripheral interface module may be a keyboard, a click wheel, a button, and the like. These buttons may include but are not limited to: home button, volume button, start button, and lock button.
- the sensor component 814 includes one or more sensors for providing the electronic device 800 with various aspects of state evaluation.
- the sensor component 814 can detect the on/off status of the electronic device 800 and the relative positioning of the components.
- the component is the display and the keypad of the electronic device 800.
- the sensor component 814 can also detect the electronic device 800 or the electronic device 800.
- the position of the component changes, the presence or absence of contact between the user and the electronic device 800, the orientation or acceleration/deceleration of the electronic device 800, and the temperature change of the electronic device 800.
- the sensor component 814 may include a proximity sensor configured to detect the presence of nearby objects when there is no physical contact.
- the sensor component 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications.
- the sensor component 814 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor or a temperature sensor.
- the communication component 816 is configured to facilitate wired or wireless communication between the electronic device 800 and other devices.
- the electronic device 800 can access a wireless network based on a communication standard, such as WiFi, 2G, or 3G, or a combination thereof.
- the communication component 816 receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel.
- the communication component 816 further includes a near field communication (NFC) module to facilitate short-range communication.
- the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology and other technologies.
- RFID radio frequency identification
- IrDA infrared data association
- UWB ultra-wideband
- Bluetooth Bluetooth
- the electronic device 800 can be implemented by one or more application specific integrated circuits (ASIC), digital signal processors (DSP), digital signal processing devices (DSPD), programmable logic devices (PLD), field A programmable gate array (FPGA), controller, microcontroller, microprocessor, or other electronic components are implemented to implement the above methods.
- ASIC application specific integrated circuits
- DSP digital signal processors
- DSPD digital signal processing devices
- PLD programmable logic devices
- FPGA field A programmable gate array
- controller microcontroller, microprocessor, or other electronic components are implemented to implement the above methods.
- a non-volatile computer-readable storage medium such as a memory 804 including computer program instructions, which can be executed by the processor 820 of the electronic device 800 to complete the foregoing method.
- FIG. 5 shows a block diagram of an electronic device 1900 according to an embodiment of the present disclosure.
- the electronic device 1900 may be provided as a server. 5
- the electronic device 1900 includes a processing component 1922, which further includes one or more processors, and a memory resource represented by a memory 1932 for storing instructions executable by the processing component 1922, such as application programs.
- the application program stored in the memory 1932 may include one or more modules each corresponding to a set of instructions.
- the processing component 1922 is configured to execute instructions to perform the above-described methods.
- the electronic device 1900 may also include a power supply component 1926 configured to perform power management of the electronic device 1900, a wired or wireless network interface 1950 configured to connect the electronic device 1900 to a network, and an input output (I/O) interface 1958 .
- the electronic device 1900 can operate based on an operating system stored in the memory 1932, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM or the like.
- a non-volatile computer-readable storage medium such as the memory 1932 including computer program instructions, which can be executed by the processing component 1922 of the electronic device 1900 to complete the foregoing method.
- the present disclosure may be a system, method, and/or computer program product.
- the computer program product may include a computer-readable storage medium loaded with computer-readable program instructions for enabling a processor to implement various aspects of the present disclosure.
- the computer-readable storage medium may be a tangible device that can hold and store instructions used by the instruction execution device.
- the computer-readable storage medium may be, for example, but not limited to, an electrical storage device, a magnetic storage device, an optical storage device, an electromagnetic storage device, a semiconductor storage device, or any suitable combination of the foregoing.
- Computer-readable storage media include: portable computer disks, hard disks, random access memory (RAM), read-only memory (ROM), erasable programmable read-only memory (EPROM) Or flash memory), static random access memory (SRAM), portable compact disk read-only memory (CD-ROM), digital versatile disk (DVD), memory stick, floppy disk, mechanical encoding device, such as a printer with instructions stored thereon
- RAM random access memory
- ROM read-only memory
- EPROM erasable programmable read-only memory
- flash memory flash memory
- SRAM static random access memory
- CD-ROM compact disk read-only memory
- DVD digital versatile disk
- memory stick floppy disk
- mechanical encoding device such as a printer with instructions stored thereon
- the computer-readable storage medium used here is not interpreted as a transient signal itself, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through waveguides or other transmission media (for example, light pulses through fiber optic cables), or through wires Transmission of electrical signals.
- the computer-readable program instructions described herein can be downloaded from a computer-readable storage medium to various computing/processing devices, or downloaded to an external computer or external storage device via a network, such as the Internet, a local area network, a wide area network, and/or a wireless network.
- the network may include copper transmission cables, optical fiber transmission, wireless transmission, routers, firewalls, switches, gateway computers, and/or edge servers.
- the network adapter card or network interface in each computing/processing device receives computer-readable program instructions from the network, and forwards the computer-readable program instructions for storage in the computer-readable storage medium in each computing/processing device .
- the computer program instructions used to perform the operations of the present disclosure may be assembly instructions, instruction set architecture (ISA) instructions, machine instructions, machine-related instructions, microcode, firmware instructions, status setting data, or in one or more programming languages.
- Source code or object code written in any combination, the programming language includes object-oriented programming languages such as Smalltalk, C++, etc., and conventional procedural programming languages such as "C" language or similar programming languages.
- Computer-readable program instructions can be executed entirely on the user's computer, partly on the user's computer, executed as a stand-alone software package, partly on the user's computer and partly executed on a remote computer, or entirely on the remote computer or server carried out.
- the remote computer can be connected to the user's computer through any kind of network, including a local area network (LAN) or a wide area network (WAN), or it can be connected to an external computer (for example, using an Internet service provider to access the Internet connection).
- LAN local area network
- WAN wide area network
- an electronic circuit such as a programmable logic circuit, a field programmable gate array (FPGA), or a programmable logic array (PLA), can be customized by using the status information of the computer-readable program instructions.
- the computer-readable program instructions are executed to realize various aspects of the present disclosure.
- These computer-readable program instructions can be provided to the processor of a general-purpose computer, a special-purpose computer, or other programmable data processing device, thereby producing a machine such that when these instructions are executed by the processor of the computer or other programmable data processing device , A device that implements the functions/actions specified in one or more blocks in the flowchart and/or block diagram is produced. It is also possible to store these computer-readable program instructions in a computer-readable storage medium. These instructions make computers, programmable data processing apparatuses, and/or other devices work in a specific manner, so that the computer-readable medium storing instructions includes An article of manufacture, which includes instructions for implementing various aspects of the functions/actions specified in one or more blocks in the flowchart and/or block diagram.
- each block in the flowchart or block diagram may represent a module, program segment, or part of an instruction, and the module, program segment, or part of an instruction contains one or more functions for implementing the specified logical function.
- Executable instructions may also occur in a different order from the order marked in the drawings. For example, two consecutive blocks can actually be executed in parallel, or they can sometimes be executed in the reverse order, depending on the functions involved.
- each block in the block diagram and/or flowchart, and the combination of the blocks in the block diagram and/or flowchart can be implemented by a dedicated hardware-based system that performs the specified functions or actions Or it can be realized by a combination of dedicated hardware and computer instructions.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Data Mining & Analysis (AREA)
- General Physics & Mathematics (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Health & Medical Sciences (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Human Computer Interaction (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Life Sciences & Earth Sciences (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Software Systems (AREA)
- Databases & Information Systems (AREA)
- Library & Information Science (AREA)
- Computing Systems (AREA)
- Probability & Statistics with Applications (AREA)
- Medical Informatics (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Image Analysis (AREA)
Abstract
Description
Claims (23)
- 一种图像处理方法,其特征在于,包括:对待处理图像进行特征提取,得到所述待处理图像的第一特征;根据所述第一特征以及特征库中多个参考图像类别的类中心特征,确定所述待处理图像的图像类别;在所述待处理图像的图像类别为所述多个参考图像类别中的第一类别的情况下,根据所述第一特征以及所述第一类别在所述特征库中的多个特征信息,更新所述第一类别的类中心特征。
- 根据权利要求1所述的方法,其特征在于,所述方法还包括:在所述待处理图像的图像类别不是所述多个参考图像类别中的类别的情况下,对所述待处理图像的第一特征进行类中心提取,得到所述待处理图像的第二类别的类中心特征;将所述第一特征及所述第二类别的类中心特征添加到所述特征库中,并将所述第二类别添加到所述多个参考图像类别中。
- 根据权利要求1所述的方法,其特征在于,所述方法还包括:在所述待处理图像的图像类别不是所述多个参考图像类别中的类别的情况下,对所述待处理图像的第一特征进行聚类,得到一个或多个第三类别;分别对各个第三类别进行类中心提取,得到所述第三类别的类中心特征;将所述第一特征及所述第三类别的类中心特征添加到所述特征库中,并将所述第三类别添加到所述多个参考图像类别中。
- 根据权利要求1所述的方法,其特征在于,根据所述第一特征以及特征库中多个参考图像类别的类中心特征,确定所述待处理图像的图像类别,包括:获取所述第一特征与多个类中心特征之间的多个第一距离;在多个第一距离中距离值最小的第二距离小于或等于距离阈值的情况下,将所述待处理图像的图像类别确定为与所述第二距离对应的第一类别。
- 根据权利要求4所述的方法,其特征在于,根据所述第一特征以及特征库中多个参考图像类别的类中心特征,确定所述待处理图像的图像类别,包括:在所述第二距离大于所述距离阈值的情况下,确定所述待处理图像的图像类别不是所述多个参考图像类别中的类别。
- 根据权利要求4或5所述的方法,其特征在于,所述类中心特征包括N个类中心特征,N为正整数,其中,获取所述第一特征与多个类中心特征之间的多个第一距离,包括:对N个类中心特征分别进行量化处理,得到N个特征向量;分别获取所述第一特征与所述N个特征向量之间的N个第三距离;确定与所述N个第三距离中最小的K个近似距离对应的K个类中心特征;确定所述第一特征与所述K个类中心特征之间的K个第一距离,K为正整数且K<N。
- 根据权利要求1-6中任意一项所述的方法,其特征在于,所述方法还包括:对特征库中各个参考图像类别的特征信息分别进行类中心提取,得到各个图像类别的类中心特征。
- 根据权利要求1-7中任意一项所述的方法,其特征在于,对待处理图像进行特征提取,得到所述待处理图像的第一特征,包括:对待处理图像进行特征提取,得到所述待处理图像的第二特征;对所述第二特征进行归一化处理,得到所述待处理图像的第一特征。
- 根据权利要求1-8中任意一项所述的方法,其特征在于,所述方法还包括:在所述特征库中的多个第四类别对应同一对象的情况下,对所述多个第四类别的特征信息进行重新聚类,得到第五类别;对所述第五类别进行类中心提取,得到所述第五类别的类中心特征;将所述第五类别的类中心特征添加到所述特征库中,并将所述第五类别添加到多个参考图像类别中。
- 根据权利要求9所述的方法,其特征在于,所述方法还包括:从所述特征库中删除所述多个第四类别的类中心特征,并从所述多个参考图像类别中删除所述多个第四类别。
- 一种图像处理装置,其特征在于,包括:第一特征提取模块,用于对待处理图像进行特征提取,得到所述待处理图像的第一特征;类别确定模块,用于根据所述第一特征以及特征库中多个参考图像类别的类中心特征,确定所述待处理图像的图像类别;第一更新模块,用于在所述待处理图像的图像类别为所述多个参考图像类别中的第一类别的情况下,根据所述第一特征以及所述第一类别在所述特征库中的多个特征信息,更新所述第一类别的类中心特征。
- 根据权利要求11所述的装置,其特征在于,所述装置还包括:第二特征提取模块,用于在所述待处理图像的图像类别不是所述多个参考图像类别中的类别的情况下,对所述待处理图像的第一特征进行类中心提取,得到所述待处理图 像的第二类别的类中心特征;第二更新模块,用于将所述第一特征及所述第二类别的类中心特征添加到所述特征库中,并将所述第二类别添加到所述多个参考图像类别中。
- 根据权利要求11所述的装置,其特征在于,所述装置还包括:第一聚类模块,用于在所述待处理图像的图像类别不是所述多个参考图像类别中的类别的情况下,对所述待处理图像的第一特征进行聚类,得到一个或多个第三类别;第三特征提取模块,用于分别对各个第三类别进行类中心提取,得到所述第三类别的类中心特征;第三更新模块,用于将所述第一特征及所述第三类别的类中心特征添加到所述特征库中,并将所述第三类别添加到所述多个参考图像类别中。
- 根据权利要求11所述的装置,其特征在于,所述类别确定模块,包括:距离确定子模块,用于获取所述第一特征与多个类中心特征之间的多个第一距离;第一类别确定子模块,用于在多个第一距离中距离值最小的第二距离小于或等于距离阈值的情况下,将所述待处理图像的图像类别确定为与所述第二距离对应的第一类别。
- 根据权利要求14所述的装置,其特征在于,所述类别确定模块,包括:第二类别确定子模块,用于在所述第二距离大于所述距离阈值的情况下,确定所述待处理图像的图像类别不是所述多个参考图像类别中的类别。
- 根据权利要求14或15所述的装置,其特征在于,所述类中心特征包括N个类中心特征,N为正整数,其中,所述距离确定子模块,用于:对N个类中心特征分别进行量化处理,得到N个特征向量;分别获取所述第一特征与所述N个特征向量之间的N个第三距离;确定与所述N个第三距离中最小的K个近似距离对应的K个类中心特征;确定所述第一特征与所述K个类中心特征之间的K个第一距离,K为正整数且K<N。
- 根据权利要求11-16中任意一项所述的装置,其特征在于,所述装置还包括:第四特征提取模块,用于对特征库中各个参考图像类别的特征信息分别进行类中心提取,得到各个图像类别的类中心特征。
- 根据权利要求11-17中任意一项所述的装置,其特征在于,所述第一特征提取模块,包括:特征提取子模块,用于对待处理图像进行特征提取,得到所述待处理图像的第二特征;归一化子模块,用于对所述第二特征进行归一化处理,得到所述待处理图像的第一特征。
- 根据权利要求11-18中任意一项所述的装置,其特征在于,所述装置还包括:第二聚类模块,用于在所述特征库中的多个第四类别对应同一对象的情况下,对所述多个第四类别的特征信息进行重新聚类,得到第五类别;第五特征提取模块,用于对所述第五类别进行类中心提取,得到所述第五类别的类中心特征;第四更新模块,用于将所述第五类别的类中心特征添加到所述特征库中,并将所述第五类别添加到多个参考图像类别中。
- 根据权利要求19所述的装置,其特征在于,所述装置还包括:删除模块,用于从所述特征库中删除所述多个第四类别的类中心特征,并从所述多个参考图像类别中删除所述多个第四类别。
- 一种电子设备,其特征在于,包括:处理器;用于存储处理器可执行指令的存储器;其中,所述处理器被配置为:执行权利要求1至10中任意一项所述的方法。
- 一种计算机可读存储介质,其上存储有计算机程序指令,其特征在于,所述计算机程序指令被处理器执行时实现权利要求1至10中任意一项所述的方法。
- 一种计算机程序,其特征在于,所述计算机程序包括计算机可读代码,当所述计算机可读代码在电子设备中运行时,所述电子设备中的处理器执行用于实现权利要求1至10中任意一项所述的方法。
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2021558009A JP2022526381A (ja) | 2019-08-22 | 2020-05-26 | 画像処理方法及び装置、電子機器並びに記憶媒体 |
US17/488,634 US20220019838A1 (en) | 2019-08-22 | 2021-09-29 | Image Processing Method and Device, and Storage Medium |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201910779555.3A CN110472091B (zh) | 2019-08-22 | 2019-08-22 | 图像处理方法及装置、电子设备和存储介质 |
CN201910779555.3 | 2019-08-22 |
Related Child Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US17/488,634 Continuation US20220019838A1 (en) | 2019-08-22 | 2021-09-29 | Image Processing Method and Device, and Storage Medium |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2021031645A1 true WO2021031645A1 (zh) | 2021-02-25 |
Family
ID=68513418
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2020/092391 WO2021031645A1 (zh) | 2019-08-22 | 2020-05-26 | 图像处理方法及装置、电子设备和存储介质 |
Country Status (5)
Country | Link |
---|---|
US (1) | US20220019838A1 (zh) |
JP (1) | JP2022526381A (zh) |
CN (1) | CN110472091B (zh) |
TW (1) | TW202109314A (zh) |
WO (1) | WO2021031645A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP7504236B2 (ja) | 2021-06-25 | 2024-06-21 | エルアンドティー テクノロジー サービシズ リミテッド | データサンプルをクラスタ化する方法およびシステム |
Families Citing this family (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110472091B (zh) * | 2019-08-22 | 2022-01-11 | 深圳市商汤科技有限公司 | 图像处理方法及装置、电子设备和存储介质 |
CN110929072A (zh) * | 2019-11-29 | 2020-03-27 | 深圳市商汤科技有限公司 | 聚类系统及方法、电子设备和存储介质 |
CN111079653B (zh) * | 2019-12-18 | 2024-03-22 | 中国工商银行股份有限公司 | 数据库自动分库方法及装置 |
CN111309946B (zh) * | 2020-02-10 | 2023-04-07 | 浙江大华技术股份有限公司 | 一种已建立档案优化方法及装置 |
CN111177435B (zh) * | 2019-12-31 | 2023-03-31 | 重庆邮电大学 | 一种基于改进pq算法的cbir方法 |
CN111242217A (zh) * | 2020-01-13 | 2020-06-05 | 支付宝实验室(新加坡)有限公司 | 图像识别模型的训练方法、装置、电子设备及存储介质 |
CN112668632B (zh) * | 2020-12-25 | 2022-04-08 | 浙江大华技术股份有限公司 | 一种数据处理方法、装置、计算机设备及存储介质 |
CN112949710B (zh) * | 2021-02-26 | 2023-06-13 | 北京百度网讯科技有限公司 | 一种图像的聚类方法和装置 |
CN116881485A (zh) * | 2023-06-19 | 2023-10-13 | 北京百度网讯科技有限公司 | 生成图像检索索引的方法及装置、电子设备和介质 |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080101705A1 (en) * | 2006-10-31 | 2008-05-01 | Motorola, Inc. | System for pattern recognition with q-metrics |
CN102009879A (zh) * | 2010-11-18 | 2011-04-13 | 无锡中星微电子有限公司 | 电梯自动按键控制系统、方法和人脸模型训练系统、方法 |
CN108664920A (zh) * | 2018-05-10 | 2018-10-16 | 深圳市深网视界科技有限公司 | 一种实时的大规模级联人脸聚类方法和装置 |
CN109726674A (zh) * | 2018-12-28 | 2019-05-07 | 上海依图网络科技有限公司 | 一种人脸识别方法及装置 |
CN109753577A (zh) * | 2018-12-29 | 2019-05-14 | 深圳云天励飞技术有限公司 | 一种搜索人脸的方法及相关装置 |
CN110472091A (zh) * | 2019-08-22 | 2019-11-19 | 深圳市商汤科技有限公司 | 图像处理方法及装置、电子设备和存储介质 |
Family Cites Families (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2016006090A1 (ja) * | 2014-07-10 | 2016-01-14 | 株式会社東芝 | 電子機器、方法及びプログラム |
CN106547744B (zh) * | 2015-09-16 | 2020-11-06 | 杭州海康威视数字技术股份有限公司 | 一种图像检索方法及系统 |
JP2018120527A (ja) * | 2017-01-27 | 2018-08-02 | 株式会社リコー | 画像処理装置、画像処理方法及び画像処理システム |
US10621416B2 (en) * | 2017-10-02 | 2020-04-14 | Microsoft Technology Licensing, Llc | Image processing for person recognition |
JP2019082959A (ja) * | 2017-10-31 | 2019-05-30 | キヤノン株式会社 | 情報処理装置、情報処理方法及びプログラム |
CN109213732B (zh) * | 2018-06-28 | 2022-03-18 | 努比亚技术有限公司 | 一种改善相册分类的方法、移动终端及计算机可读存储介质 |
CN109657711A (zh) * | 2018-12-10 | 2019-04-19 | 广东浪潮大数据研究有限公司 | 一种图像分类方法、装置、设备及可读存储介质 |
CN111260032A (zh) * | 2020-01-14 | 2020-06-09 | 北京迈格威科技有限公司 | 神经网络训练方法、图像处理方法及装置 |
-
2019
- 2019-08-22 CN CN201910779555.3A patent/CN110472091B/zh active Active
-
2020
- 2020-05-26 JP JP2021558009A patent/JP2022526381A/ja active Pending
- 2020-05-26 WO PCT/CN2020/092391 patent/WO2021031645A1/zh active Application Filing
- 2020-07-16 TW TW109124023A patent/TW202109314A/zh unknown
-
2021
- 2021-09-29 US US17/488,634 patent/US20220019838A1/en not_active Abandoned
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20080101705A1 (en) * | 2006-10-31 | 2008-05-01 | Motorola, Inc. | System for pattern recognition with q-metrics |
CN102009879A (zh) * | 2010-11-18 | 2011-04-13 | 无锡中星微电子有限公司 | 电梯自动按键控制系统、方法和人脸模型训练系统、方法 |
CN108664920A (zh) * | 2018-05-10 | 2018-10-16 | 深圳市深网视界科技有限公司 | 一种实时的大规模级联人脸聚类方法和装置 |
CN109726674A (zh) * | 2018-12-28 | 2019-05-07 | 上海依图网络科技有限公司 | 一种人脸识别方法及装置 |
CN109753577A (zh) * | 2018-12-29 | 2019-05-14 | 深圳云天励飞技术有限公司 | 一种搜索人脸的方法及相关装置 |
CN110472091A (zh) * | 2019-08-22 | 2019-11-19 | 深圳市商汤科技有限公司 | 图像处理方法及装置、电子设备和存储介质 |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP7504236B2 (ja) | 2021-06-25 | 2024-06-21 | エルアンドティー テクノロジー サービシズ リミテッド | データサンプルをクラスタ化する方法およびシステム |
Also Published As
Publication number | Publication date |
---|---|
US20220019838A1 (en) | 2022-01-20 |
JP2022526381A (ja) | 2022-05-24 |
CN110472091A (zh) | 2019-11-19 |
CN110472091B (zh) | 2022-01-11 |
TW202109314A (zh) | 2021-03-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2021031645A1 (zh) | 图像处理方法及装置、电子设备和存储介质 | |
TWI754855B (zh) | 人臉圖像識別方法、裝置、電子設備及儲存介質 | |
JP7171884B2 (ja) | 歩行者認識方法及び装置 | |
US11120078B2 (en) | Method and device for video processing, electronic device, and storage medium | |
WO2021036382A1 (zh) | 图像处理方法及装置、电子设备和存储介质 | |
WO2021027343A1 (zh) | 一种人脸图像识别方法及装置、电子设备和存储介质 | |
KR20200113195A (ko) | 이미지 클러스터링 방법 및 장치, 전자 기기 및 저장 매체 | |
CN110781957B (zh) | 图像处理方法及装置、电子设备和存储介质 | |
WO2017028416A1 (zh) | 分类器训练方法、类型识别方法及装置 | |
TWI761851B (zh) | 圖像處理方法、圖像處理裝置、電子設備和電腦可讀儲存媒體 | |
CN111259967B (zh) | 图像分类及神经网络训练方法、装置、设备及存储介质 | |
CN112101238A (zh) | 聚类方法及装置、电子设备和存储介质 | |
CN110659690B (zh) | 神经网络的构建方法及装置、电子设备和存储介质 | |
CN110532956B (zh) | 图像处理方法及装置、电子设备和存储介质 | |
TWI785267B (zh) | 影像處理方法、電子設備和儲存介質 | |
CN112926310B (zh) | 一种关键词提取方法及装置 | |
TWI779449B (zh) | 對象計數方法、電子設備、電腦可讀儲存介質 | |
CN109101542B (zh) | 图像识别结果输出方法及装置、电子设备和存储介质 | |
CN112307281A (zh) | 一种实体推荐方法及装置 | |
WO2021164100A1 (zh) | 图像处理方法及装置、电子设备和存储介质 | |
CN114332503A (zh) | 对象重识别方法及装置、电子设备和存储介质 | |
CN114168798A (zh) | 文本存储管理与检索方法及装置 | |
CN111062407B (zh) | 图像处理方法及装置、电子设备和存储介质 | |
WO2023092975A1 (zh) | 图像处理方法及装置、电子设备、存储介质及计算机程序产品 | |
CN112131999B (zh) | 身份确定方法及装置、电子设备和存储介质 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 20855738 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 2021558009 Country of ref document: JP Kind code of ref document: A |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20855738 Country of ref document: EP Kind code of ref document: A1 |
|
32PN | Ep: public notification in the ep bulletin as address of the adressee cannot be established |
Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205A DATED 25.08.2022) |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 20855738 Country of ref document: EP Kind code of ref document: A1 |