WO2015070678A1 - Image recognition method, and method and device for mining main body information about image - Google Patents

Image recognition method, and method and device for mining main body information about image Download PDF

Info

Publication number
WO2015070678A1
WO2015070678A1 PCT/CN2014/087954 CN2014087954W WO2015070678A1 WO 2015070678 A1 WO2015070678 A1 WO 2015070678A1 CN 2014087954 W CN2014087954 W CN 2014087954W WO 2015070678 A1 WO2015070678 A1 WO 2015070678A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
training data
data
intermediate data
information
Prior art date
Application number
PCT/CN2014/087954
Other languages
French (fr)
Chinese (zh)
Inventor
陶哲
薛红霞
白明
韩玉刚
Original Assignee
北京奇虎科技有限公司
奇智软件(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN201310575290.8A external-priority patent/CN103631889B/en
Priority claimed from CN201310575292.7A external-priority patent/CN103631890B/en
Application filed by 北京奇虎科技有限公司, 奇智软件(北京)有限公司 filed Critical 北京奇虎科技有限公司
Publication of WO2015070678A1 publication Critical patent/WO2015070678A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/583Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content

Definitions

  • the present application relates to the technical field of data processing, and in particular, to an image recognition method, a method and an apparatus for mining image body information.
  • the image resources on the Internet are increasingly rich, and the image resources acquired from the network often contain a variety of information, such as background, time, place, subject, etc., and so much information.
  • information such as background, time, place, subject, etc.
  • users can get a wide variety of images from a variety of sources, but not all images are accompanied by clear instructions or notes; for example, for users who are viewing sports news pages, in some cases, users
  • the exact information of the image is not known; in addition, the user cannot obtain other images associated with the image based on the known image.
  • the image resources acquired by the user from the network often only have annotations or annotation information of the image, and the annotation or annotation information cannot accurately give the subject information of the image due to the massive information contained in the acquired image; for example, When a user browses a sports news webpage, the user can only guess the content to be expressed by the news headline and the article summary, and cannot accurately know the character information of the map.
  • the present application has been made in order to provide an image recognition method, a method for mining image body information, and a corresponding device that overcome the above problems or at least partially solve or alleviate the above problems.
  • a method for image recognition including:
  • Obtaining an image to be identified searching for N other images similar to the image similarity degree; acquiring body information corresponding to the other images; and determining a weight value of each other image according to the similarity order, according to the weight information of each body information and the corresponding image, The weight information is separately calculated for each subject information, and the subject information corresponding to the largest accumulated value is extracted as the subject information of the image to be identified.
  • a method for mining image body information including:
  • an apparatus for image recognition including:
  • a search module configured to obtain an image to be recognized, and find N other images similar to the image similarity
  • the sorting module is configured to acquire body information corresponding to other images found by the searching module, and determine each order according to similarity ranking The weighting value of the other images
  • the calculating module is configured to perform weight accumulation calculation on each body information according to the weight information of each body information and the corresponding image
  • the identifying module is configured to extract the body information corresponding to the largest accumulated value, as the waiting Identify the subject information of the image.
  • an apparatus for mining image body information includes:
  • Obtaining a module configured to acquire an image and its annotation information; a generation module, configured to acquire a support information list of the image annotation information by using training data; and an extraction module configured to extract body information of the image from the support information list .
  • a computer program comprising computer readable code, when said computer readable code is run on a computing device, causing said computing device to perform according to claims 1-15 Any of the methods described.
  • the beneficial effects of the present application are: obtaining the image to be recognized and searching for N other images similar to each other according to the image similarity, and then acquiring the body information corresponding to the other images and determining the weight of each other image according to the similarity order, according to Each subject information and the weight of the corresponding image are respectively subjected to weight accumulation calculation for each subject information, and the subject information corresponding to the largest accumulated value is extracted as the subject information of the image to be identified; thereby, an accurate description of the unknown image can be relatively accurately searched.
  • the user can provide the accurate result of the unknown image search, and effectively improve the efficiency of the image data processing; in addition, by acquiring the image and its annotation information, and using the training data to obtain the support information list of the image annotation information, and then from the The main body information of the image is extracted from the support information list; thereby, the main body information of the image can be excavated relatively accurately, and the unnecessary interference description in the image annotation information or the annotation information is excluded, thereby improving the accuracy of the data search.
  • FIG. 1 is a flow chart showing the steps of an embodiment of a method for image recognition according to an embodiment of the present application
  • FIG. 2 is a flow chart showing the steps of an embodiment of a method for mining image body information according to an embodiment of the present application
  • FIG. 3 is a block diagram showing the structure of an apparatus for image recognition according to an embodiment of the present application.
  • FIG. 4 is a block diagram showing the structure of an embodiment of a sorting module according to an embodiment of the present application.
  • FIG. 5 is a schematic block diagram showing an embodiment of an apparatus for mining image body information according to an embodiment of the present application.
  • Figure 6 shows schematically a block diagram of a computing device for performing the method according to the present application
  • Fig. 7 schematically shows a storage module for holding or carrying program code implementing the method according to the present application.
  • Embodiment 1 is a flow chart showing the steps of Embodiment 1 of an image recognition method according to an embodiment of the present application, which may specifically include the following steps:
  • Step 110 Acquire an image to be identified, and find N other images similar to the image according to the image similarity
  • the remote computing device extracts the image to be identified from the image recognition request; constructs an inverted index by using the similar feature of the image, and then performs similar search on the image to be identified to obtain a similar image.
  • N other images can be used to find other N images similar to each other according to the image similarity.
  • the face recognition technology can be used to extract the character frame of the image to be recognized, and then the character is used.
  • the framework finds N other images similar to each other; in addition, we can also aggregate the existing image similarity to form a cluster of aggregated images, and obtain a similar N according to the aggregated image cluster to which the image to be identified belongs.
  • Other images Image similarity aggregation techniques such as sift/surf, phash, haar, or some CSD-7 CSD, SCD, CLD, DCD, HTD, EHD.
  • Step 120 Acquire main body information corresponding to other images, and determine weight values of each other image according to similarity order;
  • the method of the following embodiment 2 can be used to obtain the body information corresponding to the other images.
  • the method of the following embodiment 2 can be used to obtain the body information corresponding to the other images.
  • those skilled in the art can easily understand that there are other methods for acquiring the image body information. This will not be repeated here;
  • x is the reference value of the image pixel to be identified
  • Step 130 Perform weight calculation on each subject information according to the weight information of each subject information and the corresponding image;
  • the weights of the image whose subject information is Name are cumulatively processed as:
  • Step 140 Extract body information corresponding to the maximum accumulated value as the body information of the image to be identified.
  • the object After obtaining the weight accumulation result of all the N other images, the object is sorted according to the weight value, and the body information corresponding to the image with the largest weight accumulation result is extracted therefrom, and the body information is used as the body information of the image to be identified.
  • the embodiment of the present application obtains an image to be identified and searches for N other images similar to each other according to the image similarity, and then acquires the body information corresponding to the other images, and determines the weight of each other image according to the similarity order, according to each subject information. And the weight of the corresponding image, respectively performing weight calculation on each subject information, extracting the subject information corresponding to the largest accumulated value, as the subject information of the image to be identified; thereby accurately searching for accurate description information of the unknown image, and further In the network environment where massive image data exists, it can provide users with accurate results of unknown image search, and effectively improve the efficiency of image data processing.
  • FIG. 2 a flow chart of steps of an embodiment of a method for acquiring body information corresponding to another image according to another embodiment of the present application is shown, which may specifically include the following steps:
  • Step 210 Acquire an image and its annotation information
  • the information is the annotation information of the picture; in addition, some images are actually inconsistent with the annotation information, such as the picture "Some movie festival is amazing," where "Some movie festival is amazing” is the picture
  • the information is marked, but the image may show a lot of images of other people in the film festival. In this case, the user cannot accurately know the subject information of the image from the annotation information. Therefore, when the remote computing device receives such an image of the external input and its annotation information, the annotation information of the image is directly extracted for subsequent processing.
  • the remote computing device only receives the externally input image but does not receive the annotation information of the image, it searches for the annotation information of the image according to the URL of the image, the image source, the text surrounding the image, and the like, and stores the image. For subsequent processing;
  • Step 220 Obtain a support information list of the image annotation information by using training data.
  • the embodiment proposes to obtain a specific image body by acquiring the support information list of the image tagging information. Information, which can effectively improve the accuracy of image description;
  • the following information is used to obtain a list of support information for acquiring image annotation information by using training data, but is not limited thereto:
  • the annotation information of the image is often a collection of multiple information, but usually the collection of the multiple information contains intermediate data; in practical applications, the intermediate data is usually the subject of the image annotation information or the annotation, and the part of speech is often For nouns, such as names of people, places, etc.; for example, if an annotated information is an image of "someone's girlfriend", the intermediate data is "some”; at the same time, because the selection of the intermediate data of the image is usually
  • the words around the word are determined, for example, in the image tagging information "someone's girlfriend", “someone” should be selected separately, which is the intermediate data, and the "girlfriend” should not be selected, which is As a supporting word; specifically, in the annotation information of an image, the entropy of the left and right words of the intermediate data is much smaller than the entropy of the left and right words of the supporting words, so by comparing the entropy of the words of any word in the annotation information, Determine if the word is intermediate data.
  • an image database is preset in the embodiment, and a plurality of intermediate data and supporting words are stored in the image database, and all supporting words are collectively referred to as training.
  • Data; and the plurality of intermediate data and the supporting words have a corresponding matching relationship in the image database, that is, an intermediate data corresponds to multiple supporting words, and the same data may exist in the supporting words corresponding to the plurality of intermediate data; of course, the field
  • the image database can be extracted from the existing network database without a preset image database.
  • the matching relationship of the data in the preset image database can also be diverse. Let me repeat.
  • the correlation score between the training data and the intermediate data can be calculated by using this rule.
  • S224 Generate a support information list of the image annotation information by using the correlation score.
  • the intermediate data often corresponds to multiple supporting words
  • the correlation scores of the related supporting words and the intermediate data are calculated, the correspondence between the training data and the intermediate data relevance scores can be matched, and then the vocabulary is supported.
  • the list consisting of the plurality of sets of supporting vocabularies is the support information list of the image annotation information.
  • Step 230 Extract body information of the image from the support information list.
  • the correlation between the intermediate data and the supporting words composing the training data is a normal distribution relationship.
  • This embodiment also proposes a method for calculating the correlation score between the training data and the intermediate data, but it is not Limited to this, the specific method includes:
  • the score of each support word can be obtained, and then the scores of each support word can be added to obtain the total score of the training data, as follows:
  • is the step size of the unit support word and the intermediate data distance
  • the sum of the correlation weights of all the training data and the intermediate data is E2:
  • S2233 Determine a first correlation score of the training data and the intermediate data by calculating a ratio of the E2 to the E1;
  • the first correlation score of each set of training data and the intermediate data is E3:
  • this embodiment also proposes to extract the body information of the image from the support information list by the following manner, but is not limited thereto;
  • S231 Acquire all intermediate data of the image annotation information and related training data
  • the score of the intermediate data Name is:
  • S233 determining a size of each intermediate data and a preset threshold. When the score of an intermediate data is not less than the preset threshold, determining that the intermediate data is a subject name of the image; otherwise, determining There is no subject name for this image.
  • the embodiment further provides another method for acquiring subject information corresponding to other images, and the method further includes the following steps on the basis of the foregoing method:
  • Step 240 Perform denoising processing of the training data after determining a first correlation score of the training data and the intermediate data;
  • support words that have no practical meaning but are related to the intermediate data in the image database.
  • These support words are often just ordinary data, which are used as the subject name. The probability is very low; for example, vocabulary such as person name and place name can often be used as supporting words, but words such as "" and "and" are supporting words of adverbs, and the probability of being the subject name is very low; in this embodiment, this is the case.
  • Section Support words are defined as background noise, which affects the accuracy of supporting vocabularies.
  • the denoising process in this embodiment can be implemented in the following manner, including:
  • S241 Calculate the correlation weights of the other training data that are synchronous with any training data and all the training data, and perform the summation processing and summing to determine the noise weight B1 of the any training data;
  • the calculation of the background noise value in this embodiment is similar to the calculation method of the weight of the support word, and of course, there may be other ways of calculating; using ⁇ as the unit step size, the support word with the distance of any supporting word step is i ⁇ .
  • the weights are as follows:
  • S243 Determine a second relevance score of the training data and the intermediate data by acquiring a difference between the first correlation score and a noise value of the training data; wherein, the F1 and the F2 The ratio is the noise value of the training data.
  • the background noise value of each supported word is obtained:
  • the denoising process in this embodiment can be implemented by making the basis score of each supporting word and the background noise value of the supporting word, that is, the score of the supporting word after denoising is:
  • the embodiment of the present application obtains the image and the annotation information thereof, and obtains the support information list of the image annotation information by using the training data, and then extracts the body information of the image from the support information list;
  • the subject information of the image excludes unnecessary interference descriptions in the image annotation information or annotation information, and improves the accuracy of the data search.
  • the present application further discloses an apparatus for image recognition, including the following module: a search module 310, configured to acquire an image to be recognized, and find N other images similar to the image according to the image similarity; the sorting module 320, set Obtaining the body information corresponding to the other images found by the searching module, and determining the weights of each of the other images according to the similarity ranking; the calculating module 330 is configured to respectively perform the objects according to the weights of the main body information and the corresponding images. The information is subjected to weight accumulation calculation; and the identification module 340 is configured to extract the body information corresponding to the maximum accumulated value as the body information of the image to be identified.
  • a search module 310 configured to acquire an image to be recognized, and find N other images similar to the image according to the image similarity
  • the sorting module 320 set Obtaining the body information corresponding to the other images found by the searching module, and determining the weights of each of the other images according to the similarity ranking
  • the calculating module 330 is configured to
  • the searching module 310 in this embodiment includes (not shown in the figure): a receiving module that receives an image recognition request, and an extracting module that extracts an image to be recognized from an image recognition request received by the receiving module. .
  • lookup module 310 can include (not shown):
  • An indexing module configured to build an inverted index by using similar features of the image
  • the comparison module performs the similar retrieval on the image to be identified, and acquires N other images similar thereto.
  • the sorting module 320 may specifically include the following modules: an acquiring module 410 configured to acquire an image and its annotation information; and a generating module 420 configured to The support information list of the image annotation information is acquired by using the training data; and the extraction module 430 is configured to extract the body information of the image from the support information list.
  • the generating module 420 includes: (not shown in the figure): a first processing module configured to acquire intermediate data of the image annotation information acquired by the acquiring module; and a second processing module configured to extract from the image database a training data associated with the intermediate data; a third processing module configured to calculate a first correlation score of the training data and the intermediate data; and a fourth processing module configured to utilize the first correlation score The value generates a list of support information for the image annotation information.
  • the third processing module may also include (not shown): the first calculator is configured to calculate and calculate the correlation weights of the training data and the intermediate data. E1; a second calculator configured to accumulate correlation weights of all training data and the intermediate data and sum E2; and a third calculator configured to determine by calculating a ratio of the E2 to the E1 The first correlation score of the training data and the intermediate data.
  • the apparatus for mining image body information of the embodiment may further include (not shown): a denoising module, configured to perform the training data after the third processing module determines the first relevance score Denoising processing.
  • the denoising module may further include (not shown): a fourth calculator configured to calculate correlation between other training data that is synchronous with any training data and all training data. Weight, the correlation weight is cumulatively processed and summed to determine the noise weight F1 of any training data; the fifth calculator is set to accumulate the noise of all training data and summed to determine all a total noise weight F2 of the training data; a sixth calculator configured to determine a second correlation between the training data and the intermediate data by obtaining a difference between the first correlation score and a noise value of the training data a sex score; wherein the ratio of the F1 to the F2 is a noise value of the training data.
  • the extraction module of the embodiment may further include (not shown): a fifth processing module, configured to acquire all intermediate data of the image annotation information and the phase thereof And a sixth processing module, configured to calculate a score of each intermediate data by counting scores of the same training data in the support information list; and a seventh processing module configured to determine a score of each of the intermediate data And the size of the preset threshold, when the score of an intermediate data is not less than the preset threshold, determining that the intermediate data is the body information of the image.
  • the method may include the following modules: an acquiring module 510 configured to acquire an image and labeling information thereof; and a generating module 520. And a set of support information for acquiring the image annotation information by using the training data; and an extraction module 530, configured to extract the body information of the image from the support information list.
  • the generating module 520 includes: (not shown in the figure): the first processing module is configured to acquire intermediate data of the image annotation information acquired by the acquiring module; and the second processing module is configured to extract from the image database a training data associated with the intermediate data; a third processing module configured to calculate a first correlation score of the training data and the intermediate data; and a fourth processing module configured to utilize the first correlation score The value generates a list of support information for the image annotation information.
  • the third processing module may also include (not shown): the first calculator is configured to calculate and calculate the correlation weights of the training data and the intermediate data. E1; a second calculator configured to accumulate correlation weights of all training data and the intermediate data and sum E2; and a third calculator configured to determine by calculating a ratio of the E2 to the E1 The first correlation score of the training data and the intermediate data.
  • the apparatus for mining image body information of the embodiment may further include (not shown): a denoising module, configured to perform the training data after the third processing module determines the first relevance score Denoising processing.
  • the denoising module may further include (not shown): a fourth calculator configured to calculate correlation between other training data that is synchronous with any training data and all training data. Weight, the correlation weight is cumulatively processed and summed to determine the noise weight F1 of any training data; the fifth calculator is set to accumulate the noise of all training data and summed to determine all a total noise weight F2 of the training data; a sixth calculator configured to determine a second correlation between the training data and the intermediate data by obtaining a difference between the first correlation score and a noise value of the training data a sex score; wherein the ratio of the F1 to the F2 is a noise value of the training data.
  • the extraction module may further include (not shown): a fifth processing module configured to acquire all intermediate data of the image annotation information and Corresponding training data; a sixth processing module, configured to calculate a score of each intermediate data by counting scores of the same training data in the support information list; and a seventh processing module configured to determine each of the intermediate data The score is equal to the size of the preset threshold. When the score of an intermediate data is not less than the preset threshold, it is determined that the intermediate data is the body information of the image.
  • the various component embodiments of the present application can be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof. It will be understood by those skilled in the art that a ⁇ processor or a digital signal processor (DSP) may be used in practice to implement some or all of the components of the image recognition device or the device for exchanging image body information according to embodiments of the present application. Some or all of the features.
  • the application can also be implemented as a device or device program (eg, a computer program and a computer program product) configured to perform some or all of the methods described herein.
  • Such a program implementing the present application may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.
  • FIG. 6 illustrates a computing device, such as a client terminal device or server, that may implement an image recognition method or a method of mining image body information in accordance with the present application.
  • the computing device conventionally includes a processor 610 and a computer program product or computer readable medium in the form of a memory 620.
  • the memory 620 may be an electronic memory such as a flash memory, an EEPROM (Electrically Erasable Programmable Read Only Memory), an EPROM, a hard disk, or a ROM.
  • Memory 620 has a memory space 630 of program code 631 that is configured to perform any of the method steps described above.
  • the storage space 630 for program code may include respective program codes 631 that are respectively set to implement various steps in the above method.
  • the program code can be read from or written to one or more computer program products.
  • These computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks.
  • Such computer program products are typically portable or fixed storage modules as described with reference to FIG.
  • the storage module can have storage segments, storage spaces, and the like that are similarly arranged to memory 620 in the computing device of FIG.
  • the program code can be compressed, for example, in an appropriate form.
  • the storage module includes computer readable code 631', i.e., a generation that can be read by a processor such as 610. Codes that, when executed by a computing device, cause the computing device to perform various steps in the methods described above.
  • "an embodiment," or "an embodiment," or "one or more embodiments" as used herein means that the particular features, structures, or characteristics described in connection with the embodiments are included in at least one embodiment of the present application.
  • phrase "in one embodiment" is not necessarily referring to the same embodiment.

Abstract

An image recognition method and device. The method comprises: acquiring an image to be recognized, and searching for N other images similar thereto according to an image similarity; and acquiring main body information corresponding to other images, and, according to a similarity sequence, determining the weight of each of the other images, according to each piece of main body information and the weight of each corresponding image, respectively conducting a weight accumulation calculation on each piece of main body information, extracting main body information corresponding to the maximum accumulated value, and taking same as main body information about the image to be recognized. The solution of the present application can relatively accurately search out accurate description information about unknown images, and can then provide an accurate search result of unknown images for a user when mass image data exists in a network environment, thereby effectively improving the efficiency of processing of image data.

Description

一种图像识别方法、挖掘图像主体信息方法及装置Image recognition method, method and device for mining image body information 技术领域Technical field
本申请涉及数据处理的技术领域,尤其涉及一种图像识别方法、挖掘图像主体信息的方法及装置。The present application relates to the technical field of data processing, and in particular, to an image recognition method, a method and an apparatus for mining image body information.
背景技术Background technique
随着互联网和多媒体技术的飞速发展,互联网上的图像资源日益丰富,从网络上获取的图像资源也往往包含多种多样的信息,如背景、时间、地点、主体等等,而如此多的信息在通常情况下并非是用户真正所要关注的内容;例如,在浏览时事新闻网页时往往会出现多个图像,而用户对于新闻中的图像可能只关注时间和地点;而用户在浏览体育新闻网页时,可能只关注出现的多个图像中的人物和背景等。With the rapid development of the Internet and multimedia technologies, the image resources on the Internet are increasingly rich, and the image resources acquired from the network often contain a variety of information, such as background, time, place, subject, etc., and so much information. Under normal circumstances, it is not what the user really wants to pay attention to; for example, multiple images appear when browsing current news pages, and users may only pay attention to time and place for images in news; while users are browsing sports news pages. It may only focus on the characters and backgrounds in multiple images that appear.
同时,用户能够从多种渠道获取到多种多样的图像,但不是所有图像都附带明确的说明或注释;例如,对于用户在浏览体育新闻网页时出现的配图,在某些情况下,用户并无法知晓该图像的准确信息;此外,用户也无法根据已知图像获取到其他与该图像相关联的图像。同时,用户从网络上获取的图像资源往往只有该图像的注释或标注信息等,并且由于所获取图像包含的海量信息,其注释或标注信息并不能准确给出该图像的主体信息;例如,对于用户在浏览体育新闻网页时出现的配图,用户只能通过新闻标题和文章概要猜测配图所要表达的内容,并不能准确获知该配图的人物信息。At the same time, users can get a wide variety of images from a variety of sources, but not all images are accompanied by clear instructions or notes; for example, for users who are viewing sports news pages, in some cases, users The exact information of the image is not known; in addition, the user cannot obtain other images associated with the image based on the known image. At the same time, the image resources acquired by the user from the network often only have annotations or annotation information of the image, and the annotation or annotation information cannot accurately give the subject information of the image due to the massive information contained in the acquired image; for example, When a user browses a sports news webpage, the user can only guess the content to be expressed by the news headline and the article summary, and cannot accurately know the character information of the map.
因此,如何在网络环境下实现对于图像的识别及图像主体信息的挖掘,从而准确获取该图像的准确描述或其相关联的主体信息就变得十分必要和迫切。Therefore, how to realize image recognition and image body information mining in a network environment, so as to accurately obtain an accurate description of the image or its associated subject information becomes very necessary and urgent.
发明内容Summary of the invention
鉴于上述问题,提出了本申请以便提供一种克服上述问题或者至少部分地解决或者减缓上述问题的一种图像识别方法、挖掘图像主体信息方法及相应的装置。In view of the above problems, the present application has been made in order to provide an image recognition method, a method for mining image body information, and a corresponding device that overcome the above problems or at least partially solve or alleviate the above problems.
根据本申请的一个方面,提供了一种图像识别的方法,包括: According to an aspect of the present application, a method for image recognition is provided, including:
获取待识别图像,根据图像相似度查找与其相似的N张其他图像;获取其他图像对应的主体信息以及根据相似度排序确定每张其他图像的权值,根据各主体信息及对应图像的权值,分别对各主体信息进行权值累加计算,提取最大累加值对应的主体信息,作为待识别图像的主体信息。Obtaining an image to be identified, searching for N other images similar to the image similarity degree; acquiring body information corresponding to the other images; and determining a weight value of each other image according to the similarity order, according to the weight information of each body information and the corresponding image, The weight information is separately calculated for each subject information, and the subject information corresponding to the largest accumulated value is extracted as the subject information of the image to be identified.
根据本申请的另一个方面,提供了一种挖掘图像主体信息的方法,包括:According to another aspect of the present application, a method for mining image body information is provided, including:
获取图像及其标注信息;利用训练数据获取所述图像标注信息的支持信息列表;从所述支持信息列表中提取所述图像的主体信息。Acquiring an image and its annotation information; acquiring a support information list of the image annotation information by using the training data; and extracting the body information of the image from the support information list.
根据本申请的又一个方面,提供了一种图像识别的装置,包括:According to still another aspect of the present application, an apparatus for image recognition is provided, including:
查找模块,设置为获取待识别图像,并根据图像相似度查找与其相似的N张其他图像;排序模块,设置为获取所述查找模块查找到的其他图像对应的主体信息以及根据相似度排序确定每张其他图像的权值;计算模块,设置为根据各主体信息及对应图像的权值,分别对各主体信息进行权值累加计算;识别模块,设置为提取最大累加值对应的主体信息,作为待识别图像的主体信息。a search module, configured to obtain an image to be recognized, and find N other images similar to the image similarity; the sorting module is configured to acquire body information corresponding to other images found by the searching module, and determine each order according to similarity ranking The weighting value of the other images; the calculating module is configured to perform weight accumulation calculation on each body information according to the weight information of each body information and the corresponding image; and the identifying module is configured to extract the body information corresponding to the largest accumulated value, as the waiting Identify the subject information of the image.
根据本申请的又一个方面,提供了一种挖掘图像主体信息的装置,包括:According to still another aspect of the present application, an apparatus for mining image body information includes:
获取模块,设置为获取图像及其标注信息;生成模块,设置为利用训练数据获取所述图像标注信息的支持信息列表;提取模块,设置为从所述支持信息列表中提取所述图像的主体信息。Obtaining a module, configured to acquire an image and its annotation information; a generation module, configured to acquire a support information list of the image annotation information by using training data; and an extraction module configured to extract body information of the image from the support information list .
根据本申请的又一个方面,提供了一种计算机程序,其包括计算机可读代码,当所述计算机可读代码在计算设备上运行时,导致所述计算设备执行根据权利要求1-15中的任一个所述的方法。According to still another aspect of the present application, there is provided a computer program comprising computer readable code, when said computer readable code is run on a computing device, causing said computing device to perform according to claims 1-15 Any of the methods described.
根据本申请的再一个方面,提供了一种计算机可读介质,其中存储了如权利要求31所述的计算机程序。According to still another aspect of the present application, there is provided a computer readable medium storing the computer program of claim 31.
本申请的有益效果为:通过获取待识别图像并根据图像相似度查找与其相似的N张其他图像,然后再获取其他图像对应的主体信息以及根据相似度排序确定每张其他图像的权值,根据各主体信息及对应图像的权值,分别对各主体信息进行权值累加计算,提取最大累加值对应的主体信息,作为待识别图像的主体信息;从而可以相对准确的搜索出未知图像的准确描述信息,进而在海量图像数据存在网络环境 下,能够为用户提供未知图像搜索的准确结果,有效提高了图像数据处理的效率;此外,通过获取图像及其标注信息,并利用训练数据获取所述图像标注信息的支持信息列表,然后从所述支持信息列表中提取所述图像的主体信息;从而可以相对准确的挖掘出图像的主体信息,排除了该图像标注信息或注释信息中不必要的干扰描述,提高了数据搜索的准确度。The beneficial effects of the present application are: obtaining the image to be recognized and searching for N other images similar to each other according to the image similarity, and then acquiring the body information corresponding to the other images and determining the weight of each other image according to the similarity order, according to Each subject information and the weight of the corresponding image are respectively subjected to weight accumulation calculation for each subject information, and the subject information corresponding to the largest accumulated value is extracted as the subject information of the image to be identified; thereby, an accurate description of the unknown image can be relatively accurately searched. Information, and thus in the presence of massive image data in a network environment The user can provide the accurate result of the unknown image search, and effectively improve the efficiency of the image data processing; in addition, by acquiring the image and its annotation information, and using the training data to obtain the support information list of the image annotation information, and then from the The main body information of the image is extracted from the support information list; thereby, the main body information of the image can be excavated relatively accurately, and the unnecessary interference description in the image annotation information or the annotation information is excluded, thereby improving the accuracy of the data search.
上述说明仅是本申请技术方案的概述,为了能够更清楚了解本申请的技术手段,而可依照说明书的内容予以实施,并且为了让本申请的上述和其它目的、特征和优点能够更明显易懂,以下特举本申请的具体实施方式。The above description is only an overview of the technical solutions of the present application, and the technical means of the present application can be more clearly understood, and the above and other objects, features and advantages of the present application can be more clearly understood. The following is a specific embodiment of the present application.
附图说明DRAWINGS
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本申请的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:Various other advantages and benefits will become apparent to those skilled in the art from a The drawings are only for the purpose of illustrating the preferred embodiments and are not intended to be limiting. Throughout the drawings, the same reference numerals are used to refer to the same parts. In the drawing:
图1示意性示出了根据本申请一个实施例的图像识别的方法实施例的步骤流程图;1 is a flow chart showing the steps of an embodiment of a method for image recognition according to an embodiment of the present application;
图2示意性示出了根据本申请一个实施例的挖掘图像主体信息的方法实施例的步骤流程图;2 is a flow chart showing the steps of an embodiment of a method for mining image body information according to an embodiment of the present application;
图3示意性示出了根据本申请一个实施例的图像识别的装置实施例的结构框图;3 is a block diagram showing the structure of an apparatus for image recognition according to an embodiment of the present application;
图4示意性示出了根据本申请一个实施例的排序模块实施例的结构框图;4 is a block diagram showing the structure of an embodiment of a sorting module according to an embodiment of the present application;
图5示意性示出了根据本申请一个实施例的一种挖掘图像主体信息的装置实施例的结构框图;FIG. 5 is a schematic block diagram showing an embodiment of an apparatus for mining image body information according to an embodiment of the present application; FIG.
图6示意性地示出了用于执行根据本申请的方法的计算设备的框图;以及Figure 6 shows schematically a block diagram of a computing device for performing the method according to the present application;
图7示意性地示出了用于保持或者携带实现根据本申请的方法的程序代码的存储模块。Fig. 7 schematically shows a storage module for holding or carrying program code implementing the method according to the present application.
具体实施例 Specific embodiment
下面结合附图和具体的实施方式对本申请作进一步的描述。The present application is further described below in conjunction with the drawings and specific embodiments.
参照图1,示出了根据本申请一个实施例的一种图像识别方法实施例1的步骤流程图,具体可以包括如下步骤:1 is a flow chart showing the steps of Embodiment 1 of an image recognition method according to an embodiment of the present application, which may specifically include the following steps:
步骤110:获取待识别图像,根据图像相似度查找与其相似的N张其他图像;Step 110: Acquire an image to be identified, and find N other images similar to the image according to the image similarity;
具体的,远端计算设备接收到图像识别请求后,从该图像识别请求中提取出待识别图像;通过图片相似特征建成倒排索引,然后将待识别的图像去进行相似检索,获取与其相似的N张其他图像。当然,本领域普通技术人员很容易了解,还可以通过其他方式来根据图像相似度查找与其相似的N张其他图像,例如可采用人脸识别技术提取出待识别图像的人物框架,再利用该人物框架查找与之类似的N张其他图像;另外,我们还可以通过现有的图像相似性聚合到一起的技术,形成聚合图片簇,根据待识别图片所属的聚合图片簇,从而获取与其相似的N张其他图像。图像相似性聚合技术比如sift/surf,phash,haar这些,或者mpeg-7的一些CSD,SCD,CLD,DCD,HTD,EHD。Specifically, after receiving the image recognition request, the remote computing device extracts the image to be identified from the image recognition request; constructs an inverted index by using the similar feature of the image, and then performs similar search on the image to be identified to obtain a similar image. N other images. Of course, those skilled in the art can easily understand that other methods can be used to find other N images similar to each other according to the image similarity. For example, the face recognition technology can be used to extract the character frame of the image to be recognized, and then the character is used. The framework finds N other images similar to each other; in addition, we can also aggregate the existing image similarity to form a cluster of aggregated images, and obtain a similar N according to the aggregated image cluster to which the image to be identified belongs. Other images. Image similarity aggregation techniques such as sift/surf, phash, haar, or some CSD-7 CSD, SCD, CLD, DCD, HTD, EHD.
步骤120:获取其他图像对应的主体信息以及根据相似度排序确定每张其他图像的权值;Step 120: Acquire main body information corresponding to other images, and determine weight values of each other image according to similarity order;
本实施例中,在查找到其他图像后,可利用下述实施例2的方法获取其他图像对应的主体信息,当然本领域普通技术人员很容易了解还可以有其他获取图像主体信息的方法,在此不再赘述;In this embodiment, after the other images are found, the method of the following embodiment 2 can be used to obtain the body information corresponding to the other images. Of course, those skilled in the art can easily understand that there are other methods for acquiring the image body information. This will not be repeated here;
由于查找出的其他N章图像与待识别图像存在相似度关系为The similarity relationship between the other N chapter images found and the image to be recognized is
Figure PCTCN2014087954-appb-000001
Figure PCTCN2014087954-appb-000001
其中x为待识别图像像素组成参考值,μ为其他图像像素组成参考值,当x=μ时,f(x)=1;Where x is the reference value of the image pixel to be identified, μ is the reference value of other image pixels, and when x=μ, f(x)=1;
上述的公式仅是实现本发明技术方案的一种实施方式举例,本领域技术人员通过对公式进行各种变形,其实质也在本发明的保护范围内。The above formula is only an example of an embodiment of the technical solution of the present invention, and various modifications are made to the formula by those skilled in the art, and the essence thereof is also within the protection scope of the present invention.
令每张图像与待识别图像的相似度在所有图像中与待识别图像的相似度的排序位置为σμ,则该图像的权值为:Let the similarity of each image to the image to be recognized be σμ in the similarity of the similarity of the image to be recognized in all the images, then the weight of the image is:
Figure PCTCN2014087954-appb-000002
Figure PCTCN2014087954-appb-000002
上述的公式仅是实现本发明技术方案的一种实施方式举例,本领域技术人员通过对公式进行各种变形,其实质也在本发明的保护范围内。The above formula is only an example of an embodiment of the technical solution of the present invention, and various modifications are made to the formula by those skilled in the art, and the essence thereof is also within the protection scope of the present invention.
步骤130:根据各主体信息及对应图像的权值,分别对各主体信息进行权值累加计算;Step 130: Perform weight calculation on each subject information according to the weight information of each subject information and the corresponding image;
假设N张其他图像中主体信息为Name的图像数为M,且已确定该图像的权值为Weighti后,则对主体信息为Name的图像的权值进行累加处理为:Assuming that the number of images in which the subject information is Name is M in N other images, and the weight of the image is determined to be Weight i , the weights of the image whose subject information is Name are cumulatively processed as:
Figure PCTCN2014087954-appb-000003
Figure PCTCN2014087954-appb-000003
上述的公式仅是实现本发明技术方案的一种实施方式举例,本领域技术人员通过对公式进行各种变形,其实质也在本发明的保护范围内。The above formula is only an example of an embodiment of the technical solution of the present invention, and various modifications are made to the formula by those skilled in the art, and the essence thereof is also within the protection scope of the present invention.
步骤140:提取最大累加值对应的主体信息,作为待识别图像的主体信息。Step 140: Extract body information corresponding to the maximum accumulated value as the body information of the image to be identified.
在获取所有N张其他图像的权值累加结果后,按照权值大小进行排序,并从中提取出权值累加结果最大的图像对应的主体信息,以该主体信息作为待识别图像的主体信息。After obtaining the weight accumulation result of all the N other images, the object is sorted according to the weight value, and the body information corresponding to the image with the largest weight accumulation result is extracted therefrom, and the body information is used as the body information of the image to be identified.
本申请实施例通过获取待识别图像并根据图像相似度查找与其相似的N张其他图像,然后再获取其他图像对应的主体信息以及根据相似度排序确定每张其他图像的权值,根据各主体信息及对应图像的权值,分别对各主体信息进行权值累加计算,提取最大累加值对应的主体信息,作为待识别图像的主体信息;从而可以相对准确的搜索出未知图像的准确描述信息,进而在海量图像数据存在网络环境下,能够为用户提供未知图像搜索的准确结果,有效提高了图像数据处理的效率。The embodiment of the present application obtains an image to be identified and searches for N other images similar to each other according to the image similarity, and then acquires the body information corresponding to the other images, and determines the weight of each other image according to the similarity order, according to each subject information. And the weight of the corresponding image, respectively performing weight calculation on each subject information, extracting the subject information corresponding to the largest accumulated value, as the subject information of the image to be identified; thereby accurately searching for accurate description information of the unknown image, and further In the network environment where massive image data exists, it can provide users with accurate results of unknown image search, and effectively improve the efficiency of image data processing.
参照图2,示出了根据本申请另一个实施例的获取其他图像对应的主体信息的方法实施例的步骤流程图,具体可以包括如下步骤:Referring to FIG. 2, a flow chart of steps of an embodiment of a method for acquiring body information corresponding to another image according to another embodiment of the present application is shown, which may specifically include the following steps:
步骤210,获取图像及其标注信息;Step 210: Acquire an image and its annotation information;
众所周知的,用户获取的图像中往往包含了多种信息,例如一张图片中就可包含N个名字(N>=0),而其中只有一个名字是用户所需要的,同时该名字也往往被包含在该图片的描述信息中,则该描述信 息即为该图片的标注信息;另外,有一些图像实际上与其标注信息是不相符合的,比如图片“某某电影节惊艳四方”,其中“某某电影节惊艳四方”即为该图片的标注信息,但该图片可能会显示出很多其他人在电影节的图像,此种情况下用户既无法从标注信息中准确获知该图片的主体信息。因此,当远端计算设备接收到外部输入的此类图像及其标注信息后,会直接提取出该图像的标注信息,以备后续处理。而如果远端计算设备只接收到外部输入的图像,却并未接收到该图像的标注信息后,会根据该图像的URL、图像源、图像周围文本等内容查找该图像的标注信息,并存储以备后续处理;As is well known, the image acquired by the user often contains a variety of information, for example, a picture may contain N names (N>=0), and only one of the names is required by the user, and the name is often Included in the description of the picture, the description letter The information is the annotation information of the picture; in addition, some images are actually inconsistent with the annotation information, such as the picture "Some movie festival is amazing," where "Some movie festival is amazing" is the picture The information is marked, but the image may show a lot of images of other people in the film festival. In this case, the user cannot accurately know the subject information of the image from the annotation information. Therefore, when the remote computing device receives such an image of the external input and its annotation information, the annotation information of the image is directly extracted for subsequent processing. If the remote computing device only receives the externally input image but does not receive the annotation information of the image, it searches for the annotation information of the image according to the URL of the image, the image source, the text surrounding the image, and the like, and stores the image. For subsequent processing;
当然,本领域普通技术人员很容易了解,当接收到图像后,还可以通过其他方式来获取该图像的标注信息,本实施例在此不再赘述。Of course, it is easy for those skilled in the art to know that the image information of the image can be obtained by other means after the image is received.
步骤220:利用训练数据获取所述图像标注信息的支持信息列表;Step 220: Obtain a support information list of the image annotation information by using training data.
由于图像标注信息中所包含的信息较多,为避免提取该图像的主体信息出现误差,本实施例提出通过获取该图像标注信息的支持信息列表,再从该支持信息列表中提取具体的图像主体信息,从而可以有效提高图像描述的准确性;Since the information included in the image tagging information is large, in order to avoid the error of extracting the body information of the image, the embodiment proposes to obtain a specific image body by acquiring the support information list of the image tagging information. Information, which can effectively improve the accuracy of image description;
具体的,本实施例可通过以下步骤来实现利用训练数据获取图像标注信息的支持信息列表,但并不局限于此:Specifically, in this embodiment, the following information is used to obtain a list of support information for acquiring image annotation information by using training data, but is not limited thereto:
S221:获取所述图像标注信息的中间数据;S221: Acquire intermediate data of the image annotation information;
其中,图像的标注信息往往是多信息的集合,但通常该多信息的集合中包含有中间数据;在实际应用中,该中间数据通常是图像标注信息或注释中的主语,其词性也往往是以名词为主,如人名、地名等;例如,如果一标注信息为“某某的女友”的图像,其中间数据即为“某某”;同时,由于图像的中间数据的选择通常是由该词周围的词所决定的,比如图像标注信息“某某的女友”中,“某某”单独看就应该被选中,其即为中间数据,而“的女友”就不应该被选中,其即为支持词;具体的,在一图像的标注信息中,中间数据左右两边词的熵比支持词左右两边词的熵小很多,因此通过比较标注信息中任意词左右词的熵的大小,即可判断该词是否为中间数据。The annotation information of the image is often a collection of multiple information, but usually the collection of the multiple information contains intermediate data; in practical applications, the intermediate data is usually the subject of the image annotation information or the annotation, and the part of speech is often For nouns, such as names of people, places, etc.; for example, if an annotated information is an image of "someone's girlfriend", the intermediate data is "some"; at the same time, because the selection of the intermediate data of the image is usually The words around the word are determined, for example, in the image tagging information "someone's girlfriend", "someone" should be selected separately, which is the intermediate data, and the "girlfriend" should not be selected, which is As a supporting word; specifically, in the annotation information of an image, the entropy of the left and right words of the intermediate data is much smaller than the entropy of the left and right words of the supporting words, so by comparing the entropy of the words of any word in the annotation information, Determine if the word is intermediate data.
S222:从图像数据库中提取与所述中间数据相关的训练数据;S222: Extract training data related to the intermediate data from an image database;
需要说明的是,为了查找方便本实施例预设一图像数据库,在该图像数据库中存储有若干中间数据与支持词,所有支持词统称为训练 数据;而该若干中间数据与支持词在该图像数据库中存在对应匹配关系,即一中间数据对应多支持词,同时多个中间数据所对应的支持词中也可以存在相同数据;当然,本领域普通技术人员很容易了解,还可以不预设图像数据库,而从现有网络数据库中进行提取;当然,预设的图像数据库中的数据对应匹配关系也可以是多样的,本实施例在此不再赘述。It should be noted that, in order to facilitate the search, an image database is preset in the embodiment, and a plurality of intermediate data and supporting words are stored in the image database, and all supporting words are collectively referred to as training. Data; and the plurality of intermediate data and the supporting words have a corresponding matching relationship in the image database, that is, an intermediate data corresponds to multiple supporting words, and the same data may exist in the supporting words corresponding to the plurality of intermediate data; of course, the field It is easy for a person skilled in the art to understand that the image database can be extracted from the existing network database without a preset image database. Of course, the matching relationship of the data in the preset image database can also be diverse. Let me repeat.
具体的,当获取所述图像标注信息中的中间数据后,以该中间数据为目标查找所述预设图像数据库,若该图像数据库中存在该中间数据,则利用所述匹配关系提取出该中间数据对应的相关支持词;否则,结束操作并向外部反馈该图像无主体信息的指示。Specifically, after acquiring the intermediate data in the image annotation information, searching the preset image database with the intermediate data as a target, and if the intermediate data exists in the image database, extracting the middle by using the matching relationship The relevant supporting word corresponding to the data; otherwise, the operation is ended and the indication of the image without the subject information is externally fed back.
S223:计算所述训练数据与所述中间数据的相关性分值;S223: Calculate a correlation score of the training data and the intermediate data;
需要说明的是,由于中间数据与其训练数据的相关性是一个正太分布的关系,因此利用这个规律即可计算出训练数据与中间数据的相关性分值。It should be noted that since the correlation between the intermediate data and its training data is a positively distributed relationship, the correlation score between the training data and the intermediate data can be calculated by using this rule.
S224:利用所述相关性分值生成所述图像标注信息的支持信息列表。S224: Generate a support information list of the image annotation information by using the correlation score.
具体的,由于中间数据往往对应多个支持词,所以当计算出相关支持词与中间数据的相关性分值后,即可匹配出训练数据与中间数据相关性分值对应关系支持词表,再获取多组训练数据与中间数据相关性分值的对应关系后,由所述多组支持词表组成的列表即为图像标注信息的支持信息列表。Specifically, since the intermediate data often corresponds to multiple supporting words, when the correlation scores of the related supporting words and the intermediate data are calculated, the correspondence between the training data and the intermediate data relevance scores can be matched, and then the vocabulary is supported. After obtaining the correspondence between the plurality of sets of training data and the intermediate data relevance scores, the list consisting of the plurality of sets of supporting vocabularies is the support information list of the image annotation information.
步骤230:从所述支持信息列表中提取所述图像的主体信息。Step 230: Extract body information of the image from the support information list.
值得注意的是,基于中间数据与组成训练数据的支持词的相关性是一个正态分布的关系,本实施例还提出了一种计算训练数据与中间数据相关性分值的方法,但并不局限于此,具体该方法包括:It is worth noting that the correlation between the intermediate data and the supporting words composing the training data is a normal distribution relationship. This embodiment also proposes a method for calculating the correlation score between the training data and the intermediate data, but it is not Limited to this, the specific method includes:
S2231:计算训练数据与所述中间数据的相关性权值并求和E1;S2231: Calculate the correlation weight of the training data and the intermediate data and sum E1;
具体的,由于中间数据与支持词的相关性的正态分布关系为Specifically, the normal distribution relationship between the intermediate data and the supporting words is
Figure PCTCN2014087954-appb-000004
Figure PCTCN2014087954-appb-000004
其中x为中间数据,μ为支持词;Where x is the intermediate data and μ is the supporting word;
上述的公式仅是实现本发明技术方案的一种实施方式举例,本领域技术人员通过对公式进行各种变形,其实质也在本发明的保护范围 内。The above formula is only an example of an embodiment of the technical solution of the present invention, and various modifications are made to the formula by those skilled in the art, and the essence thereof is also in the protection scope of the present invention. Inside.
因此,将μ与x的相关性作为权值,即可得到每个支持词的分值,再将每个支持词的分值相加即可获得训练数据的总分值,如下:Therefore, by using the correlation between μ and x as the weight, the score of each support word can be obtained, and then the scores of each support word can be added to obtain the total score of the training data, as follows:
当x=μ时,f(x)=1,则与中间数据x距离步长为iσμ的支持词的相关性权值为:When x=μ, f(x)=1, the correlation weight of the support words with the intermediate data x distance step iσμ is:
Figure PCTCN2014087954-appb-000005
其中σμ为单位支持词与中间数据距离的步长;
Figure PCTCN2014087954-appb-000005
Where σμ is the step size of the unit support word and the intermediate data distance;
需要说明,上述的公式仅是实现本发明技术方案的一种实施方式举例,本领域技术人员通过对公式进行各种变形,其实质也在本发明的保护范围内。It should be noted that the above formula is only an example of implementing one embodiment of the technical solution of the present invention, and various modifications are made to the formula by those skilled in the art, and the essence thereof is also within the protection scope of the present invention.
由此也可看出,距离中间数据越远的支持词其相关性越差,权值也越小;It can also be seen that the farther the support word from the intermediate data is, the worse the correlation is, and the smaller the weight is;
再将N个支持词的分值进行累加即可获得所有训练数据与所述中间数据的相关性权值E1:Then, the scores of the N support words are accumulated to obtain the correlation weight E1 of all the training data and the intermediate data:
Figure PCTCN2014087954-appb-000006
Figure PCTCN2014087954-appb-000006
需要说明,上述的公式仅是实现本发明技术方案的一种实施方式举例,本领域技术人员通过对公式进行各种变形,其实质也在本发明的保护范围内。It should be noted that the above formula is only an example of implementing one embodiment of the technical solution of the present invention, and various modifications are made to the formula by those skilled in the art, and the essence thereof is also within the protection scope of the present invention.
S2232:将所有训练数据与所述中间数据的相关性权值进行累加处理并求和E2;S2232: accumulating the correlation weights of all the training data and the intermediate data and summing E2;
令所述中间数据存在多组训练数据M,则所有训练数据与所述中间数据的相关性权值之和为E2:If the intermediate data has multiple sets of training data M, the sum of the correlation weights of all the training data and the intermediate data is E2:
Figure PCTCN2014087954-appb-000007
Figure PCTCN2014087954-appb-000007
需要说明,上述的公式仅是实现本发明技术方案的一种实施方式举例,本领域技术人员通过对公式进行各种变形,其实质也在本发明的保护范围内。It should be noted that the above formula is only an example of implementing one embodiment of the technical solution of the present invention, and various modifications are made to the formula by those skilled in the art, and the essence thereof is also within the protection scope of the present invention.
S2233:通过计算所述E2与所述E1的比值确定该训练数据与所述中间数据的第一相关性分值;S2233: Determine a first correlation score of the training data and the intermediate data by calculating a ratio of the E2 to the E1;
具体的,每组训练数据与所述中间数据的第一相关性分值为E3: Specifically, the first correlation score of each set of training data and the intermediate data is E3:
Figure PCTCN2014087954-appb-000008
Figure PCTCN2014087954-appb-000008
需要说明,上述的公式仅是实现本发明技术方案的一种实施方式举例,本领域技术人员通过对公式进行各种变形,其实质也在本发明的保护范围内。It should be noted that the above formula is only an example of implementing one embodiment of the technical solution of the present invention, and various modifications are made to the formula by those skilled in the art, and the essence thereof is also within the protection scope of the present invention.
此外,本实施例也提出了通过如下方式从支持信息列表中提取所述图像的主体信息,但并不局限于此;包括:In addition, this embodiment also proposes to extract the body information of the image from the support information list by the following manner, but is not limited thereto;
S231:获取所述图像标注信息的所有中间数据及其相关训练数据;S231: Acquire all intermediate data of the image annotation information and related training data;
S232:通过统计相同训练数据在所述支持信息列表中的分值计算每个中间数据的得分;S232: Calculate a score of each intermediate data by counting scores of the same training data in the support information list;
具体的,设所有中间数据中每个中间数据Name的相关训练数据的总数为P,而训练数据P中支持词Wordi在支持信息词表中的分值为Scorei,其权重为Weighti,则该中间数据Name的得分为:Specifically, it is assumed that the total number of related training data of each intermediate data Name in all the intermediate data is P, and the score of the supporting word Word i in the training information word table in the supporting information word table is Score i , and the weight thereof is Weight i . Then the score of the intermediate data Name is:
Figure PCTCN2014087954-appb-000009
Figure PCTCN2014087954-appb-000009
需要说明,上述的公式仅是实现本发明技术方案的一种实施方式举例,本领域技术人员通过对公式进行各种变形,其实质也在本发明的保护范围内。It should be noted that the above formula is only an example of implementing one embodiment of the technical solution of the present invention, and various modifications are made to the formula by those skilled in the art, and the essence thereof is also within the protection scope of the present invention.
S233:判断所述每个中间数据的得分与预设阈值的大小,当一中间数据的得分不小于所述预设阈值,则确定该中间数据为所述图像的主体名称;否则,即可确定该图像不存在主体名称。S233: determining a size of each intermediate data and a preset threshold. When the score of an intermediate data is not less than the preset threshold, determining that the intermediate data is a subject name of the image; otherwise, determining There is no subject name for this image.
除此之外,本实施例还提供了另一种获取其他图像对应的主体信息的方法,该方法在上述方法的基础上还包括以下步骤:In addition, the embodiment further provides another method for acquiring subject information corresponding to other images, and the method further includes the following steps on the basis of the foregoing method:
步骤240:在确定训练数据与中间数据的第一相关性分值之后进行所述训练数据的去噪处理;Step 240: Perform denoising processing of the training data after determining a first correlation score of the training data and the intermediate data;
需要说明的是,在获取的相关训练数据中,通常存在一些并无实际意义但却在图像数据库中与中间数据存在相关性的支持词,这部分支持词往往只是普通数据,其作为主体名称的概率很低;例如,人名、地名等词汇往往可以作为支持词,但是诸如“的”“而”等词性为副词的支持词,其作为主体名称的概率就很低;在本实施例中将这部分 支持词定义为背景噪声,其会影响支持词表的准确性。It should be noted that in the relevant training data obtained, there are usually some support words that have no practical meaning but are related to the intermediate data in the image database. These support words are often just ordinary data, which are used as the subject name. The probability is very low; for example, vocabulary such as person name and place name can often be used as supporting words, but words such as "" and "and" are supporting words of adverbs, and the probability of being the subject name is very low; in this embodiment, this is the case. section Support words are defined as background noise, which affects the accuracy of supporting vocabularies.
具体的,本实施例的去噪处理可通过以下方式实现,包括:Specifically, the denoising process in this embodiment can be implemented in the following manner, including:
S241:计算与任一训练数据相同步长的其他训练数据与所有训练数据的相关性权值,将该相关性权值进行累加处理并求和后确定该任一训练数据的噪声权值B1;S241: Calculate the correlation weights of the other training data that are synchronous with any training data and all the training data, and perform the summation processing and summing to determine the noise weight B1 of the any training data;
具体的,本实施例中背景噪声值的计算与支持词的权值计算方式相似,当然还可以有其他方式计算;以σμ作为单位步长,计算与任意支持词步长距离为iσμ的支持词的权值如下:Specifically, the calculation of the background noise value in this embodiment is similar to the calculation method of the weight of the support word, and of course, there may be other ways of calculating; using σμ as the unit step size, the support word with the distance of any supporting word step is iσμ. The weights are as follows:
Figure PCTCN2014087954-appb-000010
Figure PCTCN2014087954-appb-000010
需要说明,上述的公式仅是实现本发明技术方案的一种实施方式举例,本领域技术人员通过对公式进行各种变形,其实质也在本发明的保护范围内。It should be noted that the above formula is only an example of implementing one embodiment of the technical solution of the present invention, and various modifications are made to the formula by those skilled in the art, and the essence thereof is also within the protection scope of the present invention.
将与任意支持词相同步长的同一支持词的权值进行累加处理,即可得到该任意支持词的背景噪声权值B1=BackNoisewordThe weight of the same supporting word that is synchronized with any supporting word is accumulated, and the background noise weight B1=BackNoise word of the arbitrary supporting word is obtained;
S242:将所有训练数据的噪声进行累加处理并求和后确定所有训练数据的总噪声权值B2=TotalBackNoiseS242: Accumulating the noise of all the training data and summing to determine the total noise weight of all the training data B2=Total BackNoise ;
S243:通过获取所述第一相关性分值与所述训练数据的噪声值之差确定所述训练数据与所述中间数据的第二相关性分值;其中,所述F1与所述F2的比值为所述训练数据的噪声值。S243: Determine a second relevance score of the training data and the intermediate data by acquiring a difference between the first correlation score and a noise value of the training data; wherein, the F1 and the F2 The ratio is the noise value of the training data.
具体的,在计算出任意支持词的背景噪声权值F1和所有训练数据的总噪声权值F2后,即可获知每个支持词词的背景噪声值为:Specifically, after calculating the background noise weight F1 of any support word and the total noise weight F2 of all training data, the background noise value of each supported word is obtained:
Figure PCTCN2014087954-appb-000011
Figure PCTCN2014087954-appb-000011
需要说明,上述的公式仅是实现本发明技术方案的一种实施方式举例,本领域技术人员通过对公式进行各种变形,其实质也在本发明的保护范围内。It should be noted that the above formula is only an example of implementing one embodiment of the technical solution of the present invention, and various modifications are made to the formula by those skilled in the art, and the essence thereof is also within the protection scope of the present invention.
而本实施例中的去噪声处理即可通过将每个支持词的基础得分与此支持词的背景噪声值做差来实现,即该支持词去噪后的分值为:The denoising process in this embodiment can be implemented by making the basis score of each supporting word and the background noise value of the supporting word, that is, the score of the supporting word after denoising is:
Score=Scoresup port-ScoreBackNoise Score=Score sup port -Score BackNoise
然后再记录所有支持词的分值,最终生成支持词表。Then record the scores of all supporting words and finally generate a supporting vocabulary.
需要说明,上述的公式仅是实现本发明技术方案的一种实施方式举例,本领域技术人员通过对公式进行各种变形,其实质也在本发明的保护范围内。It should be noted that the above formula is only an example of implementing one embodiment of the technical solution of the present invention, and various modifications are made to the formula by those skilled in the art, and the essence thereof is also within the protection scope of the present invention.
当然,上述特种信息及其判断方式只是作为示例,在实施本申请实施例时,可以根据实际情况设置其他特种信息及其判断方式,本申请实施例对此不加以限制。另外,除了上述特种信息及其判断方式外,本领域技术人员还可以根据实际需要采用其他特种信息及其判断方式,本申请实施例对此也不加以限制。Of course, the above-mentioned special information and its judgment manner are only examples. In the implementation of the embodiment of the present application, other special information and its judgment manner may be set according to actual conditions, which is not limited by the embodiment of the present application. In addition, in addition to the above-mentioned special information and its judgment manner, the person skilled in the art can also adopt other special information and its judgment manner according to actual needs, and the embodiment of the present application does not limit this.
本申请实施例通过获取图像及其标注信息,并利用训练数据获取所述图像标注信息的支持信息列表,然后从所述支持信息列表中提取所述图像的主体信息;从而可以相对准确的挖掘出图像的主体信息,排除了该图像标注信息或注释信息中不必要的干扰描述,提高了数据搜索的准确度。The embodiment of the present application obtains the image and the annotation information thereof, and obtains the support information list of the image annotation information by using the training data, and then extracts the body information of the image from the support information list; The subject information of the image excludes unnecessary interference descriptions in the image annotation information or annotation information, and improves the accuracy of the data search.
对于方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本申请实施例并不受所描述的动作顺序的限制,因为依据本申请实施例,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作并不一定是本申请实施例所必须的。For the method embodiments, for the sake of brevity, they are all described as a series of action combinations, but those skilled in the art should understand that the embodiments of the present application are not limited by the described action sequence, because the embodiment according to the present application Some steps can be performed in other orders or at the same time. In the following, those skilled in the art should also understand that the embodiments described in the specification are all preferred embodiments, and the actions involved are not necessarily required in the embodiments of the present application.
参照图3,本申请还公开了一种图像识别的装置,包括以下模块:查找模块310,设置为获取待识别图像,并根据图像相似度查找与其相似的N张其他图像;排序模块320,设置为获取所述查找模块查找到的其他图像对应的主体信息以及根据相似度排序确定每张其他图像的权值;计算模块330,设置为根据各主体信息及对应图像的权值,分别对各主体信息进行权值累加计算;以及识别模块340,设置为提取最大累加值对应的主体信息,作为待识别图像的主体信息。Referring to FIG. 3, the present application further discloses an apparatus for image recognition, including the following module: a search module 310, configured to acquire an image to be recognized, and find N other images similar to the image according to the image similarity; the sorting module 320, set Obtaining the body information corresponding to the other images found by the searching module, and determining the weights of each of the other images according to the similarity ranking; the calculating module 330 is configured to respectively perform the objects according to the weights of the main body information and the corresponding images. The information is subjected to weight accumulation calculation; and the identification module 340 is configured to extract the body information corresponding to the maximum accumulated value as the body information of the image to be identified.
其中,本实施例中的所述查找模块310包括(图中未示出):接收图像识别请求的接收模块,及从所述接收模块接收到的图像识别请求中提取出待识别图像的提取模块。The searching module 310 in this embodiment includes (not shown in the figure): a receiving module that receives an image recognition request, and an extracting module that extracts an image to be recognized from an image recognition request received by the receiving module. .
除此之外,所述查找模块310可包括(图中未示出):In addition, the lookup module 310 can include (not shown):
索引模块,设置为通过图片相似特征建成倒排索引; An indexing module, configured to build an inverted index by using similar features of the image;
比较模块,将待识别的图像去进行相似检索,获取与其相似的N张其他图像。The comparison module performs the similar retrieval on the image to be identified, and acquires N other images similar thereto.
值得注意的是,本实施例图像识别的装置中,如图4所示,所述排序模块320具体可以包括如下模块:获取模块410,设置为获取图像及其标注信息;生成模块420,设置为利用训练数据获取所述图像标注信息的支持信息列表;提取模块430,设置为从所述支持信息列表中提取所述图像的主体信息。It is to be noted that, in the apparatus for image recognition in this embodiment, as shown in FIG. 4, the sorting module 320 may specifically include the following modules: an acquiring module 410 configured to acquire an image and its annotation information; and a generating module 420 configured to The support information list of the image annotation information is acquired by using the training data; and the extraction module 430 is configured to extract the body information of the image from the support information list.
其中,所述生成模块420包括(图中未示出):第一处理模块,设置为获取所述获取模块获取到的图像标注信息的中间数据;第二处理模块,设置为从图像数据库中提取与所述中间数据相关的训练数据;第三处理模块,设置为计算所述训练数据与所述中间数据的第一相关性分值;第四处理模块,设置为利用所述第一相关性分值生成所述图像标注信息的支持信息列表。The generating module 420 includes: (not shown in the figure): a first processing module configured to acquire intermediate data of the image annotation information acquired by the acquiring module; and a second processing module configured to extract from the image database a training data associated with the intermediate data; a third processing module configured to calculate a first correlation score of the training data and the intermediate data; and a fourth processing module configured to utilize the first correlation score The value generates a list of support information for the image annotation information.
需要说明的是,在本实施例中所述第三处理模块也可包括(图中未示出):第一计算器,设置为计算训练数据与所述中间数据的相关性权值并求和E1;第二计算器,设置为将所有训练数据与所述中间数据的相关性权值进行累加处理并求和E2;第三计算器,设置为通过计算所述E2与所述E1的比值确定该训练数据与所述中间数据的第一相关性分值。It should be noted that, in the embodiment, the third processing module may also include (not shown): the first calculator is configured to calculate and calculate the correlation weights of the training data and the intermediate data. E1; a second calculator configured to accumulate correlation weights of all training data and the intermediate data and sum E2; and a third calculator configured to determine by calculating a ratio of the E2 to the E1 The first correlation score of the training data and the intermediate data.
此外,本实施例的挖掘图像主体信息的装置还可包括(图中未示出):去噪模块,设置为所述第三处理模块确定所述第一相关性分值之后进行所述训练数据的去噪处理。In addition, the apparatus for mining image body information of the embodiment may further include (not shown): a denoising module, configured to perform the training data after the third processing module determines the first relevance score Denoising processing.
具体的,本实施例提出所述去噪模块还可包括(图中未示出):第四计算器,设置为计算与任一训练数据相同步长的其他训练数据与所有训练数据的相关性权值,将该相关性权值进行累加处理并求和后确定该任一训练数据的噪声权值F1;第五计算器,设置为将所有训练数据的噪声进行累加处理并求和后确定所有训练数据的总噪声权值F2;第六计算器,设置为通过获取所述第一相关性分值与所述训练数据的噪声值之差确定所述训练数据与所述中间数据的第二相关性分值;其中,所述F1与所述F2的比值为所述训练数据的噪声值。Specifically, the embodiment provides that the denoising module may further include (not shown): a fourth calculator configured to calculate correlation between other training data that is synchronous with any training data and all training data. Weight, the correlation weight is cumulatively processed and summed to determine the noise weight F1 of any training data; the fifth calculator is set to accumulate the noise of all training data and summed to determine all a total noise weight F2 of the training data; a sixth calculator configured to determine a second correlation between the training data and the intermediate data by obtaining a difference between the first correlation score and a noise value of the training data a sex score; wherein the ratio of the F1 to the F2 is a noise value of the training data.
除此之外,本实施例的所述提取模块还可包括(图中未示出):第五处理模块,设置为获取所述图像标注信息的所有中间数据及其相 关训练数据;第六处理模块,设置为通过统计相同训练数据在所述支持信息列表中的分值计算每个中间数据的得分;第七处理模块,设置为判断所述每个中间数据的得分与预设阈值的大小,当一中间数据的得分不小于所述预设阈值,则确定该中间数据为所述图像的主体信息。In addition, the extraction module of the embodiment may further include (not shown): a fifth processing module, configured to acquire all intermediate data of the image annotation information and the phase thereof And a sixth processing module, configured to calculate a score of each intermediate data by counting scores of the same training data in the support information list; and a seventh processing module configured to determine a score of each of the intermediate data And the size of the preset threshold, when the score of an intermediate data is not less than the preset threshold, determining that the intermediate data is the body information of the image.
参照图5,示出了根据本申请一个实施例的一种挖掘图像主体信息的装置实施例的结构框图,具体可以包括如下模块:获取模块510,设置为获取图像及其标注信息;生成模块520,设置为利用训练数据获取所述图像标注信息的支持信息列表;提取模块530,设置为从所述支持信息列表中提取所述图像的主体信息。Referring to FIG. 5, a structural block diagram of an apparatus for mining image body information according to an embodiment of the present application is shown. Specifically, the method may include the following modules: an acquiring module 510 configured to acquire an image and labeling information thereof; and a generating module 520. And a set of support information for acquiring the image annotation information by using the training data; and an extraction module 530, configured to extract the body information of the image from the support information list.
其中,所述生成模块520包括(图中未示出):第一处理模块,设置为获取所述获取模块获取到的图像标注信息的中间数据;第二处理模块,设置为从图像数据库中提取与所述中间数据相关的训练数据;第三处理模块,设置为计算所述训练数据与所述中间数据的第一相关性分值;第四处理模块,设置为利用所述第一相关性分值生成所述图像标注信息的支持信息列表。The generating module 520 includes: (not shown in the figure): the first processing module is configured to acquire intermediate data of the image annotation information acquired by the acquiring module; and the second processing module is configured to extract from the image database a training data associated with the intermediate data; a third processing module configured to calculate a first correlation score of the training data and the intermediate data; and a fourth processing module configured to utilize the first correlation score The value generates a list of support information for the image annotation information.
需要说明的是,在本实施例中所述第三处理模块也可包括(图中未示出):第一计算器,设置为计算训练数据与所述中间数据的相关性权值并求和E1;第二计算器,设置为将所有训练数据与所述中间数据的相关性权值进行累加处理并求和E2;第三计算器,设置为通过计算所述E2与所述E1的比值确定该训练数据与所述中间数据的第一相关性分值。It should be noted that, in the embodiment, the third processing module may also include (not shown): the first calculator is configured to calculate and calculate the correlation weights of the training data and the intermediate data. E1; a second calculator configured to accumulate correlation weights of all training data and the intermediate data and sum E2; and a third calculator configured to determine by calculating a ratio of the E2 to the E1 The first correlation score of the training data and the intermediate data.
此外,本实施例的挖掘图像主体信息的装置还可包括(图中未示出):去噪模块,设置为所述第三处理模块确定所述第一相关性分值之后进行所述训练数据的去噪处理。In addition, the apparatus for mining image body information of the embodiment may further include (not shown): a denoising module, configured to perform the training data after the third processing module determines the first relevance score Denoising processing.
具体的,本实施例提出所述去噪模块还可包括(图中未示出):第四计算器,设置为计算与任一训练数据相同步长的其他训练数据与所有训练数据的相关性权值,将该相关性权值进行累加处理并求和后确定该任一训练数据的噪声权值F1;第五计算器,设置为将所有训练数据的噪声进行累加处理并求和后确定所有训练数据的总噪声权值F2;第六计算器,设置为通过获取所述第一相关性分值与所述训练数据的噪声值之差确定所述训练数据与所述中间数据的第二相关性分值;其中,所述F1与所述F2的比值为所述训练数据的噪声值。 Specifically, the embodiment provides that the denoising module may further include (not shown): a fourth calculator configured to calculate correlation between other training data that is synchronous with any training data and all training data. Weight, the correlation weight is cumulatively processed and summed to determine the noise weight F1 of any training data; the fifth calculator is set to accumulate the noise of all training data and summed to determine all a total noise weight F2 of the training data; a sixth calculator configured to determine a second correlation between the training data and the intermediate data by obtaining a difference between the first correlation score and a noise value of the training data a sex score; wherein the ratio of the F1 to the F2 is a noise value of the training data.
除此之外,本实施例的挖掘图像主体信息的装置中,所述提取模块还可包括(图中未示出):第五处理模块,设置为获取所述图像标注信息的所有中间数据及其相关训练数据;第六处理模块,设置为通过统计相同训练数据在所述支持信息列表中的分值计算每个中间数据的得分;第七处理模块,设置为判断所述每个中间数据的得分与预设阈值的大小,当一中间数据的得分不小于所述预设阈值,则确定该中间数据为所述图像的主体信息。In addition, in the apparatus for mining image body information of the embodiment, the extraction module may further include (not shown): a fifth processing module configured to acquire all intermediate data of the image annotation information and Corresponding training data; a sixth processing module, configured to calculate a score of each intermediate data by counting scores of the same training data in the support information list; and a seventh processing module configured to determine each of the intermediate data The score is equal to the size of the preset threshold. When the score of an intermediate data is not less than the preset threshold, it is determined that the intermediate data is the body information of the image.
本申请的各个部件实施例可以以硬件实现,或者以在一个或者多个处理器上运行的软件模块实现,或者以它们的组合实现。本领域的技术人员应当理解,可以在实践中使用徼处理器或者数字信号处理器(DSP)来实现根据本申请实施例的图像识别设备或挖据图像主体信息的设备中的一些或者全部部件的一些或者全部功能。本申请还可以实现为设置为执行这里所描述的方法的一部分或者全部的设备或者装置程序(例如,计算机程序和计算机程序产品)。这样的实现本申请的程序可以存储在计算机可读介质上,或者可以具有一个或者多个信号的形式。这样的信号可以从因特网网站上下载得到,或者在载体信号上提供,或者以任何其他形式提供。The various component embodiments of the present application can be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof. It will be understood by those skilled in the art that a 徼 processor or a digital signal processor (DSP) may be used in practice to implement some or all of the components of the image recognition device or the device for exchanging image body information according to embodiments of the present application. Some or all of the features. The application can also be implemented as a device or device program (eg, a computer program and a computer program product) configured to perform some or all of the methods described herein. Such a program implementing the present application may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.
例如,图6示出了可以实现根据本申请的图像识别方法或挖掘图像主体信息的方法的计算设备,例如客户终端设备或服务器。该计算设备传统上包括处理器610和以存储器620形式的计算机程序产品或者计算机可读介质。存储器620可以是诸如闪存、EEPROM(电可擦除可编程只读存储器)、EPROM、硬盘或者ROM之类的电子存储器。存储器620具有设置为执行上述方法中的任何方法步骤的程序代码631的存储空间630。例如,用于程序代码的存储空间630可以包括分别设置为实现上面的方法中的各种步骤的各个程序代码631。这些程序代码可以从一个或者多个计算机程序产品中读出或者写入到这一个或者多个计算机程序产品中。这些计算机程序产品包括诸如硬盘,紧致盘(CD)、存储卡或者软盘之类的程序代码载体。这样的计算机程序产品通常为如参考图7所述的便携式或者固定存储模块。该存储模块可以具有与图6的计算设备中的存储器620类似布置的存储段、存储空间等。程序代码可以例如以适当形式进行压缩。通常,存储模块包括计算机可读代码631’,即可以由例如诸如610之类的处理器读取的代 码,这些代码当由计算设备运行时,导致该计算设备执行上面所描述的方法中的各个步骤。For example, FIG. 6 illustrates a computing device, such as a client terminal device or server, that may implement an image recognition method or a method of mining image body information in accordance with the present application. The computing device conventionally includes a processor 610 and a computer program product or computer readable medium in the form of a memory 620. The memory 620 may be an electronic memory such as a flash memory, an EEPROM (Electrically Erasable Programmable Read Only Memory), an EPROM, a hard disk, or a ROM. Memory 620 has a memory space 630 of program code 631 that is configured to perform any of the method steps described above. For example, the storage space 630 for program code may include respective program codes 631 that are respectively set to implement various steps in the above method. The program code can be read from or written to one or more computer program products. These computer program products include program code carriers such as hard disks, compact disks (CDs), memory cards or floppy disks. Such computer program products are typically portable or fixed storage modules as described with reference to FIG. The storage module can have storage segments, storage spaces, and the like that are similarly arranged to memory 620 in the computing device of FIG. The program code can be compressed, for example, in an appropriate form. Typically, the storage module includes computer readable code 631', i.e., a generation that can be read by a processor such as 610. Codes that, when executed by a computing device, cause the computing device to perform various steps in the methods described above.
本文中所称的“一个实施例”、“实施例”或者“一个或者多个实施例”意味着,结合实施例描述的特定特征、结构或者特性包括在本申请的至少一个实施例中。此外,请注意,这里“在一个实施例中”的词语例子不一定全指同一个实施例。"an embodiment," or "an embodiment," or "one or more embodiments" as used herein means that the particular features, structures, or characteristics described in connection with the embodiments are included in at least one embodiment of the present application. In addition, it is noted that the phrase "in one embodiment" is not necessarily referring to the same embodiment.
在此处所提供的说明书中,说明了大量具体细节。然而,能够理解,本申请的实施例可以在没有这些具体细节的情况下被实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。In the description provided herein, numerous specific details are set forth. However, it is understood that the embodiments of the present application can be practiced without these specific details. In some instances, well-known methods, structures, and techniques are not shown in detail so as not to obscure the understanding of the description.
应该注意的是上述实施例对本申请进行说明而不是对本申请进行限制,并且本领域技术人员在不脱离所附权利要求的范围的情况下可设计出替换实施例。在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词“一”或“一个”不排除存在多个这样的元件。本申请可以借助于包括有若干不同元件的硬件以及借助于适当编程的计算机来实现。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。It should be noted that the above-described embodiments are illustrative of the present application and are not intended to limit the scope of the application, and those skilled in the art can devise alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as a limitation. The word "comprising" does not exclude the presence of the elements or steps that are not recited in the claims. The word "a" or "an" The application can be implemented by means of hardware comprising several distinct elements and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means can be embodied by the same hardware item. The use of the words first, second, and third does not indicate any order. These words can be interpreted as names.
此外,还应当注意,本说明书中使用的语言主要是为了可读性和教导的目的而选择的,而不是为了解释或者限定本申请的主题而选择的。因此,在不偏离所附权利要求书的范围和精神的情况下,对于本技术领域的普通技术人员来说许多修改和变更都是显而易见的。对于本申请的范围,对本申请所做的公开是说明性的,而非限制性的,本申请的范围由所附权利要求书限定。 In addition, it should be noted that the language used in the specification has been selected for the purpose of readability and teaching, and is not intended to be interpreted or limited. Therefore, many modifications and changes will be apparent to those skilled in the art without departing from the scope of the invention. The disclosure of the present application is intended to be illustrative, and not restrictive, and the scope of the application is defined by the appended claims.

Claims (32)

  1. 一种图像识别的方法,包括:A method of image recognition, comprising:
    获取待识别图像,根据图像相似度查找与其相似的N张其他图像;Obtaining an image to be identified, and finding N other images similar to the image according to image similarity;
    获取其他图像对应的主体信息以及根据相似度排序确定每张其他图像的权值,Obtaining body information corresponding to other images and determining weights of each other image according to similarity ordering,
    根据各主体信息及对应图像的权值,分别对各主体信息进行权值累加计算,According to the weight information of each subject information and the corresponding image, weight calculation is performed on each subject information,
    提取最大累加值对应的主体信息,作为待识别图像的主体信息。The body information corresponding to the largest accumulated value is extracted as the body information of the image to be identified.
  2. 如权利要求1所述的方法,所述获取待识别图像包括:The method of claim 1, wherein the obtaining the image to be identified comprises:
    接收图像识别请求;Receiving an image recognition request;
    从所述图像识别请求中提取出待识别图像。An image to be identified is extracted from the image recognition request.
  3. 如权利要求1或2所述的方法,所述根据图像相似度查找与其相似的N张其他图像包括:The method according to claim 1 or 2, wherein the finding N other images similar to the image similarity includes:
    通过图片相似特征建成倒排索引,然后将待识别的图像去进行相似检索,获取与其相似的N张其他图像。The inverted index is built by the similar feature of the picture, and then the image to be recognized is similarly searched to obtain N other images similar thereto.
  4. 如权利要求1所述的方法,通过以下方式获取其他图像对应的主体信息:The method according to claim 1, wherein the subject information corresponding to the other images is obtained by:
    获取图像及其标注信息;Obtain images and their annotation information;
    利用训练数据获取所述图像标注信息的支持信息列表;Obtaining a support information list of the image annotation information by using training data;
    从所述支持信息列表中提取所述图像的主体信息。The body information of the image is extracted from the list of support information.
  5. 如权利要求4所述的方法,所述利用训练数据获取图像标注信息的支持信息列表包括:The method of claim 4, wherein the obtaining the support information list of the image annotation information by using the training data comprises:
    获取所述图像标注信息的中间数据;Obtaining intermediate data of the image annotation information;
    从图像数据库中提取与所述中间数据相关的训练数据;Extracting training data related to the intermediate data from an image database;
    计算所述训练数据与所述中间数据的第一相关性分值;Calculating a first relevance score of the training data and the intermediate data;
    利用所述第一相关性分值生成所述图像标注信息的支持信息列表。Generating a support information list of the image annotation information by using the first correlation score.
  6. 如权利要求5所述的方法,所述计算训练数据与所述中间数据的相关性分值包括:The method of claim 5, wherein the calculating the relevance score of the training data and the intermediate data comprises:
    计算训练数据与所述中间数据的相关性权值并求和E1;Calculating a correlation weight of the training data and the intermediate data and summing E1;
    将所有训练数据与所述中间数据的相关性权值进行累加处理并求和E2; All the training data and the intermediate data correlation weights are cumulatively processed and summed E2;
    通过计算所述E2与所述E1的比值确定该训练数据与所述中间数据的第一相关性分值。A first correlation score of the training data and the intermediate data is determined by calculating a ratio of the E2 to the E1.
  7. 如权利要求6所述的方法,还包括:The method of claim 6 further comprising:
    在确定训练数据与中间数据的第一相关性分值之后进行所述训练数据的去噪处理。Denoising processing of the training data is performed after determining a first correlation score of the training data and the intermediate data.
  8. 如权利要求7所述的方法,所述进行训练数据的去噪处理包括:The method of claim 7, wherein the performing the denoising processing of the training data comprises:
    计算与任一训练数据相同步长的其他训练数据与所有训练数据的相关性权值,将该相关性权值进行累加处理并求和后确定该任一训练数据的噪声权值F1;Calculating a correlation weight of the other training data that is synchronous with any training data and all the training data, accumulating the correlation weights and summing the noise weight F1 of the any training data;
    将所有训练数据的噪声进行累加处理并求和后确定所有训练数据的总噪声权值F2;The noise of all the training data is accumulated and summed to determine the total noise weight F2 of all the training data;
    通过获取所述第一相关性分值与所述训练数据的噪声值之差确定所述训练数据与所述中间数据的第二相关性分值;其中,所述F1与所述F2的比值为所述训练数据的噪声值。Determining, by acquiring a difference between the first correlation score and a noise value of the training data, a second relevance score of the training data and the intermediate data; wherein, a ratio of the F1 to the F2 is The noise value of the training data.
  9. 如权利要求5所述的方法,所述从支持信息列表中提取所述图像的主体信息包括:The method of claim 5, the extracting the body information of the image from the support information list comprises:
    获取所述图像标注信息的所有中间数据及其相关训练数据;Obtaining all intermediate data of the image annotation information and related training data;
    通过统计相同训练数据在所述支持信息列表中的分值计算每个中间数据的得分;Calculating the score of each intermediate data by counting the scores of the same training data in the support information list;
    判断所述每个中间数据的得分与预设阈值的大小,当一中间数据的得分不小于所述预设阈值,则确定该中间数据为所述图像的主体信息。Determining the score of each of the intermediate data and the size of the preset threshold. When the score of an intermediate data is not less than the preset threshold, determining that the intermediate data is the body information of the image.
  10. 一种挖掘图像主体信息的方法,包括:A method for mining image body information, comprising:
    获取图像及其标注信息;Obtain images and their annotation information;
    利用训练数据获取所述图像标注信息的支持信息列表;Obtaining a support information list of the image annotation information by using training data;
    从所述支持信息列表中提取所述图像的主体信息。The body information of the image is extracted from the list of support information.
  11. 如权利要求10所述的方法,所述利用训练数据获取图像标注信息的支持信息列表包括:The method of claim 10, wherein the obtaining the support information list of the image annotation information by using the training data comprises:
    获取所述图像标注信息的中间数据;Obtaining intermediate data of the image annotation information;
    从图像数据库中提取与所述中间数据相关的训练数据;Extracting training data related to the intermediate data from an image database;
    计算所述训练数据与所述中间数据的第一相关性分值;Calculating a first relevance score of the training data and the intermediate data;
    利用所述第一相关性分值生成所述图像标注信息的支持信息列表。Generating a support information list of the image annotation information by using the first correlation score.
  12. 如权利要求11所述的方法,所述计算训练数据与所述中间数 据的第一相关性分值包括:The method of claim 11 wherein said calculating training data and said intermediate number The first relevance scores are:
    计算训练数据与所述中间数据的相关性权值并求和E1;Calculating a correlation weight of the training data and the intermediate data and summing E1;
    将所有训练数据与所述中间数据的相关性权值进行累加处理并求和E2;All the training data and the intermediate data correlation weights are cumulatively processed and summed E2;
    通过计算所述E2与所述E1的比值确定该训练数据与所述中间数据的第一相关性分值。A first correlation score of the training data and the intermediate data is determined by calculating a ratio of the E2 to the E1.
  13. 如权利要求11所述的方法,还包括:The method of claim 11 further comprising:
    在确定训练数据与中间数据的第一相关性分值之后进行所述训练数据的去噪处理。Denoising processing of the training data is performed after determining a first correlation score of the training data and the intermediate data.
  14. 如权利要求13所述的方法,所述进行训练数据的去噪处理包括:The method of claim 13, wherein the performing the denoising processing of the training data comprises:
    计算与任一训练数据相同步长的其他训练数据与所有训练数据的相关性权值,将该相关性权值进行累加处理并求和后确定该任一训练数据的噪声权值F1;Calculating a correlation weight of the other training data that is synchronous with any training data and all the training data, accumulating the correlation weights and summing the noise weight F1 of the any training data;
    将所有训练数据的噪声进行累加处理并求和后确定所有训练数据的总噪声权值F2;The noise of all the training data is accumulated and summed to determine the total noise weight F2 of all the training data;
    通过获取所述第一相关性分值与所述训练数据的噪声值之差确定所述训练数据与所述中间数据的第二相关性分值;其中,所述F1与所述F2的比值为所述训练数据的噪声值。Determining, by acquiring a difference between the first correlation score and a noise value of the training data, a second relevance score of the training data and the intermediate data; wherein, a ratio of the F1 to the F2 is The noise value of the training data.
  15. 如权利要求11所述的方法,其特征在于,所述从支持信息列表中提取所述图像的主体信息包括:The method according to claim 11, wherein the extracting the body information of the image from the support information list comprises:
    获取所述图像标注信息的所有中间数据及其相关训练数据;Obtaining all intermediate data of the image annotation information and related training data;
    通过统计相同训练数据在所述支持信息列表中的分值计算每个中间数据的得分;Calculating the score of each intermediate data by counting the scores of the same training data in the support information list;
    判断所述每个中间数据的得分与预设阈值的大小,当一中间数据的得分不小于所述预设阈值,则确定该中间数据为所述图像的主体信息。Determining the score of each of the intermediate data and the size of the preset threshold. When the score of an intermediate data is not less than the preset threshold, determining that the intermediate data is the body information of the image.
  16. 一种图像识别的装置,包括:An image recognition device comprising:
    查找模块,设置为获取待识别图像,并根据图像相似度查找与其相似的N张其他图像;a search module, configured to acquire an image to be recognized, and find N other images similar to the image according to the image similarity;
    排序模块,设置为获取所述查找模块查找到的其他图像对应的主体信息以及根据相似度排序确定每张其他图像的权值,a sorting module, configured to obtain body information corresponding to other images found by the searching module, and determine weights of each other image according to similarity ordering,
    计算模块,设置为根据各主体信息及对应图像的权值,分别对各主 体信息进行权值累加计算,The calculation module is set to respectively according to the weight information of each subject information and the corresponding image The body information is calculated by adding weights,
    识别模块,设置为提取最大累加值对应的主体信息,作为待识别图像的主体信息。The identification module is configured to extract body information corresponding to the maximum accumulated value as the body information of the image to be identified.
  17. 如权利要求16所述的装置,所述查找模块包括:The apparatus of claim 16, the lookup module comprising:
    接收图像识别请求的接收模块,及a receiving module that receives an image recognition request, and
    从所述接收模块接收到的图像识别请求中提取出待识别图像的提取模块。An extraction module that extracts an image to be recognized from an image recognition request received by the receiving module.
  18. 如权利要求16所述的装置,所述查找模块包括:The apparatus of claim 16, the lookup module comprising:
    索引模块,设置为通过图片相似特征建成倒排索引;An indexing module, configured to build an inverted index by using similar features of the image;
    比较模块,将待识别的图像去进行相似检索,获取与其相似的N张其他图像。The comparison module performs the similar retrieval on the image to be identified, and acquires N other images similar thereto.
  19. 如权利要求16所述的装置,所述排序模块包括:The apparatus of claim 16, the ordering module comprising:
    获取模块,设置为获取图像及其标注信息;Obtain a module, set to obtain an image and its annotation information;
    生成模块,设置为利用训练数据获取所述图像标注信息的支持信息列表;Generating a module, configured to obtain a support information list of the image annotation information by using training data;
    提取模块,设置为从所述支持信息列表中提取所述图像的主体信息。An extraction module is configured to extract body information of the image from the list of support information.
  20. 如权利要求19所述的装置,所述生成模块包括:The apparatus of claim 19, the generating module comprising:
    第一处理模块,设置为获取所述获取模块获取到的图像标注信息的中间数据;a first processing module, configured to acquire intermediate data of the image annotation information acquired by the acquiring module;
    第二处理模块,设置为从图像数据库中提取与所述中间数据相关的训练数据;a second processing module configured to extract training data related to the intermediate data from an image database;
    第三处理模块,设置为计算所述训练数据与所述中间数据的第一相关性分值;a third processing module, configured to calculate a first correlation score of the training data and the intermediate data;
    第四处理模块,设置为利用所述第一相关性分值生成所述图像标注信息的支持信息列表。The fourth processing module is configured to generate a support information list of the image annotation information by using the first correlation score.
  21. 如权利要求20所述的装置,所述第三处理模块包括:The apparatus of claim 20, the third processing module comprising:
    第一计算器,设置为计算训练数据与所述中间数据的相关性权值并求和E1;a first calculator, configured to calculate a correlation weight of the training data and the intermediate data and sum E1;
    第二计算器,设置为将所有训练数据与所述中间数据的相关性权值进行累加处理并求和E2;a second calculator, configured to accumulate the correlation weights of all the training data and the intermediate data and sum E2;
    第三计算器,设置为通过计算所述E2与所述E1的比值确定该训 练数据与所述中间数据的第一相关性分值。a third calculator configured to determine the training by calculating a ratio of the E2 to the E1 A first correlation score of the training data and the intermediate data.
  22. 如权利要求20所述的装置,还包括:The apparatus of claim 20, further comprising:
    去噪模块,设置为所述第三处理模块确定所述第一相关性分值之后进行所述训练数据的去噪处理。And a denoising module, configured to perform denoising processing of the training data after the third processing module determines the first correlation score.
  23. 如权利要求22所述的装置,所述去噪模块包括:The apparatus of claim 22, the denoising module comprising:
    第四计算器,设置为计算与任一训练数据相同步长的其他训练数据与所有训练数据的相关性权值,将该相关性权值进行累加处理并求和后确定该任一训练数据的噪声权值F1;a fourth calculator configured to calculate a correlation weight of the other training data and all the training data that are synchronous with any of the training data, perform the cumulative processing of the correlation weights, and determine the training data after the summation Noise weight F1;
    第五计算器,设置为将所有训练数据的噪声进行累加处理并求和后确定所有训练数据的总噪声权值F2;a fifth calculator, configured to accumulate the noise of all the training data and summed to determine the total noise weight F2 of all the training data;
    第六计算器,设置为通过获取所述第一相关性分值与所述训练数据的噪声值之差确定所述训练数据与所述中间数据的第二相关性分值;其中,所述F1与所述F2的比值为所述训练数据的噪声值。a sixth calculator configured to determine a second relevance score of the training data and the intermediate data by obtaining a difference between the first correlation score and a noise value of the training data; wherein the F1 The ratio to the F2 is the noise value of the training data.
  24. 如权利要求19所述的装置,所述提取模块包括:The apparatus of claim 19, the extraction module comprising:
    第五处理模块,设置为获取所述图像标注信息的所有中间数据及其相关训练数据;a fifth processing module, configured to acquire all intermediate data of the image annotation information and related training data;
    第六处理模块,设置为通过统计相同训练数据在所述支持信息列表中的分值计算每个中间数据的得分;a sixth processing module, configured to calculate a score of each intermediate data by counting scores of the same training data in the support information list;
    第七处理模块,设置为判断所述每个中间数据的得分与预设阈值的大小,当一中间数据的得分不小于所述预设阈值,则确定该中间数据为所述图像的主体信息。The seventh processing module is configured to determine a score of each of the intermediate data and a size of a preset threshold. When a score of the intermediate data is not less than the preset threshold, determining that the intermediate data is the body information of the image.
  25. 一种挖掘图像主体信息的装置,包括:An apparatus for mining image body information, comprising:
    获取模块,设置为获取图像及其标注信息;Obtain a module, set to obtain an image and its annotation information;
    生成模块,设置为利用训练数据获取所述图像标注信息的支持信息列表;Generating a module, configured to obtain a support information list of the image annotation information by using training data;
    提取模块,设置为从所述支持信息列表中提取所述图像的主体信息。An extraction module is configured to extract body information of the image from the list of support information.
  26. 如权利要求25所述的装置,其特征在于,所述生成模块包括:The device of claim 25, wherein the generating module comprises:
    第一处理模块,设置为获取所述获取模块获取到的图像标注信息的中间数据;a first processing module, configured to acquire intermediate data of the image annotation information acquired by the acquiring module;
    第二处理模块,设置为从图像数据库中提取与所述中间数据相关的训练数据; a second processing module configured to extract training data related to the intermediate data from an image database;
    第三处理模块,设置为计算所述训练数据与所述中间数据的第一相关性分值;a third processing module, configured to calculate a first correlation score of the training data and the intermediate data;
    第四处理模块,设置为利用所述第一相关性分值生成所述图像标注信息的支持信息列表。The fourth processing module is configured to generate a support information list of the image annotation information by using the first correlation score.
  27. 如权利要求26所述的装置,其特征在于,所述第三处理模块包括:The device of claim 26, wherein the third processing module comprises:
    第一计算器,设置为计算训练数据与所述中间数据的相关性权值并求和E1;a first calculator, configured to calculate a correlation weight of the training data and the intermediate data and sum E1;
    第二计算器,设置为将所有训练数据与所述中间数据的相关性权值进行累加处理并求和E2;a second calculator, configured to accumulate the correlation weights of all the training data and the intermediate data and sum E2;
    第三计算器,设置为通过计算所述E2与所述E1的比值确定该训练数据与所述中间数据的第一相关性分值。And a third calculator configured to determine a first relevance score of the training data and the intermediate data by calculating a ratio of the E2 to the E1.
  28. 如权利要求26所述的装置,其特征在于,还包括:The device of claim 26, further comprising:
    去噪模块,设置为所述第三处理模块确定所述第一相关性分值之后进行所述训练数据的去噪处理。And a denoising module, configured to perform denoising processing of the training data after the third processing module determines the first correlation score.
  29. 如权利要求28所述的装置,所述去噪模块包括:The apparatus of claim 28, the denoising module comprising:
    第四计算器,设置为计算与任一训练数据相同步长的其他训练数据与所有训练数据的相关性权值,将该相关性权值进行累加处理并求和后确定该任一训练数据的噪声权值F1;a fourth calculator configured to calculate a correlation weight of the other training data and all the training data that are synchronous with any of the training data, perform the cumulative processing of the correlation weights, and determine the training data after the summation Noise weight F1;
    第五计算器,设置为将所有训练数据的噪声进行累加处理并求和后确定所有训练数据的总噪声权值F2;a fifth calculator, configured to accumulate the noise of all the training data and summed to determine the total noise weight F2 of all the training data;
    第六计算器,设置为通过获取所述第一相关性分值与所述训练数据的噪声值之差确定所述训练数据与所述中间数据的第二相关性分值;其中,所述F1与所述F2的比值为所述训练数据的噪声值。a sixth calculator configured to determine a second relevance score of the training data and the intermediate data by obtaining a difference between the first correlation score and a noise value of the training data; wherein the F1 The ratio to the F2 is the noise value of the training data.
  30. 如权利要求25所述的装置,所述提取模块包括:The apparatus of claim 25, the extraction module comprising:
    第五处理模块,设置为获取所述图像标注信息的所有中间数据及其相关训练数据;a fifth processing module, configured to acquire all intermediate data of the image annotation information and related training data;
    第六处理模块,设置为通过统计相同训练数据在所述支持信息列表中的分值计算每个中间数据的得分;a sixth processing module, configured to calculate a score of each intermediate data by counting scores of the same training data in the support information list;
    第七处理模块,设置为判断所述每个中间数据的得分与预设阈值的大小,当一中间数据的得分不小于所述预设阈值,则确定该中间数据为所述图像的主体信息。 The seventh processing module is configured to determine a score of each of the intermediate data and a size of a preset threshold. When a score of the intermediate data is not less than the preset threshold, determining that the intermediate data is the body information of the image.
  31. 一种计算机程序,包括计算机可读代码,当所述计算机可读代码在计算设备上运行时,导致所述计算设备执行根据权利要求1-15中的任一个所述方法。A computer program comprising computer readable code that, when executed on a computing device, causes the computing device to perform the method of any of claims 1-15.
  32. 一种计算机可读介质,其中存储了如权利要求31所述的计算机程序。 A computer readable medium storing the computer program of claim 31.
PCT/CN2014/087954 2013-11-15 2014-09-30 Image recognition method, and method and device for mining main body information about image WO2015070678A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN201310575290.8 2013-11-15
CN201310575292.7 2013-11-15
CN201310575290.8A CN103631889B (en) 2013-11-15 2013-11-15 Image recognizing method and device
CN201310575292.7A CN103631890B (en) 2013-11-15 2013-11-15 Method and device for mining image principal information

Publications (1)

Publication Number Publication Date
WO2015070678A1 true WO2015070678A1 (en) 2015-05-21

Family

ID=53056741

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/087954 WO2015070678A1 (en) 2013-11-15 2014-09-30 Image recognition method, and method and device for mining main body information about image

Country Status (1)

Country Link
WO (1) WO2015070678A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111061890A (en) * 2019-12-09 2020-04-24 腾讯云计算(北京)有限责任公司 Method for verifying labeling information, method and device for determining category
CN111539008A (en) * 2020-05-22 2020-08-14 支付宝(杭州)信息技术有限公司 Image processing method and device for protecting privacy

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020087538A1 (en) * 1998-06-22 2002-07-04 U.S.Philips Corporation Image retrieval system
CN102542067A (en) * 2012-01-06 2012-07-04 上海交通大学 Automatic image semantic annotation method based on scale learning and correlated label dissemination
CN103377381A (en) * 2012-04-26 2013-10-30 富士通株式会社 Method and device for identifying content attribute of image
CN103631889A (en) * 2013-11-15 2014-03-12 北京奇虎科技有限公司 Image recognizing method and device
CN103631890A (en) * 2013-11-15 2014-03-12 北京奇虎科技有限公司 Method and device for mining image principal information

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020087538A1 (en) * 1998-06-22 2002-07-04 U.S.Philips Corporation Image retrieval system
CN102542067A (en) * 2012-01-06 2012-07-04 上海交通大学 Automatic image semantic annotation method based on scale learning and correlated label dissemination
CN103377381A (en) * 2012-04-26 2013-10-30 富士通株式会社 Method and device for identifying content attribute of image
CN103631889A (en) * 2013-11-15 2014-03-12 北京奇虎科技有限公司 Image recognizing method and device
CN103631890A (en) * 2013-11-15 2014-03-12 北京奇虎科技有限公司 Method and device for mining image principal information

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111061890A (en) * 2019-12-09 2020-04-24 腾讯云计算(北京)有限责任公司 Method for verifying labeling information, method and device for determining category
CN111061890B (en) * 2019-12-09 2023-04-07 腾讯云计算(北京)有限责任公司 Method for verifying labeling information, method and device for determining category
CN111539008A (en) * 2020-05-22 2020-08-14 支付宝(杭州)信息技术有限公司 Image processing method and device for protecting privacy
CN111539008B (en) * 2020-05-22 2023-04-11 蚂蚁金服(杭州)网络技术有限公司 Image processing method and device for protecting privacy

Similar Documents

Publication Publication Date Title
US10268703B1 (en) System and method for associating images with semantic entities
US10394878B2 (en) Associating still images and videos
US9372920B2 (en) Identifying textual terms in response to a visual query
US9753951B1 (en) Presenting image search results
CN104899322B (en) Search engine and implementation method thereof
JP6759844B2 (en) Systems, methods, programs and equipment that associate images with facilities
WO2019218514A1 (en) Method for extracting webpage target information, device, and storage medium
US8892554B2 (en) Automatic word-cloud generation
US11461386B2 (en) Visual recognition using user tap locations
US8868569B2 (en) Methods for detecting and removing duplicates in video search results
US20100185691A1 (en) Scalable semi-structured named entity detection
JP2017157192A (en) Method of matching between image and content item based on key word
JP2013541793A (en) Multi-mode search query input method
WO2016000555A1 (en) Methods and systems for recommending social network-based content and news
JP2017220205A (en) Method and system for dynamically rankings images to be matched with content in response to search query
US9218366B1 (en) Query image model
CN109241319B (en) Picture retrieval method, device, server and storage medium
CN113806660A (en) Data evaluation method, training method, device, electronic device and storage medium
WO2015070678A1 (en) Image recognition method, and method and device for mining main body information about image
US20190121911A1 (en) Cognitive content suggestive sharing and display decay
CN107423294A (en) A kind of community image search method and system
CN107577667B (en) Entity word processing method and device
CN113657116B (en) Social media popularity prediction method and device based on visual semantic relationship
GB2608112A (en) System and method for providing media content
TW201416890A (en) Method and system for providing article information

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14862010

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14862010

Country of ref document: EP

Kind code of ref document: A1