WO2015172721A1 - Method and device for searching and ranking images and providing image search - Google Patents

Method and device for searching and ranking images and providing image search Download PDF

Info

Publication number
WO2015172721A1
WO2015172721A1 PCT/CN2015/078881 CN2015078881W WO2015172721A1 WO 2015172721 A1 WO2015172721 A1 WO 2015172721A1 CN 2015078881 W CN2015078881 W CN 2015078881W WO 2015172721 A1 WO2015172721 A1 WO 2015172721A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
images
module
family
webpage
Prior art date
Application number
PCT/CN2015/078881
Other languages
French (fr)
Chinese (zh)
Inventor
陶哲
Original Assignee
北京奇虎科技有限公司
奇智软件(北京)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from CN201410203342.3A external-priority patent/CN103995856B/en
Priority claimed from CN201410203700.0A external-priority patent/CN103995857A/en
Application filed by 北京奇虎科技有限公司, 奇智软件(北京)有限公司 filed Critical 北京奇虎科技有限公司
Publication of WO2015172721A1 publication Critical patent/WO2015172721A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor

Definitions

  • the present invention relates to the technical field of image data processing, and in particular, to a method and apparatus for implementing image search sorting, and a method and apparatus for providing image search.
  • the search engine is a software system applied on the network, which can be certain.
  • the method realizes the search and discovery of information on the network, and displays the search result after processing the searched information.
  • search results that can be provided to users are no longer just the search results of text information searched according to user input commands, but also can search for network images according to user needs, and will search out The results of the picture are presented to the user.
  • the present invention has been made in order to provide a method for realizing image search sorting and a corresponding apparatus for realizing image search sorting that overcomes the above problems or at least partially solves the above problems.
  • the present invention also provides a method of providing image search and a corresponding apparatus for providing image search.
  • a method for implementing image search ordering including: creating a plurality of image families corresponding to a plurality of source images; calculating a reference weight value of each image family; and using reference weights of each image family The size of the search results as a parameter for the search results of the search query feedback.
  • a method for providing an image search includes: after receiving an image query request, screening a plurality of image families associated with the query request; searching for each image family in the image family Corresponding source image and reference weight value of each image family; according to the order of the reference weights, the source images in each image family are sorted to draw search results corresponding to the query request.
  • an apparatus for implementing image search ordering comprising: a creating unit adapted to create a family of images corresponding to a plurality of source images; and a calculating unit adapted to calculate a reference weight of each image family And a sorting unit adapted to be a parameter for sorting the search results fed back by the search query according to the size of the reference weight of each image family.
  • an apparatus for providing an image search comprising: a screening unit adapted to After receiving the image query request, filtering a plurality of image families related to the query request; the searching unit is configured to search for a source image corresponding to each image family in the image family filtered by the screening unit, and each image family a reference weight; a rendering unit, adapted to receive the search result of the search unit, and sort the source images in each image family according to the order of the reference weights to draw a search result corresponding to the query request.
  • a program comprising readable code, when said readable code is run on a computing device, causing said computing device to perform image search ordering as described in an embodiment of the present invention method.
  • a program comprising readable code, when the readable code is run on a computing device, causing the computing device to perform the method of providing image search as described in an embodiment of the present invention
  • the embodiment of the present invention creates a plurality of image groups corresponding to the source image, and calculates a reference weight value of each image family, and then uses the size of the reference weight of each image family as a parameter for sorting the search results of the search query feedback. It can obtain more accurate and accurate sorting results, and make the image sorting result have priority order in the number of citations, which greatly improves the accuracy of the search and effectively improves the search efficiency.
  • the embodiment of the present invention creates a plurality of image groups corresponding to the source image, and calculates a reference weight value of each image family, and then uses the size of the reference weight of each image family as a parameter for sorting the search results of the search query feedback. It can obtain more accurate and accurate sorting results, and make the image sorting result have priority order in the number of citations, which greatly improves the accuracy of the search and effectively improves the search efficiency.
  • FIG. 1 is a flow chart showing the steps of a method for implementing image search ordering according to an embodiment of the present invention
  • FIG. 2 is a flow chart showing another method step of implementing image search sorting according to an embodiment of the present invention
  • FIG. 3 is a flow chart showing the steps of a method for providing image search according to an embodiment of the present invention.
  • FIG. 4 illustrates a flow chart of another method of providing image search in accordance with one embodiment of the present invention
  • FIG. 5 is a block diagram showing the structure of an apparatus for performing image search sorting according to an embodiment of the present invention.
  • FIG. 6 is a block diagram showing another structure of an apparatus for performing image search sorting according to an embodiment of the present invention.
  • FIG. 7 is a structural block diagram of another acquisition module in an apparatus for performing image search sorting according to an embodiment of the present invention.
  • FIG. 8 is a block diagram showing the structure of an apparatus for providing image search according to an embodiment of the present invention.
  • FIG. 9 is a block diagram showing another structure of an apparatus for providing image search according to an embodiment of the present invention.
  • FIG. 10 is a block diagram showing the structure of a pre-acquisition module in another apparatus for providing image search according to an embodiment of the present invention.
  • Figure 11 shows a block diagram of a computing device for performing an image search method in accordance with the present invention
  • Fig. 12 shows a storage unit for holding or carrying program code implementing the image search method according to the present invention.
  • Embodiment 1 of the method for performing image search sorting according to an embodiment of the present invention is shown, which may specifically include the following steps:
  • Step 110 Create an image family corresponding to multiple source images.
  • the same image family refers to images that are visually consistent from human beings. These images are images modified from a source image, since multiple images in the image family are modified by a source image. Therefore, each image in an image family should have the same source image; based on this, in the embodiment, it is proposed that the webpage corresponding to the source image is captured from the resource site by step S111, and the webpage is obtained by parsing the webpage.
  • the plurality of images corresponding to the source image; since each image in the same image family is derived from the same source image, after acquiring a plurality of images, the transmission between the multiple images corresponding to the source image may be acquired by step S112: Relationship, and using the propagation relationship between the plurality of images to create a plurality of image families.
  • the propagation relationship between multiple images corresponding to the source image is mainly included: reprinting, copying, and modifying, but is not limited thereto, and may have other propagation relationships. This is not described again; specifically, the embodiment obtains the propagation relationship by:
  • the corresponding relationship between the webpage URL (Uniform Resource Locator) and the plurality of image URLs is obtained by parsing the webpage page; Wherein, if the plurality of webpage URLs correspond to the same image URL, determining that the plurality of webpages including the image and the image are reprinted; or
  • Step 120 Calculate a reference weight of each image family
  • the value of different propagation relationships is different.
  • the value of the above three propagation relationships may be: modification > copy > reprint; wherein, the workload required for modification is greater than the simple preservation, The cost of saving the image and then providing the image service is greater than the reprint behavior; therefore, the cost means that the value of each image is different, that is, the basis weight of each propagation relationship; at the same time, after analysis, it is known that the difference is different.
  • the value of the image referenced by the site is also different. The site with a large amount of access has a large value of the image. Therefore, the site weight parameter is set in this embodiment. Specifically, this embodiment proposes to calculate each image by the following method. Family weights, including but not limited to:
  • the size of the modified relationship weight, the copy relationship weight, and the retransmission relationship weight are successively decremented; here, a formula is calculated to calculate the reference weight of the image family as follows;
  • a is the modification relationship weight
  • b is the replication relationship weight
  • c is the retransmission relationship weight
  • a>b>c is the weight of the site channel.
  • Step 130 According to the size of the reference weight of each image family, as a parameter of the search result feedback of the search query feedback.
  • the search as the feedback of the search query may be determined according to the size of the reference weight of each image family.
  • the sorted parameters of the result for example, the sorting of the search results is proportional to the size of the reference weights of the respective image families; of course, those skilled in the art can easily understand that the reference weights of the image families are used as search results.
  • the other parameters may be introduced as the sorting reference, which is not specifically limited in this embodiment.
  • Step 210 Create an image family including Figure A, Figure B, Figure C, Figure D, Figure E, and Figure F;
  • Step 220 Grab the webpage 7 to the webpage 14 from the resource site
  • Step 230 Parsing the webpage 7 to the webpage 14 to obtain a correspondence between the image url and the webpage url, that is, (picture url, webpage url) is: (A, 13), (B, 14), (C, 11), (D, 12), (F, 10), (E, 7), (E, 8), (E, 9); wherein the picture E corresponds to the web page 7, the web page 8, and the web page 9, so the web page 7, the web page 8 and the image in the web page 9 and the image E are reproduced relationships;
  • Step 240 After calculating the md5 values of the pictures A to F, it is known that the picture b, the picture E, and the md5 of the picture F are the same, so it can be determined that the picture B, the picture E, and the picture F are in a replication relationship;
  • Step 250 Perform an "approximate copy" calculation on picture A, picture B, picture C, and picture D having different md5 values, and determine that picture A, picture B, picture C, and picture D are an approximate copy, thereby determining picture A. , picture B, picture C and picture D are modified relationships;
  • Step 270 The size of the reference weight R of each image family is used as a parameter for sorting the search results fed back by the search query.
  • Embodiment 2 of the method for providing image search according to an embodiment of the present invention is shown, which may specifically include the following steps:
  • Step 310 After receiving the image query request, screening a plurality of image families related to the query request;
  • the method provided by the embodiment may further include: a step of pre-creating a plurality of image families corresponding to the source image and a step of calculating a reference weight for each image family; wherein the same image family refers to images that are visually consistent from a person, the images being a source image
  • the same image family refers to images that are visually consistent from a person, the images being a source image
  • the modified image since a plurality of images in the image family are modified by a source image, each image in one image family should have the same source image; based on this, in the present embodiment, a step can be adopted.
  • step S311 Grab the webpage corresponding to the source image from the resource site, and obtain multiple images corresponding to the source image by parsing the webpage page; since each image in the same image family is derived from the same source image, After the image is uploaded, the propagation relationship between the plurality of images corresponding to the source image may be acquired through step S312, and a plurality of image families are established by using the propagation relationship between the plurality of images.
  • the propagation relationship between multiple images corresponding to the source image is mainly included: reprinting, copying, and modifying, but is not limited thereto, and may have other propagation relationships.
  • the embodiment obtains the propagation relationship by: A.
  • the webpage URL is obtained by parsing the webpage page and Corresponding relationship between the plurality of image URLs; wherein, if the plurality of webpage URLs correspond to the same image URL, determining that the plurality of webpages including the image are in a reprint relationship with the image; or, B, grasping the source from the resource site
  • the plurality of images are obtained by parsing the webpage, and the information summary MD5 value of the plurality of images is calculated, wherein if the MD5 values of the multiple images are the same, it is determined that the plurality of images of the same MD5 are between To copy the relationship; otherwise, it is determined whether the multiple images are the same approximate copy, and if so, it is determined that the plurality of images having different MD5 values are modified.
  • the image family can be pre-created by other means, and the details are not described herein again.
  • the value of different propagation relationships is different.
  • the value of the above three propagation relationships may be: modification > copy > reprint; wherein the amount of work required for modification is greater than simple preservation, and the same image is saved. Then the cost of providing the image service is greater than the reprint behavior; therefore, this cost means that the value of each image is different, that is, the basis weight of each propagation relationship; at the same time, after analysis, it is known that different sites refer to The value of the image is also different. The site with a large amount of access has a large value of the image. Therefore, the site weight parameter is set in this embodiment. Specifically, this embodiment proposes to calculate the reference of each image family by the following method.
  • the weight including but not limited to: preset the weight of the resource site and different propagation relationships; wherein, if the propagation relationship includes reprinting, copying, and modifying, the weight relationship between the three is modified > copy > reprint , that is, setting the modified relationship weight, the copy relationship weight, and the resale relationship weight in the propagation relationship are successively decremented;
  • the formula weights reference image group as follows:
  • a is the modification relationship weight
  • b is the replication relationship weight
  • c is the retransmission relationship weight
  • a>b>c is the retransmission relationship weight
  • Step 320 Find a source image corresponding to each image family in the image family and a reference weight of each image family;
  • Step 330 Sort the source images in each image family according to the order of the reference weights to draw a search result corresponding to the query request.
  • the reference weights are first sorted, and then the matching between the source images is obtained by using a matching relationship between the source image corresponding to the image family and the reference weight of the image family, where the source image is
  • the ordering of the references is the same as the order of the weights of the reference weights, and then the ordering between the source images is one of the basis for drawing the search results of the query request; of course, those skilled in the art can easily understand that Other parameters may be added to the search result, which is not specifically limited in this embodiment.
  • Step 410 Receive an image query request, and filter an image family related to the query request; the image family includes a map A ⁇ , a diagram B ⁇ , a diagram C ⁇ , a diagram D ⁇ , a diagram E ⁇ , and a diagram F ⁇ ;
  • Step 420 Find a source image in the image family as Figure A ⁇ ;
  • Step 430 Grab the webpage 7 ⁇ page 14' from the resource site; parse the webpage 7 ⁇ page 14', and obtain the correspondence between the image url and the webpage url, that is, (picture url, webpage url) is: (A ⁇ , 13 ⁇ ), (B ⁇ , 14 ⁇ ), (C ⁇ , 11 ⁇ ), (D ⁇ , 12 ⁇ ), (F ⁇ , 10 ⁇ ), (E ⁇ , 7 ⁇ ), (E ⁇ , 8 ), (E ⁇ , 9 ⁇ ); wherein the picture E ⁇ corresponds to the web page 7 ⁇ , the web page 8' and the web page 9', so the image in the web page 7', the web page 8' and the web page 9' and the image E ⁇ is the reprint relationship;
  • Step 440 After calculating the md5 value of the picture A' ⁇ F', it is known that the picture B ⁇ , the picture E ⁇ , and the picture f'md5 are the same, so the picture B', the picture E', and the picture F' can be determined. Inter-replication relationship;
  • Step 450 Perform an "approximate copy" calculation on the picture A ⁇ , the picture B', the picture C', and the picture D' having different md5 values, and determine that the picture A', the picture B', the picture C', and the picture D' are an approximation. Copy, which determines the picture The relationship between A ⁇ , picture B ⁇ , picture C ⁇ and picture D ⁇ is modified;
  • the method may further include: a creating unit 510, configured to create an image family corresponding to multiple source images;
  • the calculating unit 520 is adapted to calculate a reference weight value of each image family;
  • the sorting unit 530 is adapted to use the size of the reference weight value of each image family as a parameter of the search result feedback of the search query feedback.
  • the method may include the following module: a creating unit 610, configured to create a family of images corresponding to multiple source images.
  • the calculation unit 620 is adapted to calculate a reference weight value of each image family;
  • the sorting unit 630 is adapted to use the size of the reference weight value of each image family as a parameter of the search result feedback of the search query feedback.
  • the creating unit 610 includes: a crawling module 6102, configured to capture the source image pair from a resource site.
  • the image processing module 6104 is configured to acquire a plurality of images corresponding to the source image by parsing the webpage page captured by the crawling module, and the acquiring module 6106 is configured to acquire the multiple images corresponding to the source image.
  • the propagation relationship; the family building module 6108 is adapted to establish a plurality of image families by using the propagation relationship between the plurality of images.
  • the method may further include the following modules: the obtaining module 6106 in the embodiment may further include The first processing module 610602 is configured to receive the analysis result of the parsing module, and obtain a correspondence between the webpage uniform resource locator URL and the image URL according to the parsing result; the first comparing module 610604 is adapted to compare the Corresponding relationship between the plurality of webpage URLs and the plurality of image URLs, and when the plurality of webpage URLs correspond to the same image URL, determining that the plurality of webpages including the image and the image are reprinted relationships.
  • the obtaining module 6106 further includes: a second processing module 610606, configured to calculate a message digest MD5 value of the plurality of images parsed by the parsing module; and a second comparing module 610608, configured to compare the plurality of images The MD5 value, and when the MD5 values of the plurality of images are the same, it is determined that the plurality of images having the same MD5 are in a copy relationship.
  • a second processing module 610606 configured to calculate a message digest MD5 value of the plurality of images parsed by the parsing module
  • a second comparing module 610608 configured to compare the plurality of images The MD5 value, and when the MD5 values of the plurality of images are the same, it is determined that the plurality of images having the same MD5 are in a copy relationship.
  • the obtaining module further includes: a third processing module 610610, configured to calculate a message digest MD5 value of the plurality of images parsed by the parsing module; and a third comparing module 610612, configured to compare the plurality of The MD5 value of the image, and when the MD5 values of the plurality of images are different, whether the plurality of images having different MD5 values are modified by the approximate copying manner.
  • a third processing module 610610 configured to calculate a message digest MD5 value of the plurality of images parsed by the parsing module
  • a third comparing module 610612 configured to compare the plurality of The MD5 value of the image, and when the MD5 values of the plurality of images are different, whether the plurality of images having different MD5 values are modified by the approximate copying manner.
  • the calculating unit 620 may further include: a setting module 6202, configured to preset a resource site of the crawling module to capture a webpage, and the different propagation relationship acquired by the acquiring module.
  • the weighting ratio matching module 6204 is adapted to calculate the reference weight of the image family by using the resource site in the same image family and the different propagation relationship weights.
  • FIG. 8 is a block diagram showing an embodiment of an apparatus for providing an image search according to an embodiment of the present invention.
  • the method may include the following module: a screening unit 810, configured to: after receiving an image query request, screening and a plurality of image families related to the query request; the searching unit 820 is adapted to search for a source image corresponding to each image family in the image family filtered by the filtering unit 810 and a reference weight of each image family; the drawing unit 830, The method is adapted to receive the search result of the searching unit 820, and sort the source images in each image family according to the order of the reference weights to draw a search result corresponding to the query request.
  • the method may include a module: a pre-creation unit 940, which is pre-configured to create an image corresponding to multiple source images.
  • a pre-computation unit 950 adapted to calculate a reference weight for each image family.
  • the filtering unit 910 is configured to: after receiving the image query request, filter a plurality of image families related to the query request; the searching unit 920 is configured to search for each image family in the image family selected by the screening unit 910. Source image and reference weight for each image family; drawing
  • the unit 930 is adapted to receive the search result of the searching unit 920, and sort the source images in each image family according to the order of the reference weights to draw a search result corresponding to the query request.
  • the pre-creation unit 940 includes: a pre-fetching module 9402, configured to capture a webpage corresponding to the source image from a resource site; and a pre-parsing module 9404, configured to parse the webpage page captured by the crawling module. Acquiring a plurality of images corresponding to the source image; the pre-acquisition module 9406 is adapted to acquire a propagation relationship between the plurality of images corresponding to the source image; and the pre-built family module 9408 is adapted to utilize the propagation between the multiple images Relationships establish multiple image families.
  • FIG. 10 is a block diagram showing a structure of a pre-acquisition module in an apparatus for providing an image search according to an embodiment of the present invention.
  • the pre-acquisition module 9406 may also be used in this embodiment.
  • the first pre-comparison module 940604 is adapted to receive the parsing result of the parsing module, and obtain the correspondence between the webpage uniform resource locator URL and the image URL according to the parsing result; the first pre-comparison module 940604 is adapted to Comparing the correspondence between the plurality of webpage URLs and the plurality of image URLs, and when the plurality of webpage URLs correspond to the same image URL, determining that the plurality of webpages including the image and the image are reprinted relationships.
  • the pre-acquisition module 9406 further includes: a second pre-processing module 940606, configured to calculate an information digest MD5 value of the plurality of images parsed by the parsing module; and a second pre-comparison module 940608, configured to compare the plurality of The MD5 value of the image, and when the MD5 values of the plurality of images are the same, it is determined that the plurality of images having the same MD5 are in a copy relationship.
  • a second pre-processing module 940606 configured to calculate an information digest MD5 value of the plurality of images parsed by the parsing module
  • a second pre-comparison module 940608 configured to compare the plurality of The MD5 value of the image, and when the MD5 values of the plurality of images are the same, it is determined that the plurality of images having the same MD5 are in a copy relationship.
  • the pre-acquisition module further includes: a third pre-processing module 940610, configured to calculate an information digest MD5 value of the plurality of images parsed by the parsing module; and a third pre-comparison module 940612, suitable for comparing the The MD5 value of the plurality of images is described, and when the MD5 values of the plurality of images are different, whether the plurality of images having different MD5 values are modified by the approximate copying manner is determined.
  • a third pre-processing module 940610 configured to calculate an information digest MD5 value of the plurality of images parsed by the parsing module
  • a third pre-comparison module 940612 suitable for comparing the The MD5 value of the plurality of images is described, and when the MD5 values of the plurality of images are different, whether the plurality of images having different MD5 values are modified by the approximate copying manner is determined.
  • the pre-calculation unit 950 may further include: a pre-setting module 9502, configured to preset a resource site of the crawling module to capture a webpage, and the different propagation acquired by the acquiring module.
  • the weighting of the relationship; the pre-matching module 9504 is adapted to calculate the reference weight of the image family by using the resource site in the same image family and the different propagation relationship weights.
  • modules in the devices of the embodiments can be adaptively changed and placed in one or more devices different from the embodiment.
  • the modules or units or components of the embodiments may be combined into one module or unit or component, and further they may be divided into a plurality of sub-modules or sub-units or sub-components.
  • any combination of the features disclosed in the specification, including the accompanying claims, the abstract and the drawings, and any methods so disclosed, or All processes or units of the device are combined.
  • Each feature disclosed in this specification (including the accompanying claims, the abstract and the drawings) may be replaced by alternative features that provide the same, equivalent or similar purpose.
  • the various component embodiments of the present invention may be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof.
  • a microprocessor or digital signal processor may be used in practice to implement some or all of the functionality of some or all of the components of the web page loading device in accordance with embodiments of the present invention.
  • the invention can also be implemented as a device or device program (e.g., a computer program and a computer program product) for performing some or all of the methods described herein.
  • a program implementing the invention may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.
  • FIG. 11 illustrates a computing device that can implement an image search method according to the present invention, wherein the image search method includes the method of implementing image search ordering described in the above embodiments, and the above-described embodiments.
  • the computing device conventionally includes a processor 1110 and a program product or readable medium in the form of a memory 1120.
  • the memory 1120 may be an electronic memory such as a flash memory, an EEPROM (Electrically Erasable Programmable Read Only Memory), an EPROM, or a ROM.
  • the memory 1120 has a function for performing the above
  • the storage space 1130 for program code may include respective program codes 1131 for implementing various steps in the above methods, respectively.
  • These program codes can be read from or written to one or more program products.
  • These program products include program code carriers such as memory cards.
  • Such a program product is typically a portable or fixed storage unit as described with reference to FIG.
  • the storage unit may have a storage segment, a storage space, and the like that are similarly arranged to the storage 1120 in the computing device of FIG.
  • the program code can be compressed, for example, in an appropriate form.
  • the storage unit includes readable code 1131 ', ie, code that can be read by a processor, such as, for example, 1110, which when executed by a computing device causes the computing device to perform various steps in the methods described above .

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

A method and device for searching and ranking images and providing image search. The method comprises: creating image clusters corresponding to a plurality of source images (110); calculating a reference weight of each image cluster (120); and taking the size of the reference weight of each image cluster as a ranking parameter of a search result fed back by search query (130). By means of the method and device, more excellent and accurate ranking results can be obtained, and the image ranking results are made to have a priority order in the number of references, thereby greatly improving the search accuracy, and effectively increasing the search efficiency.

Description

图像搜索排序及提供图像搜索的方法和装置Image search sorting and method and apparatus for providing image search 技术领域Technical field
本发明涉及图像数据处理的技术领域,具体涉及一种实现图像搜索排序的方法和装置,以及一种提供图像搜索的方法和装置。The present invention relates to the technical field of image data processing, and in particular, to a method and apparatus for implementing image search sorting, and a method and apparatus for providing image search.
背景技术Background technique
随着互联网和多媒体技术的飞速发展,互联网上的资源也日益丰富,从网络上获取资源也变得越来越容易;搜索引擎即是一种在网络上应用的软件系统,其能以一定的方式在网络上实现信息的搜索和发现,并在对搜索到的信息进行处理后显示出搜索结果。With the rapid development of the Internet and multimedia technologies, the resources on the Internet are becoming more and more abundant, and it is becoming easier to obtain resources from the network. The search engine is a software system applied on the network, which can be certain. The method realizes the search and discovery of information on the network, and displays the search result after processing the searched information.
而目前,随着搜索引擎技术的日益成熟,能够提供给用户的搜索结果已经不再只是根据用户输入命令搜索到的文本信息搜索结果,还可以根据用户需求对网络图片进行搜索,并将搜索出的图片结果呈献给用户。At present, with the increasing maturity of search engine technology, the search results that can be provided to users are no longer just the search results of text information searched according to user input commands, but also can search for network images according to user needs, and will search out The results of the picture are presented to the user.
然而,在目前现有技术的图片搜索方案中,呈献给用户的搜索结果往往没有任何规律,而只是将所有可能相关的图片简单罗列,其图片的搜索结果中并没有任何优先级顺序,这就会使输出的图片搜索结果显示无序状态,进而大大降低了搜索的准确性,从而影响了搜索效率。However, in the current state of the art image search scheme, the search results presented to the user often have no rules, but simply list all possible related images, and the search results of the images do not have any priority order, which is The output image search results will be displayed in an unordered state, which will greatly reduce the accuracy of the search and affect the search efficiency.
发明内容Summary of the invention
鉴于上述问题,提出了本发明以便提供一种克服上述问题或者至少部分地解决上述问题的一种实现图像搜索排序的方法和相应的一种实现图像搜索排序的装置。本发明还提供了一种提供图像搜索的方法和相应的一种提供图像搜索的装置。In view of the above problems, the present invention has been made in order to provide a method for realizing image search sorting and a corresponding apparatus for realizing image search sorting that overcomes the above problems or at least partially solves the above problems. The present invention also provides a method of providing image search and a corresponding apparatus for providing image search.
依据本发明的一个方面,提供了一种实现图像搜索排序的方法,包括:创建多个源图像对应的图像族;计算每个图像族的引用权值;根据所述各图像族的引用权值的大小作为搜索查询反馈的搜索结果排序的参数。According to an aspect of the present invention, a method for implementing image search ordering is provided, including: creating a plurality of image families corresponding to a plurality of source images; calculating a reference weight value of each image family; and using reference weights of each image family The size of the search results as a parameter for the search results of the search query feedback.
根据本发明的另一方面,提供了一种提供图像搜索的方法,包括:接收到图像查询请求后,筛选与所述查询请求相关的多个图像族;查找所述图像族中每个图像族对应的源图像及每个图像族的引用权值;根据所述引用权值大小顺序,将各图像族中的源图像进行排序绘制对应所述查询请求的搜索结果。According to another aspect of the present invention, a method for providing an image search includes: after receiving an image query request, screening a plurality of image families associated with the query request; searching for each image family in the image family Corresponding source image and reference weight value of each image family; according to the order of the reference weights, the source images in each image family are sorted to draw search results corresponding to the query request.
根据本发明的另一方面,提供了一种实现图像搜索排序的装置,包括:创建单元,适于创建多个源图像对应的图像族;计算单元,适于计算每个图像族的引用权值;排序单元,适于根据各图像族的引用权值的大小作为搜索查询反馈的搜索结果排序的参数。According to another aspect of the present invention, an apparatus for implementing image search ordering is provided, comprising: a creating unit adapted to create a family of images corresponding to a plurality of source images; and a calculating unit adapted to calculate a reference weight of each image family And a sorting unit adapted to be a parameter for sorting the search results fed back by the search query according to the size of the reference weight of each image family.
根据本发明的另一方面,提供了一种提供图像搜索的装置,包括:筛选单元,适于 在接收到图像查询请求后,筛选与所述查询请求相关的多个图像族;查找单元,适于查找所述筛选单元筛选过的图像族中每个图像族对应的源图像及每个图像族的引用权值;绘制单元,适于接收所述查找单元的查找结果,并根据所述引用权值大小顺序,将各图像族中的源图像进行排序绘制对应所述查询请求的搜索结果。According to another aspect of the present invention, an apparatus for providing an image search is provided, comprising: a screening unit adapted to After receiving the image query request, filtering a plurality of image families related to the query request; the searching unit is configured to search for a source image corresponding to each image family in the image family filtered by the screening unit, and each image family a reference weight; a rendering unit, adapted to receive the search result of the search unit, and sort the source images in each image family according to the order of the reference weights to draw a search result corresponding to the query request.
根据本发明的另一方面,提供了一种程序,包括可读代码,当所述可读代码在计算设备上运行时,导致所述计算设备执行本发明实施例所述的实现图像搜索排序的方法。According to another aspect of the present invention, there is provided a program comprising readable code, when said readable code is run on a computing device, causing said computing device to perform image search ordering as described in an embodiment of the present invention method.
根据本发明的另一方面,提供了一种可读介质,其中存储了上述程序。According to another aspect of the present invention, there is provided a readable medium in which the above program is stored.
根据本发明的另一方面,提供了一种程序,包括可读代码,当所述可读代码在计算设备上运行时,导致所述计算设备执行本发明实施例所述的提供图像搜索的方法According to another aspect of the present invention, there is provided a program comprising readable code, when the readable code is run on a computing device, causing the computing device to perform the method of providing image search as described in an embodiment of the present invention
根据本发明的另一方面,提供了一种可读介质,其中存储了上述程序。According to another aspect of the present invention, there is provided a readable medium in which the above program is stored.
本发明实施例通过创建多个源图像对应的图像族,并计算每个图像族的引用权值,然后再根据所述各图像族的引用权值的大小作为搜索查询反馈的搜索结果排序的参数,可以获得更为优质准确的排序结果,并使图像排序结果存在引用次数上的优先级顺序,大大改善了搜索的准确性,并有效提高了搜索效率。The embodiment of the present invention creates a plurality of image groups corresponding to the source image, and calculates a reference weight value of each image family, and then uses the size of the reference weight of each image family as a parameter for sorting the search results of the search query feedback. It can obtain more accurate and accurate sorting results, and make the image sorting result have priority order in the number of citations, which greatly improves the accuracy of the search and effectively improves the search efficiency.
本发明实施例通过创建多个源图像对应的图像族,并计算每个图像族的引用权值,然后再根据所述各图像族的引用权值的大小作为搜索查询反馈的搜索结果排序的参数,可以获得更为优质准确的排序结果,并使图像排序结果存在引用次数上的优先级顺序,大大改善了搜索的准确性,并有效提高了搜索效率。The embodiment of the present invention creates a plurality of image groups corresponding to the source image, and calculates a reference weight value of each image family, and then uses the size of the reference weight of each image family as a parameter for sorting the search results of the search query feedback. It can obtain more accurate and accurate sorting results, and make the image sorting result have priority order in the number of citations, which greatly improves the accuracy of the search and effectively improves the search efficiency.
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂,以下特举本发明的具体实施方式。The above description is only an overview of the technical solutions of the present invention, and the above-described and other objects, features and advantages of the present invention can be more clearly understood. Specific embodiments of the invention are set forth below.
附图说明DRAWINGS
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:Various other advantages and benefits will become apparent to those skilled in the art from a The drawings are only for the purpose of illustrating the preferred embodiments and are not to be construed as limiting. Throughout the drawings, the same reference numerals are used to refer to the same parts. In the drawing:
图1示出了根据本发明一个实施例的一种实现图像搜索排序的方法步骤流程图;1 is a flow chart showing the steps of a method for implementing image search ordering according to an embodiment of the present invention;
图2示出了根据本发明一个实施例的另一种实现图像搜索排序的方法步骤流程图;2 is a flow chart showing another method step of implementing image search sorting according to an embodiment of the present invention;
图3示出了根据本发明一个实施例的一种提供图像搜索的方法步骤流程图;3 is a flow chart showing the steps of a method for providing image search according to an embodiment of the present invention;
图4示出了根据本发明一个实施例的另一种提供图像搜索的方法步骤流程图;4 illustrates a flow chart of another method of providing image search in accordance with one embodiment of the present invention;
图5示出了根据本发明一个实施例的一种实现图像搜索排序的装置结构框图;FIG. 5 is a block diagram showing the structure of an apparatus for performing image search sorting according to an embodiment of the present invention; FIG.
图6示出了根据本发明一个实施例的另一种实现图像搜索排序的装置结构框图; 6 is a block diagram showing another structure of an apparatus for performing image search sorting according to an embodiment of the present invention;
图7示出了根据本发明一个实施例的另一种实现图像搜索排序的装置中获取模块的结构框图;FIG. 7 is a structural block diagram of another acquisition module in an apparatus for performing image search sorting according to an embodiment of the present invention; FIG.
图8示出了根据本发明一个实施例的一种提供图像搜索的装置结构框图;FIG. 8 is a block diagram showing the structure of an apparatus for providing image search according to an embodiment of the present invention; FIG.
图9示出了根据本发明一个实施例的另一种提供图像搜索的装置结构框图;FIG. 9 is a block diagram showing another structure of an apparatus for providing image search according to an embodiment of the present invention; FIG.
图10示出了根据本发明一个实施例的另一种提供图像搜索的装置中预获取模块的结构框图;FIG. 10 is a block diagram showing the structure of a pre-acquisition module in another apparatus for providing image search according to an embodiment of the present invention; FIG.
图11示出了用于执行根据本发明的关于图像搜索方法的计算设备的框图;Figure 11 shows a block diagram of a computing device for performing an image search method in accordance with the present invention;
图12示出了用于保持或者携带实现根据本发明的关于图像搜索方法的程序代码的存储单元。Fig. 12 shows a storage unit for holding or carrying program code implementing the image search method according to the present invention.
具体实施方式detailed description
下面将参照附图更详细地描述本公开的示例性实施例。虽然附图中显示了本公开的示例性实施例,然而应当理解,可以以各种形式实现本公开而不应被这里阐述的实施例所限制。相反,提供这些实施例是为了能够更透彻地理解本公开,并且能够将本公开的范围完整的传达给本领域的技术人员。Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While the embodiments of the present invention have been shown in the drawings, the embodiments Rather, these embodiments are provided so that this disclosure will be more fully understood and the scope of the disclosure will be fully disclosed.
参照图1,示出了根据本发明一个实施例的一种实现图像搜索排序的方法实施例1的步骤流程图,具体可以包括如下步骤:Referring to FIG. 1 , a flow chart of the steps of Embodiment 1 of the method for performing image search sorting according to an embodiment of the present invention is shown, which may specifically include the following steps:
步骤110:创建多个源图像对应的图像族;Step 110: Create an image family corresponding to multiple source images.
需要说明的是,相同图像族就是指从人的视觉上看是一致的图像,这些图像是由一源图像修改而来的图像,由于图像族中的多个图像是由一源图像修改而来,因此一个图像族中的各图像应具有相同的源图像;基于此,在本实施例中提出可以通过步骤S111:从资源站点抓取所述源图像对应的网页,通过解析所述网页页面获取所述源图像对应的多张图像;由于同一图像族中各图像来源于同一源图像,因此在获取到多张图像后,可以通过步骤S112:获取所述源图像对应的多张图像间的传播关系,并利用所述多张图像间的传播关系建立多个图像族。It should be noted that the same image family refers to images that are visually consistent from human beings. These images are images modified from a source image, since multiple images in the image family are modified by a source image. Therefore, each image in an image family should have the same source image; based on this, in the embodiment, it is proposed that the webpage corresponding to the source image is captured from the resource site by step S111, and the webpage is obtained by parsing the webpage. The plurality of images corresponding to the source image; since each image in the same image family is derived from the same source image, after acquiring a plurality of images, the transmission between the multiple images corresponding to the source image may be acquired by step S112: Relationship, and using the propagation relationship between the plurality of images to create a plurality of image families.
当然,本领域普通技术人员很容易了解还可以通过其他方式来创建图像族,本实施例在此不再赘述。Of course, those skilled in the art can easily understand that the image family can be created by other means, and the details are not described herein again.
值得注意的是,在本实施例中获取所述源图像对应的多张图像间的传播关系主要包括:转载、复制和修改,但并不局限于此,还可以有其他传播关系,本实例在此不再赘述;具体的,本实施例通过以下方式来获取所述传播关系:It should be noted that, in this embodiment, the propagation relationship between multiple images corresponding to the source image is mainly included: reprinting, copying, and modifying, but is not limited thereto, and may have other propagation relationships. This is not described again; specifically, the embodiment obtains the propagation relationship by:
A、从资源站点抓取所述源图像对应的网页后,通过对所述网页页面进行解析来获取网页URL(Uniform Resource Locator,统一资源定位符)和多张图像URL的对应关系; 其中,如果多个网页URL与同一图像URL对应,则确定包含该图像的多个网页与所述图像为转载关系;或,After the webpage corresponding to the source image is captured from the resource site, the corresponding relationship between the webpage URL (Uniform Resource Locator) and the plurality of image URLs is obtained by parsing the webpage page; Wherein, if the plurality of webpage URLs correspond to the same image URL, determining that the plurality of webpages including the image and the image are reprinted; or
B、从资源站点抓取所述源图像对应的网页后,通过解析网页页面获取多张图像,计算多张图像的信息摘要MD5值(Message-Digest Algorithm 5),其中,如果多张图像的MD5值相同,则确定所述MD5相同的多张图像之间为复制关系;否则,判断多张图像之间是否为同一近似拷贝,如果是,则确定所述MD5值不同的多张图像间为修改关系。B. After capturing the webpage corresponding to the source image from the resource site, obtaining multiple images by parsing the webpage page, and calculating a message digest MD5 value (Message-Digest Algorithm 5) of the plurality of images, wherein if the MD5 of the multiple images is If the values are the same, it is determined that the multiple images of the same MD5 are in a copy relationship; otherwise, it is determined whether the multiple images are the same approximate copy, and if so, it is determined that the multiple images with different MD5 values are modified. relationship.
步骤120:计算每个图像族的引用权值;Step 120: Calculate a reference weight of each image family;
在实际应用中,不同的传播关系的价值是不同的,例如上述三种传播关系之间的价值大小可以为:修改>复制>转载;其中,修改需要耗费的工作量是大于简单的保存的,同样保存图片然后提供图片服务的代价是大于转载行为的;因此,此种代价意味着每张图像的价值不同,也就是每种传播关系的基础权值;而与此同时,经过分析可知,不同站点引用的图像的价值也是不同的,访问量大的站点其引用的图像价值较大,因此本实施例中设定了站点权值参数;具体的,本实施例提出通过以下方式计算每个图像族的引用权值,包括但不限于:In practical applications, the value of different propagation relationships is different. For example, the value of the above three propagation relationships may be: modification > copy > reprint; wherein, the workload required for modification is greater than the simple preservation, The cost of saving the image and then providing the image service is greater than the reprint behavior; therefore, the cost means that the value of each image is different, that is, the basis weight of each propagation relationship; at the same time, after analysis, it is known that the difference is different. The value of the image referenced by the site is also different. The site with a large amount of access has a large value of the image. Therefore, the site weight parameter is set in this embodiment. Specifically, this embodiment proposes to calculate each image by the following method. Family weights, including but not limited to:
预设所述资源站点及不同传播关系的权值;其中,如果传播关系中包含转载、复制和修改,则三者之间的权值关系为修改>复制>转载,即设置所述传播关系中修改关系权值、复制关系权值及转载关系权值的大小依次递减;此处例举一种公式计算所述图像族的引用权值如下;Presetting the weight of the resource site and different propagation relationships; wherein if the propagation relationship includes reprinting, copying, and modifying, the weight relationship between the three is modified > copy > reprint, that is, the propagation relationship is set The size of the modified relationship weight, the copy relationship weight, and the retransmission relationship weight are successively decremented; here, a formula is calculated to calculate the reference weight of the image family as follows;
Figure PCTCN2015078881-appb-000001
Figure PCTCN2015078881-appb-000001
其中,a为修改关系权值、b为复制关系权值、c为转载关系权值,且a>b>c,SITEj是站点频道的权值。Where a is the modification relationship weight, b is the replication relationship weight, c is the retransmission relationship weight, and a>b>c, SITEj is the weight of the site channel.
本发明并不限于此计算公式,只要依据本发明思想的其他公式变形也为本发明保护之列。The present invention is not limited to this calculation formula, as long as other formula modifications in accordance with the inventive concept are also protected by the present invention.
步骤130:根据所述各图像族的引用权值的大小作为搜索查询反馈的搜索结果排序的参数。Step 130: According to the size of the reference weight of each image family, as a parameter of the search result feedback of the search query feedback.
其中,当计算出每个图像族的引用权值后,当接收到用户图像搜索请求后,如果搜索查询命中,则可根据所述各图像族的引用权值的大小确定作为搜索查询反馈的搜索结果排序的参数;例如搜索结果排序与以所述各图像族的引用权值大小成正比例关系;当然,本领域普通技术人员很容易了解,在以所述各图像族的引用权值作为搜索结果排序的参数的同时,还可以引入其他参数作为排序参照,本实施例对此并不做具体限定。After the reference weight value of each image family is calculated, after the user image search request is received, if the search query hits, the search as the feedback of the search query may be determined according to the size of the reference weight of each image family. The sorted parameters of the result; for example, the sorting of the search results is proportional to the size of the reference weights of the respective image families; of course, those skilled in the art can easily understand that the reference weights of the image families are used as search results. At the same time, the other parameters may be introduced as the sorting reference, which is not specifically limited in this embodiment.
当然,上述特种信息及其判断方式只是作为示例,在实施本发明实施例时,可以根 据实际情况设置其他特种信息及其判断方式,本发明实施例对此不加以限制。另外,除了上述特种信息及其判断方式外,本领域技术人员还可以根据实际需要采用其他特种信息及其判断方式,本发明实施例对此也不加以限制。Of course, the above special information and its judgment manner are only examples, and when implementing the embodiments of the present invention, The other special information and the manner of judging thereof are set according to the actual situation, which is not limited by the embodiment of the present invention. In addition, in addition to the above-mentioned special information and the manner of its judgment, the person skilled in the art can also adopt other special information and its judgment manner according to actual needs, which is not limited by the embodiment of the present invention.
参照图2,通过一个具体的图片实例对上述实施例的一种实现图像搜索排序的方法进行详细描述,具体包括如下步骤:Referring to FIG. 2, a method for implementing image search sorting in the foregoing embodiment is described in detail by using a specific image example, and specifically includes the following steps:
步骤210:创建包含图A、图B、图C、图D、图E和图F的图像族;Step 210: Create an image family including Figure A, Figure B, Figure C, Figure D, Figure E, and Figure F;
步骤220:从资源站点抓取网页7~网页14;Step 220: Grab the webpage 7 to the webpage 14 from the resource site;
步骤230:对网页7~网页14进行解析,得到图片url和网页url的对应关系,即(图片url,网页url)为:(A,13),(B,14),(C,11),(D,12),(F,10),(E,7),(E,8),(E,9);其中,图片E对应了网页7、网页8和网页9,因此网页7、网页8和网页9中的图像与所述图像E即为转载关系;Step 230: Parsing the webpage 7 to the webpage 14 to obtain a correspondence between the image url and the webpage url, that is, (picture url, webpage url) is: (A, 13), (B, 14), (C, 11), (D, 12), (F, 10), (E, 7), (E, 8), (E, 9); wherein the picture E corresponds to the web page 7, the web page 8, and the web page 9, so the web page 7, the web page 8 and the image in the web page 9 and the image E are reproduced relationships;
步骤240:通过计算图片A~图F的md5值后获知,图片B、图片E和图片F的md5是相同的,因此可确定图片B、图片E和图片F之间为复制关系;Step 240: After calculating the md5 values of the pictures A to F, it is known that the picture b, the picture E, and the md5 of the picture F are the same, so it can be determined that the picture B, the picture E, and the picture F are in a replication relationship;
步骤250:通过对md5值不同的图片A、图片B、图片C和图片D进行“近似拷贝”计算,确定图片A、图片B、图片C和图片D是一个近似拷贝,由此可确定图片A、图片B、图片C和图片D之间为修改关系;Step 250: Perform an "approximate copy" calculation on picture A, picture B, picture C, and picture D having different md5 values, and determine that picture A, picture B, picture C, and picture D are an approximate copy, thereby determining picture A. , picture B, picture C and picture D are modified relationships;
步骤260:由于图A到图B、图C、图D是修改关系,因此图片A的引用权值W1=site(B)*3*1+site(C)*3*1+site(D)*3*1,其中site是图片所在站点的权重,设3为修改关系权重;而对于图B到图E和图F而言是复制关系,因此图B的引用权值W2=site(E)*2*1+site(F)*2*1,设2为复制关系权重;对于图E到网页8和网页9是转载关系,并设图E的原始网页为网页7,则图E的引用权值为W3=site(8)*1*1+site(9)*1*1,设1为链接关系权重;因此,该图像族的引用权值即为R=W1+W2+W3。Step 260: Since Figure A to Figure B, Figure C, and Figure D are modified relationships, the reference weight of the picture A is W1 = site(B) * 3 * 1 + site (C) * 3 * 1 + site (D) *3*1, where site is the weight of the site where the image is located, and 3 is the modification relationship weight; for Figure B to Figure E and Figure F is the replication relationship, so the reference weight of Figure B is W2=site(E) *2*1+site(F)*2*1, let 2 be the copy relationship weight; for Figure E to page 8 and page 9 is the reprint relationship, and set the original page of Figure E to page 7, then the reference to Figure E The weight is W3=site(8)*1*1+site(9)*1*1, and 1 is the link relationship weight; therefore, the reference weight of the image family is R=W1+W2+W3.
步骤270:根据所述各图像族的引用权值R的大小作为搜索查询反馈的搜索结果排序的参数。Step 270: The size of the reference weight R of each image family is used as a parameter for sorting the search results fed back by the search query.
可以看出,采用本发明实施例的方法,通过创建多个源图像对应的图像族,并计算每个图像族的引用权值,然后再根据所述各图像族的引用权值的大小作为搜索查询反馈的搜索结果排序的参数,可以获得更为优质准确的排序结果,并使图像排序结果存在引用次数上的优先级顺序,有效提高了搜索效率。It can be seen that, by using the method of the embodiment of the present invention, a plurality of image families corresponding to the source image are created, and a reference weight value of each image family is calculated, and then the search weight value of each image family is used as a search. Querying the parameters of the search result sorting results in a more accurate and accurate sorting result, and the image sorting result has a priority order on the number of citations, thereby effectively improving the search efficiency.
参照图3,示出了根据本发明一个实施例的一种提供图像搜索的方法实施例2的步骤流程图,具体可以包括如下步骤:Referring to FIG. 3, a flow chart of the steps of Embodiment 2 of the method for providing image search according to an embodiment of the present invention is shown, which may specifically include the following steps:
步骤310:接收到图像查询请求后,筛选与所述查询请求相关的多个图像族;Step 310: After receiving the image query request, screening a plurality of image families related to the query request;
在实际应用过程中,为了提高搜索结果显示效率,本实施例提出该方法还可以包括: 预先创建多个源图像对应的图像族的步骤以及计算每个图像族的引用权值的步骤;其中,相同图像族就是指从人的视觉上看是一致的图像,这些图像是由一源图像修改而来的图像,由于图像族中的多个图像是由一源图像修改而来,因此一个图像族中的各图像应具有相同的源图像;基于此,在本实施例中提出可以通过步骤S311:从资源站点抓取所述源图像对应的网页,通过解析所述网页页面获取所述源图像对应的多张图像;由于同一图像族中各图像来源于同一源图像,因此在获取到多张图像后,可以通过步骤S312:获取所述源图像对应的多张图像间的传播关系,并利用所述多张图像间的传播关系建立多个图像族。In the actual application process, in order to improve the display efficiency of the search result, the method provided by the embodiment may further include: a step of pre-creating a plurality of image families corresponding to the source image and a step of calculating a reference weight for each image family; wherein the same image family refers to images that are visually consistent from a person, the images being a source image The modified image, since a plurality of images in the image family are modified by a source image, each image in one image family should have the same source image; based on this, in the present embodiment, a step can be adopted. S311: Grab the webpage corresponding to the source image from the resource site, and obtain multiple images corresponding to the source image by parsing the webpage page; since each image in the same image family is derived from the same source image, After the image is uploaded, the propagation relationship between the plurality of images corresponding to the source image may be acquired through step S312, and a plurality of image families are established by using the propagation relationship between the plurality of images.
值得注意的是,在本实施例中获取所述源图像对应的多张图像间的传播关系主要包括:转载、复制和修改,但并不局限于此,还可以有其他传播关系,本实例在此不再赘述;具体的,本实施例通过以下方式来获取所述传播关系:A、从资源站点抓取所述源图像对应的网页后,通过对所述网页页面进行解析来获取网页URL和多张图像URL的对应关系;其中,如果多个网页URL与同一图像URL对应,则确定包含该图像的多个网页与所述图像为转载关系;或,B、从资源站点抓取所述源图像对应的网页后,通过解析网页页面获取多张图像,计算所述多张图像的信息摘要MD5值,其中,如果多张图像的MD5值相同,则确定所述MD5相同的多张图像之间为复制关系;否则,判断多张图像之间是否为同一近似拷贝,如果是,则确定所述MD5值不同的多张图像间为修改关系。当然,本领域普通技术人员很容易了解还可以通过其他方式来预先创建图像族,本实施例在此不再赘述。It should be noted that, in this embodiment, the propagation relationship between multiple images corresponding to the source image is mainly included: reprinting, copying, and modifying, but is not limited thereto, and may have other propagation relationships. Specifically, the embodiment obtains the propagation relationship by: A. After the webpage corresponding to the source image is captured from the resource site, the webpage URL is obtained by parsing the webpage page and Corresponding relationship between the plurality of image URLs; wherein, if the plurality of webpage URLs correspond to the same image URL, determining that the plurality of webpages including the image are in a reprint relationship with the image; or, B, grasping the source from the resource site After the webpage corresponding to the image, the plurality of images are obtained by parsing the webpage, and the information summary MD5 value of the plurality of images is calculated, wherein if the MD5 values of the multiple images are the same, it is determined that the plurality of images of the same MD5 are between To copy the relationship; otherwise, it is determined whether the multiple images are the same approximate copy, and if so, it is determined that the plurality of images having different MD5 values are modified. Of course, those skilled in the art can easily understand that the image family can be pre-created by other means, and the details are not described herein again.
此外,不同的传播关系的价值是不同的,例如上述三种传播关系之间的价值大小可以为:修改>复制>转载;其中,修改需要耗费的工作量是大于简单的保存的,同样保存图片然后提供图片服务的代价是大于转载行为的;因此,此种代价意味着每张图像的价值不同,也就是每种传播关系的基础权值;而与此同时,经过分析可知,不同站点引用的图像的价值也是不同的,访问量大的站点其引用的图像价值较大,因此本实施例中设定了站点权值参数;具体的,本实施例提出通过以下方式计算每个图像族的引用权值,包括但不限于:预设所述资源站点及不同传播关系的权值;其中,如果传播关系中包含转载、复制和修改,则三者之间的权值关系为修改>复制>转载,即设置所述传播关系中修改关系权值、复制关系权值及转载关系权值的大小依次递减;此处例举一种公式计算所述图像族的引用权值如下:In addition, the value of different propagation relationships is different. For example, the value of the above three propagation relationships may be: modification > copy > reprint; wherein the amount of work required for modification is greater than simple preservation, and the same image is saved. Then the cost of providing the image service is greater than the reprint behavior; therefore, this cost means that the value of each image is different, that is, the basis weight of each propagation relationship; at the same time, after analysis, it is known that different sites refer to The value of the image is also different. The site with a large amount of access has a large value of the image. Therefore, the site weight parameter is set in this embodiment. Specifically, this embodiment proposes to calculate the reference of each image family by the following method. The weight, including but not limited to: preset the weight of the resource site and different propagation relationships; wherein, if the propagation relationship includes reprinting, copying, and modifying, the weight relationship between the three is modified > copy > reprint , that is, setting the modified relationship weight, the copy relationship weight, and the resale relationship weight in the propagation relationship are successively decremented; Exemplified by one kind of the formula weights reference image group as follows:
Figure PCTCN2015078881-appb-000002
Figure PCTCN2015078881-appb-000002
其中,a为修改关系权值、b为复制关系权值、c为转载关系权值,且a>b>c,SITEj 是站点频道的权值。Where a is the modification relationship weight, b is the replication relationship weight, c is the retransmission relationship weight, and a>b>c, SITEj Is the weight of the site channel.
本发明并不限于此计算公式,只要依据本发明思想的其他公式变形也为本发明保护之列。The present invention is not limited to this calculation formula, as long as other formula modifications in accordance with the inventive concept are also protected by the present invention.
步骤320:查找所述图像族中每个图像族对应的源图像及每个图像族的引用权值;Step 320: Find a source image corresponding to each image family in the image family and a reference weight of each image family;
具体的,获取所有图像族中每个图像族对应的源图像以及该图像族的引用权值,并将所述源图像与所述引用权值匹配成对应关系,即所述引用权值与所述源图像的先后关系应一致。Specifically, acquiring a source image corresponding to each image family of all image families and a reference weight of the image family, and matching the source image with the reference weight into a corresponding relationship, that is, the reference weight and the The order of the source images should be consistent.
步骤330:根据所述引用权值大小顺序,将各图像族中的源图像进行排序绘制对应所述查询请求的搜索结果。Step 330: Sort the source images in each image family according to the order of the reference weights to draw a search result corresponding to the query request.
其中,首先对所述引用权值进行大小排序,再利用所述图像族对应的源图像与图像族的引用权值之间的匹配关系获取所述源图像之间的排序,所述源图像之间的排序与所述引用权值大小顺序相同,然后以所述源图像之间的排序作为所述查询请求的搜索结果的绘制基础之一;当然,本领域普通技术人员很容易了解,在绘制所述搜索结果时,还可以加入其它参数,在此本实施例并不做具体限定。First, the reference weights are first sorted, and then the matching between the source images is obtained by using a matching relationship between the source image corresponding to the image family and the reference weight of the image family, where the source image is The ordering of the references is the same as the order of the weights of the reference weights, and then the ordering between the source images is one of the basis for drawing the search results of the query request; of course, those skilled in the art can easily understand that Other parameters may be added to the search result, which is not specifically limited in this embodiment.
当然,上述特种信息及其判断方式只是作为示例,在实施本发明实施例时,可以根据实际情况设置其他特种信息及其判断方式,本发明实施例对此不加以限制。另外,除了上述特种信息及其判断方式外,本领域技术人员还可以根据实际需要采用其他特种信息及其判断方式,本发明实施例对此也不加以限制。Of course, the above-mentioned special information and its judgment manner are only examples. When the embodiment of the present invention is implemented, other special information and its judgment manner may be set according to actual conditions, which is not limited by the embodiment of the present invention. In addition, in addition to the above-mentioned special information and the manner of its judgment, the person skilled in the art can also adopt other special information and its judgment manner according to actual needs, which is not limited by the embodiment of the present invention.
参照图4,通过一个具体的实例对上述实施例中的一种提供图像搜索的方法进行详细描述,具体包括如下步骤:Referring to FIG. 4, a method for providing an image search in the foregoing embodiment is described in detail by using a specific example, and specifically includes the following steps:
步骤410:接收图像查询请求,筛选与所述查询请求相关的图像族;该图像族包含图A`、图B`、图C`、图D`、图E`和图F`;Step 410: Receive an image query request, and filter an image family related to the query request; the image family includes a map A`, a diagram B`, a diagram C`, a diagram D`, a diagram E`, and a diagram F`;
步骤420:查找所述图像族中的源图像为图A`;Step 420: Find a source image in the image family as Figure A`;
步骤430:从资源站点抓取网页7`~网页14`;对网页7`~网页14`进行解析,得到图片url和网页url的对应关系,即(图片url,网页url)为:(A`,13`),(B`,14`),(C`,11`),(D`,12`),(F`,10`),(E`,7`),(E`,8)`,(E`,9`);其中,图片E`对应了网页7`、网页8`和网页9`,因此网页7`、网页8`和网页9`中的图像与所述图像E`即为转载关系;Step 430: Grab the webpage 7`~page 14' from the resource site; parse the webpage 7`~page 14', and obtain the correspondence between the image url and the webpage url, that is, (picture url, webpage url) is: (A` , 13`), (B`, 14`), (C`, 11`), (D`, 12`), (F`, 10`), (E`, 7`), (E`, 8 ), (E`, 9`); wherein the picture E` corresponds to the web page 7`, the web page 8' and the web page 9', so the image in the web page 7', the web page 8' and the web page 9' and the image E `is the reprint relationship;
步骤440:通过计算图片A`~图F`的md5值后获知,图片B`、图片E`和图片F`的md5是相同的,因此可确定图片B`、图片E`和图片F`之间为复制关系;Step 440: After calculating the md5 value of the picture A'~F', it is known that the picture B`, the picture E`, and the picture f'md5 are the same, so the picture B', the picture E', and the picture F' can be determined. Inter-replication relationship;
步骤450:通过对md5值不同的图片A`、图片B`、图片C`和图片D`进行“近似拷贝”计算,确定图片A`、图片B`、图片C`和图片D`是一个近似拷贝,由此可确定图片 A`、图片B`、图片C`和图片D`之间为修改关系;Step 450: Perform an "approximate copy" calculation on the picture A`, the picture B', the picture C', and the picture D' having different md5 values, and determine that the picture A', the picture B', the picture C', and the picture D' are an approximation. Copy, which determines the picture The relationship between A`, picture B`, picture C` and picture D` is modified;
步骤460:由于图A`到图B`、图C`、图D`是修改关系,因此图片A`的引用权值W1`=site(B`)*3*1+site(C`)*3*1+site(D`)*3*1,其中site是图片所在站点的权重,设3为修改关系权重;而对于图B`到图E`和图F`而言是复制关系,因此图B`的引用权值W2`=site(E`)*2*1+site(F`)*2*1,设2为复制关系权重;对于图E`到网页8`和网页9`是转载关系,并设图E`的原始网页为网页7`,则图E`的引用权值为W3`=site(8`)*1*1+site(9`)*1*1,设1为链接关系权重;因此,该图像族的引用权值即为R`=W1`+W2`+W3`。Step 460: Since the figures A' to B', C', and D' are modified, the reference weight of the picture A' is W1`=site(B`)*3*1+site(C`)* 3*1+site(D`)*3*1, where site is the weight of the site where the image is located, and 3 is the modification relationship weight; for Figure B` to Figure E` and Figure F` is the replication relationship, therefore Figure B`s reference weight W2`=site(E`)*2*1+site(F`)*2*1, let 2 be the copy relationship weight; for Figure E`to page 8` and page 9` Reproduce the relationship, and set the original page of the picture E` as the page 7`, then the reference weight of the picture E` is W3`=site(8`)*1*1+site(9`)*1*1, set 1 For the link relationship weight; therefore, the reference weight of the image family is R`=W1`+W2`+W3`.
步骤470:由于改图像族的源图像为图A`,则图A`对应的引用权值即为R`;而对于不同的图像族的源图像,利用R`即可以进行排序;在本实施例中,假设所有站点的权值都为1,则上述图像族对应的源图像图A`的引用权值R`=W1`+W2`+W3`=9+4+2=15,假设按照上述方式查找到的另外一图像族对应的源图像的引用权值R`为10,则可按照15>10的顺序,上述图像族对应的源图像图A`相对于引用权值R`为10的源图像在搜索结果中的排序靠前,以此类推,本实施例在此不再赘述。Step 470: Since the source image of the changed image family is the graph A′, the reference weight corresponding to the graph A′ is R′; and for the source image of different image families, the sorting can be performed by using R′; In the example, assuming that the weights of all the stations are 1, the reference weight of the source image map A` corresponding to the image family is R`=W1`+W2`+W3`=9+4+2=15, assuming If the reference weight R' of the source image corresponding to another image family found in the above manner is 10, the source image map A' corresponding to the image family may be 10 with respect to the reference weight R' according to the order of 15>10. The source image is ranked first in the search results, and so on, and the details are not described herein again.
可以看出,采用本发明实施例的方法,通过创建多个源图像对应的图像族,并计算每个图像族的引用权值,然后再根据所述各图像族的引用权值的大小作为搜索查询反馈的搜索结果排序的参数,可以获得更为优质准确的排序结果,并使图像排序结果存在引用次数上的优先级顺序,有效提高了搜索效率。It can be seen that, by using the method of the embodiment of the present invention, a plurality of image families corresponding to the source image are created, and a reference weight value of each image family is calculated, and then the search weight value of each image family is used as a search. Querying the parameters of the search result sorting results in a more accurate and accurate sorting result, and the image sorting result has a priority order on the number of citations, thereby effectively improving the search efficiency.
对于方法实施例,为了简单描述,故将其都表述为一系列的动作组合,但是本领域技术人员应该知悉,本发明实施例并不受所描述的动作顺序的限制,因为依据本发明实施例,某些步骤可以采用其他顺序或者同时进行。其次,本领域技术人员也应该知悉,说明书中所描述的实施例均属于优选实施例,所涉及的动作并不一定是本发明实施例所必须的。For the method embodiments, for the sake of simple description, they are all expressed as a series of action combinations, but those skilled in the art should understand that the embodiments of the present invention are not limited by the described action sequence, because the embodiment according to the present invention Some steps can be performed in other orders or at the same time. In the following, those skilled in the art should also understand that the embodiments described in the specification are all preferred embodiments, and the actions involved are not necessarily required by the embodiments of the present invention.
参照图5,示出了根据本发明一个实施例的一种实现图像搜索排序的装置实施例的结构框图,具体可以包括如下模块:创建单元510,适于创建多个源图像对应的图像族;计算单元520,适于计算每个图像族的引用权值;排序单元530,适于根据所述各图像族的引用权值的大小作为搜索查询反馈的搜索结果排序的参数。Referring to FIG. 5, a block diagram of an embodiment of an apparatus for performing image search sorting according to an embodiment of the present invention is shown. The method may further include: a creating unit 510, configured to create an image family corresponding to multiple source images; The calculating unit 520 is adapted to calculate a reference weight value of each image family; the sorting unit 530 is adapted to use the size of the reference weight value of each image family as a parameter of the search result feedback of the search query feedback.
参照图6,示出了根据本发明一个实施例的另一种实现图像搜索排序的装置实施例的结构框图,具体可以包括如下模块:创建单元610,适于创建多个源图像对应的图像族;计算单元620,适于计算每个图像族的引用权值;排序单元630,适于根据所述各图像族的引用权值的大小作为搜索查询反馈的搜索结果排序的参数。Referring to FIG. 6, a block diagram of another embodiment of an apparatus for performing image search sorting according to an embodiment of the present invention is shown. Specifically, the method may include the following module: a creating unit 610, configured to create a family of images corresponding to multiple source images. The calculation unit 620 is adapted to calculate a reference weight value of each image family; the sorting unit 630 is adapted to use the size of the reference weight value of each image family as a parameter of the search result feedback of the search query feedback.
其中,所述创建单元610包括:抓取模块6102,适于从资源站点抓取所述源图像对 应的网页;解析模块6104,适于通过解析所述抓取模块抓取的网页页面获取所述源图像对应的多张图像;获取模块6106,适于获取所述源图像对应的多张图像间的传播关系;建族模块6108,适于利用所述多张图像间的传播关系建立多个图像族。The creating unit 610 includes: a crawling module 6102, configured to capture the source image pair from a resource site. The image processing module 6104 is configured to acquire a plurality of images corresponding to the source image by parsing the webpage page captured by the crawling module, and the acquiring module 6106 is configured to acquire the multiple images corresponding to the source image. The propagation relationship; the family building module 6108 is adapted to establish a plurality of image families by using the propagation relationship between the plurality of images.
参照图7,示出了根据本发明一个实施例的另一种实现图像搜索排序的装置实施例中获取模块的结构框图,具体可以包括如下模块:本实施例中所述获取模块6106还可包括:第一处理模块610602,适于通过接收所述解析模块的解析结果,并根据所述解析结果获取网页统一资源定位符URL和图像URL的对应关系;第一比较模块610604,适于比较所述多个网页URL与多张图像URL的对应关系,并当所述多个网页URL与同一图像URL对应时,确定包含该图像的多个网页与所述图像为转载关系。Referring to FIG. 7, a block diagram of a structure of an apparatus for implementing image search sorting according to an embodiment of the present invention is shown. The method may further include the following modules: the obtaining module 6106 in the embodiment may further include The first processing module 610602 is configured to receive the analysis result of the parsing module, and obtain a correspondence between the webpage uniform resource locator URL and the image URL according to the parsing result; the first comparing module 610604 is adapted to compare the Corresponding relationship between the plurality of webpage URLs and the plurality of image URLs, and when the plurality of webpage URLs correspond to the same image URL, determining that the plurality of webpages including the image and the image are reprinted relationships.
此外,所述获取模块6106还包括:第二处理模块610606,适于计算所述解析模块解析出的多张图像的信息摘要MD5值;第二比较模块610608,适于比较所述多张图像的MD5值,并当多张图像的MD5值相同时,确定所述MD5相同的多张图像之间为复制关系。In addition, the obtaining module 6106 further includes: a second processing module 610606, configured to calculate a message digest MD5 value of the plurality of images parsed by the parsing module; and a second comparing module 610608, configured to compare the plurality of images The MD5 value, and when the MD5 values of the plurality of images are the same, it is determined that the plurality of images having the same MD5 are in a copy relationship.
除此之外,所述获取模块还包括:第三处理模块610610,适于计算所述解析模块解析出的多张图像的信息摘要MD5值;第三比较模块610612,适于比较所述多张图像的MD5值,并当多张图像的MD5值不同时,通过近似拷贝方式确定所述MD5值不同的多张图像间是否为修改关系。In addition, the obtaining module further includes: a third processing module 610610, configured to calculate a message digest MD5 value of the plurality of images parsed by the parsing module; and a third comparing module 610612, configured to compare the plurality of The MD5 value of the image, and when the MD5 values of the plurality of images are different, whether the plurality of images having different MD5 values are modified by the approximate copying manner.
值得注意的是,在本实施例中所述计算单元620还可包括:设置模块6202,适于预设所述抓取模块抓取网页的资源站点及获取模块获取到的所述不同传播关系的权值;比配模块6204,适于利用同一图像族中所述资源站点及所述不同传播关系权值计算该图像族的引用权值。It is to be noted that, in the embodiment, the calculating unit 620 may further include: a setting module 6202, configured to preset a resource site of the crawling module to capture a webpage, and the different propagation relationship acquired by the acquiring module. The weighting ratio matching module 6204 is adapted to calculate the reference weight of the image family by using the resource site in the same image family and the different propagation relationship weights.
参照图8,示出了根据本发明一个实施例的一种提供图像搜索的装置实施例的结构框图,具体可以包括如下模块:筛选单元810,适于在接收到图像查询请求后,筛选与所述查询请求相关的多个图像族;查找单元820,适于查找所述筛选单元810筛选过的图像族中每个图像族对应的源图像及每个图像族的引用权值;绘制单元830,适于接收所述查找单元820的查找结果,并根据所述引用权值大小顺序,将各图像族中的源图像进行排序绘制对应所述查询请求的搜索结果。FIG. 8 is a block diagram showing an embodiment of an apparatus for providing an image search according to an embodiment of the present invention. Specifically, the method may include the following module: a screening unit 810, configured to: after receiving an image query request, screening and a plurality of image families related to the query request; the searching unit 820 is adapted to search for a source image corresponding to each image family in the image family filtered by the filtering unit 810 and a reference weight of each image family; the drawing unit 830, The method is adapted to receive the search result of the searching unit 820, and sort the source images in each image family according to the order of the reference weights to draw a search result corresponding to the query request.
参照图9,示出了根据本发明一个实施例的另一种提供图像搜索的装置实施例的结构框图,具体可以包括如下模块:预创建单元940,预先适于创建多个源图像对应的图像族;预计算单元950,适于计算每个图像族的引用权值。筛选单元910,适于在接收到图像查询请求后,筛选与所述查询请求相关的多个图像族;查找单元920,适于查找所述筛选单元910筛选过的图像族中每个图像族对应的源图像及每个图像族的引用权值;绘制 单元930,适于接收所述查找单元920的查找结果,并根据所述引用权值大小顺序,将各图像族中的源图像进行排序绘制对应所述查询请求的搜索结果。Referring to FIG. 9, a block diagram of another embodiment of an apparatus for providing image search according to an embodiment of the present invention is shown. Specifically, the method may include a module: a pre-creation unit 940, which is pre-configured to create an image corresponding to multiple source images. A pre-computation unit 950 adapted to calculate a reference weight for each image family. The filtering unit 910 is configured to: after receiving the image query request, filter a plurality of image families related to the query request; the searching unit 920 is configured to search for each image family in the image family selected by the screening unit 910. Source image and reference weight for each image family; drawing The unit 930 is adapted to receive the search result of the searching unit 920, and sort the source images in each image family according to the order of the reference weights to draw a search result corresponding to the query request.
其中,所述预创建单元940包括:预抓取模块9402,适于从资源站点抓取所述源图像对应的网页;预解析模块9404,适于通过解析所述抓取模块抓取的网页页面获取所述源图像对应的多张图像;预获取模块9406,适于获取所述源图像对应的多张图像间的传播关系;预建族模块9408,适于利用所述多张图像间的传播关系建立多个图像族。The pre-creation unit 940 includes: a pre-fetching module 9402, configured to capture a webpage corresponding to the source image from a resource site; and a pre-parsing module 9404, configured to parse the webpage page captured by the crawling module. Acquiring a plurality of images corresponding to the source image; the pre-acquisition module 9406 is adapted to acquire a propagation relationship between the plurality of images corresponding to the source image; and the pre-built family module 9408 is adapted to utilize the propagation between the multiple images Relationships establish multiple image families.
参照图10,示出了根据本发明一个实施例的另一种提供图像搜索的装置实施例中预获取模块的结构框图,具体可以包括如下模块:本实施例中所述预获取模块9406还可包括:第一预处理模块940602,适于通过接收所述解析模块的解析结果,并根据所述解析结果获取网页统一资源定位符URL和图像URL的对应关系;第一预比较模块940604,适于比较所述多个网页URL与多张图像URL的对应关系,并当所述多个网页URL与同一图像URL对应时,确定包含该图像的多个网页与所述图像为转载关系。FIG. 10 is a block diagram showing a structure of a pre-acquisition module in an apparatus for providing an image search according to an embodiment of the present invention. Specifically, the pre-acquisition module 9406 may also be used in this embodiment. The first pre-comparison module 940604 is adapted to receive the parsing result of the parsing module, and obtain the correspondence between the webpage uniform resource locator URL and the image URL according to the parsing result; the first pre-comparison module 940604 is adapted to Comparing the correspondence between the plurality of webpage URLs and the plurality of image URLs, and when the plurality of webpage URLs correspond to the same image URL, determining that the plurality of webpages including the image and the image are reprinted relationships.
此外,所述预获取模块9406还包括:第二预处理模块940606,适于计算所述解析模块解析出的多张图像的信息摘要MD5值;第二预比较模块940608,适于比较所述多张图像的MD5值,并当多张图像的MD5值相同时,确定所述MD5相同的多张图像之间为复制关系。In addition, the pre-acquisition module 9406 further includes: a second pre-processing module 940606, configured to calculate an information digest MD5 value of the plurality of images parsed by the parsing module; and a second pre-comparison module 940608, configured to compare the plurality of The MD5 value of the image, and when the MD5 values of the plurality of images are the same, it is determined that the plurality of images having the same MD5 are in a copy relationship.
除此之外,所述预获取模块还包括:第三预处理模块940610,适于计算所述解析模块解析出的多张图像的信息摘要MD5值;第三预比较模块940612,适于比较所述多张图像的MD5值,并当多张图像的MD5值不同时,通过近似拷贝方式确定所述MD5值不同的多张图像间是否为修改关系。In addition, the pre-acquisition module further includes: a third pre-processing module 940610, configured to calculate an information digest MD5 value of the plurality of images parsed by the parsing module; and a third pre-comparison module 940612, suitable for comparing the The MD5 value of the plurality of images is described, and when the MD5 values of the plurality of images are different, whether the plurality of images having different MD5 values are modified by the approximate copying manner is determined.
值得注意的是,在本实施例中所述预计算单元950还可包括:预设置模块9502,适于预设所述抓取模块抓取网页的资源站点及获取模块获取到的所述不同传播关系的权值;预比配模块9504,适于利用同一图像族中所述资源站点及所述不同传播关系权值计算该图像族的引用权值。It is to be noted that, in the embodiment, the pre-calculation unit 950 may further include: a pre-setting module 9502, configured to preset a resource site of the crawling module to capture a webpage, and the different propagation acquired by the acquiring module. The weighting of the relationship; the pre-matching module 9504 is adapted to calculate the reference weight of the image family by using the resource site in the same image family and the different propagation relationship weights.
在此提供的算法和显示不与任何特定计算机、虚拟系统或者其它设备固有相关。各种通用系统也可以与基于在此的示教一起使用。根据上面的描述,构造这类系统所要求的结构是显而易见的。此外,本发明也不针对任何特定编程语言。应当明白,可以利用各种编程语言实现在此描述的本发明的内容,并且上面对特定语言所做的描述是为了披露本发明的最佳实施方式。The algorithms and displays provided herein are not inherently related to any particular computer, virtual system, or other device. Various general purpose systems can also be used with the teaching based on the teachings herein. The structure required to construct such a system is apparent from the above description. Moreover, the invention is not directed to any particular programming language. It is to be understood that the invention may be embodied in a variety of programming language, and the description of the specific language has been described above in order to disclose the preferred embodiments of the invention.
在此处所提供的说明书中,说明了大量具体细节。然而,能够理解,本发明的实施例可以在没有这些具体细节的情况下实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。 In the description provided herein, numerous specific details are set forth. However, it is understood that the embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures, and techniques are not shown in detail so as not to obscure the understanding of the description.
类似地,应当理解,为了精简本公开并帮助理解各个发明方面中的一个或多个,在上面对本发明的示例性实施例的描述中,本发明的各个特征有时被一起分组到单个实施例、图、或者对其的描述中。然而,并不应将该公开的方法解释成反映如下意图:即所要求保护的本发明要求比在每个权利要求中所明确记载的特征更多的特征。更确切地说,如下面的权利要求书所反映的那样,发明方面在于少于前面公开的单个实施例的所有特征。因此,遵循具体实施方式的权利要求书由此明确地并入该具体实施方式,其中每个权利要求本身都作为本发明的单独实施例。Similarly, the various features of the invention are sometimes grouped together into a single embodiment, in the above description of the exemplary embodiments of the invention, Figure, or a description of it. However, the method disclosed is not to be interpreted as reflecting the intention that the claimed invention requires more features than those recited in the claims. Rather, as the following claims reflect, inventive aspects reside in less than all features of the single embodiments disclosed herein. Therefore, the claims following the specific embodiments are hereby explicitly incorporated into the embodiments, and each of the claims as a separate embodiment of the invention.
本领域那些技术人员可以理解,可以对实施例中的设备中的模块进行自适应性地改变并且把它们设置在与该实施例不同的一个或多个设备中。可以把实施例中的模块或单元或组件组合成一个模块或单元或组件,以及此外可以把它们分成多个子模块或子单元或子组件。除了这样的特征和/或过程或者单元中的至少一些是相互排斥之外,可以采用任何组合对本说明书(包括伴随的权利要求、摘要和附图)中公开的所有特征以及如此公开的任何方法或者设备的所有过程或单元进行组合。除非另外明确陈述,本说明书(包括伴随的权利要求、摘要和附图)中公开的每个特征可以由提供相同、等同或相似目的的替代特征来代替。Those skilled in the art will appreciate that the modules in the devices of the embodiments can be adaptively changed and placed in one or more devices different from the embodiment. The modules or units or components of the embodiments may be combined into one module or unit or component, and further they may be divided into a plurality of sub-modules or sub-units or sub-components. In addition to such features and/or at least some of the processes or units being mutually exclusive, any combination of the features disclosed in the specification, including the accompanying claims, the abstract and the drawings, and any methods so disclosed, or All processes or units of the device are combined. Each feature disclosed in this specification (including the accompanying claims, the abstract and the drawings) may be replaced by alternative features that provide the same, equivalent or similar purpose.
此外,本领域的技术人员能够理解,尽管在此所述的一些实施例包括其它实施例中所包括的某些特征而不是其它特征,但是不同实施例的特征的组合意味着处于本发明的范围之内并且形成不同的实施例。例如,在下面的权利要求书中,所要求保护的实施例的任意之一都可以以任意的组合方式来使用。In addition, those skilled in the art will appreciate that, although some embodiments described herein include certain features that are included in other embodiments and not in other features, combinations of features of different embodiments are intended to be within the scope of the present invention. Different embodiments are formed and formed. For example, in the following claims, any one of the claimed embodiments can be used in any combination.
本发明的各个部件实施例可以以硬件实现,或者以在一个或者多个处理器上运行的软件模块实现,或者以它们的组合实现。本领域的技术人员应当理解,可以在实践中使用微处理器或者数字信号处理器(DSP)来实现根据本发明实施例的进行网页加载的设备中的一些或者全部部件的一些或者全部功能。本发明还可以实现为用于执行这里所描述的方法的一部分或者全部的设备或者装置程序(例如,计算机程序和计算机程序产品)。这样的实现本发明的程序可以存储在计算机可读介质上,或者可以具有一个或者多个信号的形式。这样的信号可以从因特网网站上下载得到,或者在载体信号上提供,或者以任何其他形式提供。The various component embodiments of the present invention may be implemented in hardware, or in a software module running on one or more processors, or in a combination thereof. Those skilled in the art will appreciate that a microprocessor or digital signal processor (DSP) may be used in practice to implement some or all of the functionality of some or all of the components of the web page loading device in accordance with embodiments of the present invention. The invention can also be implemented as a device or device program (e.g., a computer program and a computer program product) for performing some or all of the methods described herein. Such a program implementing the invention may be stored on a computer readable medium or may be in the form of one or more signals. Such signals may be downloaded from an Internet website, provided on a carrier signal, or provided in any other form.
例如,图11示出了可以实现根据本发明的关于图像搜索方法的计算设备,其中,所述关于图像搜索方法包括上述实施例所述的实现图像搜索排序的方法,以及上述实施例所述的提供图像搜索的方法。该计算设备传统上包括处理器1110和以存储器1120形式的程序产品或者可读介质。存储器1120可以是诸如闪存、EEPROM(电可擦除可编程只读存储器)、EPROM或者ROM之类的电子存储器。存储器1120具有用于执行上述方 法中的任何方法步骤的程序代码1131的存储空间1130。例如,用于程序代码的存储空间1130可以包括分别用于实现上面的方法中的各种步骤的各个程序代码1131。这些程序代码可以从一个或者多个程序产品中读出或者写入到这一个或者多个程序产品中。这些程序产品包括诸如存储卡之类的程序代码载体。这样的程序产品通常为如参考图12所述的便携式或者固定存储单元。该存储单元可以具有与图11的计算设备中的存储器1120类似布置的存储段、存储空间等。程序代码可以例如以适当形式进行压缩。通常,存储单元包括可读代码1131’,即可以由例如诸如1110之类的处理器读取的代码,这些代码当由计算设备运行时,导致该计算设备执行上面所描述的方法中的各个步骤。For example, FIG. 11 illustrates a computing device that can implement an image search method according to the present invention, wherein the image search method includes the method of implementing image search ordering described in the above embodiments, and the above-described embodiments. Provide a method of image search. The computing device conventionally includes a processor 1110 and a program product or readable medium in the form of a memory 1120. The memory 1120 may be an electronic memory such as a flash memory, an EEPROM (Electrically Erasable Programmable Read Only Memory), an EPROM, or a ROM. The memory 1120 has a function for performing the above The storage space 1130 of the program code 1131 of any method step in the method. For example, the storage space 1130 for program code may include respective program codes 1131 for implementing various steps in the above methods, respectively. These program codes can be read from or written to one or more program products. These program products include program code carriers such as memory cards. Such a program product is typically a portable or fixed storage unit as described with reference to FIG. The storage unit may have a storage segment, a storage space, and the like that are similarly arranged to the storage 1120 in the computing device of FIG. The program code can be compressed, for example, in an appropriate form. Typically, the storage unit includes readable code 1131 ', ie, code that can be read by a processor, such as, for example, 1110, which when executed by a computing device causes the computing device to perform various steps in the methods described above .
应该注意的是上述实施例对本发明进行说明而不是对本发明进行限制,并且本领域技术人员在不脱离所附权利要求的范围的情况下可设计出替换实施例。在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词“一”或“一个”不排除存在多个这样的元件。本发明可以借助于包括有若干不同元件的硬件以及借助于适当编程的计算机来实现。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。 It is to be noted that the above-described embodiments are illustrative of the invention and are not intended to be limiting, and that the invention may be devised without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as a limitation. The word "comprising" does not exclude the presence of the elements or steps that are not recited in the claims. The word "a" or "an" The invention can be implemented by means of hardware comprising several distinct elements and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means can be embodied by the same hardware item. The use of the words first, second, and third does not indicate any order. These words can be interpreted as names.

Claims (28)

  1. 一种实现图像搜索排序的方法,包括:A method for implementing image search sorting, comprising:
    创建多个源图像对应的图像族;Create a family of images corresponding to multiple source images;
    计算每个图像族的引用权值;Calculate the reference weight of each image family;
    根据所述各图像族的引用权值的大小作为搜索查询反馈的搜索结果排序的参数。The size of the reference weight of each image family is used as a parameter for sorting the search results fed back by the search query.
  2. 如权利要求1所述的方法,其特征在于,所述创建多个源图像对应的图像族包括:The method of claim 1, wherein the creating a family of images corresponding to the plurality of source images comprises:
    从资源站点抓取所述源图像对应的网页;Grab the webpage corresponding to the source image from the resource site;
    通过解析所述网页页面获取所述源图像对应的多张图像;Obtaining a plurality of images corresponding to the source image by parsing the webpage page;
    获取所述源图像对应的多张图像间的传播关系;Obtaining a propagation relationship between the plurality of images corresponding to the source image;
    利用所述多张图像间的传播关系建立多个图像族。A plurality of image families are established using the propagation relationship between the plurality of images.
  3. 如权利要求1-2任一项所述的方法,其特征在于,所述获取所述源图像对应的多张图像间的传播关系包括:The method according to any one of claims 1-2, wherein the obtaining a propagation relationship between the plurality of images corresponding to the source image comprises:
    通过所述网页页面解析获取网页统一资源定位符URL和多张图像URL的对应关系;Obtaining a correspondence between a webpage uniform resource locator URL and a plurality of image URLs by using the webpage page parsing;
    如果多个网页URL与同一图像URL对应,则确定包含该图像的多个网页与所述图像为转载关系。If the plurality of web page URLs correspond to the same image URL, it is determined that the plurality of web pages including the image are in a reprint relationship with the image.
  4. 如权利要求1-3任一项所述的方法,其特征在于,所述获取所述源图像对应的多张图像间的传播关系包括:The method according to any one of claims 1 to 3, wherein the acquiring a propagation relationship between the plurality of images corresponding to the source image comprises:
    计算通过解析网页页面获取的多张图像的信息摘要MD5值;Calculating a message digest MD5 value of a plurality of images obtained by parsing a webpage page;
    如果多张图像的MD5值相同,则确定所述MD5相同的多张图像之间为复制关系。If the MD5 values of the plurality of images are the same, it is determined that the plurality of images having the same MD5 are in a copy relationship.
  5. 如权利要求1-4任一项所述的方法,其特征在于,所述获取所述源图像对应的多张图像间的传播关系包括:The method according to any one of claims 1 to 4, wherein the acquiring a propagation relationship between the plurality of images corresponding to the source image comprises:
    计算通过解析网页页面获取的多张图像的MD5值;Calculating the MD5 value of multiple images obtained by parsing the webpage page;
    如果多张图像的MD5值不同,则通过近似拷贝方式确定所述MD5值不同的多张图像间是否为修改关系。If the MD5 values of the plurality of images are different, it is determined whether the plurality of images having different MD5 values are modified by an approximate copying manner.
  6. 如权利要求1-5任一项所述的方法,其特征在于,所述计算每个图像族的引用权值包括:The method of any of claims 1-5, wherein said calculating a reference weight for each image family comprises:
    预设所述资源站点及不同传播关系的权值;Presetting the weight of the resource site and different propagation relationships;
    利用同一图像族中所述资源站点及所述不同传播关系权值计算该图像族的引用权值。Calculating the reference weight of the image family by using the resource site in the same image family and the different propagation relationship weights.
  7. 一种提供图像搜索的方法,包括:A method of providing image search, including:
    接收到图像查询请求后,筛选与所述查询请求相关的多个图像族; After receiving the image query request, filtering a plurality of image families related to the query request;
    查找所述图像族中每个图像族对应的源图像及每个图像族的引用权值;Finding a source image corresponding to each image family in the image family and a reference weight of each image family;
    根据所述引用权值大小顺序,将各图像族中的源图像进行排序绘制对应所述查询请求的搜索结果。And sorting the source images in each image family according to the order of the reference weights to draw a search result corresponding to the query request.
  8. 如权利要求7所述的方法,其特征在于,该方法还包括:The method of claim 7 further comprising:
    预先创建多个源图像对应的图像族;Pre-creating image families corresponding to multiple source images;
    计算每个图像族的引用权值。Calculate the reference weight for each image family.
  9. 如权利要求7-8任一项所述的方法,其特征在于,所述创建多个源图像对应的图像族包括:The method according to any one of claims 7-8, wherein the creating a family of images corresponding to the plurality of source images comprises:
    从资源站点抓取所述源图像对应的网页;Grab the webpage corresponding to the source image from the resource site;
    通过解析所述网页页面获取所述源图像对应的多张图像;Obtaining a plurality of images corresponding to the source image by parsing the webpage page;
    获取所述源图像对应的多张图像间的传播关系;Obtaining a propagation relationship between the plurality of images corresponding to the source image;
    利用所述多张图像间的传播关系建立多个图像族。A plurality of image families are established using the propagation relationship between the plurality of images.
  10. 如权利要求7-9任一项所述的方法,其特征在于,所述获取所述源图像对应的多张图像间的传播关系包括:The method according to any one of claims 7-9, wherein the obtaining a propagation relationship between the plurality of images corresponding to the source image comprises:
    通过所述网页页面解析获取网页统一资源定位符URL和多张图像URL的对应关系;Obtaining a correspondence between a webpage uniform resource locator URL and a plurality of image URLs by using the webpage page parsing;
    如果多个网页URL与同一图像URL对应,则确定包含该图像的多个网页与所述图像为转载关系。If the plurality of web page URLs correspond to the same image URL, it is determined that the plurality of web pages including the image are in a reprint relationship with the image.
  11. 如权利要求7-10任一项所述的方法,其特征在于,所述获取所述源图像对应的多张图像间的传播关系包括:The method according to any one of claims 7 to 10, wherein the acquiring the propagation relationship between the plurality of images corresponding to the source image comprises:
    计算通过解析网页页面获取的多张图像的信息摘要MD5值;Calculating a message digest MD5 value of a plurality of images obtained by parsing a webpage page;
    如果多张图像的MD5值相同,则确定所述MD5相同的多张图像之间为复制关系。If the MD5 values of the plurality of images are the same, it is determined that the plurality of images having the same MD5 are in a copy relationship.
  12. 如权利要求7-11任一项所述的方法,其特征在于,所述获取所述源图像对应的多张图像间的传播关系包括:The method according to any one of claims 7 to 11, wherein the obtaining a propagation relationship between the plurality of images corresponding to the source image comprises:
    计算通过解析网页页面获取的多张图像的MD5值;Calculating the MD5 value of multiple images obtained by parsing the webpage page;
    如果多张图像的MD5值不同,则通过近似拷贝方式确定所述MD5值不同的多张图像间是否为修改关系。If the MD5 values of the plurality of images are different, it is determined whether the plurality of images having different MD5 values are modified by an approximate copying manner.
  13. 如权利要求7-12任一项所述的方法,其特征在于,所述计算每个图像族的引用权值包括:The method according to any one of claims 7 to 12, wherein the calculating the reference weight of each image family comprises:
    预设所述资源站点及不同传播关系的权值;Presetting the weight of the resource site and different propagation relationships;
    利用同一图像族中所述资源站点及所述不同传播关系权值计算该图像族的引用权值。 Calculating the reference weight of the image family by using the resource site in the same image family and the different propagation relationship weights.
  14. 一种实现图像搜索排序的装置,包括:An apparatus for implementing image search sorting, comprising:
    创建单元,适于创建多个源图像对应的图像族;Creating a unit adapted to create an image family corresponding to multiple source images;
    计算单元,适于计算每个图像族的引用权值;a calculation unit adapted to calculate a reference weight value of each image family;
    排序单元,适于根据所述各图像族的引用权值的大小作为搜索查询反馈的搜索结果排序的参数。And a sorting unit, configured to use, according to the size of the reference weight of each image family, as a parameter for sorting the search results fed back by the search query.
  15. 如权利要求14所述的装置,其特征在于,所述创建单元包括:The apparatus of claim 14, wherein the creating unit comprises:
    抓取模块,适于从资源站点抓取所述源图像对应的网页;a capture module, configured to capture a webpage corresponding to the source image from a resource site;
    解析模块,适于通过解析所述抓取模块抓取的网页页面获取所述源图像对应的多张图像;The parsing module is adapted to obtain a plurality of images corresponding to the source image by parsing a webpage page captured by the crawling module;
    获取模块,适于获取所述源图像对应的多张图像间的传播关系;An acquiring module, configured to acquire a propagation relationship between multiple images corresponding to the source image;
    建族模块,适于利用所述多张图像间的传播关系建立多个图像族。The building module is adapted to establish a plurality of image families by using a propagation relationship between the plurality of images.
  16. 如权利要求14-15任一项所述的装置,其特征在于,所述获取模块还包括:The device according to any one of claims 14-15, wherein the obtaining module further comprises:
    第一处理模块,适于通过接收所述解析模块的解析结果,并根据所述解析结果获取网页统一资源定位符URL和图像URL的对应关系;The first processing module is configured to receive a parsing result of the parsing module, and obtain a correspondence between a webpage uniform resource locator URL and an image URL according to the parsing result;
    第一比较模块,适于比较所述多个网页URL与多张图像URL的对应关系,并当所述多个网页URL与同一图像URL对应时,确定包含该图像的多个网页与所述图像为转载关系。a first comparison module, configured to compare a correspondence between the plurality of webpage URLs and the plurality of image URLs, and when the plurality of webpage URLs correspond to the same image URL, determine a plurality of webpages including the image and the image For the reprint relationship.
  17. 如权利要求14-16任一项所述的装置,其特征在于,所述获取模块还包括:The device according to any one of claims 14-16, wherein the obtaining module further comprises:
    第二处理模块,适于计算所述解析模块解析出的多张图像的信息摘要MD5值;a second processing module, configured to calculate a message digest MD5 value of the plurality of images parsed by the parsing module;
    第二比较模块,适于比较所述多张图像的MD5值,并当多张图像的MD5值相同时,确定所述MD5相同的多张图像之间为复制关系。The second comparison module is adapted to compare the MD5 values of the plurality of images, and when the MD5 values of the plurality of images are the same, determine that the plurality of images having the same MD5 are in a replication relationship.
  18. 如权利要求14-17任一项所述的装置,其特征在于,所述获取模块还包括:The device according to any one of claims 14-17, wherein the obtaining module further comprises:
    第三处理模块,适于计算所述解析模块解析出的多张图像的信息摘要MD5值;a third processing module, configured to calculate a message digest MD5 value of the plurality of images parsed by the parsing module;
    第三比较模块,适于比较所述多张图像的MD5值,并当多张图像的MD5值不同时,通过近似拷贝方式确定所述MD5值不同的多张图像间是否为修改关系。The third comparison module is adapted to compare the MD5 values of the plurality of images, and when the MD5 values of the plurality of images are different, determine whether the plurality of images having different MD5 values are modified by an approximate copy manner.
  19. 如权利要求14-18任一项所述的装置,其特征在于,所述计算单元包括:The device of any of claims 14-18, wherein the computing unit comprises:
    设置模块,适于预设所述抓取模块抓取网页的资源站点及获取模块获取到的所述不同传播关系的权值;a setting module, configured to preset a resource site of the crawling module to capture a webpage, and a weight of the different propagation relationship acquired by the acquiring module;
    比配模块,适于利用同一图像族中所述资源站点及所述不同传播关系权值计算该图像族的引用权值。The matching module is adapted to calculate the reference weight of the image family by using the resource site in the same image family and the different propagation relationship weights.
  20. 一种提供图像搜索的装置,包括:An apparatus for providing image search, comprising:
    筛选单元,适于在接收到图像查询请求后,筛选与所述查询请求相关的多个图像族; a screening unit, configured to filter a plurality of image families related to the query request after receiving the image query request;
    查找单元,适于查找所述筛选单元筛选过的图像族中每个图像族对应的源图像及每个图像族的引用权值;a search unit, configured to search for a source image corresponding to each image family in the image family filtered by the screening unit, and a reference weight of each image family;
    绘制单元,适于接收所述查找单元的查找结果,并根据所述引用权值大小顺序,将各图像族中的源图像进行排序绘制对应所述查询请求的搜索结果。The drawing unit is adapted to receive the search result of the searching unit, and sort the source images in each image family according to the order of the reference weights to draw a search result corresponding to the query request.
  21. 如权利要求20所述的装置,其特征在于,还包括:The device of claim 20, further comprising:
    预创建单元,适于预先创建多个源图像对应的图像族;a pre-creation unit, configured to pre-create image families corresponding to multiple source images;
    预计算单元,适于计算每个图像族的引用权值。A pre-calculation unit adapted to calculate a reference weight for each image family.
  22. 如权利要求20-21任一项所述的装置,其特征在于,所述预创建单元包括:The apparatus according to any one of claims 20 to 21, wherein the pre-creation unit comprises:
    预抓取模块,适于从资源站点抓取所述源图像对应的网页;a pre-crawling module, configured to capture a webpage corresponding to the source image from a resource site;
    预解析模块,适于通过解析所述抓取模块抓取的网页页面获取所述源图像对应的多张图像;a pre-parsing module, configured to acquire a plurality of images corresponding to the source image by parsing a webpage page captured by the crawling module;
    预获取模块,适于获取所述源图像对应的多张图像间的传播关系;a pre-acquisition module, configured to acquire a propagation relationship between multiple images corresponding to the source image;
    预建族模块,适于利用所述多张图像间的传播关系建立多个图像族。The pre-built family module is adapted to establish a plurality of image families by using a propagation relationship between the plurality of images.
  23. 如权利要求21所述的装置,其特征在于,所述预获取模块还包括:The device of claim 21, wherein the pre-fetch module further comprises:
    第一预处理模块,适于通过接收所述解析模块的解析结果,并根据所述解析结果获取网页统一资源定位符URL和图像URL的对应关系;The first pre-processing module is configured to receive the parsing result of the parsing module, and obtain a correspondence between the webpage uniform resource locator URL and the image URL according to the parsing result;
    第一预比较模块,适于比较所述多个网页URL与多张图像URL的对应关系,并当所述多个网页URL与同一图像URL对应时,确定包含该图像的多个网页与所述图像为转载关系。a first pre-comparison module, configured to compare a correspondence between the plurality of webpage URLs and the plurality of image URLs, and when the plurality of webpage URLs correspond to the same image URL, determine a plurality of webpages including the image and the The image is a reprint relationship.
  24. 如权利要求21所述的装置,其特征在于,所述预获取模块还包括:The device of claim 21, wherein the pre-fetch module further comprises:
    第二预处理模块,适于计算所述解析模块解析出的多张图像的信息摘要MD5值;a second pre-processing module, configured to calculate a message digest MD5 value of the plurality of images parsed by the parsing module;
    第二预比较模块,适于比较所述多张图像的MD5值,并当多张图像的MD5值相同时,确定所述MD5相同的多张图像之间为复制关系。The second pre-comparison module is adapted to compare the MD5 values of the plurality of images, and when the MD5 values of the plurality of images are the same, determine that the plurality of images having the same MD5 are in a replication relationship.
  25. 如权利要求21所述的装置,其特征在于,所述预获取模块还包括:The device of claim 21, wherein the pre-fetch module further comprises:
    第三预处理模块,适于计算所述解析模块解析出的多张图像的信息摘要MD5值;a third pre-processing module, configured to calculate a message digest MD5 value of the plurality of images parsed by the parsing module;
    第三预比较模块,适于比较所述多张图像的MD5值,并当多张图像的MD5值不同时,通过近似拷贝方式确定所述MD5值不同的多张图像间是否为修改关系。The third pre-comparison module is adapted to compare the MD5 values of the plurality of images, and when the MD5 values of the plurality of images are different, determine whether the plurality of images having different MD5 values are modified relationships by an approximate copy manner.
  26. 如权利要求20所述的装置,其特征在于,所述预计算单元包括:The apparatus of claim 20 wherein said pre-computing unit comprises:
    预设置模块,适于预设所述抓取模块抓取网页的资源站点及获取模块获取到的所述不同传播关系的权值;a preset module, configured to preset a resource site of the crawling module to capture a webpage, and a weight of the different propagation relationship acquired by the acquiring module;
    预比配模块,适于利用同一图像族中所述资源站点及所述不同传播关系权值计算该图像族的引用权值。 The pre-matching module is adapted to calculate a reference weight of the image family by using the resource site in the same image family and the different propagation relationship weights.
  27. 一种程序,包括可读代码,当所述可读代码在计算设备上运行时,导致所述计算设备执行根据权利要求1-13中的任一个所述的关于图像搜索的方法。A program comprising readable code that, when executed on a computing device, causes the computing device to perform a method for image search according to any of claims 1-13.
  28. 一种可读介质,其中存储了如权利要求27所述的程序。 A readable medium storing the program of claim 27.
PCT/CN2015/078881 2014-05-14 2015-05-13 Method and device for searching and ranking images and providing image search WO2015172721A1 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
CN201410203342.3A CN103995856B (en) 2014-05-14 2014-05-14 Method and device for image search
CN201410203700.0A CN103995857A (en) 2014-05-14 2014-05-14 Method and device for achieving image search and sorting
CN201410203700.0 2014-05-14
CN201410203342.3 2014-05-14

Publications (1)

Publication Number Publication Date
WO2015172721A1 true WO2015172721A1 (en) 2015-11-19

Family

ID=54479338

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/078881 WO2015172721A1 (en) 2014-05-14 2015-05-13 Method and device for searching and ranking images and providing image search

Country Status (1)

Country Link
WO (1) WO2015172721A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101398832A (en) * 2007-09-30 2009-04-01 国际商业机器公司 Image searching method and system by utilizing human face detection
CN102713902A (en) * 2009-12-02 2012-10-03 萨基姆通讯宽带公司 Method for generating the result of a search carried out using a search engine
WO2013075310A1 (en) * 2011-11-24 2013-05-30 Microsoft Corporation Reranking using confident image samples
CN103995856A (en) * 2014-05-14 2014-08-20 北京奇虎科技有限公司 Method and device for image search
CN103995857A (en) * 2014-05-14 2014-08-20 北京奇虎科技有限公司 Method and device for achieving image search and sorting

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101398832A (en) * 2007-09-30 2009-04-01 国际商业机器公司 Image searching method and system by utilizing human face detection
CN102713902A (en) * 2009-12-02 2012-10-03 萨基姆通讯宽带公司 Method for generating the result of a search carried out using a search engine
WO2013075310A1 (en) * 2011-11-24 2013-05-30 Microsoft Corporation Reranking using confident image samples
CN103995856A (en) * 2014-05-14 2014-08-20 北京奇虎科技有限公司 Method and device for image search
CN103995857A (en) * 2014-05-14 2014-08-20 北京奇虎科技有限公司 Method and device for achieving image search and sorting

Similar Documents

Publication Publication Date Title
US11822600B2 (en) Content tagging
US10210179B2 (en) Dynamic feature weighting
US9008433B2 (en) Object tag metadata and image search
US9607014B2 (en) Image tagging
WO2019127832A1 (en) Intelligent search method and apparatus, terminal, server, and storage medium
JP6785921B2 (en) Picture search method, device, server and storage medium
CA2790421C (en) Indexing and searching employing virtual documents
CN106855952B (en) Neural network-based computing method and device
JP6932360B2 (en) Object search method, device and server
WO2016000507A1 (en) Traffic-saving mode search service method, server, client and system
US8861896B2 (en) Method and system for image-based identification
KR102361112B1 (en) Extracting similar group elements
CN112136123A (en) Characterizing documents for similarity search
CN106777201B (en) Method and device for sorting recommended data on search result page
US20120166412A1 (en) Super-clustering for efficient information extraction
JP6419969B2 (en) Method and apparatus for providing image presentation information
US20160188680A1 (en) Electronic device and information searching method for the electronic device
WO2019056797A1 (en) Network picture capturing method, program and application server
CN108665459A (en) A kind of image fuzzy detection method, computing device and readable storage medium storing program for executing
WO2015172721A1 (en) Method and device for searching and ranking images and providing image search
US20130230248A1 (en) Ensuring validity of the bookmark reference in a collaborative bookmarking system
CN107508705B (en) Resource tree construction method of HTTP element and computing equipment
CN111782945B (en) Book searching method, computing device and storage medium
US10089369B2 (en) Searching method, searching apparatus and device
CN103995856B (en) Method and device for image search

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15791932

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15791932

Country of ref document: EP

Kind code of ref document: A1