CN103995857A - Method and device for achieving image search and sorting - Google Patents

Method and device for achieving image search and sorting Download PDF

Info

Publication number
CN103995857A
CN103995857A CN201410203700.0A CN201410203700A CN103995857A CN 103995857 A CN103995857 A CN 103995857A CN 201410203700 A CN201410203700 A CN 201410203700A CN 103995857 A CN103995857 A CN 103995857A
Authority
CN
China
Prior art keywords
image
images
relation
multiple images
webpage
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201410203700.0A
Other languages
Chinese (zh)
Inventor
陶哲
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201410203700.0A priority Critical patent/CN103995857A/en
Publication of CN103995857A publication Critical patent/CN103995857A/en
Priority to PCT/CN2015/078881 priority patent/WO2015172721A1/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/951Indexing; Web crawling techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/51Indexing; Data structures therefor; Storage structures

Abstract

The invention discloses a method and device for achieving image search and sorting. The method comprises the steps that image families corresponding to a plurality of source images are established; a reference right value of each image family is calculated; reference right values of the image families serve as sorting parameters of search results fed back by search query. By the adoption of the method and device, the high-quality accurate sorting result can be obtained, the image sorting result has the priority sequence of reference frequencies, accuracy of search is greatly improved, and search efficiency is effectively improved.

Description

A kind of method and apparatus of realizing picture search sequence
Technical field
The present invention relates to the technical field of view data processing, be specifically related to a kind of method and apparatus of realizing picture search sequence.
Background technology
Along with the develop rapidly of internet and multimedia technology, the resource on internet also becomes increasingly abundant, and from network, Gains resources also becomes more and more easier; Search engine is a kind of software systems of applying on network, and it can realize search and the discovery of information in some way on network, and demonstrates Search Results after the information searching is processed.
And at present, increasingly mature along with search engine technique, can offer the text message Search Results that user's Search Results has no longer just searched according to user's input command, can also search for network picture according to user's request, and the picture result searching out is and dedicates user to.
But, in the picture searching scheme of currently available technology, be the Search Results of dedicating user to often without any rule, and just by likely relevant picture simply enumerate, in the Search Results of its picture, do not have any priority orders, this will make the picture search result of output show disordered state, and then greatly reduces the accuracy of search, thereby has affected search efficiency.
Summary of the invention
In view of the above problems, the present invention has been proposed to a kind of a kind of method and corresponding a kind of device of realizing picture search sequence of realizing picture search sequence that overcomes the problems referred to above or address the above problem is at least in part provided.
According to one aspect of the present invention, a kind of method that realizes picture search sequence is provided, comprising: create the image family that multiple source images are corresponding; Calculate the weights of quoting of each image family; Parameter according to the size of quoting weights of described each image family as the search results ranking of search inquiry feedback.
Optionally, image family corresponding to the multiple source images of described establishment comprises: capture from resource website the webpage that described source images is corresponding; Obtain by resolving described Webpage multiple images that described source images is corresponding; Obtain the propagation relation between multiple images that described source images is corresponding; Utilize propagation relation between described multiple images to set up multiple image family.
Optionally, the propagation relation of obtaining described between multiple images that described source images is corresponding comprises: resolve the corresponding relation that obtains webpage uniform resource position mark URL and multiple image URL by described Webpage; If multiple webpage URL are corresponding with same image URL, determine that the multiple webpages and the described image that comprise this image are reprinting relation.
Optionally, the propagation relation of obtaining described between multiple images that described source images is corresponding comprises: the informative abstract MD5 value of calculating multiple images that obtain by the analyzing web page page; If the MD5 value of multiple images is identical, determine that between multiple images that described MD5 is identical be replication relation.
Optionally, the propagation relation of obtaining described between multiple images that described source images is corresponding comprises: the MD5 value of calculating multiple images that obtain by the analyzing web page page; If the MD5 value of multiple images is different, determine between multiple images that described MD5 value is different whether be amendment relation by approximate copy mode.
Optionally, the weights of quoting of the each image of described calculating family comprise: the weights of default described resource website and different propagation relations; Utilize resource website described in same image family and described different propagation to be related to that weights calculate the weights of quoting of this image family.
According to a further aspect in the invention, provide a kind of device of realizing picture search sequence, having comprised: creating unit, is suitable for creating the image family that multiple source images are corresponding; Computing unit, is suitable for calculating the weights of quoting of each image family; Sequencing unit, is suitable for the parameter as the search results ranking of search inquiry feedback according to the size of quoting weights of described each image family.
Optionally, described creating unit comprises: handling module, is suitable for capturing from resource website the webpage that described source images is corresponding; Parsing module, the Webpage being suitable for by resolving described handling module crawl obtains multiple images that described source images is corresponding; Acquisition module, is suitable for obtaining the propagation relation between multiple images that described source images is corresponding; Build family's module, be suitable for utilizing propagation relation between described multiple images to set up multiple image family.
Optionally, described acquisition module also comprises: the first processing module, is suitable for the analysis result by receiving described parsing module, and obtains the corresponding relation of webpage uniform resource position mark URL and image URL according to described analysis result; The first comparison module, is suitable for the corresponding relation of more described multiple webpage URL and multiple image URL, and as described multiple webpage URL with same image URL at once, definite multiple webpages and described image that comprises this image is reprinting relation.
Optionally, described acquisition module also comprises: the second processing module, is suitable for calculating the informative abstract MD5 value of multiple images that described parsing module parses; The second comparison module, is suitable for the MD5 value of relatively more described multiple images, and in the time that the MD5 of multiple images value is identical, determines between multiple images that described MD5 is identical to be replication relation.
Optionally, described acquisition module also comprises: the 3rd processing module, is suitable for calculating the informative abstract MD5 value of multiple images that described parsing module parses; The 3rd comparison module, is suitable for the MD5 value of relatively more described multiple images, and in the time that the MD5 of multiple images value is different, determines between multiple images that described MD5 value is different whether be amendment relation by approximate copy mode.
Optionally, described computing unit comprises: module is set, is suitable for default described handling module and captures the resource website of webpage and the weights of the described different propagation relations that acquisition module gets; Match well module, be suitable for utilizing resource website described in same image family and described different propagation to be related to that weights calculate the weights of quoting of this image family.
The embodiment of the present invention is by creating image family corresponding to multiple source images, and calculate the weights of quoting of each image family, and then according to the size of quoting weights of described each image family the parameter as the search results ranking of search inquiry feedback, can obtain more high-quality ranking results accurately, and make image ranking results have the priority orders of quoting on number of times, greatly improve the accuracy of search, and effectively improved search efficiency.
Above-mentioned explanation is only the general introduction of technical solution of the present invention, in order to better understand technological means of the present invention, and can be implemented according to the content of instructions, and for above and other objects of the present invention, feature and advantage can be become apparent, below especially exemplified by the specific embodiment of the present invention.
Brief description of the drawings
By reading below detailed description of the preferred embodiment, various other advantage and benefits will become cheer and bright for those of ordinary skill in the art.Accompanying drawing is only for the object of preferred implementation is shown, and do not think limitation of the present invention.And in whole accompanying drawing, represent identical parts by identical reference symbol.In the accompanying drawings:
Fig. 1 shows a kind of according to an embodiment of the invention method step process flow diagram of realizing picture search sequence;
Fig. 2 shows the another kind of according to an embodiment of the invention method step process flow diagram of realizing picture search sequence;
Fig. 3 shows a kind of according to an embodiment of the invention apparatus structure block diagram of realizing picture search sequence.
Embodiment
Exemplary embodiment of the present disclosure is described below with reference to accompanying drawings in more detail.Although shown exemplary embodiment of the present disclosure in accompanying drawing, but should be appreciated that and can realize the disclosure and the embodiment that should do not set forth limits here with various forms.On the contrary, it is in order more thoroughly to understand the disclosure that these embodiment are provided, and can be by the those skilled in the art that conveys to complete the scope of the present disclosure.
With reference to Fig. 1, show a kind of according to an embodiment of the invention flow chart of steps of the embodiment of the method 1 that realizes picture search sequence, specifically can comprise the steps:
Step 110: create the image family that multiple source images are corresponding;
It should be noted that, identical image family just refers to that from visually seeing of people be consistent image, these images are the images that come by a source images amendment, and because the multiple images in image family are to be come by a source images amendment, therefore the each image in an image family should have identical source images; Based on this, proposing in the present embodiment can be by step S111: capture from resource website the webpage that described source images is corresponding, obtain by resolving described Webpage multiple images that described source images is corresponding; Because each image in same image family derives from same source images, therefore getting after multiple images, can be by step S112: obtain the propagation relation between multiple images that described source images is corresponding, and utilize propagation relation between described multiple images to set up multiple image family.
Certainly, those of ordinary skill in the art readily understand and can also create by other means image family, and the present embodiment does not repeat them here.
It should be noted that the propagation relation of obtaining in the present embodiment between multiple images that described source images is corresponding mainly comprises: reprint, copy and revise, but being not limited to this, can also have other propagation relations, this example does not repeat them here; Concrete, the present embodiment obtains described propagation relation in the following manner:
A, from resource website captures the webpage that described source images is corresponding, by described Webpage being resolved to obtain the corresponding relation of webpage URL and multiple image URL; Wherein, if multiple webpage URL is corresponding with same image URL, determine that the multiple webpages and the described image that comprise this image are reprinting relation; Or,
B, from resource website captures the webpage that described source images is corresponding, obtain multiple images by the analyzing web page page, calculate the informative abstract MD5 value of described multiple images, wherein, if the MD5 value of multiple images is identical, determine that between multiple images that described MD5 is identical be replication relation; Otherwise, judge between multiple images whether be same approximate copy, if so, determine between multiple images that described MD5 value is different to be amendment relation.
Step 120: the weights of quoting that calculate each image family;
In actual applications, the value of different propagation relations is different, and for example the value between above-mentioned three kinds of propagation relations can be: amendment > copies > and reprints; Wherein, the workload that amendment need to expend is greater than simple preservation, and preserving equally picture then provides the cost of picture service to be greater than reprinting behavior; Therefore, this kind of cost means the value difference of every image, namely propagates the basic weights of relation for every kind; And meanwhile, known by analysis, the value of the image that different websites are quoted is also different, its image of quoting of website that visit capacity is large is worth larger, therefore in the present embodiment, has set website weighting parameter; Concrete, the present embodiment proposes to calculate in the following manner the weights of quoting of each image family, includes but not limited to:
The weights of default described resource website and different propagation relations; Wherein, if comprise reprinting in propagation relation, copy and revise, to close be that amendment > copies > and reprints to the weights between three, arranges to revise in described propagation relation to be related to weights, replication relation weights and to reprint and be related to that the size of weights successively decreases successively; Exemplify herein a kind of formula calculate described image family to quote weights as follows;
Ri = Σ j = 0 n 1 a * SITEj * MD 5 ij + Σ j = 0 n 2 b * SITEj * IMGURLij + Σ j = 0 n 3 c * SITEj * PAGEURLij
Wherein, a is that amendment is related to that weights, b are that replication relation weights, c are that reprinting is related to weights, and a>b>c, and SITEj is the weights of website channel.
The present invention is not limited to this computing formula, as long as be out of shape the also row for the present invention's protection according to other formula of inventive concept.
Step 130: the parameter according to the size of quoting weights of described each image family as the search results ranking of search inquiry feedback.
Wherein, when calculating the quoting after weights of each image family, when receiving after user images searching request, if search query hit can be determined the parameter as the search results ranking of search inquiry feedback according to the size of quoting weights of described each image family; For example search results ranking and the weights size relation in direct ratio of quoting with described each image family; Certainly, those of ordinary skill in the art readily understand, using described each image family quote weights as in the parameter of search results ranking, can also introduce other parameters as sequence reference, the present embodiment is to this and be not specifically limited.
Certainly, above-mentioned special type information and judgment mode thereof, just as example, in the time implementing the embodiment of the present invention, can arrange other special type informations and judgment mode thereof according to actual conditions, and the embodiment of the present invention is not limited this.In addition, except above-mentioned special type information and judgment mode thereof, those skilled in the art can also adopt other special type informations and judgment mode thereof according to actual needs, and the embodiment of the present invention is not also limited this.
With reference to Fig. 2, by a concrete picture example, a kind of method that realizes picture search sequence of above-described embodiment is described in detail, specifically comprise the steps:
Step 210: create the image family that comprises figure A, figure B, figure C, figure D, figure E and figure F;
Step 220: capture webpage 7~webpage 14 from resource website;
Step 230: webpage 7~webpage 14 is resolved, obtain the corresponding relation of picture url and webpage url, (picture url, webpage url) is: (A, 13), (B, 14), (C, 11), (D, 12), (F, 10), (E, 7), (E, 8), (E, 9); Wherein, picture E correspondence webpage 7, webpage 8 and webpage 9, therefore the image in webpage 7, webpage 8 and webpage 9 and described image E are reprinting relation;
Step 240: know after the md5 value by calculating picture A~figure F, the md5 of picture B, picture E and picture F is identical, therefore can determine between picture B, picture E and picture F to be replication relation;
Step 250: calculate by the different picture A of md5 value, picture B, picture C and picture D being carried out to " approximate copy ", determine that picture A, picture B, picture C and picture D are approximate copies, can determine thus between picture A, picture B, picture C and picture D to be amendment relation;
Step 260: because figure A is amendment relation to figure B, figure C, figure D, therefore picture A quote weights W1=site (B) * 3*1+site (C) * 3*1+site (D) * 3*1, wherein site is the weight of picture place website, establishes 3 and is related to weight for amendment; And be replication relation to figure E and figure for figure B for F, that therefore schemes B quotes weights W2=site (E) * 2*1+site (F) * 2*1, and establishing 2 is replication relation weight; Are reprinting relations for figure E to webpage 8 and webpage 9, and the original web page of establishing figure E is webpage 7, the weights of quoting of scheming E are W3=site (8) * 1*1+site (9) * 1*1, and establishing 1 is linking relationship weight; Therefore, the weights of quoting of this image family are R=W1+W2+W3.
Step 270: the parameter according to the size of quoting weights R of described each image family as the search results ranking of search inquiry feedback.
Can find out, adopt the method for the embodiment of the present invention, by creating the image family that multiple source images are corresponding, and calculate the weights of quoting of each image family, and then according to the size of quoting weights of described each image family the parameter as the search results ranking of search inquiry feedback, can obtain more high-quality ranking results accurately, and make image ranking results have the priority orders of quoting on number of times, effectively improve search efficiency.
For embodiment of the method, for simple description, therefore it is all expressed as to a series of combination of actions, but those skilled in the art should know, the embodiment of the present invention is not subject to the restriction of described sequence of movement, because according to the embodiment of the present invention, some step can adopt other orders or carry out simultaneously.Secondly, those skilled in the art also should know, the embodiment described in instructions all belongs to preferred embodiment, and related action might not be that the embodiment of the present invention is necessary.
With reference to Fig. 3, show the structured flowchart of a kind of according to an embodiment of the invention device embodiment that realizes picture search sequence, specifically can comprise as lower module: creating unit 310, is suitable for creating the image family that multiple source images are corresponding; Computing unit 320, is suitable for calculating the weights of quoting of each image family; Sequencing unit 330, is suitable for the parameter as the search results ranking of search inquiry feedback according to the size of quoting weights of described each image family.
Wherein, described creating unit 310 comprises (not shown): handling module, is suitable for capturing from resource website the webpage that described source images is corresponding; Parsing module, the Webpage being suitable for by resolving described handling module crawl obtains multiple images that described source images is corresponding; Acquisition module, is suitable for obtaining the propagation relation between multiple images that described source images is corresponding; Build family's module, be suitable for utilizing propagation relation between described multiple images to set up multiple image family.
It should be noted that, acquisition module described in the present embodiment also can comprise (not shown): the first processing module, be suitable for the analysis result by receiving described parsing module, and obtain the corresponding relation of webpage uniform resource position mark URL and image URL according to described analysis result; The first comparison module, is suitable for the corresponding relation of more described multiple webpage URL and multiple image URL, and as described multiple webpage URL with same image URL at once, definite multiple webpages and described image that comprises this image is reprinting relation.
In addition, described acquisition module also comprises (not shown): the second processing module, is suitable for calculating the informative abstract MD5 value of multiple images that described parsing module parses; The second comparison module, is suitable for the MD5 value of relatively more described multiple images, and in the time that the MD5 of multiple images value is identical, determines between multiple images that described MD5 is identical to be replication relation.
In addition, described acquisition module also comprises (not shown): the 3rd processing module, is suitable for calculating the informative abstract MD5 value of multiple images that described parsing module parses; The 3rd comparison module, is suitable for the MD5 value of relatively more described multiple images, and in the time that the MD5 of multiple images value is different, determines between multiple images that described MD5 value is different whether be amendment relation by approximate copy mode.
It should be noted that, described computing unit 320 also can comprise (not shown) in the present embodiment: module is set, is suitable for default described handling module and captures the resource website of webpage and the weights of the described different propagation relations that acquisition module gets; Match well module, be suitable for utilizing resource website described in same image family and described different propagation to be related to that weights calculate the weights of quoting of this image family.
The algorithm providing at this is intrinsic not relevant to any certain computer, virtual system or miscellaneous equipment with demonstration.Various general-purpose systems also can with based on using together with this teaching.According to description above, it is apparent constructing the desired structure of this type systematic.In addition, the present invention is not also for any certain programmed language.It should be understood that and can utilize various programming languages to realize content of the present invention described here, and the description of above language-specific being done is in order to disclose preferred forms of the present invention.
In the instructions that provided herein, a large amount of details are described.But, can understand, embodiments of the invention can be put into practice in the situation that there is no these details.In some instances, be not shown specifically known method, structure and technology, so that not fuzzy understanding of this description.
Similarly, be to be understood that, in order to simplify the disclosure and to help to understand one or more in each inventive aspect, in the above in the description of exemplary embodiment of the present invention, each feature of the present invention is grouped together into single embodiment, figure or sometimes in its description.But, the method for the disclosure should be construed to the following intention of reflection: the present invention for required protection requires than the more feature of feature of clearly recording in each claim.Or rather, as reflected in claims below, inventive aspect is to be less than all features of disclosed single embodiment above.Therefore, claims of following embodiment are incorporated to this embodiment thus clearly, and wherein each claim itself is as independent embodiment of the present invention.
Those skilled in the art are appreciated that and can the module in the equipment in embodiment are adaptively changed and they are arranged in one or more equipment different from this embodiment.Module in embodiment or unit or assembly can be combined into a module or unit or assembly, and can put them in addition multiple submodules or subelement or sub-component.At least some in such feature and/or process or unit are mutually repelling, and can adopt any combination to combine all processes or the unit of disclosed all features in this instructions (comprising claim, summary and the accompanying drawing followed) and disclosed any method like this or equipment.Unless clearly statement in addition, in this instructions (comprising claim, summary and the accompanying drawing followed) disclosed each feature can be by providing identical, be equal to or the alternative features of similar object replaces.
In addition, those skilled in the art can understand, although embodiment more described herein comprise some feature instead of further feature included in other embodiment, the combination of the feature of different embodiment means within scope of the present invention and forms different embodiment.For example, in the following claims, the one of any of embodiment required for protection can be used with array mode arbitrarily.
All parts embodiment of the present invention can realize with hardware, or realizes with the software module of moving on one or more processor, or realizes with their combination.It will be understood by those of skill in the art that and can use in practice microprocessor or digital signal processor (DSP) to realize according to the some or all functions of the some or all parts in the equipment that carries out webpage loading of the embodiment of the present invention.The present invention can also be embodied as part or all equipment or the device program (for example, computer program and computer program) for carrying out method as described herein.Realizing program of the present invention and can be stored on computer-readable medium like this, or can there is the form of one or more signal.Such signal can be downloaded and obtain from internet website, or provides on carrier signal, or provides with any other form.
It should be noted above-described embodiment the present invention will be described instead of limit the invention, and those skilled in the art can design alternative embodiment in the case of not departing from the scope of claims.In the claims, any reference symbol between bracket should be configured to limitations on claims.Word " comprises " not to be got rid of existence and is not listed as element or step in the claims.Being positioned at word " " before element or " one " does not get rid of and has multiple such elements.The present invention can be by means of including the hardware of some different elements and realizing by means of the computing machine of suitably programming.In the unit claim of having enumerated some devices, several in these devices can be to carry out imbody by same hardware branch.The use of word first, second and C grade does not represent any order.Can be title by these word explanations.

Claims (10)

1. a method that realizes picture search sequence, comprising:
Create the image family that multiple source images are corresponding;
Calculate the weights of quoting of each image family;
Parameter according to the size of quoting weights of described each image family as the search results ranking of search inquiry feedback.
2. the method for claim 1, is characterized in that, the image family that the multiple source images of described establishment are corresponding comprises:
Capture from resource website the webpage that described source images is corresponding;
Obtain by resolving described Webpage multiple images that described source images is corresponding;
Obtain the propagation relation between multiple images that described source images is corresponding;
Utilize propagation relation between described multiple images to set up multiple image family.
3. the method as described in claim 1-2 any one, is characterized in that, described in the propagation relation obtained between multiple images that described source images is corresponding comprise:
Resolve the corresponding relation that obtains webpage uniform resource position mark URL and multiple image URL by described Webpage;
If multiple webpage URL are corresponding with same image URL, determine that the multiple webpages and the described image that comprise this image are reprinting relation.
4. the method as described in claim 1-3 any one, is characterized in that, described in the propagation relation obtained between multiple images that described source images is corresponding comprise:
Calculate the informative abstract MD5 value of multiple images that obtain by the analyzing web page page;
If the MD5 value of multiple images is identical, determine that between multiple images that described MD5 is identical be replication relation.
5. the method as described in claim 1-4 any one, is characterized in that, described in the propagation relation obtained between multiple images that described source images is corresponding comprise:
Calculate the MD5 value of multiple images that obtain by the analyzing web page page;
If the MD5 value of multiple images is different, determine between multiple images that described MD5 value is different whether be amendment relation by approximate copy mode.
6. the method as described in claim 1-5 any one, is characterized in that, the weights of quoting of the each image of described calculating family comprise:
The weights of default described resource website and different propagation relations;
Utilize resource website described in same image family and described different propagation to be related to that weights calculate the weights of quoting of this image family.
7. a device of realizing picture search sequence, comprising:
Creating unit, is suitable for creating the image family that multiple source images are corresponding;
Computing unit, is suitable for calculating the weights of quoting of each image family;
Sequencing unit, is suitable for the parameter as the search results ranking of search inquiry feedback according to the size of quoting weights of described each image family.
8. device as claimed in claim 7, is characterized in that, described creating unit comprises:
Handling module, is suitable for capturing from resource website the webpage that described source images is corresponding;
Parsing module, the Webpage being suitable for by resolving described handling module crawl obtains multiple images that described source images is corresponding;
Acquisition module, is suitable for obtaining the propagation relation between multiple images that described source images is corresponding;
Build family's module, be suitable for utilizing propagation relation between described multiple images to set up multiple image family.
9. the device as described in claim 7-8 any one, is characterized in that, described acquisition module also comprises:
The first processing module, is suitable for the analysis result by receiving described parsing module, and obtains the corresponding relation of webpage uniform resource position mark URL and image URL according to described analysis result;
The first comparison module, is suitable for the corresponding relation of more described multiple webpage URL and multiple image URL, and as described multiple webpage URL with same image URL at once, definite multiple webpages and described image that comprises this image is reprinting relation.
10. the device as described in claim 7-9 any one, is characterized in that, described acquisition module also comprises:
The second processing module, is suitable for calculating the informative abstract MD5 value of multiple images that described parsing module parses;
The second comparison module, is suitable for the MD5 value of relatively more described multiple images, and in the time that the MD5 of multiple images value is identical, determines between multiple images that described MD5 is identical to be replication relation.
CN201410203700.0A 2014-05-14 2014-05-14 Method and device for achieving image search and sorting Pending CN103995857A (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN201410203700.0A CN103995857A (en) 2014-05-14 2014-05-14 Method and device for achieving image search and sorting
PCT/CN2015/078881 WO2015172721A1 (en) 2014-05-14 2015-05-13 Method and device for searching and ranking images and providing image search

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410203700.0A CN103995857A (en) 2014-05-14 2014-05-14 Method and device for achieving image search and sorting

Publications (1)

Publication Number Publication Date
CN103995857A true CN103995857A (en) 2014-08-20

Family

ID=51310022

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410203700.0A Pending CN103995857A (en) 2014-05-14 2014-05-14 Method and device for achieving image search and sorting

Country Status (1)

Country Link
CN (1) CN103995857A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015172721A1 (en) * 2014-05-14 2015-11-19 北京奇虎科技有限公司 Method and device for searching and ranking images and providing image search

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102799635A (en) * 2012-06-27 2012-11-28 天津大学 Image set ordering method driven by user
CN103309864A (en) * 2012-03-07 2013-09-18 腾讯科技(深圳)有限公司 Method, device and system for displaying search result
CN103617262A (en) * 2013-12-02 2014-03-05 北京奇虎科技有限公司 Picture content attribute identification method and system
CN103699612A (en) * 2013-12-13 2014-04-02 中国科学院深圳先进技术研究院 Image retrieval ranking method and device

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103309864A (en) * 2012-03-07 2013-09-18 腾讯科技(深圳)有限公司 Method, device and system for displaying search result
CN102799635A (en) * 2012-06-27 2012-11-28 天津大学 Image set ordering method driven by user
CN103617262A (en) * 2013-12-02 2014-03-05 北京奇虎科技有限公司 Picture content attribute identification method and system
CN103699612A (en) * 2013-12-13 2014-04-02 中国科学院深圳先进技术研究院 Image retrieval ranking method and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2015172721A1 (en) * 2014-05-14 2015-11-19 北京奇虎科技有限公司 Method and device for searching and ranking images and providing image search

Similar Documents

Publication Publication Date Title
CN103914529B (en) Search exhibiting method and device
CN104484459A (en) Method and device for combining entities in knowledge map
CN103744853A (en) Method and device for providing web cache information in search engine
CN104077391A (en) Method, server, client and system for providing special news search
JP2011054159A (en) System to modify website for organic search optimization
RU2645266C1 (en) Method and device for planning web-crowlers in accordance with keyword search
CN109871311B (en) Method and device for recommending test cases
CN103617241A (en) Search information processing method, browser terminal and server
CN103870607A (en) Sequencing method and device of search results of multiple search engines
CN103984757A (en) Method and system for inserting news information articles in search result page
CN102955850A (en) Method and device for loading sequencing website
CN103605848A (en) Method and device for analyzing paths
CN104317931A (en) Webpage title determining method and device
CN107784003B (en) Data query anomaly detection method, device, equipment and system
CN104036003A (en) Search result integration method and device
CN103593406A (en) Static resource identifier processing method and device
CN111967234A (en) Visual report generation method and device, terminal equipment and storage medium
CN102982177A (en) Method and device for performing search in browser
CN103544271B (en) Load Image in a kind of browser the method and apparatus for processing window
US20160117398A1 (en) Systems and methods for extracting similar group elements
CN103744970A (en) Method and device for determining subject term of picture
CN104317929A (en) Search result display optimizing method and device
CN104778233A (en) Searching method and device based on click rate
CN105786910A (en) Term weight calculation method and device
CN104462556A (en) Method and device for recommending question and answer page related questions

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20140820

RJ01 Rejection of invention patent application after publication