CN102662957B - Apparatus and method for optimizing search result page of browser - Google Patents

Apparatus and method for optimizing search result page of browser Download PDF

Info

Publication number
CN102662957B
CN102662957B CN201210054359.8A CN201210054359A CN102662957B CN 102662957 B CN102662957 B CN 102662957B CN 201210054359 A CN201210054359 A CN 201210054359A CN 102662957 B CN102662957 B CN 102662957B
Authority
CN
China
Prior art keywords
information
result
search
item
page searching
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201210054359.8A
Other languages
Chinese (zh)
Other versions
CN102662957A (en
Inventor
阮星华
高亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Baidu Online Network Technology Beijing Co Ltd
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN201210054359.8A priority Critical patent/CN102662957B/en
Publication of CN102662957A publication Critical patent/CN102662957A/en
Application granted granted Critical
Publication of CN102662957B publication Critical patent/CN102662957B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The present invention provides an apparatus and a method for optimizing a search result page of a browser, wherein the apparatus includes: an extraction module for extracting structural information of the search result page or information of every information item on the search result page; a determination module for determining whether or not search bad points exist in the search result page according to the structural information or the information of every information item; and an automatic feedback module for giving feedback of search bad points automatically to a backend server. The apparatus in embodiments of the present invention can mine out search results with poor relativity or poor presentation effect and feed the results back to the backend server for improving search engine algorithm on the one hand. And on the other hand, the apparatus employs idle resources of search engine clients to perform search result page analysis, and thus saves energy, realizes the search result page analysis synchronously when users are using the search engine without any need to individually issue an analysis request to the search engine, and causes no pressure to search products and brings no influence to use of the users.

Description

For optimizing the device and method of the result of page searching of browser
Technical field
The present invention relates to web search technical field, particularly relating to a kind of device and method of the result of page searching for optimizing browser.
Background technology
The search engine of internet is towards the various webpages counted in hundreds of millions, although existing search technique and corresponding searching algorithm have achieved huge improvement and leap, but there is when processing the panoramic webpage of magnanimity the situation that Search Results effect is undesirable unavoidably, such as, the correlativity of Search Results is bad, Search Results to represent effect bad, dead chain, mess code, Search Results repeats, the phenomenons such as title summary is inaccurate, we are referred to as these phenomenons is Bad Case, these Bad Case of Timeliness coverage also carry out the improvement of searching algorithm accordingly, search engine for internet is abnormal important.
Summary of the invention
The present invention is intended at least one of solve the problems of the technologies described above.
For this reason, one object of the present invention is that proposition is a kind of and while user uses search engine, can automatically analyzes and then excavate search bad point to Search Results and the automatic device feeding back the result of page searching for optimizing browser improved for search engine to background server.
Another object of the present invention is to a kind of method proposing result of page searching for optimizing browser.
To achieve these goals, the device for the result of page searching optimizing browser of embodiment comprises according to a first aspect of the invention: extraction module, and described extraction module is for extracting the information of every bar item of information in the structural information of described result of page searching or described result of page searching; Judge module, described judge module is used for judging whether there is search bad point in described result of page searching according to the information of described structural information or described every bar item of information; And automatic feedback module, described automatic feedback module is used for described search bad point to be automatically fed to background server.
According to the device of the result of page searching for optimizing browser of the embodiment of the present invention, can realize carrying out inspection to the structural information analysis of result of page searching and dead chain, title, summary and the search key often included by bar item of information by extraction module and judge module on the one hand to judge, and then excavate correlativity or represent the bad Search Results of effect and feed back to background server, be convenient to improve search engine algorithms; On the other hand this device can utilize the idling-resource of search engine client to carry out the extraction of result of page searching, analysis, judgement and feedback, economize on resources, synchronously realize the analysis to result of page searching when user uses search engine simultaneously, do not need to initiate analysis request to search engine separately, the pressure to searching products itself can not be caused, also can not have an impact to the use of user.
To achieve these goals, the method for the result of page searching optimizing browser of embodiment comprises the following steps according to a second aspect of the invention: the information extracting every bar item of information in the structural information of described result of page searching or described result of page searching; Information according to described structural information or described every bar item of information judges whether there is search bad point in described result of page searching; And described search bad point is automatically fed to background server.
According to the method for the result of page searching for optimizing browser of the embodiment of the present invention, on the one hand by extracting the information of every bar item of information in the structural information of result of page searching or result of page searching, carry out inspection to the structural information analysis of result of page searching and dead chain, title, summary and the search key often included by bar item of information to judge, and then excavate correlativity or represent the bad Search Results of effect and feed back to background server, be convenient to improve search engine algorithms; On the other hand the method can utilize the idling-resource of search engine client to carry out the extraction of result of page searching, analysis, judgement and feedback, economize on resources, synchronously realize the analysis to result of page searching when user uses search engine simultaneously, do not need to initiate analysis request to search engine separately, the pressure to searching products itself can not be caused, also can not have an impact to the use of user.
The aspect that the present invention adds and advantage will part provide in the following description, and part will become obvious from the following description, or be recognized by practice of the present invention.
Accompanying drawing explanation
The present invention above-mentioned and/or additional aspect and advantage will become obvious and easy understand from the following description of the accompanying drawings of embodiments, wherein,
Fig. 1 is according to an embodiment of the invention for optimizing the structured flowchart of the device of the result of page searching of browser;
Fig. 2 is according to an embodiment of the invention for optimizing the structured flowchart of the device of the result of page searching of browser;
Fig. 3 is according to an embodiment of the invention for optimizing the structured flowchart of the device of the result of page searching of browser;
Fig. 4 is according to an embodiment of the invention for optimizing the process flow diagram of the method for the result of page searching of browser;
Fig. 5 is according to an embodiment of the invention for optimizing the process flow diagram of the method for the result of page searching of browser; And
Fig. 6 is according to an embodiment of the invention for optimizing the process flow diagram of the method for the result of page searching of browser.
Embodiment
Be described below in detail embodiments of the invention, the example of described embodiment is shown in the drawings, and wherein same or similar label represents same or similar element or has element that is identical or similar functions from start to finish.Being exemplary below by the embodiment be described with reference to the drawings, only for explaining the present invention, and can not limitation of the present invention being interpreted as.On the contrary, embodiments of the invention comprise fall into attached claims spirit and intension within the scope of all changes, amendment and equivalent.
In describing the invention, it is to be appreciated that term " first ", " second " etc. are only for describing object, and instruction or hint relative importance can not be interpreted as.In describing the invention, it should be noted that, unless otherwise clearly defined and limited, term " is connected ", " connection " should be interpreted broadly, such as, can be fixedly connected with, also can be removably connect, or connect integratedly; Can be mechanical connection, also can be electrical connection; Can be directly be connected, also indirectly can be connected by intermediary.For the ordinary skill in the art, concrete condition above-mentioned term concrete meaning in the present invention can be understood.In addition, in describing the invention, except as otherwise noted, the implication of " multiple " is two or more.
Describe and can be understood in process flow diagram or in this any process otherwise described or method, represent and comprise one or more for realizing the module of the code of the executable instruction of the step of specific logical function or process, fragment or part, and the scope of the preferred embodiment of the present invention comprises other realization, wherein can not according to order that is shown or that discuss, comprise according to involved function by the mode while of basic or by contrary order, carry out n-back test, this should understand by embodiments of the invention person of ordinary skill in the field.
Below with reference to Figure of description, the device according to the result of page searching for optimizing browser of the embodiment of the present invention is described.
For optimizing a device for the result of page searching of browser, comprising: extraction module, extraction module is for extracting the information of every bar item of information in the structural information of result of page searching or result of page searching; Judge module, judge module is used for judging whether there is search bad point in result of page searching according to the information of structural information or every bar item of information; And automatic feedback module, automatic feedback module is used for search bad point to be automatically fed to background server.
Fig. 1 is according to an embodiment of the invention for optimizing the structured flowchart of the device of the result of page searching of browser.
As shown in Figure 1, comprise according to the device for the result of page searching optimizing browser of the embodiment of the present invention: extraction module 100, judge module 200 and automatic feedback module 300.
Particularly, extraction module 100 is for extracting the information of every bar item of information in the structural information of result of page searching or result of page searching.
In one embodiment of the invention, structural information comprises the number of the advertising message item that result of page searching comprises and the number of position and general information item and position.Particularly; advertising message item can comprise brand advertising, sponsor link advertisement, promote advertisement, brand protection result etc.; general information item can be AS result; this is for also comprising Aladdin result (the General Open platform that Search Engines of Baidu is released; interface opening is supplied to the owner of unique information data, solves the darknet information that existing search engine cannot capture and retrieve) etc.
In one embodiment of the invention, the information of every bar item of information comprises link that every bar item of information comprises, title, summary and search key.
In one embodiment of the invention, the function of extraction module 100 can use JavaScript to define, such as define a batOverlay.js file, define class bat (executable file) and related methods inside this batOverlay.js file for initialization, inside this batOverlay.js file, define the various function carrying out analyzing for Search Results simultaneously.The number of advertising message item that such as function Bat.page_analysis () can comprise result of page searching and the number of position and general information item and position are analyzed, the result exported after this function carries out page analysis to certain result of page searching as called is { P:1, AS:5, AL:1, AS:3}, wherein, P:1 represents that a brand advertising is positioned at first position, AS:5 represents 5 general results, position is after a brand advertising, AL:1 represents 1 Aladdin result, position is after 5 general results, AS:3 represents 3 general results, position is after 1 Aladdin result.
Judge module 200 is for judging whether there is search bad point in result of page searching according to the information of structural information or every bar item of information.
Particularly, judge module 200 judges whether there is search bad point in result of page searching according to structural information.First the threshold value of the structural information of result of page searching is set, the quantity threshold that the search key such as inputted according to user in a result of page searching arranges advertising message item is 1, the quantity threshold of Aladdin result is 1, if the number arranging advertising message item in a result of page searching or the number of Aladdin result exceed the threshold value 1 of setting, then judge module 200 judges to there is search bad point in this result of page searching.
Judge module 200 can also judge whether there is search bad point in result of page searching according to the information of every bar item of information.Such as, if the link that the first information item in every bar item of information comprises is the link that cannot open, or first information item does not comprise summary, or the title that first information item comprises and/or summary repeat, or do not comprise search key in the title that first information item comprises or summary, then judge module 200 judges that this first information item is search bad point.
In one embodiment of the invention, the function of judge module 200 also can use JavaScript to define, such as, define the various function carrying out analyzing for Search Results inside the batOverlay.js file defined, as function Bat.linkcheck (type), Bat.is_equal (), Bat.piaohong () etc.Such as, use function Bat.linkcheck (type) dead chain inspection can be carried out to the item of information of Search Results, wherein, when parametric t ype=all represents, dead chain inspection is carried out to all items of information in result of page searching, when parametric t ype=top represents, dead chain inspection is carried out to the item of information of first three in result of page searching, when parametric t ype=random represents that three items of information random in result of page searching carry out dead chain inspection.Use function Bat.is_equal () can judge whether there is title in result of page searching or identical item of information of making a summary.Whether general rise of prices of the stocks and other securities in result of page searching is normal to use function Bat.piaohong () to judge, whether the situation whether with or without general rise of prices of the stocks and other securities in title and summary, comprise search key in the title namely in item of information or summary.Whether the title summary of item of information in result of page searching is correct to use function Bat.title_abstract () to check, the Output rusults such as called after this function is { A:10, A1:{0,0,0}, A2:{0,1,0} ... A10:{0,0,0}}, represent and present 10 general results altogether, wherein the 2nd article of result (A2:{0, the 1st, summary 0}) is labeled as 1, represents that summary has exception.
Automatic feedback module 300 is for being automatically fed to background server by search bad point.
In one embodiment of the invention, automatic feedback module 300 is fed back to background server by using HTTP request.Such as, the Search Results of the existence search bad point that judge module 200 judges by automatic feedback module 300 is with json (JavaScript Object Notation, the data interchange format of lightweight) form is delivered to background server by HTTP Service port, HTTP Service port provides a write_db.php file, in the data write into Databasce that POST can transmit by this file, result uploads in database by the php page this write_db.php can being asked corresponding via XmlHttpRequest by plug-in unit in a browser, wherein, above-mentioned functions can be encapsulated by automatic feedback module 300, when judge module 200 judges, when there is search bad point, search bad point is automatically fed to background server.
According to the device of the result of page searching for optimizing browser of the embodiment of the present invention, can realize carrying out inspection to the structural information analysis of result of page searching and dead chain, title, summary and the search key often included by bar item of information by extraction module and judge module on the one hand to judge, and then excavate correlativity or represent the bad Search Results of effect and feed back to background server, be convenient to improve search engine algorithms; On the other hand this device can utilize the idling-resource of search engine client to carry out the extraction of result of page searching, analysis, judgement and feedback, economize on resources, synchronously realize the analysis to result of page searching when user uses search engine simultaneously, do not need to initiate analysis request to search engine separately, the pressure to searching products itself can not be caused, also can not have an impact to the use of user.
Fig. 2 is according to an embodiment of the invention for optimizing the structured flowchart of the device of the result of page searching of browser.
As shown in Figure 2, comprise according to the device for the result of page searching optimizing browser of the embodiment of the present invention: extraction module 100, judge module 200, automatic feedback module 300 and manual feedback module 400.
Particularly, extraction module 100 is for extracting the information of every bar item of information in the structural information of result of page searching or result of page searching.Judge module 200 is for judging whether there is search bad point in result of page searching according to the information of structural information or every bar item of information.Automatic feedback module 300 is for being automatically fed to background server by search bad point.Manual feedback module 400 provides interface thus user can use interface to the suggestion of background server feedback to Search Results.
In one embodiment of the invention, automatic feedback module 300 or manual feedback module 400 are fed back to background server by using HTTP request.Such as, the Search Results of the existence search bad point that judge module 200 judges by automatic feedback module 300 or manual feedback module 400 is with json (JavaScript Object Notation, the data interchange format of lightweight) form is delivered to background server by HTTP Service port, HTTP Service port provides a write_db.php file, in the data write into Databasce that POST can transmit by this file, result uploads in database by the php page this write_db.php can being asked corresponding via XmlHttpRequest by plug-in unit in a browser.Wherein, above-mentioned functions can be encapsulated by automatic feedback module 300, when judge module 200 judges, when there is search bad point, search bad point is automatically fed to background server, above-mentioned functions can be made interface by manual feedback module 400, can fast feed back when user thinks that Search Results effect is bad, can easily and timely feedback search bad point by simple function button such as right-click menu, accomplish one-touch feedback.
According to the device of the result of page searching for optimizing browser of the embodiment of the present invention, can manually feed back to the conscientious feedback of background server quickly by feedback module when user thinks that Search Results effect is bad, by the discovery of user's acceleration search bad point, raise the efficiency, manual feedback module is packaged into interface simultaneously, user can easily and timely feedback search bad point by simple function button, reduces the cost that user participates in testing.
Fig. 3 is according to an embodiment of the invention for optimizing the structured flowchart of the device of the result of page searching of browser.
As shown in Figure 3, comprise according to the device for the result of page searching optimizing browser of the embodiment of the present invention: extraction module 100, judge module 200, setting unit 210, comparing unit 220, determining unit 230 and automatic feedback module 300.
Particularly, extraction module 100 is for extracting the information of every bar item of information in the structural information of result of page searching or result of page searching.
Judge module 200 is for judging whether there is search bad point in result of page searching according to the information of structural information or every bar item of information.In one embodiment of the invention, judge module 200 comprises setting unit 210, comparing unit 220 and determining unit 230.
More specifically, setting unit 210 is for arranging the threshold value of the structural information of result of page searching according to search key.Such as, the quantity threshold arranging advertising message item according to the search key of user's input in a result of page searching is 1, Aladdin result quantity threshold is 1.
Comparing unit 220 is for comparing structural information with threshold value.Particularly, compared with the threshold value of the structural information that the structural information that extraction module 100 extracts by comparing unit is arranged with setting unit 210.
Whether determining unit 230 is for existing search bad point in the comparative result according to comparing unit 220 or the information determination result of page searching according to every bar item of information.
In one embodiment of the invention, whether determining unit 230 is for existing search bad point according in the comparative result determination result of page searching of comparing unit 220, if such as according to the comparative result of comparing unit 220, the number of the advertising message item in a result of page searching be 2 or the number of Aladdin result be 3, exceed the threshold value of setting, then determining unit 230 determines to there is search bad point in result of page searching.
In one embodiment of the invention, whether determining unit 230 is also for existing search bad point according in the information determination result of page searching of every bar item of information, such as, if the link that the first information item in every bar item of information comprises is the link that cannot open, or first information item does not comprise summary, or the title that first information item comprises and/or summary repeat, or do not comprise search key in the title that first information item comprises or summary, then determining unit 230 determines that this first information item is search bad point.
In one embodiment of the invention, determining unit 230 also can use JavaScript to define according to the function that whether there is search bad point in the information determination result of page searching of every bar item of information, such as, define the various function carrying out analyzing for Search Results inside the batOverlay.js file defined, as function Bat.linkcheck (type), Bat.is_equal (), Bat.piaohong () etc.Such as, use function Bat.linkcheck (type) dead chain inspection can be carried out to the item of information of Search Results, wherein, when parametric t ype=all represents, dead chain inspection is carried out to all items of information in result of page searching, when parametric t ype=top represents, dead chain inspection is carried out to the item of information of first three in result of page searching, when parametric t ype=random represents that three items of information random in result of page searching carry out dead chain inspection.Use function Bat.is_equal () can judge whether there is title in result of page searching or identical item of information of making a summary.Whether general rise of prices of the stocks and other securities in result of page searching is normal to use function Bat.piaohong () to judge, whether the situation whether with or without general rise of prices of the stocks and other securities in title and summary, comprise search key in the title namely in item of information or summary.Whether the title summary of item of information in result of page searching is correct to use function Bat.title_abstract () to check, the Output rusults such as called after this function is { A:10, A1:{0,0,0}, A2:{0,1,0} ... A10:{0,0,0}}, represent and present 10 general results altogether, wherein the 2nd article of result (A2:{0, the 1st, summary 0}) is labeled as 1, represents that summary has exception.
Automatic feedback module 300 is for being automatically fed to background server by search bad point.
Manual feedback module 400 provides interface thus user can use interface to the suggestion of background server feedback to Search Results.
In one embodiment of the invention, automatic feedback module 300 or manual feedback module 400 are fed back to background server by using HTTP request.Such as, determined for determining unit 230 existence is searched for the result of bad point with json (JavaScript Object Notation by automatic feedback module 300 or manual feedback module 400, the data interchange format of lightweight) form is delivered to background server by HTTP Service port, HTTP Service port provides a write_db.php file, in the data write into Databasce that POST can transmit by this file, result passes back in database by the php page this write_db.php can being asked corresponding via XmlHttpRequest by plug-in unit in a browser, wherein, above-mentioned functions can be encapsulated by automatic feedback module 300, when determining unit 230 is determined, when there is search bad point, search bad point is automatically fed to background server, above-mentioned functions can be made interface by manual feedback module 400, can feed back to background server quickly when user thinks that Search Results effect is bad.
According to the device of the result of page searching for optimizing browser of the embodiment of the present invention, the information of every bar item of information in the structural information of result of page searching and result of page searching is extracted by extraction module, realize carrying out inspection to the structural information analysis of result of page searching and dead chain, title, summary and the search key often included by bar item of information by setting unit, comparing unit and determining unit again to judge, and then excavate correlativity or represent the bad Search Results of effect and feed back to background server, be convenient to improve search engine algorithms; This device can utilize the idling-resource of search engine client to carry out result of page searching extraction, analysis, judgement and feedback on the other hand, economize on resources, synchronously realize the analysis to result of page searching when user uses search engine simultaneously, do not need to initiate analysis request to search engine separately, the pressure to searching products itself can not be caused, also can not have an impact to the use of user.
Below with reference to Figure of description, the method according to the result of page searching for optimizing browser of the embodiment of the present invention is described.
For optimizing a method for the result of page searching of browser, comprise the following steps: the information extracting every bar item of information in the structural information of result of page searching or result of page searching; Information according to structural information or every bar item of information judges whether there is search bad point in result of page searching; And search bad point is automatically fed to background server.
Fig. 4 is according to an embodiment of the invention for optimizing the process flow diagram of the method for the result of page searching of browser.
As shown in Figure 4, according to the embodiment of the present invention for optimizing the method for the result of page searching of browser, comprise the steps.
Step S101, extracts the information of every bar item of information in the structural information of result of page searching or result of page searching.
In one embodiment of the invention, structural information comprises the number of the advertising message item that result of page searching comprises and the number of position and general information item and position.Particularly; advertising message item can comprise brand advertising, sponsor link advertisement, promote advertisement, brand protection result etc.; general information item can be AS result; this is for also comprising Aladdin result (the General Open platform that Search Engines of Baidu is released; interface opening is supplied to the owner of unique information data, solves the darknet information that existing search engine cannot capture and retrieve) etc.
In one embodiment of the invention, the information of every bar item of information comprises link that every bar item of information comprises, title, summary and search key.
In one embodiment of the invention, the function extracting the information of every bar item of information in the structural information of result of page searching or result of page searching can use JavaScript to define, such as define a batOverlay.js file, define class bat (executable file) and related methods inside this batOverlay.js file for initialization, inside this batOverlay.js file, define the various function carrying out analyzing for Search Results simultaneously.The number of advertising message item that such as function Bat.page_analysis () can comprise result of page searching and the number of position and general information item and position are analyzed, the result exported after this function carries out page analysis to certain result of page searching as called is { P:1, AS:5, AL:1, AS:3}, wherein, P:1 represents that a brand advertising is positioned at first position, AS:5 represents 5 general results, position is after a brand advertising, AL:1 represents 1 Aladdin result, position is after 5 general results, AS:3 represents 3 general results, position is after 1 Aladdin result.
Step S102, the information according to structural information or every bar item of information judges whether there is search bad point in result of page searching.
Particularly, judge in result of page searching, whether to there is the threshold value of searching for bad point and first needing the structural information that result of page searching is set according to structural information, the quantity threshold that the search key such as inputted according to user in a result of page searching arranges advertising message item is 1, the quantity threshold of Aladdin result is 1, if the number arranging advertising message item in a result of page searching or the number of Aladdin result exceed the threshold value 1 of setting, then judge in result of page searching, to there is search bad point.
Judge that whether there is search bad point in result of page searching realizes mainly through specific phenomenon according to the information of every bar item of information, such as, if the link that the first information item in every bar item of information comprises is the link that cannot open, or first information item does not comprise summary, or the title that first information item comprises and/or summary repeat, or do not comprise search key in the title that first information item comprises or summary, then judge that this first information item is search bad point.
In one embodiment of the invention, the function that the function judging whether to exist in result of page searching search bad point according to the information of every bar item of information can utilize the various Search Results defined inside batOverlay.js file to analyze.Such as, JavaScript is used to define, such as, define the various function carrying out analyzing for Search Results inside the batOverlay.js file defined, as function Bat.linkcheck (type), Bat.is_equal (), Bat.piaohong () etc.Such as, use function Bat.linkcheck (type) dead chain inspection can be carried out to the item of information of Search Results, wherein, when parametric t ype=all represents, dead chain inspection is carried out to all items of information in result of page searching, when parametric t ype=top represents, dead chain inspection is carried out to the item of information of first three in result of page searching, when parametric t ype=random represents that three items of information random in result of page searching carry out dead chain inspection.Use function Bat.is_equal () can judge whether there is title in result of page searching or identical item of information of making a summary.Whether general rise of prices of the stocks and other securities in result of page searching is normal to use function Bat.piaohong () to judge, whether the situation whether with or without general rise of prices of the stocks and other securities in title and summary, comprise search key in the title namely in item of information or summary.Whether the title summary of item of information in result of page searching is correct to use function Bat.title_abstract () to check, the Output rusults such as called after this function is { A:10, A1:{0,0,0}, A2:{0,1,0} ... A10:{0,0,0}}, represent and present 10 general results altogether, wherein the 2nd article of result (A2:{0, the 1st, summary 0}) is labeled as 1, represents that summary has exception.
Step S103, is automatically fed to background server by search bad point.
In one embodiment of the invention, by using HTTP request to feed back to background server.Such as, to judge that the Search Results that there is search bad point is with json (JavaScript Object Notation, the data interchange format of lightweight) form is delivered to background server by HTTP Service port, HTTP Service port provides a write_db.php file, in the data write into Databasce that POST can transmit by this file, result uploads in database by the php page this write_db.php can being asked corresponding via XmlHttpRequest by plug-in unit in a browser, wherein, above-mentioned functions can be encapsulated, when judging, when there is search bad point, search bad point is automatically fed to background server.
According to the method for the result of page searching for optimizing browser of the embodiment of the present invention, on the one hand by extracting the information of every bar item of information in the structural information of result of page searching or result of page searching, carry out inspection to the structural information analysis of result of page searching and dead chain, title, summary and the search key often included by bar item of information to judge, and then excavate correlativity or represent the bad Search Results of effect and feed back to background server, be convenient to improve search engine algorithms; On the other hand the method can utilize the idling-resource of search engine client to carry out the extraction of result of page searching, analysis, judgement and feedback, economize on resources, synchronously realize the analysis to result of page searching when user uses search engine simultaneously, do not need to initiate analysis request to search engine separately, the pressure to searching products itself can not be caused, also can not have an impact to the use of user.
Fig. 5 is according to an embodiment of the invention for optimizing the process flow diagram of the method for the result of page searching of browser.
As shown in Figure 5, according to the embodiment of the present invention for optimizing the method for the result of page searching of browser, comprise the steps.
Step S201, extracts the information of every bar item of information in the structural information of result of page searching or result of page searching.
In one embodiment of the invention, structural information comprises the number of the advertising message item that result of page searching comprises and the number of position and general information item and position.Particularly; advertising message item can comprise brand advertising, sponsor link advertisement, promote advertisement, brand protection result etc.; general information item can be AS result; this is for also comprising Aladdin result (the General Open platform that Search Engines of Baidu is released; interface opening is supplied to the owner of unique information data, solves the darknet information that existing search engine cannot capture and retrieve) etc.
In one embodiment of the invention, the information of every bar item of information comprises link that every bar item of information comprises, title, summary and search key.
In one embodiment of the invention, the function extracting the information of every bar item of information in the structural information of result of page searching or result of page searching can use JavaScript to define, such as define a batOverlay.js file, define class bat (executable file) and related methods inside this batOverlay.js file for initialization, inside this batOverlay.js file, define the various function carrying out analyzing for Search Results simultaneously.The number of advertising message item that such as function Bat.page_analysis () can comprise result of page searching and the number of position and general information item and position are analyzed, the result exported after this function carries out page analysis to certain result of page searching as called is { P:1, AS:5, AL:1, AS:3}, wherein, P:1 represents that a brand advertising is positioned at first position, AS:5 represents 5 general results, position is after a brand advertising, AL:1 represents 1 Aladdin result, position is after 5 general results, AS:3 represents 3 general results, position is after 1 Aladdin result.
Step S202, the information according to structural information or every bar item of information judges whether there is search bad point in result of page searching.
Particularly, judge in result of page searching, whether to there is the threshold value of searching for bad point and first needing the structural information that result of page searching is set according to structural information, the quantity threshold that the search key such as inputted according to user in a result of page searching arranges advertising message item is 1, the quantity threshold of Aladdin result is 1, if the number arranging advertising message item in a result of page searching or the number of Aladdin result exceed the threshold value 1 of setting, then judge in result of page searching, to there is search bad point.
Judge that whether there is search bad point in result of page searching realizes mainly through specific phenomenon according to the information of every bar item of information, such as, if the link that the first information item in every bar item of information comprises is the link that cannot open, or first information item does not comprise summary, or the title that first information item comprises and/or summary repeat, or do not comprise search key in the title that first information item comprises or summary, then judge that this first information item is search bad point.
In one embodiment of the invention, the function that the function judging whether to exist in result of page searching search bad point according to the information of every bar item of information can utilize the various Search Results defined inside batOverlay.js file to analyze.Such as, JavaScript is used to define, such as, define the various function carrying out analyzing for Search Results inside the batOverlay.js file defined, as function Bat.linkcheck (type), Bat.is_equal (), Bat.piaohong () etc.Such as, use function Bat.linkcheck (type) dead chain inspection can be carried out to the item of information of Search Results, wherein, when parametric t ype=all represents, dead chain inspection is carried out to all items of information in result of page searching, when parametric t ype=top represents, dead chain inspection is carried out to the item of information of first three in result of page searching, when parametric t ype=random represents that three items of information random in result of page searching carry out dead chain inspection.Use function Bat.is_equal () can judge whether there is title in result of page searching or identical item of information of making a summary.Whether general rise of prices of the stocks and other securities in result of page searching is normal to use function Bat.piaohong () to judge, whether the situation whether with or without general rise of prices of the stocks and other securities in title and summary, comprise search key in the title namely in item of information or summary.Whether the title summary of item of information in result of page searching is correct to use function Bat.title_abstract () to check, the Output rusults such as called after this function is { A:10, A1:{0,0,0}, A2:{0,1,0} ... A10:{0,0,0}}, represent and present 10 general results altogether, wherein the 2nd article of result (A2:{0, the 1st, summary 0}) is labeled as 1, represents that summary has exception.
Step S203, is automatically fed to background server by search bad point.
In one embodiment of the invention, by using HTTP request to feed back to background server.Such as, to judge that the Search Results that there is search bad point is with json (JavaScript Object Notation, the data interchange format of lightweight) form is delivered to background server by HTTP Service port, HTTP Service port provides a write_db.php file, in the data write into Databasce that POST can transmit by this file, result uploads in database by the php page this write_db.php can being asked corresponding via XmlHttpRequest by plug-in unit in a browser, wherein, above-mentioned functions can be encapsulated, when judging, when there is search bad point, search bad point is automatically fed to background server.
Step S204, provides user initiatively can feed back the interface to the suggestion of Search Results to background server.
In one embodiment of the invention, by using HTTP request to feed back to background server.Such as, to judge that the Search Results that there is search bad point is with json (JavaScript Object Notation, the data interchange format of lightweight) form is delivered to background server by HTTP Service port, HTTP Service port provides a write_db.php file, in the data write into Databasce that POST can transmit by this file, in a browser, result uploads in database by the php page of asking this write_db.php corresponding by plug-in unit via XmlHttpRequest.Wherein, above-mentioned functions can be made interface, can fast feed back when user thinks that Search Results effect is bad, can easily and timely feedback search bad point by simple function button such as right-click menu, accomplish one-touch feedback.
According to the method for the result of page searching for optimizing browser of the embodiment of the present invention, can be fed back to background server by the interface quick that provides when user thinks that Search Results effect is bad, by the discovery of user's acceleration search bad point, raise the efficiency, manual feedback is packaged into interface simultaneously, user can easily and timely feedback search bad point by simple function button, reduces the cost that user participates in testing.
Fig. 6 is according to an embodiment of the invention for optimizing the process flow diagram of the method for the result of page searching of browser.
As shown in Figure 6, according to the embodiment of the present invention for optimizing the method for the result of page searching of browser, comprise the steps.
Step S301, extracts the information of every bar item of information in the structural information of result of page searching or result of page searching.
In one embodiment of the invention, structural information comprises the number of the advertising message item that result of page searching comprises and the number of position and general information item and position.Particularly; advertising message item can comprise brand advertising, sponsor link advertisement, promote advertisement, brand protection result etc.; general information item can be AS result; this is for also comprising Aladdin result (the General Open platform that Search Engines of Baidu is released; interface opening is supplied to the owner of unique information data, solves the darknet information that existing search engine cannot capture and retrieve) etc.
In one embodiment of the invention, the information of every bar item of information comprises link that every bar item of information comprises, title, summary and search key.
In one embodiment of the invention, the function extracting the information of every bar item of information in the structural information of result of page searching or result of page searching can define with JavaScript, as defined a batOverlay.js file, define class bat (executable file) and related methods inside this batOverlay.js file for initialization, inside this batOverlay.js, define the various function carrying out analyzing for Search Results simultaneously.The number of advertising message item that such as function Bat.page_analysis () can comprise result of page searching and the number of position and general information item and position are analyzed, the result exported after this function carries out page analysis to certain result of page searching as called is { P:1, AS:5, AL:1, AS:3}, wherein, P:1 represents that a brand advertising is positioned at first position, AS:5 represents 5 general results, position is after a brand advertising, AL:1 represents 1 Aladdin result, position is after 5 general results, AS:3 represents 3 general results, position is after 1 Aladdin result.
Step S302, arranges the threshold value of the structural information of result of page searching according to search key.
Such as, the quantity threshold arranging advertising message item according to the search key of user's input in a result of page searching is 1, Aladdin result quantity threshold is 1.
Step S303, compares structural information with threshold value.
Particularly, the structural information extracted by step S301 is compared with the threshold value of the structural information set by step S302.
Whether step S304, exist search bad point according to comparative result or according in the information determination result of page searching of every bar item of information.
In one embodiment of the invention, whether there is search bad point according in comparative result determination result of page searching.Such as, structural information is compared with threshold value, the number of the advertising message item in a result of page searching be 2 or the number of Aladdin result be 3, exceed the threshold value of setting, then determining unit 230 determines to there is search bad point in result of page searching.
In one embodiment of the invention, search bad point can also whether be there is according in the information determination result of page searching of every bar item of information.Such as, if the link that the first information item in every bar item of information comprises is the link that cannot open, or first information item does not comprise summary, or the title that first information item comprises and/or summary repeat, or do not comprise search key in the title that first information item comprises or summary, then determine that this first information item is search bad point.
Wherein, function according to whether there is search bad point in the information determination result of page searching of every bar item of information also can use JavaScript to define, such as, define the various function carrying out analyzing for Search Results inside the batOverlay.js file defined, as function Bat.linkcheck (type), Bat.is_equal (), Bat.piaohong () etc.Such as, use function Bat.linkcheck (type) dead chain inspection can be carried out to the item of information of Search Results, wherein, when parametric t ype=all represents, dead chain inspection is carried out to all items of information in result of page searching, when parametric t ype=top represents, dead chain inspection is carried out to the item of information of first three in result of page searching, when parametric t ype=random represents that three items of information random in result of page searching carry out dead chain inspection.Use function Bat.is_equal () can judge whether there is title in result of page searching or identical item of information of making a summary.Whether general rise of prices of the stocks and other securities in result of page searching is normal to use function Bat.piaohong () to judge, whether the situation whether with or without general rise of prices of the stocks and other securities in title and summary, comprise search key in the title namely in item of information or summary.Whether the title summary of item of information in result of page searching is correct to use function Bat.title_abstract () to check, the Output rusults such as called after this function is { A:10, A1:{0,0,0}, A2:{0,1,0} ... A10:{0,0,0}}, represent and present 10 general results altogether, wherein the 2nd article of result (A2:{0, the 1st, summary 0}) is labeled as 1, represents that summary has exception.
Step S305, is automatically fed to background server by search bad point.
In one embodiment of the invention, by using HTTP request to feed back to background server.Such as, to judge that the Search Results that there is search bad point is with json (JavaScript Object Notation, the data interchange format of lightweight) form is delivered to background server by HTTP Service port, HTTP Service port provides a write_db.php file, in the data write into Databasce that POST can transmit by this file, result uploads in database by the php page this write_db.php can being asked corresponding via XmlHttpRequest by plug-in unit in a browser, wherein, above-mentioned functions can be encapsulated, when judging, when there is search bad point, search bad point is automatically fed to background server.
Step S306, provides user initiatively can feed back the interface to the suggestion of Search Results to background server.
In one embodiment of the invention, by using HTTP request to feed back to background server.Such as, to judge that the Search Results that there is search bad point is with json (JavaScript Object Notation, the data interchange format of lightweight) form is delivered to background server by HTTP Service port, HTTP Service port provides a write_db.php file, in the data write into Databasce that POST can transmit by this file, in a browser, result uploads in database by the php page of asking this write_db.php corresponding by plug-in unit via XmlHttpRequest.Wherein, above-mentioned functions can be made interface, can fast feed back when user thinks that Search Results effect is bad, can easily and timely feedback search bad point by simple function button such as right-click menu, accomplish one-touch feedback.
According to the method for the result of page searching for optimizing browser of the embodiment of the present invention, by extracting the information of every bar item of information in the structural information of result of page searching and result of page searching, judge by carrying out inspection to the structural information analysis of result of page searching and dead chain, title, summary and the search key often included by bar item of information again, and then excavate correlativity or represent the bad Search Results of effect and feed back to background server, be convenient to improve search engine algorithms; The method can utilize the idling-resource of search engine client to carry out result of page searching extraction, analysis, judgement and feedback on the other hand, economize on resources, synchronously realize the analysis to result of page searching when user uses search engine simultaneously, do not need to initiate analysis request to search engine separately, the pressure to searching products itself can not be caused, also can not have an impact to the use of user.
Should be appreciated that each several part of the present invention can realize with hardware, software, firmware or their combination.In the above-described embodiment, multiple step or method can with to store in memory and the software performed by suitable instruction execution system or firmware realize.Such as, if realized with hardware, the same in another embodiment, can realize by any one in following technology well known in the art or their combination: the discrete logic with the logic gates for realizing logic function to data-signal, there is the special IC of suitable combinational logic gate circuit, programmable gate array (PGA), field programmable gate array (FPGA) etc.
In the description of this instructions, specific features, structure, material or feature that the description of reference term " embodiment ", " some embodiments ", " example ", " concrete example " or " some examples " etc. means to describe in conjunction with this embodiment or example are contained at least one embodiment of the present invention or example.In this manual, identical embodiment or example are not necessarily referred to the schematic representation of above-mentioned term.And the specific features of description, structure, material or feature can combine in an appropriate manner in any one or more embodiment or example.
Although illustrate and describe embodiments of the invention, for the ordinary skill in the art, be appreciated that and can carry out multiple change, amendment, replacement and modification to these embodiments without departing from the principles and spirit of the present invention, scope of the present invention is by claims and equivalency thereof.

Claims (8)

1. for optimizing a device for the result of page searching of browser, it is characterized in that, comprising:
Extraction module, described extraction module is for extracting the information of every bar item of information in the structural information of described result of page searching or described result of page searching, wherein, described structural information comprises the number of the advertising message item that described result of page searching comprises and the number of position and general information item and position, and the information of described every bar item of information comprises link that described every bar item of information comprises, title, summary and search key;
Judge module, described judge module is used for judging whether there is search bad point in described result of page searching according to the information of described structural information or described every bar item of information, and wherein, judge module comprises:
Setting unit, described setting unit is used for the threshold value of the structural information arranging result of page searching according to search key;
Comparing unit, described comparing unit is used for described structural information to compare with described threshold value; And
Determining unit, described determining unit is used for according to the comparative result of described comparing unit or determines whether there is search bad point in described result of page searching according to the information of described every bar item of information; And
Automatic feedback module, described automatic feedback module is used for described search bad point to be automatically fed to background server.
2. device according to claim 1, is characterized in that, comprises further:
Manual feedback module, described manual feedback module provides interface thus user can use described interface to come to the suggestion of background server feedback to Search Results.
3. device according to claim 1, it is characterized in that, if the link that the first information item in described every bar item of information comprises is the link that cannot open, or described first information item does not comprise summary, or the title that described first information item comprises and/or summary repeat, or do not comprise search key in the title that described first information item comprises or summary, then described determining unit determines that described first information item is search bad point.
4. device according to claim 2, is characterized in that, described automatic feedback module or described manual feedback module are fed back to background server by using HTTP request.
5. for optimizing a method for the result of page searching of browser, it is characterized in that, comprising the following steps:
Extract the information of every bar item of information in the structural information of described result of page searching or described result of page searching, wherein, described structural information comprises the number of the advertising message item that described result of page searching comprises and the number of position and general information item and position, and the information of described every bar item of information comprises link that described every bar item of information comprises, title, summary and search key;
Information according to described structural information or described every bar item of information judges whether there is search bad point in described result of page searching, wherein, the threshold value of the structural information of result of page searching is set according to search key, and described structural information compared with described threshold value, and determine whether there is search bad point in described result of page searching according to described comparative result or according to the information of described every bar item of information; And
Described search bad point is automatically fed to background server.
6. method according to claim 5, is characterized in that, comprises step further:
There is provided user initiatively can feed back the interface to the suggestion of Search Results to background server.
7. method according to claim 5, it is characterized in that, if the link that the first information item in described every bar item of information comprises is the link that cannot open, or described first information item does not comprise summary, or the title that described first information item comprises and/or summary repeat, or do not comprise search key in the title that described first information item comprises or summary, then determine that described first information item is search bad point.
8. the method according to claim 5 or 6, is characterized in that, feeds back to background server by using HTTP request.
CN201210054359.8A 2012-03-02 2012-03-02 Apparatus and method for optimizing search result page of browser Active CN102662957B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201210054359.8A CN102662957B (en) 2012-03-02 2012-03-02 Apparatus and method for optimizing search result page of browser

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201210054359.8A CN102662957B (en) 2012-03-02 2012-03-02 Apparatus and method for optimizing search result page of browser

Publications (2)

Publication Number Publication Date
CN102662957A CN102662957A (en) 2012-09-12
CN102662957B true CN102662957B (en) 2015-02-18

Family

ID=46772448

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201210054359.8A Active CN102662957B (en) 2012-03-02 2012-03-02 Apparatus and method for optimizing search result page of browser

Country Status (1)

Country Link
CN (1) CN102662957B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104158697B (en) * 2013-10-18 2017-07-21 深圳信息职业技术学院 A kind of dead chain detection method and device
CN106649407A (en) * 2015-11-04 2017-05-10 阿里巴巴集团控股有限公司 Retrieval result obtaining method and apparatus
CN106484841B (en) * 2016-09-30 2019-09-24 北京奇付通科技有限公司 It is furnished an answer the searching method and device of item based on search result
CN108153663B (en) * 2016-12-02 2022-02-18 阿里巴巴集团控股有限公司 Page data processing method and device

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7668823B2 (en) * 2007-04-03 2010-02-23 Google Inc. Identifying inadequate search content
CN102043834A (en) * 2010-11-25 2011-05-04 北京搜狗科技发展有限公司 Method for realizing searching by utilizing client and search client
CN102214185A (en) * 2010-04-07 2011-10-12 腾讯科技(深圳)有限公司 Webpage searching method and webpage searching system

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100456286C (en) * 2005-01-17 2009-01-28 马岩 Universal file search system and method
CN101071422B (en) * 2006-06-15 2010-10-13 腾讯科技(深圳)有限公司 Music file search processing system and method

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7668823B2 (en) * 2007-04-03 2010-02-23 Google Inc. Identifying inadequate search content
CN102214185A (en) * 2010-04-07 2011-10-12 腾讯科技(深圳)有限公司 Webpage searching method and webpage searching system
CN102043834A (en) * 2010-11-25 2011-05-04 北京搜狗科技发展有限公司 Method for realizing searching by utilizing client and search client

Also Published As

Publication number Publication date
CN102662957A (en) 2012-09-12

Similar Documents

Publication Publication Date Title
CN102622435B (en) A kind of method and apparatus for detecting black chain
CN102662957B (en) Apparatus and method for optimizing search result page of browser
US20150121265A1 (en) Systems and methods for facilitating open source intelligence gathering
CN109510737A (en) Protocol interface test method, device, computer equipment and storage medium
CN109451147B (en) Information display method and device
CN103942279A (en) Method and device for showing search result
US8458584B1 (en) Extraction and analysis of user-generated content
CN108985064A (en) A kind of method and device identifying malice document
CN106326109A (en) New application testing method and device
CN110768977B (en) Method and system for capturing security vulnerability information
CN107193987A (en) Obtain the methods, devices and systems of the search term related to the page
CN102663060A (en) Method and device for identifying tampered webpage
CN109933514A (en) A kind of data test method and apparatus
CN110968873A (en) System and method for automatic penetration test based on artificial intelligence
CN103207906A (en) Method for delivering search results and search engine
CN111523677A (en) Method and device for explaining prediction result of machine learning model
CN110990057A (en) Extraction method, device, equipment and medium of small program sub-chain information
CN105808605B (en) A kind of search log merging method and system
WO2015024522A1 (en) Search method and system, search engine and client
CN111858834B (en) Case dispute focus determining method, device, equipment and medium based on AI
CN111932413B (en) Case element extraction method, case element extraction device, case element extraction equipment and case element extraction medium
CN113312504A (en) Management method, device, equipment and medium for content audit project
JP5040718B2 (en) Spam event detection apparatus, method, and program
JP2020102206A (en) Information processing device, program, and system
CN106503126A (en) Based on the search engine optimization collocation method of game temperature, device and server

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant