CN103530389A - Method and device for improving stopword searching effectiveness - Google Patents

Method and device for improving stopword searching effectiveness Download PDF

Info

Publication number
CN103530389A
CN103530389A CN201310499118.9A CN201310499118A CN103530389A CN 103530389 A CN103530389 A CN 103530389A CN 201310499118 A CN201310499118 A CN 201310499118A CN 103530389 A CN103530389 A CN 103530389A
Authority
CN
China
Prior art keywords
stop words
search
user
key word
keyword
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310499118.9A
Other languages
Chinese (zh)
Other versions
CN103530389B (en
Inventor
崔代超
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Qizhi Software Beijing Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd, Qizhi Software Beijing Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201310499118.9A priority Critical patent/CN103530389B/en
Publication of CN103530389A publication Critical patent/CN103530389A/en
Application granted granted Critical
Publication of CN103530389B publication Critical patent/CN103530389B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation

Abstract

The invention discloses a method and device for improving stopword searching effectiveness. The method comprises the steps of receiving searching key works provided by a user, searching the searching key words in a preset stopword bank, and acquiring appointed contents corresponding to the searching key words and returning the appointed contents which is shown to the user when the searching key words are searched in the stopword bank. According to the scheme of the method, a lot of useless searching or error searching is reasonably used for recommending the user the content related to the searching key words, the situation that return results completely have no practical significance due to error operation is avoided, user operation is reduced, and the contents which a searching service provider can be provided can be fully shown.

Description

A kind of method and apparatus that improves stop words search validity
Technical field
The present invention relates to network data communication technical field, be specifically related to a kind of method and apparatus that improves stop words search validity.
Background technology
Search engine is that user utilizes one of main path of Internet resources.Search engine uses specific computer program to collect info web from internet, extracts keyword from webpage, forms the index database of keyword.When user inputs keyword to be checked, search engine, from retrieve this keyword at index database, finds the webpage of this keyword of coupling, presents to user.
When user utilizes search engine to search for, the situation that there will be a large amount of meaningless search or search for or search for by mistake without specific purpose.For example, user is when the not yet complete input of keyword, owing to there is maloperation, initiated searching request, or only input single English character or numeral or punctuate or Chinese word character (" stop words ") and just initiated searching request, at this moment, search engine still can find and return the natural Search Results of the keyword of this input from index database, yet, Search Results or meaning that these natural results are normally not pointed.In Fig. 1 a and Fig. 1 b, provided two examples of this situation, and processing mode of the prior art.As shown in Fig. 1 a and Fig. 1 b, suppose user be to search for certain " l " or " d " beginning keyword (for example, " 163 mailbox ", " dota2 "), but in input " l " or " d ", clicked unintentionally afterwards search button, or according to each input, provide the service of instant search, that search engine returns to the Search Results to " l " and " d ", Search Results from figure, obviously it is to user's too large meaning of tool not, therefore, user is conventionally also very low to the clicking rate of this Search Results, according to statistics, this result clicking rate meaningless or search by mistake is generally below 0.05, the average result clicking rate of normal searching is more than 1.5 times, the request amount of this search simultaneously occurring in search engine is also very huge.
Visible, to a large amount of this meaningless search or mistake search, only according to its literal meaning return to the nature result, cannot meet consumers' demand, caused low-down flow conversion ratio, do not have fully to show the content that search service provider can provide, to user, carry out the recommendation of effective information yet.
Summary of the invention
In view of the above problems, the present invention has been proposed to a kind of a kind of method and apparatus that improves stop words search validity that overcomes the problems referred to above or address the above problem is at least in part provided.
According to an aspect of the present invention, provide a kind of method that improves stop words search validity, having comprised: receive the searched key word that user provides; Search keyword in the inactive dictionary of setting up in advance; If retrieve searched key word in the dictionary of stopping using, obtain the given content corresponding with searched key word, return to given content for showing to user.
Wherein, the foundation of inactive dictionary further comprises: whether the keyword of examining in search engine index storehouse is stop words; The keyword of examining as stop words is included into inactive dictionary.
Whether the keyword of further, examining in search engine index storehouse is that stop words comprises: according to the Search Results clicking rate of this keyword, examine, the keyword using Search Results clicking rate under predetermined threshold value is as stop words.
Further, each stop words that the method is also included as in the dictionary of stopping using is equipped with corresponding given content, particularly, can be equipped with the described given content that search engine server and/or third-party server provide for each stop words.
Further, this method also comprises: obtain the natural Search Results corresponding with searched key word; When returning to given content, return to described natural Search Results for showing to user.
According to a further aspect in the invention, provide a kind of device that improves stop words search validity, having comprised: receiver module, is suitable for receiving the searched key word that user provides; Retrieval module, is suitable for search keyword in the inactive dictionary of setting up in advance; Acquisition module, the in the situation that of being suitable for retrieving searched key word in the dictionary of stopping using, obtains the given content corresponding with searched key word, returns to given content for showing to user.
Further, this device also comprises validating module, and whether the keyword that is suitable for examining in search engine index storehouse is stop words, and the keyword of examining as stop words is included into inactive dictionary.
This validating module is further adapted for according to the Search Results clicking rate of this keyword and examines, and the keyword using Search Results clicking rate under predetermined threshold value is as stop words.
Further, this device also comprises relating module, and each stop words being suitable for for stopping using in dictionary is equipped with corresponding given content.This relating module is further adapted for and is equipped with the given content that search engine server and/or third-party server provide for each stop words.
Acquisition module is further adapted for and obtains the natural Search Results corresponding with searched key word, and when returning to given content, return to the nature Search Results is for showing to user.
In information retrieval field, search engine can automatically be ignored some word or word when index pages or processing searching request, and these words or word are called as stop words (Stop Words).Stop words mainly comprises the Chinese word character of English character, numeral, mathematical character, punctuation mark or frequency of utilization extra-high-speed etc.
According to scheme provided by the invention, after the searched key word that search engine server provides reception user, return to the nature Search Results immediately not, and in the inactive dictionary of building in advance, retrieve this searched key word, take and judge whether it is stop words, for the stop words retrieving, obtain corresponding given content, and given content is showed to user.According to this scheme, a large amount of meaningless search or mistake search are by reasonably for recommending the content relevant to searched key word to user, avoided user because returning results of causing of maloperation do not had the situation of practical significance completely, thereby reduced user's operation, also fully shown the content that search service provider can provide.
Above-mentioned explanation is only the general introduction of technical solution of the present invention, in order to better understand technological means of the present invention, and can be implemented according to the content of instructions, and for above and other objects of the present invention, feature and advantage can be become apparent, below especially exemplified by the specific embodiment of the present invention.
Accompanying drawing explanation
By reading below detailed description of the preferred embodiment, various other advantage and benefits will become cheer and bright for those of ordinary skills.Accompanying drawing is only for the object of preferred implementation is shown, and do not think limitation of the present invention.And in whole accompanying drawing, by identical reference symbol, represent identical parts.In the accompanying drawings:
Fig. 1 a and Fig. 1 b show the schematic diagram to the processing mode of stop words search in prior art;
Fig. 2 shows the process flow diagram of the method that improves according to an embodiment of the invention stop words search validity;
Fig. 3 shows the process flow diagram of the method for raising stop words search validity according to another embodiment of the invention;
Fig. 4 a shows an example processing stop words search according to method provided by the invention;
Fig. 4 b shows another example of processing stop words search according to method provided by the invention;
Fig. 4 c shows another example of processing stop words search according to method provided by the invention;
Fig. 5 shows the structural representation of raising stop words search validity device according to an embodiment of the invention.
Embodiment
Exemplary embodiment of the present disclosure is described below with reference to accompanying drawings in more detail.Although shown exemplary embodiment of the present disclosure in accompanying drawing, yet should be appreciated that and can realize the disclosure and the embodiment that should do not set forth limits here with various forms.On the contrary, it is in order more thoroughly to understand the disclosure that these embodiment are provided, and can by the scope of the present disclosure complete convey to those skilled in the art.
Fig. 2 shows the process flow diagram of the method that improves according to an embodiment of the invention stop words search validity.As shown in Figure 2, the method comprises the steps:
Step S101, receives the searched key word that user provides.
User is inputted search keyword in client, and client generates the searching request that comprises search keyword information, and is sent to search engine server.
Step S102, search keyword in the inactive dictionary of setting up in advance.
This step is examined out stop words from the searched key word of user's input, in the present embodiment subsequent step, result being obtained and done special processing.If a certain searched key word of input is verified as stop words, execution step S103.
Step S103, obtains the given content corresponding with searched key word, returns to given content for showing to user.
Search service provider is when set up stopping using dictionary, for each keyword is wherein equipped with given content.If searched key word is stop words, search engine server obtains given content from the dictionary of stopping using, as returning results or partly returning results and show user.
The method providing according to the above embodiment of the present invention, after the searched key word that search engine server provides reception user, return to the nature Search Results immediately not, and in the inactive dictionary of building in advance, retrieve this searched key word, take and judge whether it is stop words, for the stop words retrieving, obtain corresponding given content, given content is showed to user.According to this scheme, a large amount of meaningless search or mistake search are by reasonably for recommending the content relevant to searched key word to user, avoided user because returning results of causing of maloperation do not had a situation of practical significance, thereby reduced user's operation, also fully shown the content that search service provider can provide.
Fig. 3 shows the process flow diagram of the method for raising stop words search validity according to another embodiment of the invention.As shown in Figure 3, the method comprises the steps:
Step S201, receives the searched key word that user provides.
Search engine server receives the searched key word that user provides in every way, and for example, user is inputted search keyword in search homepage, and searched page generates the searching request that comprises searched key word.Searched page is provided by corresponding service provider, and its file layout is generally HTML, and the page is presented in browser, and request generating code is with html language, and the forms such as java script are included in page source code.Searched page is by browser, mutual based on the agreements such as HTTP and search engine server.
Step S202, the searched key word that retrieval user provides in the inactive dictionary of setting up in advance.
Step S203, judges whether to retrieve searched key word in the dictionary of stopping using.
This step adopts the method for keyword coupling to judge whether the searched key word of user's input is stop words.In inactive dictionary, include the searched key word that searched engine server is judged as stop words.Keyword in this searched key word and inactive dictionary is matched, if there is the situation that the match is successful to occur, the searched key word of this input is judged as to stop words, and performs step S204; If mate unsuccessfully, this search is processed to execution step S205 as normal searching.
Inactive dictionary is generally provided by search service provider.Setting up the dictionary of stopping using further comprises: whether the searched key word of examining in search engine index storehouse is stop words, and the searched key word of examining as stop words is included in inactive dictionary.
First, inactive dictionary can be examined the common numeral without certain sense, letter etc. into stop words, for example 1,2,130, a, b, ze, needs to get rid of the searched key words with its meaning such as 58,110,126 simultaneously.
Whether can examine a certain keyword according to user's desirability is stop words.User's demand can be by embodiments such as keyword search number of times, Search Results clicking rates.For example, search engine server can record searching keyword the information such as searching request quantity within a period of time, search rate, using request amount compared with large and search rate is less than the keyword of certain threshold value includes as stop words.The stop words of typical this type comprises: one, eh, Oh, the Chinese character such as little.
Alternatively, according to the Search Results clicking rate of searched key word, examine.Under normal circumstances, search engine server at least can meet user's part demand to returning results of non-stop words, and therefore, the result clicking rate of normal searching is higher.And stop words cannot reflect user's real intention, Search Results is conventionally nonsensical to user, is seldom clicked.Search engine server can be added up a large amount of searching request, draws the average result clicking rate of each searched key word in a period of time, and the searched key word using average result clicking rate under predetermined threshold value is as stop words.For example, data according to statistics, the average result clicking rate of normal searching is more than 1.5 times, and the average result clicking rate of stop words search is under 0.05, threshold value can be preset as to certain numerical value in 0-0.05.
Further, examine the performance data that stop words can be considered time, region characteristic and user.For example, part registered user may have clear and definite intention by the stop words under threshold value to a certain average result clicking rate, and clicking rate is higher, can identify the information such as user ID, to this user, this keyword is classified as to normal searching keyword.
Step S204, obtains the given content corresponding with searched key word, returns to given content for showing to user.
Search service provider is when set up stopping using dictionary, for each searched key word is wherein equipped with given content.If this searched key word is judged as stop words, search engine server obtains given content from the dictionary of stopping using, as returning results or partly returning results and show user.
The given content being equipped with for when user's request is not subject to meeting for user provides recommendation information.Considering the reason that stop words search produces, is that the maloperation by user causes as a rule, and therefore, the given content of outfit preferably has certain associated with user input content.Meanwhile, for improving the efficient of information recommendation, given content should be relevant to input content, the information that demand degree is higher.For example, at Fig. 4 a, 4b, in 4c, for stop words " l ", " y " and " little ", given content is respectively clicking rate and higher online broadcasting of central authorities of demand degree, and clothes class commodity displaying and trivial games are complete works of.
Being equipped with given content is further included as each stop words and is equipped with the given content that search engine server and/or third-party server provide.Search service provider can recommend the quality information of self by stop words search to user, for example, recommends and the related webpage URL of stop words address, and popular video, pictures etc., show homegrown resource to user, attract user's access.Also can be equipped with from third-party given contents such as partners for stop words, for example, in Fig. 4 a, for being equipped with online broadcasting of central authorities, stop words " l " serves, this service is from CNTV (Chinese Network TV Station), in Fig. 4 b, for stop words " y " has been equipped with " clothes are through " service, this service is provided by Taobao's clothes channel.
Also can be equipped with a plurality of given contents for each stop words in stop words storehouse, according to average click-through rate or specific user's personality data, a plurality of given contents be sorted, to user, recommend most suitable content.
Further, in this step, obtain the natural Search Results corresponding with searched key word.When returning to given content, return to the nature Search Results is for showing to user.
Here, natural Search Results comprises the various Search Results that in prior art, all kinds of search engines can return.For example, for universal search, natural Search Results refers to this keyword of returning corresponding url list in search engine index storehouse, and for vertical search, natural result is the information such as the picture, news of the particular type returned of corresponding channel or website.Natural result also comprises integration Search Results, for example, at Fig. 1 a, in Fig. 1 b, has comprised the webpage url list mating with this keyword in the results list page, has also comprised the common vertical search results of part such as picture, encyclopaedia.
As described in step S202, may there is clear and definite demand to being identified as the natural result of the searched key word of stop words in certain customers, therefore, also have the return to the nature of needs Search Results for such user when returning to given content.Alternatively, while returning to given content and natural result, can integrate result, preferentially show given content, show afterwards natural result.As shown in Fig. 4 c, for stop words " little ", the complete works of service of trivial games has preferentially been shown in result page the top, is relevant URL link subsequently, the vertical channel information such as encyclopaedia.
Step S205 obtains the natural Search Results that searched key word is corresponding from search engine index storehouse, and return to the nature Search Results is for showing to user.
While not retrieving searched key word in the dictionary of stopping using, this search is processed as normal searching, only returned to natural Search Results that searched key word is corresponding for showing to user.
The method providing according to the above embodiment of the present invention, after the searched key word that search engine server provides reception user, return to the nature Search Results immediately not, and in the inactive dictionary of building in advance, retrieve this keyword, take and judge whether it is stop words, for the stop words retrieving, obtain corresponding given content, given content is showed to user.Meanwhile, obtain the natural Search Results corresponding with searched key word, return to the nature Search Results when returning to given content.According to this scheme, a large amount of meaningless search or mistake search are by reasonably for recommending the content relevant to searched key word to user, avoided user because returning results of causing of maloperation do not had a situation of practical significance, thereby reduced user's operation, also fully shown the content that search service provider can provide.
Fig. 5 shows the structural representation of the device of raising stop words search validity according to an embodiment of the invention.As shown in Figure 5, this device comprises: receiver module 21, retrieval module 22 and acquisition module 23.
Receiver module 21 is suitable for receiving the searched key word that user provides.Receiver module 21 can receive the searched key word that user provides in every way, and for example, user is inputted search keyword in search homepage, and searched page generates the searching request that comprises searched key word.Searched page is provided by corresponding service provider, and its file layout is generally HTML, and the page is presented in browser, and request generating code is with html language, and the forms such as java script are included in page source code.Receiver module 21 receives searched page by browser, the searched key word sending based on agreements such as HTTP.
Retrieval module 22 is suitable for search keyword in the inactive dictionary of setting up in advance.
The method that retrieval module 22 mates by keyword judges whether the searched key word of user's input is stop words.In inactive dictionary, include the searched key word that searched engine is judged as stop words.Retrieval module 22 matches the keyword in this searched key word and inactive dictionary, if there is the situation that the match is successful to occur, the searched key word of this input is judged as to stop words.
The device that improves stop words search validity further comprises validating module 24, and whether the searched key word that validating module 24 is suitable for examining in search engine storehouse is stop words, and the searched key word of examining as stop words is included in inactive dictionary.
First, validating module 24 can be examined the common numeral without certain sense, letter etc. into stop words, for example 1,2,130, a, b, ze, needs to get rid of the searched key words with its meaning such as 58,110,126 simultaneously.
Whether validating module 24 is also suitable for examining a certain keyword according to user's desirability is stop words.User's demand can be by embodiments such as keyword search number of times, Search Results clicking rates.For example, validating module 24 can record searching keyword the information such as searching request quantity within a period of time, search rate, request amount is compared with large and search rate is less than the keyword of certain threshold value includes as stop words.The stop words of typical this type comprises: one, eh, Oh, the Chinese character such as little.
Alternatively, validating module 24 is suitable for examining according to the Search Results clicking rate of searched key word.Under normal circumstances, search engine at least can meet user's part demand to returning results of non-stop words, and therefore, the result clicking rate of normal searching is higher.And stop words cannot reflect user's real intention, Search Results is conventionally nonsensical to user, is seldom clicked.Validating module 24 can be added up a large amount of searching request, draws the average result clicking rate of each searched key word in a period of time, and the keyword using average result clicking rate under predetermined threshold value is as stop words.For example, data according to statistics, the average result clicking rate of normal searching is more than 1.5 times, and the average result clicking rate of stop words search is under 0.05, validating module 24 can be preset as threshold value certain numerical value in 0-0.5.
Further, validating module 24 examine stop words can using time, region characteristic and user's performance data as input data.For example, part registered user may have clear and definite intention by the stop words under threshold value to a certain average result clicking rate, and validating module 24 can, by information such as identification user ID, to this user, be classified as normal searching keyword by this keyword.
The in the situation that acquisition module 23 being suitable for retrieving searched key word in the dictionary of stopping using, obtain the given content corresponding with searched key word, return to given content for showing to user.
Search service provider is when set up stopping using dictionary, for each searched key word is wherein equipped with given content.If this searched key word is judged as stop words, acquisition module 23 obtains given content from the dictionary of stopping using, as returning results or partly returning results and show user.
If retrieval module 22 does not retrieve this searched key word in the dictionary of stopping using, acquisition module 23 is processed this search as normal searching, only from search engine index storehouse, obtains the natural result that searched key word is corresponding.
In the device of raising stop words search validity, also comprise relating module 25, each stop words being suitable for for stopping using in dictionary is equipped with corresponding given content.
The given content that relating module 25 is equipped with for stop words for when user's request is not subject to meeting for user provides recommendation information.Considering the reason that meaningless search or mistake search produce, is that the maloperation by user causes as a rule, and therefore, the given content that relating module 25 is equipped with preferably has certain associated with user input content.Meanwhile, for improving the efficient of information recommendation, given content should be relevant to input content, the information that demand degree is higher.For example, at Fig. 4 a, 4b, in 4c, acquisition module stop words " l ", " y " and " little ", given content is respectively clicking rate and higher online broadcasting of central authorities of demand degree, and clothes class commodity displaying and trivial games are complete works of.
Relating module 25 is further adapted for and is equipped with the given content that search engine server and/or third-party server provide for each stop words.Search service provider can recommend the quality information of self by meaningless search or mistake search to user, for example, recommends and the related webpage URL of stop words address, and popular video, pictures etc., show homegrown resource to user, attract user's access.Relating module 25 is also suitable for being equipped with from third-party given contents such as partners for stop words, for example, in Fig. 4 a, relating module 25 is served for stop words " l " has been equipped with online broadcasting of central authorities, this service is from CNTV (Chinese Network TV Station), in Fig. 4 b, for stop words " y " has been equipped with " clothes are through " service, this service is provided by Taobao's clothes channel.
Each stop words that relating module 25 is also suitable for stopping using in dictionary is equipped with a plurality of given contents, according to average click-through rate or specific user's personality data, a plurality of given contents is sorted, and to user, recommends most suitable content.
Acquisition module 23 is further adapted for and obtains the natural Search Results corresponding with searched key word, and when returning to given content, return to the nature Search Results is for showing to user.
Here, natural Search Results comprises the various Search Results that in prior art, all kinds of search engines can return.For example, for universal search, natural Search Results refers to this keyword of returning corresponding url list in search engine index storehouse, and for vertical search, natural result is the information such as the picture, news of the particular type returned of corresponding channel or website.Natural result also comprises integration Search Results, for example, at Fig. 1 a, in Fig. 1 b, has comprised the webpage url list mating with this keyword in the results list page, has also comprised the common vertical search results of part such as picture, encyclopaedia.
As mentioned before.May there is clear and definite demand to being identified as the natural result of the searched key word of stop words in certain customers, therefore, acquisition module 23 is further adapted for return to the nature Search Results when returning to given content.Alternatively, when acquisition module 23 returns to given content and natural result, can integrate result, preferentially show given content, show afterwards natural result.As shown in Fig. 4 c, for stop words " little ", the complete works of service of trivial games has preferentially been shown in result page the top, is relevant URL link subsequently, the vertical channel information such as encyclopaedia.
The device providing according to the above embodiment of the present invention, after the searched key word that receiver module provides reception user, retrieval module is retrieved this keyword in the inactive dictionary of building in advance, judge whether it is stop words, for the stop words retrieving, it is the corresponding given content that stop words is equipped with that acquisition module obtains by relating module, and given content is showed to user.Meanwhile, obtain the natural Search Results corresponding with searched key word, return to the nature Search Results when returning to given content.According to this scheme, a large amount of meaningless search or mistake search are by reasonably for recommending the content relevant to searched key word to user, avoided user because returning results of causing of maloperation do not had a situation of practical significance, thereby reduced user's operation, also fully shown the content that search service provider can provide.
The algorithm providing at this is intrinsic not relevant to any certain computer, virtual system or miscellaneous equipment with demonstration.Various general-purpose systems also can with based on using together with this teaching.According to description above, it is apparent constructing the desired structure of this type systematic.In addition, the present invention is not also for any certain programmed language.It should be understood that and can utilize various programming languages to realize content of the present invention described here, and the description of above language-specific being done is in order to disclose preferred forms of the present invention.
In the instructions that provided herein, a large amount of details have been described.Yet, can understand, embodiments of the invention can not put into practice in the situation that there is no these details.In some instances, be not shown specifically known method, structure and technology, so that not fuzzy understanding of this description.
Similarly, be to be understood that, in order to simplify the disclosure and to help to understand one or more in each inventive aspect, in the above in the description of exemplary embodiment of the present invention, each feature of the present invention is grouped together into single embodiment, figure or sometimes in its description.Yet, the method for the disclosure should be construed to the following intention of reflection: the present invention for required protection requires than the more feature of feature of clearly recording in each claim.Or rather, as reflected in claims below, inventive aspect is to be less than all features of disclosed single embodiment above.Therefore, claims of following embodiment are incorporated to this embodiment thus clearly, and wherein each claim itself is as independent embodiment of the present invention.
Those skilled in the art are appreciated that and can the module in the equipment in embodiment are adaptively changed and they are arranged in one or more equipment different from this embodiment.Module in embodiment or unit or assembly can be combined into a module or unit or assembly, and can put them into a plurality of submodules or subelement or sub-component in addition.At least some in such feature and/or process or unit are mutually repelling, and can adopt any combination to combine all processes or the unit of disclosed all features in this instructions (comprising claim, summary and the accompanying drawing followed) and disclosed any method like this or equipment.Unless clearly statement in addition, in this instructions (comprising claim, summary and the accompanying drawing followed) disclosed each feature can be by providing identical, be equal to or the alternative features of similar object replaces.
In addition, those skilled in the art can understand, although embodiment more described herein comprise some feature rather than further feature included in other embodiment, the combination of the feature of different embodiment means within scope of the present invention and forms different embodiment.For example, in the following claims, the one of any of embodiment required for protection can be used with array mode arbitrarily.
All parts embodiment of the present invention can realize with hardware, or realizes with the software module moved on one or more processor, or realizes with their combination.It will be understood by those of skill in the art that and can use in practice microprocessor or digital signal processor (DSP) to realize according to the some or all functions of the some or all parts in the device of the raising stop words search validity of the embodiment of the present invention.The present invention for example can also be embodied as, for carrying out part or all equipment or device program (, computer program and computer program) of method as described herein.Realizing program of the present invention and can be stored on computer-readable medium like this, or can there is the form of one or more signal.Such signal can be downloaded and obtain from internet website, or provides on carrier signal, or provides with any other form.
It should be noted above-described embodiment the present invention will be described rather than limit the invention, and those skilled in the art can design alternative embodiment in the situation that do not depart from the scope of claims.In the claims, any reference symbol between bracket should be configured to limitations on claims.Word " comprises " not to be got rid of existence and is not listed as element or step in the claims.Being positioned at word " " before element or " one " does not get rid of and has a plurality of such elements.The present invention can be by means of including the hardware of some different elements and realizing by means of the computing machine of suitably programming.In having enumerated the unit claim of some devices, several in these devices can be to carry out imbody by same hardware branch.The use of word first, second and C grade does not represent any order.Can be title by these word explanations.

Claims (10)

1. a method that improves stop words search validity, comprising:
Receive the searched key word that user provides;
In the inactive dictionary of setting up in advance, retrieve described searched key word;
If retrieve described searched key word in described inactive dictionary, obtain the given content corresponding with described searched key word, return to described given content for showing to user.
2. method according to claim 1, the foundation of described inactive dictionary further comprises:
Whether the keyword of examining in search engine index storehouse is stop words;
The keyword of examining as stop words is included into inactive dictionary.
3. method according to claim 2, described in the keyword examined in search engine index storehouse whether be that stop words comprises:
According to the Search Results clicking rate of this keyword, examine, the keyword using described Search Results clicking rate under predetermined threshold value is as stop words.
4. according to the method described in claim 1 or 2 or 3, also comprise: for each stop words in described inactive dictionary is equipped with corresponding given content.
5. method according to claim 4, describedly further comprises for each stop words of stopping using in dictionary is equipped with corresponding given content: for each stop words is equipped with the described given content that search engine server and/or third-party server provide.
6. according to the method described in claim 1-5 any one, also comprise: obtain the natural Search Results corresponding with described searched key word;
When returning to described given content, return to described natural Search Results for showing to user.
7. a device that improves stop words search validity, comprising:
Receiver module, is suitable for receiving the searched key word that user provides;
Retrieval module, is suitable for retrieving described searched key word in the inactive dictionary of setting up in advance;
Acquisition module, the in the situation that of being suitable for retrieving described searched key word in described inactive dictionary, obtains the given content corresponding with described searched key word, returns to described given content for showing to user.
8. device according to claim 7, also comprises:
Validating module, whether the keyword that is suitable for examining in search engine index storehouse is stop words, and the keyword of examining as stop words is included into inactive dictionary.
9. device according to claim 8, described validating module is further adapted for according to the Search Results clicking rate of this keyword and examines, and the keyword using described Search Results clicking rate under predetermined threshold value is as stop words.
10. according to the device described in claim 7 or 8 or 9, also comprise relating module, be suitable for being equipped with corresponding given content for each stop words in described inactive dictionary.
CN201310499118.9A 2013-10-22 2013-10-22 It is a kind of to improve the method and apparatus that stop words searches for validity Active CN103530389B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310499118.9A CN103530389B (en) 2013-10-22 2013-10-22 It is a kind of to improve the method and apparatus that stop words searches for validity

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310499118.9A CN103530389B (en) 2013-10-22 2013-10-22 It is a kind of to improve the method and apparatus that stop words searches for validity

Publications (2)

Publication Number Publication Date
CN103530389A true CN103530389A (en) 2014-01-22
CN103530389B CN103530389B (en) 2017-08-22

Family

ID=49932398

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310499118.9A Active CN103530389B (en) 2013-10-22 2013-10-22 It is a kind of to improve the method and apparatus that stop words searches for validity

Country Status (1)

Country Link
CN (1) CN103530389B (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104063418A (en) * 2014-03-17 2014-09-24 百度在线网络技术(北京)有限公司 Search recommendation method and device
CN104133908A (en) * 2014-08-07 2014-11-05 北京奇虎科技有限公司 Method, server, client and system for displaying or generating discussion box on page
CN104217033A (en) * 2014-09-29 2014-12-17 北京奇虎科技有限公司 Search method and device based on timeliness
WO2018192373A1 (en) * 2017-04-21 2018-10-25 北京搜狗科技发展有限公司 Search method and apparatus, and apparatus for searching
CN112328752A (en) * 2021-01-04 2021-02-05 平安科技(深圳)有限公司 Course recommendation method and device based on search content, computer equipment and medium
CN115238683A (en) * 2022-08-09 2022-10-25 平安科技(深圳)有限公司 Method, device, equipment and medium for recognizing stop words circularly and automatically paying attention

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7409383B1 (en) * 2004-03-31 2008-08-05 Google Inc. Locating meaningful stopwords or stop-phrases in keyword-based retrieval systems
CN102982118A (en) * 2012-11-09 2013-03-20 北京奇虎科技有限公司 Searching method and device based on favorites
CN103136339A (en) * 2013-02-01 2013-06-05 百度在线网络技术(北京)有限公司 Searching method, client-side and network server-side based on service information

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7409383B1 (en) * 2004-03-31 2008-08-05 Google Inc. Locating meaningful stopwords or stop-phrases in keyword-based retrieval systems
CN102982118A (en) * 2012-11-09 2013-03-20 北京奇虎科技有限公司 Searching method and device based on favorites
CN103136339A (en) * 2013-02-01 2013-06-05 百度在线网络技术(北京)有限公司 Searching method, client-side and network server-side based on service information

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
ARND CHRISTIAN KöNIG 等: "Click-Through Prediction for News Queries", 《SIGIR "09 PROCEEDINGS OF THE 32ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL》 *
彭冬莲: ""单汉字标引及其检索技术的优化"", 《农业图书情报学刊》 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104063418A (en) * 2014-03-17 2014-09-24 百度在线网络技术(北京)有限公司 Search recommendation method and device
CN104133908A (en) * 2014-08-07 2014-11-05 北京奇虎科技有限公司 Method, server, client and system for displaying or generating discussion box on page
CN104133908B (en) * 2014-08-07 2018-09-04 北京奇虎科技有限公司 Method, server, client and the system that frame is discussed are shown or generated in the page
CN104217033A (en) * 2014-09-29 2014-12-17 北京奇虎科技有限公司 Search method and device based on timeliness
WO2018192373A1 (en) * 2017-04-21 2018-10-25 北京搜狗科技发展有限公司 Search method and apparatus, and apparatus for searching
CN108733717A (en) * 2017-04-21 2018-11-02 北京搜狗科技发展有限公司 A kind of searching method and device, a kind of device for search
CN112328752A (en) * 2021-01-04 2021-02-05 平安科技(深圳)有限公司 Course recommendation method and device based on search content, computer equipment and medium
CN115238683A (en) * 2022-08-09 2022-10-25 平安科技(深圳)有限公司 Method, device, equipment and medium for recognizing stop words circularly and automatically paying attention
CN115238683B (en) * 2022-08-09 2023-06-20 平安科技(深圳)有限公司 Method, device, equipment and medium for recognizing stop words of circulating self-attention

Also Published As

Publication number Publication date
CN103530389B (en) 2017-08-22

Similar Documents

Publication Publication Date Title
CN103514299A (en) Information searching method and device
CN104063454A (en) Search push method and device for mining user demands
US9129009B2 (en) Related links
CN103577597A (en) Keyword searching system based on current browse webpage
CN103530389A (en) Method and device for improving stopword searching effectiveness
CN102915380A (en) Method and system for carrying out searching on data
CN102930054A (en) Data search method and data search system
CN103428076A (en) Method and device for transmitting information to multi-type terminals or applications
CN103488781A (en) Method and search engine server for providing information search
CN103488786A (en) Method and client terminal for providing information search
CN103577596A (en) Keyword searching method and device based on current browse webpage
WO2018156558A1 (en) Systems and methods for direct in-browser markup of elements in internet content
CN102982134A (en) System enabling recommended web site information to be displayed in browser address bar
CN103577595A (en) Keyword pushing method and device based on current browse webpage
CN103577392A (en) Keyword pushing method and device based on current browse webpage
CN102541853A (en) Method and device which are capable of obtaining application information by utilizing browser address bar
US11423096B2 (en) Method and apparatus for outputting information
CN103412901A (en) Method and device for clearing historical records
CN102955850A (en) Method and device for loading sequencing website
CN104021154A (en) Method and device for searching browser
CN106471497A (en) Auxiliary using context browses
CN103530385A (en) Method and device for searching for information based on vertical searching channels
CN103678706A (en) Picture recognition method, system, equipment and device based on screenshot information
CN103870573A (en) Method and device for website analysis
CN104699836A (en) Multi-keyword search prompting method and multi-keyword search prompting device

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20220725

Address after: Room 801, 8th floor, No. 104, floors 1-19, building 2, yard 6, Jiuxianqiao Road, Chaoyang District, Beijing 100015

Patentee after: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Address before: 100088 room 112, block D, 28 new street, new street, Xicheng District, Beijing (Desheng Park)

Patentee before: BEIJING QIHOO TECHNOLOGY Co.,Ltd.

Patentee before: Qizhi software (Beijing) Co.,Ltd.

TR01 Transfer of patent right