CN103530389B - It is a kind of to improve the method and apparatus that stop words searches for validity - Google Patents
It is a kind of to improve the method and apparatus that stop words searches for validity Download PDFInfo
- Publication number
- CN103530389B CN103530389B CN201310499118.9A CN201310499118A CN103530389B CN 103530389 B CN103530389 B CN 103530389B CN 201310499118 A CN201310499118 A CN 201310499118A CN 103530389 B CN103530389 B CN 103530389B
- Authority
- CN
- China
- Prior art keywords
- search
- keyword
- user
- stop words
- given content
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000000034 method Methods 0.000 title claims abstract description 35
- 230000009849 deactivation Effects 0.000 claims abstract description 28
- 235000013399 edible fruits Nutrition 0.000 claims description 3
- 238000012795 verification Methods 0.000 claims description 2
- 238000012545 processing Methods 0.000 description 7
- 230000008901 benefit Effects 0.000 description 3
- 238000004590 computer program Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000007717 exclusion Effects 0.000 description 2
- 230000008569 process Effects 0.000 description 2
- 238000013515 script Methods 0.000 description 2
- 238000004422 calculation algorithm Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 238000001035 drying Methods 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 230000009466 transformation Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9535—Search customisation based on user profiles and personalisation
Abstract
The method and apparatus that stop words searches for validity are improved the invention discloses a kind of, wherein method includes:Receive the search keyword that user provides;The search keyword is retrieved in the deactivation dictionary pre-established;If retrieving the search keyword in the deactivation dictionary, given content corresponding with the search keyword is obtained, returning to the given content is used to show to user.The scheme provided according to the present invention, substantial amounts of meaningless search or by mistake search are reasonably used for recommending the content related to search keyword to user, avoid user due to caused by maloperation returning result do not have the situation of practical significance completely, so as to reduce the operation of user, the content that also fully displaying search service provider can provide.
Description
Technical field
The present invention relates to network data communication technical field, and in particular to a kind of method that raising stop words searches for validity
And device.
Background technology
Search engine is that user utilizes one of main path of Internet resources.Search engine uses specific computer journey
Sequence collects info web from internet, and keyword is extracted from webpage, forms the index database of keyword.User's input is to be checked
When asking keyword, search engine finds the webpage for matching the keyword, is presented to user from the index library searching keyword.
When user is scanned for using search engine, it may appear that a large amount of meaningless search are searched for or missed without specific purpose
The situation of search.For example, user has initiated searching request when keyword is not yet completely inputted due to there is maloperation, or
Only input single English character or numeral or punctuate or Chinese word character (" stop words ") has just initiated searching request, at this moment, search is drawn
The natural search result for the keyword that still can find and return the input from index database is held up, however, these natural results are logical
Often it is and without targetedly search result or meaning.Two examples of such case are given in Fig. 1 a and Fig. 1 b, and
Processing mode of the prior art.As illustrated in figs. 1A and ib, it is assumed that user is to search for the key of some " l " or " d " beginning
Word (for example, " 163 mailbox ", " dota2 "), but clicks unintentionally search button after input " l " or " d ", or according to
Each input provides the service searched for immediately, and that search engine is returned to " l " and the search result of " d ", from searching in figure
From the point of view of hitch fruit, it is clear that it does not have too big meaning to user, therefore, user generally to the clicking rate of this search result also very
Lowly, according to statistics, this meaningless or search by mistake result clicking rate is general below 0.05, and the average result of normal searching
Clicking rate is then more than 1.5 times, and the request amount of this search for simultaneously scanning for occurring in engine is also very huge.
It can be seen that, search for substantial amounts of this meaningless search or by mistake, be nothing according only to its literal meaning return to the nature result
Method meets user's request, result in low-down traffic transformation rate, can also be provided without fully displaying search service provider
Content, the recommendation of effective information is carried out to user.
The content of the invention
In view of the above problems, it is proposed that the present invention so as to provide one kind overcome above mentioned problem or at least in part solve on
State the method and apparatus that a kind of raising stop words of problem searches for validity.
According to an aspect of the invention, there is provided a kind of method for improving stop words search validity, including:Receive and use
The search keyword that family is provided;Search keyword is retrieved in the deactivation dictionary pre-established;If retrieved in dictionary is disabled
To search keyword, given content corresponding with search keyword is obtained, returning to given content is used to show to user.
Wherein, the foundation for disabling dictionary further comprises:Whether the keyword examined in search engine index storehouse is deactivation
Word;By examining deactivation dictionary is included into for the keyword of stop words.
Further, whether the keyword examined in search engine index storehouse is that stop words includes:According to the keyword
Search result clicking rate is examined, and regard keyword of the search result clicking rate under predetermined threshold value as stop words.
Further, this method also includes being equipped with corresponding given content for each stop words disabled in dictionary, specifically
Ground, can be equipped with the given content that search engine server and/or third-party server are provided by each stop words.
Further, this method also includes:Obtain natural search result corresponding with search keyword;Specify interior returning
The natural search result is returned while appearance to be used to show to user.
According to another aspect of the present invention there is provided a kind of device for improving stop words search validity, including:Receive mould
Block, suitable for receiving the search keyword that user provides;Module is retrieved, suitable for the retrieval search pass in the deactivation dictionary pre-established
Keyword;Acquisition module, suitable in the case where retrieving search keyword in disabling dictionary, obtaining corresponding with search keyword
Given content, returning to given content is used to show to user.
Further, the device also includes validating module, suitable for examine the keyword in search engine index storehouse whether be
Stop words, and it is included into deactivation dictionary by examining for the keyword of stop words.
The validating module is further adapted for being examined according to the search result clicking rate of the keyword, by search result point
Keyword of the rate under predetermined threshold value is hit as stop words.
Further, the device also includes relating module, and each stop words outfit for being suitable for disabling in dictionary is corresponding
Given content.The relating module is further adapted for each stop words and is equipped with search engine server and/or third-party server
The given content provided.
Acquisition module is further adapted for obtaining natural search result corresponding with search keyword, is returning to given content
Return to the nature search result is used to show to user simultaneously.
In information retrieval field, search engine can ignore automatically in index pages or processing searching request some words or
Word, these words or word are to be referred to as stop words (Stop Words).Stop words mainly include English character, numeral, mathematical character,
Punctuation mark or the extra-high Chinese word character of frequency of use etc..
The scheme provided according to the present invention, search engine server is after the search keyword that user provides is received, not
Return to the nature search result immediately, and the search keyword is retrieved in the deactivation dictionary of built in advance, to judge whether it is deactivation
Word, for the stop words retrieved, obtains corresponding given content, and given content is showed into user.According to the program, greatly
The meaningless search of amount or by mistake search be reasonably used for recommending the content related to search keyword to user, it is to avoid user
Because returning result does not have the situation of practical significance completely caused by maloperation, so as to reduce the operation of user, also fully
The content that displaying search service provider can provide.
Described above is only the general introduction of technical solution of the present invention, in order to better understand the technological means of the present invention,
And can be practiced according to the content of specification, and in order to allow above and other objects of the present invention, feature and advantage can
Become apparent, below especially exemplified by the embodiment of the present invention.
Brief description of the drawings
By reading the detailed description of hereafter preferred embodiment, various other advantages and benefit is common for this area
Technical staff will be clear understanding.Accompanying drawing is only used for showing the purpose of preferred embodiment, and is not considered as to the present invention
Limitation.And in whole accompanying drawing, identical part is denoted by the same reference numerals.In the accompanying drawings:
Fig. 1 a and Fig. 1 b show the schematic diagram for the processing mode searched in the prior art to stop words;
Fig. 2 shows the flow chart of the method for raising stop words search validity according to an embodiment of the invention;
Fig. 3 shows the flow of the method for improving stop words search validity according to another embodiment of the invention
Figure;
Fig. 4 a show the example that the method processing stop words provided according to the present invention is searched for;
Fig. 4 b show another example that the method processing stop words provided according to the present invention is searched for;
Fig. 4 c show another example that the method processing stop words provided according to the present invention is searched for;
Fig. 5 shows that raising stop words according to an embodiment of the invention searches for the structural representation of validity device
Figure.
Embodiment
The exemplary embodiment of the disclosure is more fully described below with reference to accompanying drawings.Although showing the disclosure in accompanying drawing
Exemplary embodiment, it being understood, however, that may be realized in various forms the disclosure without should be by embodiments set forth here
Limited.On the contrary, these embodiments are provided to facilitate a more thoroughly understanding of the present invention, and can be by the scope of the present disclosure
Complete conveys to those skilled in the art.
Fig. 2 shows the flow chart of the method for raising stop words search validity according to an embodiment of the invention.Such as
Shown in Fig. 2, this method comprises the following steps:
Step S101, receives the search keyword that user provides.
User inputs search keyword in the client, and client generates the searching request for including search keyword information,
And send to search engine server.
Step S102, search keyword is retrieved in the deactivation dictionary pre-established.
Examine out stop words in the search keyword that the step is inputted from user, in the present embodiment subsequent step to knot
Fruit obtains and does specially treated.If a certain search keyword of input is verified as stop words, execution step S103.
Step S103, obtains given content corresponding with search keyword, and returning to given content is used to show to user.
Search service provider set up disable dictionary when, be each keyword therein equipped with given content.If search
Keyword is stop words, and search engine server obtains given content from dictionary is disabled, and is returned as returning result or part
As a result user is showed.
The method provided according to the above embodiment of the present invention, search engine server is receiving the search key that user provides
After word, not return to the nature search result immediately, and the search keyword is retrieved in the deactivation dictionary of built in advance, to judge that it is
No is stop words, for the stop words retrieved, obtains corresponding given content, given content is showed into user.According to this
Scheme, by mistake substantial amounts of meaningless search or search are reasonably used for recommending the content related to search keyword to user, are kept away
Exempted from user due to caused by maloperation returning result do not have the situation of practical significance, so as to reduce the operation of user,
The content that fully displaying search service provider can provide.
Fig. 3 shows the flow of the method for improving stop words search validity according to another embodiment of the invention
Figure.As shown in figure 3, this method comprises the following steps:
Step S201, receives the search keyword that user provides.
Search engine server receives the search keyword that user provides in a variety of ways, for example, user is in search homepage
Middle input search keyword, searched page generates the searching request for including search keyword.Searched page is by corresponding service provider
There is provided, its file format is usually HTML, the page is shown in a browser, request generation code is with html language, java scripts
It is included in etc. form in page source code.Searched page is handed over by browser based on the agreements such as HTTP and search engine server
Mutually.
Step S202, in the search keyword for disabling retrieval user's offer in dictionary pre-established.
Step S203, judges whether to retrieve search keyword in dictionary is disabled.
Whether the step uses the method for Keywords matching to judge the search keyword of user's input for stop words.Stop words
Included in storehouse and be searched the search keyword that engine server is judged as stop words.By this search keyword and stop words
Keyword in storehouse matches, and if the situation that the match is successful occurs, then the search keyword by this input is judged as stopping
Word, and perform step S204;If matching is unsuccessful, this search is handled as normal searching, step S205 is performed.
Dictionary is disabled typically to be provided by search service provider.Deactivation dictionary is set up to further comprise:Examine search engine index
Whether the search keyword in storehouse is stop words, and the search keyword examined as stop words is included into deactivation dictionary.
First, disable dictionary can by common numeral, letter without certain sense etc. examine be stop words, such as 1,2,
130th, a, b, ze, while needing the search keyword with its meaning such as exclusion 58,110,126.
Whether it is stop words that a certain keyword can be examined according to the desirability of user.The demand of user can pass through
Keyword search number of times, search result clicking rate etc. embody.For example, search engine server can record search keyword one
The information such as searching request quantity, search rate in the section time, by request amount is larger and search rate is less than the pass of certain threshold value
Keyword is included as stop words.The stop words of this typical type includes:First, eh, Oh, the Chinese character such as small.
Alternatively, examined according to the search result clicking rate of search keyword.Under normal circumstances, search engine service
Device can at least meet the portion requirements of user to the returning result of non-stop words, therefore, and the result clicking rate of normal searching is higher.
And stop words can not reflect the real intention of user, search result is seldom clicked on to user's usually not meaning.Search is drawn
Holding up server can count to substantial amounts of searching request, draw the average result of each search keyword in a period of time
Clicking rate, regard search keyword of the average result clicking rate under predetermined threshold value as stop words.For example, data according to statistics,
The average result clicking rate of normal searching more than 1.5 times, and stop words search average result clicking rate under 0.05, then
Can be some numerical value in 0-0.05 by threshold preset.
Further, the performance data that stop words can contemplate time, region characteristic and user is examined.For example, part is noted
Volume user may have clear and definite intention to stop words of a certain average result clicking rate under threshold value, and clicking rate is higher, then may be used
To recognize the information such as ID, to this user, the keyword is classified as normal searching keyword.
Step S204, obtains given content corresponding with search keyword, and returning to given content is used to show to user.
Search service provider set up disable dictionary when, be each search keyword therein equipped with given content.If
This search keyword is judged as stop words, and search engine server obtains given content from dictionary is disabled, is used as return
Or part returning result shows user as a result.
Provisioned given content is used to provide the user recommendation information when user's request is not affected by and met.In view of stopping
Producing reason is searched in word, is that caused by the maloperation of user, therefore, the given content of outfit is best as a rule
There is certain association with user's input content.Meanwhile, to improve the effective percentage of information recommendation, given content should be and input content
Related, the higher information of demand degree.For example, in Fig. 4 a, 4b, in 4c, for stop words " l ", " y " and " small ", given content
Respectively one higher, center of clicking rate and demand degree is online plays, and clothes class merchandise display and trivial games are complete works of.
Given content is equipped with to further comprise being equipped with search engine server and/or third-party server for each stop words
The given content provided.Search service provider can search for by stop words recommends the quality information of itself to user, for example, pushing away
Recommend and show homegrown resource with the related webpage URL addresses of stop words, popular video, picture etc. to user, attract user to visit
Ask.It can also be equipped with for stop words from third-party given contents such as partners, for example, being that stop words " l " is matched somebody with somebody in fig .4
Serviced for online play in one, center, the service is that stop words " y " is matched somebody with somebody in CNTV (Chinese Network TV Station), Fig. 4 b
For " clothes go directly " service, the service is provided by Taobao's clothes channel.
Multiple given contents can also be equipped with for each stop words disabled in dictionary, according to average click-through rate or specific use
The personality data at family is sorted to multiple given contents, and most suitable content is recommended to user.
Further, in this step, natural search result corresponding with search keyword is obtained.Returning to given content
While return to the nature search result be used to show to user.
Here, natural search result includes the various search results that all kinds of search engines can be returned in the prior art.Example
Such as, for universal search, natural search result is to refer to the keyword returned corresponding URL column in search engine index storehouse
Table, for vertical search, natural result is the information such as certain types of picture, news that corresponding channel or website are returned.It is natural
As a result also include integrating search result, for example, in Fig. 1 a, Fig. 1 b, in the results list page including and the Keywords matching
Webpage url list, also include the common vertical search result in the parts such as picture, encyclopaedia.
As described in step S202, certain customers may be to being identified as the search keyword of stop words
There is clear and definite demand in natural result, therefore, for such user while given content is returned also return to the nature in need
Search result.Alternatively, result can be integrated when returning to given content and natural result, preferentially shows given content,
Natural result is shown afterwards.As illustrated in fig. 4 c, for stop words " small ", result page the top preferentially illustrates trivial games complete works clothes
Business, is followed by the URL link of correlation, the vertical channel information such as encyclopaedia.
Step S205, obtains the corresponding natural search result of search keyword, return to the nature from search engine index storehouse
Search result is used to show to user.
When not retrieving search keyword in disabling dictionary, this search is handled as normal searching, only returned
Returning the corresponding natural search result of search keyword is used to show to user.
The method provided according to the above embodiment of the present invention, search engine server is receiving the search key that user provides
After word, not return to the nature search result immediately, and the keyword is retrieved in the deactivation dictionary of built in advance, with judge its whether be
Stop words, for the stop words retrieved, obtains corresponding given content, given content is showed into user.Meanwhile, obtain with
The corresponding natural search result of search keyword, return to the nature search result while returning to given content.According to the program, greatly
The meaningless search of amount or by mistake search be reasonably used for recommending the content related to search keyword to user, it is to avoid user
Because returning result does not have the situation of practical significance caused by maloperation, so that the operation of user is reduced, also fully displaying
The content that search service provider can provide.
Fig. 5 shows the structural representation of the device of raising stop words search validity according to an embodiment of the invention
Figure.As shown in figure 5, the device includes:Receiving module 21, retrieval module 22 and acquisition module 23.
Receiving module 21 is suitable to receive the search keyword that user provides.Receiving module 21 can receive user with various sides
The search keyword that formula is provided, for example, user inputs search keyword in search homepage, searched page generation is closed comprising search
The searching request of keyword.Searched page is provided by corresponding service provider, and its file format is usually HTML, and the page, which is shown in, to be browsed
In device, request generation code is with html language, and the form such as java scripts is included in page source code.Receiving module 21 is received and searched
The rope page is by browser, the search keyword sent based on agreements such as HTTP.
Retrieval module 22 is suitable to retrieve search keyword in the deactivation dictionary pre-established.
Retrieval module 22 judges whether the search keyword that user inputs is stop words by the method for Keywords matching.Stop
The search keyword that engine is judged as stop words is searched with being included in dictionary.Module 22 is retrieved by this search keyword
Match with disabling the keyword in dictionary, if the situation that the match is successful occurs, then by the search keyword of this input
It is judged as stop words.
The device for improving stop words search validity further comprises validating module 24, and validating module 24 is searched suitable for verification
Whether the search keyword in rope engine library is stop words, and the search keyword examined as stop words is included into deactivation dictionary.
First, validating module 24 can by common numeral, letter without certain sense etc. examine be stop words, such as 1,
2nd, 130, a, b, ze, while needing exclusion 58,110,126 etc. that there is the search keyword of its meaning.
Validating module 24 is further adapted for examining whether a certain keyword is stop words according to the desirability of user.User's
Demand can be embodied by keyword search number of times, search result clicking rate etc..Closed for example, validating module 24 can record search
The information such as searching request quantity, search rate of the keyword within a period of time, request amount is larger and search rate is less than certain threshold
The keyword of value is included as stop words.The stop words of this typical type includes:First, eh, Oh, the Chinese character such as small.
Alternatively, validating module 24 is suitable to be examined according to the search result clicking rate of search keyword.Normal condition
Under, search engine can at least meet the portion requirements of user to the returning result of non-stop words, therefore, the result points of normal searching
Hit rate higher.And stop words can not reflect the real intention of user, search result is to user's usually not meaning, seldom by point
Hit.Validating module 24 can be counted to substantial amounts of searching request, draw the flat of each interior search keyword of a period of time
Result clicking rate, regard keyword of the average result clicking rate under predetermined threshold value as stop words.For example, counting according to statistics
According to, the average result clicking rate of normal searching more than 1.5 times, and the average result clicking rate of stop words search 0.05 it
Under, then threshold preset can be some numerical value in 0-0.5 by validating module 24.
Further, validating module 24 examine stop words can using the performance data of time, region characteristic and user as
Input data.For example, part registered user may have clearly to stop words of a certain average result clicking rate under threshold value
It is intended to, then the keyword can be classified as normal searching and closed by validating module 24 by recognizing the information such as ID, to this user
Keyword.
Acquisition module 23 is suitable to, in the case where retrieving search keyword in disabling dictionary, obtain and search keyword pair
The given content answered, returning to given content is used to show to user.
Search service provider set up disable dictionary when, be each search keyword therein equipped with given content.If
This search keyword is judged as stop words, and acquisition module 23 obtains given content from dictionary is disabled, is used as returning result
Or part returning result shows user.
If retrieval module 22 does not retrieve this search keyword in dictionary is disabled, acquisition module 23 searches this
The processing of Suo Zuowei normal searchings, only obtains the corresponding natural result of search keyword from search engine index storehouse.
Also include relating module 25 in the device for improving stop words search validity, be suitable for disabling each stopping in dictionary
Word is equipped with corresponding given content.
Relating module 25 is that the given content provisioned in stop words is used to be that user carries when user's request is not affected by and met
For recommendation information.Search Producing reason is searched for or miss in view of meaningless, led by the maloperation of user as a rule
Cause, therefore, the given content that relating module 25 is equipped with preferably has certain association with user's input content.Meanwhile, to improve letter
The effective percentage recommended is ceased, given content should be, demand degree higher information related to input content.For example, in Fig. 4 a,
In 4b, 4c, acquisition module stop words " l ", " y " and " small ", given content is respectively clicking rate and the higher center one of demand degree
Platform is played online, and clothes class merchandise display and trivial games are complete works of.
Relating module 25 is further adapted for each stop words and is equipped with search engine server and/or third-party server institute
The given content of offer.Search service provider can recommend the quality information of itself to user by meaningless search or by mistake search,
For example, recommending and the related webpage URL addresses of stop words, popular video, picture etc. shows homegrown resource to user, attracts
User accesses.Relating module 25 is further adapted to stop words and is equipped with from third-party given contents such as partners, for example, in Fig. 4 a
In, relating module 25 is that stop words " l " is equipped with the online broadcasting service in one, center, and the service is from CNTV (Chinese network electricity
Television stations), it is that stop words " y " is equipped with " clothes go directly " service, the service is provided by Taobao's clothes channel in Fig. 4 b.
Each stop words that relating module 25 is further adapted for disabling in dictionary is equipped with multiple given contents, according to average click-through rate
Or the personality data of specific user sorts to multiple given contents, most suitable content is recommended to user.
Acquisition module 23 is further adapted for obtaining natural search result corresponding with search keyword, is returning to given content
While return to the nature search result be used to show to user.
Here, natural search result includes the various search results that all kinds of search engines can be returned in the prior art.Example
Such as, for universal search, natural search result is to refer to the keyword returned corresponding URL column in search engine index storehouse
Table, for vertical search, natural result is the information such as certain types of picture, news that corresponding channel or website are returned.It is natural
As a result also include integrating search result, for example, in Fig. 1 a, Fig. 1 b, in the results list page including and the Keywords matching
Webpage url list, also include the common vertical search result in the parts such as picture, encyclopaedia.
As mentioned before.Certain customers may exist bright to the natural result for the search keyword for being identified as stop words
True demand, therefore, acquisition module 23 are further adapted for return to the nature search result while given content is returned.Alternatively,
Acquisition module 23 can be integrated when returning to given content and natural result to result, preferentially show given content, Zhi Houzhan
Show natural result.As illustrated in fig. 4 c, for stop words " small ", result page the top preferentially illustrates trivial games complete works service, with
It is related URL link afterwards, the vertical channel information such as encyclopaedia.
The device provided according to the above embodiment of the present invention, receiving module receive user provide search keyword after,
Retrieval module retrieves the keyword in the deactivation dictionary of built in advance, and whether be stop words, for the stop words retrieved if judging it,
Acquisition module is obtained by the corresponding given content that relating module is stop words outfit, and given content is showed into user.Meanwhile, obtain
Natural search result corresponding with search keyword is taken, return to the nature search result while returning to given content.According to the party
Case, by mistake substantial amounts of meaningless search or search are reasonably used for recommending the content related to search keyword to user, it is to avoid
User due to caused by maloperation returning result do not have the situation of practical significance, so as to reduce the operation of user, also fill
Divide and illustrate the content that search service provider can provide.
Algorithm and display be not inherently related to any certain computer, virtual system or miscellaneous equipment provided herein.
Various general-purpose systems can also be used together with based on teaching in this.As described above, construct required by this kind of system
Structure be obvious.In addition, the present invention is not also directed to any certain programmed language.It is understood that, it is possible to use it is various
Programming language realizes the content of invention described herein, and the description done above to language-specific is to disclose this hair
Bright preferred forms.
In the specification that this place is provided, numerous specific details are set forth.It is to be appreciated, however, that the implementation of the present invention
Example can be put into practice in the case of these no details.In some instances, known method, structure is not been shown in detail
And technology, so as not to obscure the understanding of this description.
Similarly, it will be appreciated that in order to simplify the disclosure and help to understand one or more of each inventive aspect, exist
Above in the description of the exemplary embodiment of the present invention, each feature of the invention is grouped together into single implementation sometimes
In example, figure or descriptions thereof.However, the method for the disclosure should be construed to reflect following intention:It is i.e. required to protect
The application claims of shield features more more than the feature being expressly recited in each claim.More precisely, such as following
Claims reflect as, inventive aspect is all features less than single embodiment disclosed above.Therefore,
Thus the claims for following embodiment are expressly incorporated in the embodiment, wherein each claim is in itself
All as the separate embodiments of the present invention.
Those skilled in the art, which are appreciated that, to be carried out adaptively to the module in the equipment in embodiment
Change and they are arranged in one or more equipment different from the embodiment.Can be the module or list in embodiment
Member or component be combined into a module or unit or component, and can be divided into addition multiple submodule or subelement or
Sub-component.In addition at least some in such feature and/or process or unit exclude each other, it can use any
Combination is disclosed to all features disclosed in this specification (including adjoint claim, summary and accompanying drawing) and so to appoint
Where all processes or unit of method or equipment are combined.Unless expressly stated otherwise, this specification (including adjoint power
Profit is required, summary and accompanying drawing) disclosed in each feature can or similar purpose identical, equivalent by offer alternative features come generation
Replace.
Although in addition, it will be appreciated by those of skill in the art that some embodiments described herein include other embodiments
In included some features rather than further feature, but the combination of the feature of be the same as Example does not mean in of the invention
Within the scope of and form different embodiments.For example, in the following claims, times of embodiment claimed
One of meaning mode can be used in any combination.
The present invention all parts embodiment can be realized with hardware, or with one or more processor run
Software module realize, or realized with combinations thereof.It will be understood by those of skill in the art that can use in practice
Microprocessor or digital signal processor (DSP) realize the according to embodiments of the present invention stop words search validity that improves
The some or all functions of some or all parts in device.The present invention is also implemented as being used to perform being retouched here
The some or all equipment or program of device (for example, computer program and computer program product) for the method stated.
Such program for realizing the present invention can be stored on a computer-readable medium, or can have one or more signal
Form.Such signal can be downloaded from internet website and obtained, either on carrier signal provide or with it is any its
He provides form.
It should be noted that the present invention will be described rather than limits the invention for above-described embodiment, and ability
Field technique personnel can design alternative embodiment without departing from the scope of the appended claims.In the claims,
Any reference symbol between bracket should not be configured to limitations on claims.Word "comprising" is not excluded the presence of not
Element or step listed in the claims.Word "a" or "an" before element does not exclude the presence of multiple such
Element.The present invention can be by means of including the hardware of some different elements and coming real by means of properly programmed computer
It is existing.In if the unit claim of equipment for drying is listed, several in these devices can be by same hardware branch
To embody.The use of word first, second, and third does not indicate that any order.These words can be explained and run after fame
Claim.
Claims (10)
1. a kind of method for improving stop words search validity, including:
Receive the search keyword that user provides;
The search keyword is retrieved in the deactivation dictionary pre-established;It is the deactivation when setting up the deactivation dictionary
Each search keyword in dictionary is equipped with given content;Provisioned given content is used to be not affected by satisfaction in user's request
When provide the user recommendation information;
If retrieving the search keyword in the deactivation dictionary, obtain in specify corresponding with the search keyword
Hold, returning to the given content is used to show to user;The given content is according to average click-through rate or the individual character of specific user
Data are ranked up.
2. according to the method described in claim 1, the foundation for disabling dictionary further comprises:
Whether the keyword examined in search engine index storehouse is stop words;
By examining deactivation dictionary is included into for the keyword of stop words.
3. whether the keyword in method according to claim 2, the verification search engine index storehouse is stop words bag
Include:
Examined according to the search result clicking rate of the keyword, by the search result clicking rate under predetermined threshold value
Keyword is used as stop words.
4. according to the method described in claim 1, each search keyword in the dictionary for deactivation is equipped with given content
Further comprise:The finger that search engine server and/or third-party server are provided is equipped with by each search keyword
Determine content.
5. the method according to claim any one of 1-4, in addition to:Acquisition is corresponding with the search keyword to search naturally
Hitch fruit;
The natural search result is returned while the given content is returned to be used to show to user.
6. a kind of device for improving stop words search validity, including:
Receiving module, suitable for receiving the search keyword that user provides;
Module is retrieved, suitable for retrieving the search keyword in the deactivation dictionary pre-established;
Acquisition module, in the case of retrieving the search keyword in the deactivation dictionary, is obtained and the search
The corresponding given content of keyword, returning to the given content is used to show to user;The given content is according to average click
Rate or the personality data of specific user are ranked up;
Relating module, for set up it is described deactivation dictionary when, be it is described deactivation dictionary in each search keyword equipped with
Given content;Provisioned given content is used to provide the user recommendation information when user's request is not affected by and met.
7. device according to claim 6, in addition to:
Whether validating module, be stop words suitable for examining the keyword in search engine index storehouse, and will be examined as stop words
Keyword is included into deactivation dictionary.
8. device according to claim 7, the validating module is further adapted for the search result point according to the keyword
The rate of hitting is examined, and regard keyword of the search result clicking rate under predetermined threshold value as stop words.
9. device according to claim 6, the relating module is further adapted for each search keyword and is equipped with search
The given content that engine server and/or third-party server are provided.
10. the device according to any one of claim 6-9, the acquisition module is further adapted for obtaining and the search
The corresponding natural search result of keyword, return the given content while return the natural search result be used for
Family is shown.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310499118.9A CN103530389B (en) | 2013-10-22 | 2013-10-22 | It is a kind of to improve the method and apparatus that stop words searches for validity |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310499118.9A CN103530389B (en) | 2013-10-22 | 2013-10-22 | It is a kind of to improve the method and apparatus that stop words searches for validity |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103530389A CN103530389A (en) | 2014-01-22 |
CN103530389B true CN103530389B (en) | 2017-08-22 |
Family
ID=49932398
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310499118.9A Active CN103530389B (en) | 2013-10-22 | 2013-10-22 | It is a kind of to improve the method and apparatus that stop words searches for validity |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103530389B (en) |
Families Citing this family (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104063418A (en) * | 2014-03-17 | 2014-09-24 | 百度在线网络技术(北京)有限公司 | Search recommendation method and device |
CN104133908B (en) * | 2014-08-07 | 2018-09-04 | 北京奇虎科技有限公司 | Method, server, client and the system that frame is discussed are shown or generated in the page |
CN104217033B (en) * | 2014-09-29 | 2017-11-07 | 北京奇虎科技有限公司 | Based on ageing searching method and device |
CN108733717A (en) * | 2017-04-21 | 2018-11-02 | 北京搜狗科技发展有限公司 | A kind of searching method and device, a kind of device for search |
CN112328752B (en) * | 2021-01-04 | 2021-06-15 | 平安科技(深圳)有限公司 | Course recommendation method and device based on search content, computer equipment and medium |
CN115238683B (en) * | 2022-08-09 | 2023-06-20 | 平安科技(深圳)有限公司 | Method, device, equipment and medium for recognizing stop words of circulating self-attention |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103136339A (en) * | 2013-02-01 | 2013-06-05 | 百度在线网络技术(北京)有限公司 | Searching method, client-side and network server-side based on service information |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7409383B1 (en) * | 2004-03-31 | 2008-08-05 | Google Inc. | Locating meaningful stopwords or stop-phrases in keyword-based retrieval systems |
CN102982118B (en) * | 2012-11-09 | 2017-04-19 | 北京奇虎科技有限公司 | Searching method and device based on favorites |
-
2013
- 2013-10-22 CN CN201310499118.9A patent/CN103530389B/en active Active
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103136339A (en) * | 2013-02-01 | 2013-06-05 | 百度在线网络技术(北京)有限公司 | Searching method, client-side and network server-side based on service information |
Non-Patent Citations (1)
Title |
---|
"单汉字标引及其检索技术的优化";彭冬莲;《农业图书情报学刊》;20050405;第17卷(第4期);第2.3节和3.2节 * |
Also Published As
Publication number | Publication date |
---|---|
CN103530389A (en) | 2014-01-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103530389B (en) | It is a kind of to improve the method and apparatus that stop words searches for validity | |
CN102298616B (en) | Method and device for providing related sub links in search result | |
CN103577597B (en) | Keyword search system based on current browse webpage | |
CN103631887B (en) | Browser side carries out the method and browser of web search | |
US9348934B2 (en) | Systems and methods for facilitating open source intelligence gathering | |
US8341157B2 (en) | System and method for intent-driven search result presentation | |
CN103577596B (en) | Keyword search methodology and device based on current browse webpage | |
CN103412881B (en) | The method and system of Search Results are provided | |
CN103577595B (en) | Keyword method for pushing and device based on current browse webpage | |
US20180210895A1 (en) | Generating descriptive text for images | |
CA3153598A1 (en) | Method of and device for predicting video playback integrity | |
CN103984740B (en) | Based on the method and system that the retrieved page of combination tag shows | |
CN103699669B (en) | The method of message push and a kind of browser terminal is carried out in a kind of browser | |
CN109101658B (en) | Information searching method and device, and equipment/terminal/server | |
KR20170140226A (en) | Information retrieval navigation method and apparatus | |
US9378276B1 (en) | Systems and methods for generating navigation filters | |
CN103823907B (en) | A kind of method, apparatus and engine for integrating online video resource address | |
CN105630907A (en) | Method for assembling android application based on content of application | |
US11423096B2 (en) | Method and apparatus for outputting information | |
CN102855256A (en) | Method, device and equipment for determining evaluation information of websites | |
US11630875B2 (en) | Searching and aggregating web pages | |
CN106446115A (en) | Mobile Internet user classification method and device | |
CN103838862B (en) | Video searching method, device and terminal | |
CN110175264A (en) | Construction method, server and the computer readable storage medium of video user portrait | |
CN104090757A (en) | Method and device for displaying rich media information in browser |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20220725 Address after: Room 801, 8th floor, No. 104, floors 1-19, building 2, yard 6, Jiuxianqiao Road, Chaoyang District, Beijing 100015 Patentee after: BEIJING QIHOO TECHNOLOGY Co.,Ltd. Address before: 100088 room 112, block D, 28 new street, new street, Xicheng District, Beijing (Desheng Park) Patentee before: BEIJING QIHOO TECHNOLOGY Co.,Ltd. Patentee before: Qizhi software (Beijing) Co.,Ltd. |